Overview

Dataset statistics

Number of variables49
Number of observations301
Missing cells2333
Missing cells (%)15.8%
Duplicate rows3
Duplicate rows (%)1.0%
Total size in memory115.4 KiB
Average record size in memory392.4 B

Variable types

Unsupported42
Text7

Dataset

Description사업체조사 산업중분류, 대표자남여 및 읍면동별 사업체수(2013년) 정보 입니다.
Author경기도 용인시
URLhttps://www.data.go.kr/data/15053394/fileData.do

Alerts

Dataset has 3 (1.0%) duplicate rowsDuplicates
7. 산업중분류, 대표자남여 및 읍면동별 사업체수 has 207 (68.8%) missing valuesMissing
Unnamed: 1 has 21 (7.0%) missing valuesMissing
Unnamed: 2 has 21 (7.0%) missing valuesMissing
Unnamed: 3 has 21 (7.0%) missing valuesMissing
Unnamed: 4 has 21 (7.0%) missing valuesMissing
Unnamed: 5 has 21 (7.0%) missing valuesMissing
Unnamed: 6 has 21 (7.0%) missing valuesMissing
Unnamed: 7 has 21 (7.0%) missing valuesMissing
Unnamed: 8 has 21 (7.0%) missing valuesMissing
Unnamed: 9 has 21 (7.0%) missing valuesMissing
7. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces has 20 (6.6%) missing valuesMissing
Unnamed: 11 has 21 (7.0%) missing valuesMissing
Unnamed: 12 has 21 (7.0%) missing valuesMissing
Unnamed: 13 has 21 (7.0%) missing valuesMissing
Unnamed: 14 has 21 (7.0%) missing valuesMissing
Unnamed: 15 has 21 (7.0%) missing valuesMissing
Unnamed: 16 has 21 (7.0%) missing valuesMissing
Unnamed: 17 has 21 (7.0%) missing valuesMissing
Unnamed: 18 has 21 (7.0%) missing valuesMissing
Unnamed: 19 has 207 (68.8%) missing valuesMissing
Unnamed: 20 has 22 (7.3%) missing valuesMissing
7. 산업중분류, 대표자남여 및 읍면동별 사업체수.1 has 207 (68.8%) missing valuesMissing
Unnamed: 22 has 21 (7.0%) missing valuesMissing
Unnamed: 23 has 21 (7.0%) missing valuesMissing
Unnamed: 24 has 21 (7.0%) missing valuesMissing
Unnamed: 25 has 21 (7.0%) missing valuesMissing
Unnamed: 26 has 21 (7.0%) missing valuesMissing
Unnamed: 27 has 21 (7.0%) missing valuesMissing
Unnamed: 28 has 21 (7.0%) missing valuesMissing
Unnamed: 29 has 21 (7.0%) missing valuesMissing
Unnamed: 30 has 21 (7.0%) missing valuesMissing
7. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces.1 has 20 (6.6%) missing valuesMissing
Unnamed: 32 has 21 (7.0%) missing valuesMissing
Unnamed: 33 has 21 (7.0%) missing valuesMissing
Unnamed: 34 has 21 (7.0%) missing valuesMissing
Unnamed: 35 has 21 (7.0%) missing valuesMissing
Unnamed: 36 has 21 (7.0%) missing valuesMissing
Unnamed: 37 has 21 (7.0%) missing valuesMissing
Unnamed: 38 has 21 (7.0%) missing valuesMissing
Unnamed: 39 has 21 (7.0%) missing valuesMissing
Unnamed: 40 has 207 (68.8%) missing valuesMissing
Unnamed: 41 has 22 (7.3%) missing valuesMissing
7. 산업중분류, 대표자남여 및 읍면동별 사업체수.2 has 207 (68.8%) missing valuesMissing
Unnamed: 43 has 21 (7.0%) missing valuesMissing
Unnamed: 44 has 21 (7.0%) missing valuesMissing
Unnamed: 45 has 207 (68.8%) missing valuesMissing
Unnamed: 46 has 22 (7.3%) missing valuesMissing
Unnamed: 47 has 208 (69.1%) missing valuesMissing
Unnamed: 48 has 21 (7.0%) missing valuesMissing
7. 산업중분류, 대표자남여 및 읍면동별 사업체수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
7. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
7. 산업중분류, 대표자남여 및 읍면동별 사업체수.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 26 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 27 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 28 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 29 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 30 is an unsupported type, check if it needs cleaning or further analysisUnsupported
7. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 32 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 33 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 34 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 35 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 36 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 37 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 38 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 39 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 40 is an unsupported type, check if it needs cleaning or further analysisUnsupported
7. 산업중분류, 대표자남여 및 읍면동별 사업체수.2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 44 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 45 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 47 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 23:17:43.196182
Analysis finished2023-12-12 23:17:44.013411
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

7. 산업중분류, 대표자남여 및 읍면동별 사업체수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 1
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T08:17:44.270015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T08:17:44.728098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB
Missing20
Missing (%)6.6%
Memory size2.5 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 20
Text

MISSING 

Distinct95
Distinct (%)34.1%
Missing22
Missing (%)7.3%
Memory size2.5 KiB
2023-12-13T08:17:45.075346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length7.781362
Min length2

Characters and Unicode

Total characters2171
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)33.3%

Sample

1st row전산업
2nd rowMale
3rd rowFemale
4th row농업, 임업 및 어업 (01 ~ 03)
5th rowMale
ValueCountFrequency (%)
male 93
17.6%
female 93
17.6%
50
 
9.5%
제조업 21
 
4.0%
서비스업 14
 
2.7%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
운송업 3
 
0.6%
Other values (188) 226
42.8%
2023-12-13T08:17:45.578920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 279
 
12.9%
250
 
11.5%
l 186
 
8.6%
a 186
 
8.6%
99
 
4.6%
M 93
 
4.3%
F 93
 
4.3%
m 93
 
4.3%
50
 
2.3%
44
 
2.0%
Other values (172) 798
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
38.0%
Lowercase Letter 744
34.3%
Space Separator 250
 
11.5%
Uppercase Letter 186
 
8.6%
Decimal Number 73
 
3.4%
Other Punctuation 38
 
1.8%
Open Punctuation 19
 
0.9%
Close Punctuation 19
 
0.9%
Math Symbol 17
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
7 7
9.6%
0 7
9.6%
9 7
9.6%
8 7
9.6%
1 5
6.8%
2 2
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 279
37.5%
l 186
25.0%
a 186
25.0%
m 93
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 93
50.0%
F 93
50.0%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 930
42.8%
Hangul 825
38.0%
Common 416
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Common
ValueCountFrequency (%)
250
60.1%
, 29
 
7.0%
( 19
 
4.6%
) 19
 
4.6%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
7 7
 
1.7%
Other values (7) 37
 
8.9%
Latin
ValueCountFrequency (%)
e 279
30.0%
l 186
20.0%
a 186
20.0%
M 93
 
10.0%
F 93
 
10.0%
m 93
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
61.9%
Hangul 825
38.0%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 279
20.8%
250
18.6%
l 186
13.8%
a 186
13.8%
M 93
 
6.9%
F 93
 
6.9%
m 93
 
6.9%
, 29
 
2.2%
( 19
 
1.4%
) 19
 
1.4%
Other values (12) 97
 
7.2%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

7. 산업중분류, 대표자남여 및 읍면동별 사업체수.1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 22
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T08:17:45.909327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T08:17:46.447317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 23
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 25
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 26
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 27
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 28
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 29
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 30
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB
Missing20
Missing (%)6.6%
Memory size2.5 KiB

Unnamed: 32
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 33
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 34
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 35
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 36
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 37
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 38
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 39
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 40
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 41
Text

MISSING 

Distinct95
Distinct (%)34.1%
Missing22
Missing (%)7.3%
Memory size2.5 KiB
2023-12-13T08:17:46.803890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length7.781362
Min length2

Characters and Unicode

Total characters2171
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)33.3%

Sample

1st row전산업
2nd rowMale
3rd rowFemale
4th row농업, 임업 및 어업 (01 ~ 03)
5th rowMale
ValueCountFrequency (%)
male 93
17.6%
female 93
17.6%
50
 
9.5%
제조업 21
 
4.0%
서비스업 14
 
2.7%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
운송업 3
 
0.6%
Other values (188) 226
42.8%
2023-12-13T08:17:47.324924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 279
 
12.9%
250
 
11.5%
l 186
 
8.6%
a 186
 
8.6%
99
 
4.6%
M 93
 
4.3%
F 93
 
4.3%
m 93
 
4.3%
50
 
2.3%
44
 
2.0%
Other values (172) 798
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
38.0%
Lowercase Letter 744
34.3%
Space Separator 250
 
11.5%
Uppercase Letter 186
 
8.6%
Decimal Number 73
 
3.4%
Other Punctuation 38
 
1.8%
Open Punctuation 19
 
0.9%
Close Punctuation 19
 
0.9%
Math Symbol 17
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
7 7
9.6%
0 7
9.6%
9 7
9.6%
8 7
9.6%
1 5
6.8%
2 2
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 279
37.5%
l 186
25.0%
a 186
25.0%
m 93
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 93
50.0%
F 93
50.0%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 930
42.8%
Hangul 825
38.0%
Common 416
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Common
ValueCountFrequency (%)
250
60.1%
, 29
 
7.0%
( 19
 
4.6%
) 19
 
4.6%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
7 7
 
1.7%
Other values (7) 37
 
8.9%
Latin
ValueCountFrequency (%)
e 279
30.0%
l 186
20.0%
a 186
20.0%
M 93
 
10.0%
F 93
 
10.0%
m 93
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
61.9%
Hangul 825
38.0%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 279
20.8%
250
18.6%
l 186
13.8%
a 186
13.8%
M 93
 
6.9%
F 93
 
6.9%
m 93
 
6.9%
, 29
 
2.2%
( 19
 
1.4%
) 19
 
1.4%
Other values (12) 97
 
7.2%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

7. 산업중분류, 대표자남여 및 읍면동별 사업체수.2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 43
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T08:17:47.692141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T08:17:48.174554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 44
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 45
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 46
Text

MISSING 

Distinct95
Distinct (%)34.1%
Missing22
Missing (%)7.3%
Memory size2.5 KiB
2023-12-13T08:17:48.479370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length7.781362
Min length2

Characters and Unicode

Total characters2171
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)33.3%

Sample

1st row전산업
2nd rowMale
3rd rowFemale
4th row농업, 임업 및 어업 (01 ~ 03)
5th rowMale
ValueCountFrequency (%)
male 93
17.6%
female 93
17.6%
50
 
9.5%
제조업 21
 
4.0%
서비스업 14
 
2.7%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
운송업 3
 
0.6%
Other values (188) 226
42.8%
2023-12-13T08:17:48.908287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 279
 
12.9%
250
 
11.5%
l 186
 
8.6%
a 186
 
8.6%
99
 
4.6%
M 93
 
4.3%
F 93
 
4.3%
m 93
 
4.3%
50
 
2.3%
44
 
2.0%
Other values (172) 798
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
38.0%
Lowercase Letter 744
34.3%
Space Separator 250
 
11.5%
Uppercase Letter 186
 
8.6%
Decimal Number 73
 
3.4%
Other Punctuation 38
 
1.8%
Open Punctuation 19
 
0.9%
Close Punctuation 19
 
0.9%
Math Symbol 17
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
7 7
9.6%
0 7
9.6%
9 7
9.6%
8 7
9.6%
1 5
6.8%
2 2
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 279
37.5%
l 186
25.0%
a 186
25.0%
m 93
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 93
50.0%
F 93
50.0%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 930
42.8%
Hangul 825
38.0%
Common 416
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Common
ValueCountFrequency (%)
250
60.1%
, 29
 
7.0%
( 19
 
4.6%
) 19
 
4.6%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
7 7
 
1.7%
Other values (7) 37
 
8.9%
Latin
ValueCountFrequency (%)
e 279
30.0%
l 186
20.0%
a 186
20.0%
M 93
 
10.0%
F 93
 
10.0%
m 93
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
61.9%
Hangul 825
38.0%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 279
20.8%
250
18.6%
l 186
13.8%
a 186
13.8%
M 93
 
6.9%
F 93
 
6.9%
m 93
 
6.9%
, 29
 
2.2%
( 19
 
1.4%
) 19
 
1.4%
Other values (12) 97
 
7.2%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 47
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing208
Missing (%)69.1%
Memory size2.5 KiB

Unnamed: 48
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T08:17:49.175631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T08:17:49.555241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Sample

7. 산업중분류, 대표자남여 및 읍면동별 사업체수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 97. Number of Establishments by Industrial Divisions, Sex of Representatives & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 207. 산업중분류, 대표자남여 및 읍면동별 사업체수.1Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 307. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces.1Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 417. 산업중분류, 대표자남여 및 읍면동별 사업체수.2Unnamed: 43Unnamed: 44Unnamed: 45Unnamed: 46Unnamed: 47Unnamed: 48
0단위 : 개<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In eachNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 개<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In eachNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 개<NA>NaNNaN<NA>NaN<NA>
1NaN산업분류 Industrial classification용인시\nYongin-si처인구\nCheoin-gu포곡읍\nPogok-eup모현면\nMohyeon-myeon남사면\nNamsa-myeon이동면\nIdong-myeon원삼면\nWonsam-myeon백암면\nBaegam-myeon양지면\nYangji-myeon중앙동\nJungang-dong역삼동\nYeoksam-dong유림동\nYurim-dong동부동\nDongbu-dong기흥구\nGiheung-gu구갈동\nGugal-dong상갈동\nSanggal-dong기흥동\nGiheung-dong산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification서농동\nSeonong-dong구성동\nGuseong-dong마북동\nMabuk-dong동백동\nDongbaek-dong보정동\nBojeong-dong상하동\nSangha-dong신갈동\nSingal-dong영덕동\nYeongdeok-dong수지구\nSuji-gu풍덕천1동\nPungdoekcheon 1(il)-dong풍덕천2동\nPungdoekcheon 2(i)-dong신봉동\nSinbong-dong죽전1동\nJukjeon 1(il)-dong죽전2동\nJukjeon 2(i)-dong동천동\nDongcheon-dong상현1동\nSanghyeon 1(il)-dong상현2동\nSanghyeon 2(i)-dong산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification성복동\nSeongbok-dong산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification
2NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaN<NA>NaN<NA>
3TT전산업39 92515 2751 8101 8517671 2476188361 5112 8881 3091 3111 12714 3221 8771 673782TT전산업TT전산업5028318982 0911 6977161 6111 64410 3281 8171 1558012 0121 0101 252635865TT전산업TT전산업781TT전산업TT전산업
4NaN남자24 25010 1081 1311 3665678334525471 1061 6768008827488 4901 1011 000533NaNMaleNaN남자3154884831 1909644499541 0135 6521 0546433941 129527716339445NaNMaleNaN남자405NaNMaleNaN남자
5NaN여자15 6755 1676794852004141662894051 2125094293795 832776673249NaNFemaleNaN여자1873434159017332676576314 676763512407883483536296420NaNFemaleNaN여자376NaNFemaleNaN여자
6NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaN<NA>NaN<NA>
7A농업, 임업 및 어업 (01 ~ 03)11101-4--1-3-1-1---A농업, 임업 및 어업 (01 ~ 03)A농업, 임업 및 어업 (01 ~ 03)------1----------A농업, 임업 및 어업 (01 ~ 03)A농업, 임업 및 어업 (01 ~ 03)-A농업, 임업 및 어업 (01 ~ 03)A농업, 임업 및 어업 (01 ~ 03)
8NaN남자11101-4--1-3-1-1---NaNMaleNaN남자------1----------NaNMaleNaN남자-NaNMaleNaN남자
9NaN여자-----------------NaNFemaleNaN여자-----------------NaNFemaleNaN여자-NaNFemaleNaN여자
7. 산업중분류, 대표자남여 및 읍면동별 사업체수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 97. Number of Establishments by Industrial Divisions, Sex of Representatives & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 207. 산업중분류, 대표자남여 및 읍면동별 사업체수.1Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 307. Number of Establishments by Industrial Divisions, Sex of Representatives & Provinces.1Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 417. 산업중분류, 대표자남여 및 읍면동별 사업체수.2Unnamed: 43Unnamed: 44Unnamed: 45Unnamed: 46Unnamed: 47Unnamed: 48
291NaN여자1 8685617940843162532148685349714988424NaNFemaleNaN여자19545311882271045159395596210739714157NaNFemaleNaN여자62NaNFemaleNaN여자
29294협회 및 단체1 046412373620292828435748464036862542094협회 및 단체94협회 및 단체1221234531234433266382518632044211594협회 및 단체94협회 및 단체2294협회 및 단체94협회 및 단체
293NaN남자9063442931172620223847364533322534819NaNMaleNaN남자12191838272333322403423166219351814NaNMaleNaN남자19NaNMaleNaN남자
294NaN여자14068853386510121746961NaNFemaleNaN여자-2574-1112642211931NaNFemaleNaN여자3NaNFemaleNaN여자
29595수리업1 130430445617281525476447474042034492495수리업95수리업1118345060265955280691723383736161795수리업95수리업2795수리업95수리업
296NaN남자9493683849162412234451414030358254321NaNMaleNaN남자10173046511946502235514152931291412NaNMaleNaN남자24NaNMaleNaN남자
297NaN여자18162671432313671062963NaNFemaleNaN여자11449713557143896725NaNFemaleNaN여자3NaNFemaleNaN여자
29896기타 개인 서비스업2 287660954413561431391747370518911171032896기타 개인 서비스업96기타 개인 서비스업237372144109341147473611280691404677558196기타 개인 서비스업96기타 개인 서비스업7696기타 개인 서비스업96기타 개인 서비스업
299NaN남자7402293016920914154923251928537318NaNMaleNaN남자5222837401434292263526174314221930NaNMaleNaN남자20NaNMaleNaN남자
300NaN여자1 547431652843651724125504532606807220NaNFemaleNaN여자185144107692080455107754529732553651NaNFemaleNaN여자56NaNFemaleNaN여자

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 20Unnamed: 22Unnamed: 41Unnamed: 43Unnamed: 46Unnamed: 48# duplicates
0남자Male남자Male남자Male남자93
1여자Female여자Female여자Female여자93
2<NA><NA><NA><NA><NA><NA><NA>21