Overview

Dataset statistics

Number of variables88
Number of observations488
Missing cells5250
Missing cells (%)12.2%
Duplicate rows5
Duplicate rows (%)1.0%
Total size in memory335.6 KiB
Average record size in memory704.3 B

Variable types

Unsupported79
Text9

Dataset

Description사업체조사 산업중분류, 조직형태 및 읍면동별 사업체수, 종사자수(2013년) 정보 입니다.
Author경기도 용인시
URLhttps://www.data.go.kr/data/15053397/fileData.do

Alerts

Dataset has 5 (1.0%) duplicate rowsDuplicates
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수 has 394 (80.7%) missing valuesMissing
Unnamed: 1 has 22 (4.5%) missing valuesMissing
Unnamed: 2 has 21 (4.3%) missing valuesMissing
Unnamed: 3 has 22 (4.5%) missing valuesMissing
Unnamed: 4 has 21 (4.3%) missing valuesMissing
Unnamed: 5 has 22 (4.5%) missing valuesMissing
Unnamed: 6 has 21 (4.3%) missing valuesMissing
Unnamed: 7 has 22 (4.5%) missing valuesMissing
Unnamed: 8 has 21 (4.3%) missing valuesMissing
Unnamed: 9 has 22 (4.5%) missing valuesMissing
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces has 20 (4.1%) missing valuesMissing
Unnamed: 11 has 22 (4.5%) missing valuesMissing
Unnamed: 12 has 21 (4.3%) missing valuesMissing
Unnamed: 13 has 22 (4.5%) missing valuesMissing
Unnamed: 14 has 21 (4.3%) missing valuesMissing
Unnamed: 15 has 22 (4.5%) missing valuesMissing
Unnamed: 16 has 21 (4.3%) missing valuesMissing
Unnamed: 17 has 22 (4.5%) missing valuesMissing
Unnamed: 18 has 21 (4.3%) missing valuesMissing
Unnamed: 19 has 22 (4.5%) missing valuesMissing
Unnamed: 20 has 394 (80.7%) missing valuesMissing
Unnamed: 21 has 23 (4.7%) missing valuesMissing
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.1 has 394 (80.7%) missing valuesMissing
Unnamed: 23 has 22 (4.5%) missing valuesMissing
Unnamed: 24 has 21 (4.3%) missing valuesMissing
Unnamed: 25 has 22 (4.5%) missing valuesMissing
Unnamed: 26 has 21 (4.3%) missing valuesMissing
Unnamed: 27 has 22 (4.5%) missing valuesMissing
Unnamed: 28 has 21 (4.3%) missing valuesMissing
Unnamed: 29 has 22 (4.5%) missing valuesMissing
Unnamed: 30 has 21 (4.3%) missing valuesMissing
Unnamed: 31 has 22 (4.5%) missing valuesMissing
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.1 has 20 (4.1%) missing valuesMissing
Unnamed: 33 has 22 (4.5%) missing valuesMissing
Unnamed: 34 has 21 (4.3%) missing valuesMissing
Unnamed: 35 has 22 (4.5%) missing valuesMissing
Unnamed: 36 has 21 (4.3%) missing valuesMissing
Unnamed: 37 has 22 (4.5%) missing valuesMissing
Unnamed: 38 has 21 (4.3%) missing valuesMissing
Unnamed: 39 has 22 (4.5%) missing valuesMissing
Unnamed: 40 has 21 (4.3%) missing valuesMissing
Unnamed: 41 has 22 (4.5%) missing valuesMissing
Unnamed: 42 has 394 (80.7%) missing valuesMissing
Unnamed: 43 has 23 (4.7%) missing valuesMissing
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.2 has 394 (80.7%) missing valuesMissing
Unnamed: 45 has 22 (4.5%) missing valuesMissing
Unnamed: 46 has 21 (4.3%) missing valuesMissing
Unnamed: 47 has 22 (4.5%) missing valuesMissing
Unnamed: 48 has 21 (4.3%) missing valuesMissing
Unnamed: 49 has 22 (4.5%) missing valuesMissing
Unnamed: 50 has 21 (4.3%) missing valuesMissing
Unnamed: 51 has 22 (4.5%) missing valuesMissing
Unnamed: 52 has 21 (4.3%) missing valuesMissing
Unnamed: 53 has 22 (4.5%) missing valuesMissing
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.2 has 20 (4.1%) missing valuesMissing
Unnamed: 55 has 22 (4.5%) missing valuesMissing
Unnamed: 56 has 21 (4.3%) missing valuesMissing
Unnamed: 57 has 22 (4.5%) missing valuesMissing
Unnamed: 58 has 21 (4.3%) missing valuesMissing
Unnamed: 59 has 22 (4.5%) missing valuesMissing
Unnamed: 60 has 21 (4.3%) missing valuesMissing
Unnamed: 61 has 22 (4.5%) missing valuesMissing
Unnamed: 62 has 21 (4.3%) missing valuesMissing
Unnamed: 63 has 22 (4.5%) missing valuesMissing
Unnamed: 64 has 394 (80.7%) missing valuesMissing
Unnamed: 65 has 23 (4.7%) missing valuesMissing
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.3 has 394 (80.7%) missing valuesMissing
Unnamed: 67 has 22 (4.5%) missing valuesMissing
Unnamed: 68 has 21 (4.3%) missing valuesMissing
Unnamed: 69 has 22 (4.5%) missing valuesMissing
Unnamed: 70 has 21 (4.3%) missing valuesMissing
Unnamed: 71 has 22 (4.5%) missing valuesMissing
Unnamed: 72 has 21 (4.3%) missing valuesMissing
Unnamed: 73 has 22 (4.5%) missing valuesMissing
Unnamed: 74 has 21 (4.3%) missing valuesMissing
Unnamed: 75 has 22 (4.5%) missing valuesMissing
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.3 has 20 (4.1%) missing valuesMissing
Unnamed: 77 has 22 (4.5%) missing valuesMissing
Unnamed: 78 has 21 (4.3%) missing valuesMissing
Unnamed: 79 has 22 (4.5%) missing valuesMissing
Unnamed: 80 has 21 (4.3%) missing valuesMissing
Unnamed: 81 has 22 (4.5%) missing valuesMissing
Unnamed: 82 has 21 (4.3%) missing valuesMissing
Unnamed: 83 has 22 (4.5%) missing valuesMissing
Unnamed: 84 has 394 (80.7%) missing valuesMissing
Unnamed: 85 has 23 (4.7%) missing valuesMissing
Unnamed: 86 has 395 (80.9%) missing valuesMissing
Unnamed: 87 has 22 (4.5%) missing valuesMissing
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 26 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 27 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 28 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 29 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 30 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 31 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 33 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 34 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 35 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 36 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 37 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 38 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 39 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 40 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 41 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 42 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 46 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 47 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 48 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 49 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 50 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 51 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 52 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 53 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 55 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 56 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 57 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 58 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 59 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 60 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 61 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 62 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 63 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 64 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 68 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 69 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 70 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 71 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 72 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 73 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 74 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 75 is an unsupported type, check if it needs cleaning or further analysisUnsupported
5. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 77 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 78 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 79 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 80 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 81 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 82 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 83 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 84 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 86 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:10:04.465122
Analysis finished2023-12-12 22:10:05.609652
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 1
Text

MISSING 

Distinct98
Distinct (%)21.0%
Missing22
Missing (%)4.5%
Memory size3.9 KiB
2023-12-13T07:10:05.867953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length10.51073
Min length2

Characters and Unicode

Total characters4898
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)20.2%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row개 인 사 업 체
4th row회 사 법 인
5th row회 사 이 외 법 인
ValueCountFrequency (%)
372
16.9%
279
12.7%
279
12.7%
186
8.4%
186
8.4%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
Other values (200) 438
19.9%
2023-12-13T07:10:06.634577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
41.2%
376
 
7.7%
290
 
5.9%
279
 
5.7%
193
 
3.9%
192
 
3.9%
189
 
3.9%
120
 
2.4%
100
 
2.0%
97
 
2.0%
Other values (182) 1044
21.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2689
54.9%
Space Separator 2018
41.2%
Decimal Number 73
 
1.5%
Other Punctuation 38
 
0.8%
Lowercase Letter 23
 
0.5%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Math Symbol 17
 
0.3%
Uppercase Letter 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
c 2
8.7%
l 2
8.7%
n 2
8.7%
t 2
8.7%
d 1
 
4.3%
u 1
 
4.3%
r 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2689
54.9%
Common 2185
44.6%
Latin 24
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Common
ValueCountFrequency (%)
2018
92.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (8) 38
 
1.7%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
c 2
8.3%
l 2
8.3%
n 2
8.3%
t 2
8.3%
I 1
 
4.2%
d 1
 
4.2%
u 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2689
54.9%
ASCII 2207
45.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
91.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (20) 60
 
2.7%
Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB
Missing20
Missing (%)4.1%
Memory size3.9 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 21
Text

MISSING 

Distinct97
Distinct (%)20.9%
Missing23
Missing (%)4.7%
Memory size3.9 KiB
2023-12-13T07:10:06.978245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length14.268817
Min length2

Characters and Unicode

Total characters6635
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)20.0%

Sample

1st row전산업
2nd rowProprietorship
3rd rowIncorporated Co.
4th rowNon-biz Corp.
5th rowUnincorp. Assn.
ValueCountFrequency (%)
proprietorship 93
 
9.4%
corp 93
 
9.4%
unincorp 93
 
9.4%
assn 93
 
9.4%
incorporated 93
 
9.4%
co 93
 
9.4%
non-biz 93
 
9.4%
50
 
5.0%
제조업 21
 
2.1%
서비스업 14
 
1.4%
Other values (193) 257
25.9%
2023-12-13T07:10:07.483609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 744
 
11.2%
r 651
 
9.8%
529
 
8.0%
n 465
 
7.0%
p 465
 
7.0%
. 372
 
5.6%
i 372
 
5.6%
s 279
 
4.2%
t 186
 
2.8%
c 186
 
2.8%
Other values (188) 2386
36.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3999
60.3%
Other Letter 825
 
12.4%
Uppercase Letter 651
 
9.8%
Space Separator 529
 
8.0%
Other Punctuation 410
 
6.2%
Dash Punctuation 93
 
1.4%
Decimal Number 73
 
1.1%
Open Punctuation 19
 
0.3%
Close Punctuation 19
 
0.3%
Math Symbol 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Lowercase Letter
ValueCountFrequency (%)
o 744
18.6%
r 651
16.3%
n 465
11.6%
p 465
11.6%
i 372
9.3%
s 279
 
7.0%
t 186
 
4.7%
c 186
 
4.7%
e 186
 
4.7%
a 93
 
2.3%
Other values (4) 372
9.3%
Decimal Number
ValueCountFrequency (%)
6 10
13.7%
5 10
13.7%
4 9
12.3%
3 9
12.3%
0 7
9.6%
8 7
9.6%
7 7
9.6%
9 7
9.6%
1 5
6.8%
2 2
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
C 186
28.6%
I 93
14.3%
A 93
14.3%
P 93
14.3%
U 93
14.3%
N 93
14.3%
Other Punctuation
ValueCountFrequency (%)
. 372
90.7%
, 29
 
7.1%
; 7
 
1.7%
· 2
 
0.5%
Space Separator
ValueCountFrequency (%)
529
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4650
70.1%
Common 1160
 
17.5%
Hangul 825
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Latin
ValueCountFrequency (%)
o 744
16.0%
r 651
14.0%
n 465
10.0%
p 465
10.0%
i 372
 
8.0%
s 279
 
6.0%
t 186
 
4.0%
c 186
 
4.0%
e 186
 
4.0%
C 186
 
4.0%
Other values (10) 930
20.0%
Common
ValueCountFrequency (%)
529
45.6%
. 372
32.1%
- 93
 
8.0%
, 29
 
2.5%
( 19
 
1.6%
) 19
 
1.6%
~ 17
 
1.5%
6 10
 
0.9%
5 10
 
0.9%
4 9
 
0.8%
Other values (9) 53
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5808
87.5%
Hangul 825
 
12.4%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 744
12.8%
r 651
11.2%
529
 
9.1%
n 465
 
8.0%
p 465
 
8.0%
. 372
 
6.4%
i 372
 
6.4%
s 279
 
4.8%
t 186
 
3.2%
c 186
 
3.2%
Other values (28) 1559
26.8%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%
Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 23
Text

MISSING 

Distinct98
Distinct (%)21.0%
Missing22
Missing (%)4.5%
Memory size3.9 KiB
2023-12-13T07:10:08.021958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length10.51073
Min length2

Characters and Unicode

Total characters4898
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)20.2%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row개 인 사 업 체
4th row회 사 법 인
5th row회 사 이 외 법 인
ValueCountFrequency (%)
372
16.9%
279
12.7%
279
12.7%
186
8.4%
186
8.4%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
Other values (200) 438
19.9%
2023-12-13T07:10:08.575451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
41.2%
376
 
7.7%
290
 
5.9%
279
 
5.7%
193
 
3.9%
192
 
3.9%
189
 
3.9%
120
 
2.4%
100
 
2.0%
97
 
2.0%
Other values (182) 1044
21.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2689
54.9%
Space Separator 2018
41.2%
Decimal Number 73
 
1.5%
Other Punctuation 38
 
0.8%
Lowercase Letter 23
 
0.5%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Math Symbol 17
 
0.3%
Uppercase Letter 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
c 2
8.7%
l 2
8.7%
n 2
8.7%
t 2
8.7%
d 1
 
4.3%
u 1
 
4.3%
r 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2689
54.9%
Common 2185
44.6%
Latin 24
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Common
ValueCountFrequency (%)
2018
92.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (8) 38
 
1.7%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
c 2
8.3%
l 2
8.3%
n 2
8.3%
t 2
8.3%
I 1
 
4.2%
d 1
 
4.2%
u 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2689
54.9%
ASCII 2207
45.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
91.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (20) 60
 
2.7%
Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 25
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 26
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 27
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 28
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 29
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 30
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 31
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB
Missing20
Missing (%)4.1%
Memory size3.9 KiB

Unnamed: 33
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 34
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 35
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 36
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 37
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 38
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 39
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 40
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 41
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 42
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 43
Text

MISSING 

Distinct97
Distinct (%)20.9%
Missing23
Missing (%)4.7%
Memory size3.9 KiB
2023-12-13T07:10:08.999419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length14.268817
Min length2

Characters and Unicode

Total characters6635
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)20.0%

Sample

1st row전산업
2nd rowProprietorship
3rd rowIncorporated Co.
4th rowNon-biz Corp.
5th rowUnincorp. Assn.
ValueCountFrequency (%)
proprietorship 93
 
9.4%
corp 93
 
9.4%
unincorp 93
 
9.4%
assn 93
 
9.4%
incorporated 93
 
9.4%
co 93
 
9.4%
non-biz 93
 
9.4%
50
 
5.0%
제조업 21
 
2.1%
서비스업 14
 
1.4%
Other values (193) 257
25.9%
2023-12-13T07:10:09.603336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 744
 
11.2%
r 651
 
9.8%
529
 
8.0%
n 465
 
7.0%
p 465
 
7.0%
. 372
 
5.6%
i 372
 
5.6%
s 279
 
4.2%
t 186
 
2.8%
c 186
 
2.8%
Other values (188) 2386
36.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3999
60.3%
Other Letter 825
 
12.4%
Uppercase Letter 651
 
9.8%
Space Separator 529
 
8.0%
Other Punctuation 410
 
6.2%
Dash Punctuation 93
 
1.4%
Decimal Number 73
 
1.1%
Open Punctuation 19
 
0.3%
Close Punctuation 19
 
0.3%
Math Symbol 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Lowercase Letter
ValueCountFrequency (%)
o 744
18.6%
r 651
16.3%
n 465
11.6%
p 465
11.6%
i 372
9.3%
s 279
 
7.0%
t 186
 
4.7%
c 186
 
4.7%
e 186
 
4.7%
a 93
 
2.3%
Other values (4) 372
9.3%
Decimal Number
ValueCountFrequency (%)
6 10
13.7%
5 10
13.7%
4 9
12.3%
3 9
12.3%
0 7
9.6%
8 7
9.6%
7 7
9.6%
9 7
9.6%
1 5
6.8%
2 2
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
C 186
28.6%
I 93
14.3%
A 93
14.3%
P 93
14.3%
U 93
14.3%
N 93
14.3%
Other Punctuation
ValueCountFrequency (%)
. 372
90.7%
, 29
 
7.1%
; 7
 
1.7%
· 2
 
0.5%
Space Separator
ValueCountFrequency (%)
529
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4650
70.1%
Common 1160
 
17.5%
Hangul 825
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Latin
ValueCountFrequency (%)
o 744
16.0%
r 651
14.0%
n 465
10.0%
p 465
10.0%
i 372
 
8.0%
s 279
 
6.0%
t 186
 
4.0%
c 186
 
4.0%
e 186
 
4.0%
C 186
 
4.0%
Other values (10) 930
20.0%
Common
ValueCountFrequency (%)
529
45.6%
. 372
32.1%
- 93
 
8.0%
, 29
 
2.5%
( 19
 
1.6%
) 19
 
1.6%
~ 17
 
1.5%
6 10
 
0.9%
5 10
 
0.9%
4 9
 
0.8%
Other values (9) 53
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5808
87.5%
Hangul 825
 
12.4%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 744
12.8%
r 651
11.2%
529
 
9.1%
n 465
 
8.0%
p 465
 
8.0%
. 372
 
6.4%
i 372
 
6.4%
s 279
 
4.8%
t 186
 
3.2%
c 186
 
3.2%
Other values (28) 1559
26.8%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%
Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 45
Text

MISSING 

Distinct98
Distinct (%)21.0%
Missing22
Missing (%)4.5%
Memory size3.9 KiB
2023-12-13T07:10:10.039495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length10.51073
Min length2

Characters and Unicode

Total characters4898
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)20.2%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row개 인 사 업 체
4th row회 사 법 인
5th row회 사 이 외 법 인
ValueCountFrequency (%)
372
16.9%
279
12.7%
279
12.7%
186
8.4%
186
8.4%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
Other values (200) 438
19.9%
2023-12-13T07:10:10.547054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
41.2%
376
 
7.7%
290
 
5.9%
279
 
5.7%
193
 
3.9%
192
 
3.9%
189
 
3.9%
120
 
2.4%
100
 
2.0%
97
 
2.0%
Other values (182) 1044
21.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2689
54.9%
Space Separator 2018
41.2%
Decimal Number 73
 
1.5%
Other Punctuation 38
 
0.8%
Lowercase Letter 23
 
0.5%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Math Symbol 17
 
0.3%
Uppercase Letter 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
c 2
8.7%
l 2
8.7%
n 2
8.7%
t 2
8.7%
d 1
 
4.3%
u 1
 
4.3%
r 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2689
54.9%
Common 2185
44.6%
Latin 24
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Common
ValueCountFrequency (%)
2018
92.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (8) 38
 
1.7%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
c 2
8.3%
l 2
8.3%
n 2
8.3%
t 2
8.3%
I 1
 
4.2%
d 1
 
4.2%
u 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2689
54.9%
ASCII 2207
45.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
91.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (20) 60
 
2.7%
Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 46
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 47
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 48
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 49
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 50
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 51
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 52
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 53
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB
Missing20
Missing (%)4.1%
Memory size3.9 KiB

Unnamed: 55
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 56
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 57
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 58
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 59
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 60
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 61
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 62
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 63
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 64
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 65
Text

MISSING 

Distinct97
Distinct (%)20.9%
Missing23
Missing (%)4.7%
Memory size3.9 KiB
2023-12-13T07:10:10.849690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length14.268817
Min length2

Characters and Unicode

Total characters6635
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)20.0%

Sample

1st row전산업
2nd rowProprietorship
3rd rowIncorporated Co.
4th rowNon-biz Corp.
5th rowUnincorp. Assn.
ValueCountFrequency (%)
proprietorship 93
 
9.4%
corp 93
 
9.4%
unincorp 93
 
9.4%
assn 93
 
9.4%
incorporated 93
 
9.4%
co 93
 
9.4%
non-biz 93
 
9.4%
50
 
5.0%
제조업 21
 
2.1%
서비스업 14
 
1.4%
Other values (193) 257
25.9%
2023-12-13T07:10:11.293182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 744
 
11.2%
r 651
 
9.8%
529
 
8.0%
n 465
 
7.0%
p 465
 
7.0%
. 372
 
5.6%
i 372
 
5.6%
s 279
 
4.2%
t 186
 
2.8%
c 186
 
2.8%
Other values (188) 2386
36.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3999
60.3%
Other Letter 825
 
12.4%
Uppercase Letter 651
 
9.8%
Space Separator 529
 
8.0%
Other Punctuation 410
 
6.2%
Dash Punctuation 93
 
1.4%
Decimal Number 73
 
1.1%
Open Punctuation 19
 
0.3%
Close Punctuation 19
 
0.3%
Math Symbol 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Lowercase Letter
ValueCountFrequency (%)
o 744
18.6%
r 651
16.3%
n 465
11.6%
p 465
11.6%
i 372
9.3%
s 279
 
7.0%
t 186
 
4.7%
c 186
 
4.7%
e 186
 
4.7%
a 93
 
2.3%
Other values (4) 372
9.3%
Decimal Number
ValueCountFrequency (%)
6 10
13.7%
5 10
13.7%
4 9
12.3%
3 9
12.3%
0 7
9.6%
8 7
9.6%
7 7
9.6%
9 7
9.6%
1 5
6.8%
2 2
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
C 186
28.6%
I 93
14.3%
A 93
14.3%
P 93
14.3%
U 93
14.3%
N 93
14.3%
Other Punctuation
ValueCountFrequency (%)
. 372
90.7%
, 29
 
7.1%
; 7
 
1.7%
· 2
 
0.5%
Space Separator
ValueCountFrequency (%)
529
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4650
70.1%
Common 1160
 
17.5%
Hangul 825
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Latin
ValueCountFrequency (%)
o 744
16.0%
r 651
14.0%
n 465
10.0%
p 465
10.0%
i 372
 
8.0%
s 279
 
6.0%
t 186
 
4.0%
c 186
 
4.0%
e 186
 
4.0%
C 186
 
4.0%
Other values (10) 930
20.0%
Common
ValueCountFrequency (%)
529
45.6%
. 372
32.1%
- 93
 
8.0%
, 29
 
2.5%
( 19
 
1.6%
) 19
 
1.6%
~ 17
 
1.5%
6 10
 
0.9%
5 10
 
0.9%
4 9
 
0.8%
Other values (9) 53
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5808
87.5%
Hangul 825
 
12.4%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 744
12.8%
r 651
11.2%
529
 
9.1%
n 465
 
8.0%
p 465
 
8.0%
. 372
 
6.4%
i 372
 
6.4%
s 279
 
4.8%
t 186
 
3.2%
c 186
 
3.2%
Other values (28) 1559
26.8%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%
Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 67
Text

MISSING 

Distinct98
Distinct (%)21.0%
Missing22
Missing (%)4.5%
Memory size3.9 KiB
2023-12-13T07:10:11.633788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length10.51073
Min length2

Characters and Unicode

Total characters4898
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)20.2%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row개 인 사 업 체
4th row회 사 법 인
5th row회 사 이 외 법 인
ValueCountFrequency (%)
372
16.9%
279
12.7%
279
12.7%
186
8.4%
186
8.4%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
Other values (200) 438
19.9%
2023-12-13T07:10:12.114087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
41.2%
376
 
7.7%
290
 
5.9%
279
 
5.7%
193
 
3.9%
192
 
3.9%
189
 
3.9%
120
 
2.4%
100
 
2.0%
97
 
2.0%
Other values (182) 1044
21.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2689
54.9%
Space Separator 2018
41.2%
Decimal Number 73
 
1.5%
Other Punctuation 38
 
0.8%
Lowercase Letter 23
 
0.5%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Math Symbol 17
 
0.3%
Uppercase Letter 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
c 2
8.7%
l 2
8.7%
n 2
8.7%
t 2
8.7%
d 1
 
4.3%
u 1
 
4.3%
r 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2689
54.9%
Common 2185
44.6%
Latin 24
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Common
ValueCountFrequency (%)
2018
92.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (8) 38
 
1.7%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
c 2
8.3%
l 2
8.3%
n 2
8.3%
t 2
8.3%
I 1
 
4.2%
d 1
 
4.2%
u 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2689
54.9%
ASCII 2207
45.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
91.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (20) 60
 
2.7%
Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 68
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 69
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 70
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 71
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 72
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 73
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 74
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 75
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB
Missing20
Missing (%)4.1%
Memory size3.9 KiB

Unnamed: 77
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 78
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 79
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 80
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 81
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 82
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)4.3%
Memory size3.9 KiB

Unnamed: 83
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)4.5%
Memory size3.9 KiB

Unnamed: 84
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing394
Missing (%)80.7%
Memory size3.9 KiB

Unnamed: 85
Text

MISSING 

Distinct97
Distinct (%)20.9%
Missing23
Missing (%)4.7%
Memory size3.9 KiB
2023-12-13T07:10:12.516267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length14.268817
Min length2

Characters and Unicode

Total characters6635
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)20.0%

Sample

1st row전산업
2nd rowProprietorship
3rd rowIncorporated Co.
4th rowNon-biz Corp.
5th rowUnincorp. Assn.
ValueCountFrequency (%)
proprietorship 93
 
9.4%
corp 93
 
9.4%
unincorp 93
 
9.4%
assn 93
 
9.4%
incorporated 93
 
9.4%
co 93
 
9.4%
non-biz 93
 
9.4%
50
 
5.0%
제조업 21
 
2.1%
서비스업 14
 
1.4%
Other values (193) 257
25.9%
2023-12-13T07:10:13.053084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 744
 
11.2%
r 651
 
9.8%
529
 
8.0%
n 465
 
7.0%
p 465
 
7.0%
. 372
 
5.6%
i 372
 
5.6%
s 279
 
4.2%
t 186
 
2.8%
c 186
 
2.8%
Other values (188) 2386
36.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3999
60.3%
Other Letter 825
 
12.4%
Uppercase Letter 651
 
9.8%
Space Separator 529
 
8.0%
Other Punctuation 410
 
6.2%
Dash Punctuation 93
 
1.4%
Decimal Number 73
 
1.1%
Open Punctuation 19
 
0.3%
Close Punctuation 19
 
0.3%
Math Symbol 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Lowercase Letter
ValueCountFrequency (%)
o 744
18.6%
r 651
16.3%
n 465
11.6%
p 465
11.6%
i 372
9.3%
s 279
 
7.0%
t 186
 
4.7%
c 186
 
4.7%
e 186
 
4.7%
a 93
 
2.3%
Other values (4) 372
9.3%
Decimal Number
ValueCountFrequency (%)
6 10
13.7%
5 10
13.7%
4 9
12.3%
3 9
12.3%
0 7
9.6%
8 7
9.6%
7 7
9.6%
9 7
9.6%
1 5
6.8%
2 2
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
C 186
28.6%
I 93
14.3%
A 93
14.3%
P 93
14.3%
U 93
14.3%
N 93
14.3%
Other Punctuation
ValueCountFrequency (%)
. 372
90.7%
, 29
 
7.1%
; 7
 
1.7%
· 2
 
0.5%
Space Separator
ValueCountFrequency (%)
529
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4650
70.1%
Common 1160
 
17.5%
Hangul 825
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Latin
ValueCountFrequency (%)
o 744
16.0%
r 651
14.0%
n 465
10.0%
p 465
10.0%
i 372
 
8.0%
s 279
 
6.0%
t 186
 
4.0%
c 186
 
4.0%
e 186
 
4.0%
C 186
 
4.0%
Other values (10) 930
20.0%
Common
ValueCountFrequency (%)
529
45.6%
. 372
32.1%
- 93
 
8.0%
, 29
 
2.5%
( 19
 
1.6%
) 19
 
1.6%
~ 17
 
1.5%
6 10
 
0.9%
5 10
 
0.9%
4 9
 
0.8%
Other values (9) 53
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5808
87.5%
Hangul 825
 
12.4%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 744
12.8%
r 651
11.2%
529
 
9.1%
n 465
 
8.0%
p 465
 
8.0%
. 372
 
6.4%
i 372
 
6.4%
s 279
 
4.8%
t 186
 
3.2%
c 186
 
3.2%
Other values (28) 1559
26.8%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 86
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing395
Missing (%)80.9%
Memory size3.9 KiB

Unnamed: 87
Text

MISSING 

Distinct98
Distinct (%)21.0%
Missing22
Missing (%)4.5%
Memory size3.9 KiB
2023-12-13T07:10:13.472434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length10.51073
Min length2

Characters and Unicode

Total characters4898
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)20.2%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row개 인 사 업 체
4th row회 사 법 인
5th row회 사 이 외 법 인
ValueCountFrequency (%)
372
16.9%
279
12.7%
279
12.7%
186
8.4%
186
8.4%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
93
 
4.2%
Other values (200) 438
19.9%
2023-12-13T07:10:14.038923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
41.2%
376
 
7.7%
290
 
5.9%
279
 
5.7%
193
 
3.9%
192
 
3.9%
189
 
3.9%
120
 
2.4%
100
 
2.0%
97
 
2.0%
Other values (182) 1044
21.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2689
54.9%
Space Separator 2018
41.2%
Decimal Number 73
 
1.5%
Other Punctuation 38
 
0.8%
Lowercase Letter 23
 
0.5%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Math Symbol 17
 
0.3%
Uppercase Letter 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
c 2
8.7%
l 2
8.7%
n 2
8.7%
t 2
8.7%
d 1
 
4.3%
u 1
 
4.3%
r 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2689
54.9%
Common 2185
44.6%
Latin 24
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
Common
ValueCountFrequency (%)
2018
92.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (8) 38
 
1.7%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
c 2
8.3%
l 2
8.3%
n 2
8.3%
t 2
8.3%
I 1
 
4.2%
d 1
 
4.2%
u 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2689
54.9%
ASCII 2207
45.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
91.4%
, 29
 
1.3%
) 19
 
0.9%
( 19
 
0.9%
~ 17
 
0.8%
5 10
 
0.5%
6 10
 
0.5%
4 9
 
0.4%
3 9
 
0.4%
; 7
 
0.3%
Other values (20) 60
 
2.7%
Hangul
ValueCountFrequency (%)
376
14.0%
290
 
10.8%
279
 
10.4%
193
 
7.2%
192
 
7.1%
189
 
7.0%
120
 
4.5%
100
 
3.7%
97
 
3.6%
96
 
3.6%
Other values (151) 757
28.2%
None
ValueCountFrequency (%)
· 2
100.0%

Sample

5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 95. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 215. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.1Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 30Unnamed: 315. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.1Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 41Unnamed: 42Unnamed: 435. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.2Unnamed: 45Unnamed: 46Unnamed: 47Unnamed: 48Unnamed: 49Unnamed: 50Unnamed: 51Unnamed: 52Unnamed: 535. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.2Unnamed: 55Unnamed: 56Unnamed: 57Unnamed: 58Unnamed: 59Unnamed: 60Unnamed: 61Unnamed: 62Unnamed: 63Unnamed: 64Unnamed: 655. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.3Unnamed: 67Unnamed: 68Unnamed: 69Unnamed: 70Unnamed: 71Unnamed: 72Unnamed: 73Unnamed: 74Unnamed: 755. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.3Unnamed: 77Unnamed: 78Unnamed: 79Unnamed: 80Unnamed: 81Unnamed: 82Unnamed: 83Unnamed: 84Unnamed: 85Unnamed: 86Unnamed: 87
0단위 : 개, 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In each, PersonNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 개, 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In each, PersonNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 개, 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In each, PersonNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 개, 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : In each, PersonNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>
1NaN산업분류 Industrial classification용인시\nYongin-siNaN처인구\nCheoin-guNaN포곡읍\nPogok-eupNaN모현면\nMohyeon-myeonNaN남사면\nNamsa-myeonNaN이동면\nIdong-myeonNaN원삼면\nWonsam-myeonNaN백암면\nBaegam-myeonNaN양지면\nYangji-myeonNaN산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification중앙동\nJungang-dongNaN역삼동\nYeoksam-dongNaN유림동\nYurim-dongNaN동부동\nDongbu-dongNaN기흥구\nGiheung-guNaN구갈동\nGugal-dongNaN상갈동\nSanggal-dongNaN기흥동\nGiheung-dongNaN서농동\nSeonong-dongNaN산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification구성동\nGuseong-dongNaN마북동\nMabuk-dongNaN동백동\nDongbaek-dongNaN보정동\nBojeong-dongNaN상하동\nSangha-dongNaN신갈동\nSingal-dongNaN영덕동\nYeongdeok-dongNaN수지구\nSuji-guNaN풍덕천1동\nPungdoekcheon 1(il)-dongNaN산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification풍덕천2동\nPungdoekcheon 2(i)-dongNaN신봉동\nSinbong-dongNaN죽전1동\nJukjeon 1(il)-dongNaN죽전2동\nJukjeon 2(i)-dongNaN동천동\nDongcheon-dongNaN상현1동\nSanghyeon 1(il)-dongNaN상현2동\nSanghyeon 2(i)-dongNaN성복동\nSeongbok-dongNaN산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification
2NaN<NA>사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkersNaN<NA>NaN<NA>사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkersNaN<NA>NaN<NA>사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkersNaN<NA>NaN<NA>사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkers사업체수\nEstab.종사자수\nWorkersNaN<NA>NaN<NA>
3NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>
4TT전산업39 925248 45915 27589 8651 81011 7141 8519 4017679 6951 2478 3116184 4968365 1671 5118 907TT전산업TT전산업2 88812 8451 3097 9581 3116 3771 1274 99414 322108 1731 87710 2981 6739 0947827 82950223 036TT전산업TT전산업8313 8588989 5382 0919 6021 69710 7067164 2321 6117 0791 64412 90110 32850 4211 8176 578TT전산업TT전산업1 1556 5078013 5722 01211 8071 0105 0581 2527 1936352 6408653 0657814 001TT전산업TT전산업
5NaN개 인 사 업 체32 15988 99312 25731 4961 5263 9981 4644 2904861 7149592 6914018896331 5641 1682 805NaNProprietorshipNaN개 인 사 업 체2 5196 4351 0812 5831 0912 4059292 12211 33332 1951 4763 8891 3433 6345241 4463831 010NaNProprietorshipNaN개 인 사 업 체7241 8337342 0631 7154 9871 3394 1595861 6051 3553 7691 1543 8008 56925 3021 6064 132NaNProprietorshipNaN개 인 사 업 체9573 3656792 0711 5884 4427922 5539913 0385331 5097772 1886462 004NaNProprietorshipNaN개 인 사 업 체
6NaN회 사 법 인5 224118 9592 11545 0932026 6113134 2172317 5212234 9871523 0651252 7642635 369NaNIncorporated Co.NaN회 사 법 인2263 9371131 6431413 0861261 8932 06459 7822583 9532163 8302116 0208619 840NaNIncorporated Co.NaN회 사 법 인46981946 1952372 8362764 754781 2211512 0564118 0961 04514 0841211 346NaNIncorporated Co.NaN회 사 법 인1211 327697132293 6771681 8311613 0784030950456861 347NaNIncorporated Co.NaN회 사 법 인
7NaN회 사 이 외 법 인94433 75741911 756401 013348012942935559354684777231539NaNNon-biz Corp.NaN회 사 이 외 법 인702 168583 588176202379931413 470412 026361 28317278192 108NaNNon-biz Corp.NaN회 사 이 외 법 인18818241 018431 358351 554211 246401 048207332118 53127801NaNNon-biz Corp.NaN회 사 이 외 법 인301 56019538553 159124632667819618823915475NaNNon-biz Corp.NaN회 사 이 외 법 인
8NaN비 법 인 단 체1 5986 7504841 52042924093213130743074316749194NaNUnincorp. Assn.NaN비 법 인 단 체733055714462266491806112 7261024307834730851478NaNUnincorp. Assn.NaN비 법 인 단 체432264626296421472393116065206592725032 50463299NaNUnincorp. Assn.NaN비 법 인 단 체47255342501405293821174399432043018234175NaNUnincorp. Assn.NaN비 법 인 단 체
9NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>
5. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 95. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 215. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.1Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 30Unnamed: 315. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.1Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 41Unnamed: 42Unnamed: 435. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.2Unnamed: 45Unnamed: 46Unnamed: 47Unnamed: 48Unnamed: 49Unnamed: 50Unnamed: 51Unnamed: 52Unnamed: 535. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.2Unnamed: 55Unnamed: 56Unnamed: 57Unnamed: 58Unnamed: 59Unnamed: 60Unnamed: 61Unnamed: 62Unnamed: 63Unnamed: 64Unnamed: 655. 산업중분류 , 조직형태 및 읍면동별 사업체수, 종사자수.3Unnamed: 67Unnamed: 68Unnamed: 69Unnamed: 70Unnamed: 71Unnamed: 72Unnamed: 73Unnamed: 74Unnamed: 755. Number of Establishments, Workers by Industrial Divisions, Type of Legal Organization & Provinces.3Unnamed: 77Unnamed: 78Unnamed: 79Unnamed: 80Unnamed: 81Unnamed: 82Unnamed: 83Unnamed: 84Unnamed: 85Unnamed: 86Unnamed: 87
47895수리업1 1303 7654301 15844945613617802872154725364717695수리업95수리업6415347874718840894201 79634724915724176112095수리업95수리업183434895013760377266659194554742808116916695수리업95수리업173323503864371173620416641732278195수리업95수리업
479NaN개 인 사 업 체1 0252 12840382742845411716432755144525364195NaNProprietorshipNaN개 인 사 업 체61984686418936793647903154418618411120NaNProprietorshipNaN개 인 사 업 체18343266459349112256254123409925851164114NaNProprietorshipNaN개 인 사 업 체17332247376234853063154616222339NaNProprietorshipNaN개 인 사 업 체
480NaN회 사 법 인1021 62925327210219137117----681NaNIncorporated Co.NaN회 사 법 인3551169938561 0063188716135--NaNIncorporated Co.NaN회 사 법 인--22354411265145711537521296552NaNIncorporated Co.NaN회 사 법 인--13123325137118110442NaNIncorporated Co.NaN회 사 법 인
481NaN회 사 이 외 법 인3824--------12----NaNNon-biz Corp.NaN회 사 이 외 법 인------12----------NaNNon-biz Corp.NaN회 사 이 외 법 인--------------14--NaNNon-biz Corp.NaN회 사 이 외 법 인--------14------NaNNon-biz Corp.NaN회 사 이 외 법 인
482NaN비 법 인 단 체------------------NaNUnincorp. Assn.NaN비 법 인 단 체------------------NaNUnincorp. Assn.NaN비 법 인 단 체------------------NaNUnincorp. Assn.NaN비 법 인 단 체----------------NaNUnincorp. Assn.NaN비 법 인 단 체
48396기타 개인 서비스업2 2874 9636601 40395146441551347568714173156398096기타 개인 서비스업96기타 개인 서비스업17451873116709451878911 9211172491032002840233796기타 개인 서비스업96기타 개인 서비스업73115721441443131092923458114275741987361 63911222396기타 개인 서비스업96기타 개인 서비스업8019069145140271461737719855115811867613896기타 개인 서비스업96기타 개인 서비스업
484NaN개 인 사 업 체2 2494 6186511 28995146428912435480141730523761NaNProprietorshipNaN개 인 사 업 체17350473116709451878711 74711523610219828402337NaNProprietorshipNaN개 인 사 업 체70109711311422901032463458113254701487271 582111222NaNProprietorshipNaN개 인 사 업 체80190671381392524315576193551158118675131NaNProprietorshipNaN개 인 사 업 체
485NaN회 사 법 인30237429----1413----18NaNIncorporated Co.NaN회 사 법 인114------1815221312----NaNIncorporated Co.NaN회 사 법 인23113223527--121450856--NaNIncorporated Co.NaN회 사 법 인--2711931815----17NaNIncorporated Co.NaN회 사 법 인
486NaN회 사 이 외 법 인593474--266--14--14--NaNNon-biz Corp.NaN회 사 이 외 법 인--------119--------NaNNon-biz Corp.NaN회 사 이 외 법 인------119----------NaNNon-biz Corp.NaN회 사 이 외 법 인----------------NaNNon-biz Corp.NaN회 사 이 외 법 인
487NaN비 법 인 단 체315111------------111NaNUnincorp. Assn.NaN비 법 인 단 체--------13--------NaNUnincorp. Assn.NaN비 법 인 단 체13------------1111NaNUnincorp. Assn.NaN비 법 인 단 체----------------NaNUnincorp. Assn.NaN비 법 인 단 체

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 21Unnamed: 23Unnamed: 43Unnamed: 45Unnamed: 65Unnamed: 67Unnamed: 85Unnamed: 87# duplicates
0개 인 사 업 체Proprietorship개 인 사 업 체Proprietorship개 인 사 업 체Proprietorship개 인 사 업 체Proprietorship개 인 사 업 체93
1비 법 인 단 체Unincorp. Assn.비 법 인 단 체Unincorp. Assn.비 법 인 단 체Unincorp. Assn.비 법 인 단 체Unincorp. Assn.비 법 인 단 체93
2회 사 법 인Incorporated Co.회 사 법 인Incorporated Co.회 사 법 인Incorporated Co.회 사 법 인Incorporated Co.회 사 법 인93
3회 사 이 외 법 인Non-biz Corp.회 사 이 외 법 인Non-biz Corp.회 사 이 외 법 인Non-biz Corp.회 사 이 외 법 인Non-biz Corp.회 사 이 외 법 인93
4<NA><NA><NA><NA><NA><NA><NA><NA><NA>22