Overview

Dataset statistics

Number of variables43
Number of observations301
Missing cells1647
Missing cells (%)12.7%
Duplicate rows3
Duplicate rows (%)1.0%
Total size in memory101.2 KiB
Average record size in memory344.4 B

Variable types

Unsupported39
Text4

Dataset

Description사업체조사 산업중분류, 종사자남여 및 읍면동별 종사자수(2013년) 정보 입니다.
Author경기도 용인시
URLhttps://www.data.go.kr/data/15053400/fileData.do

Alerts

Dataset has 3 (1.0%) duplicate rowsDuplicates
8. 산업중분류, 종사자남여 및 읍면동별 종사자수 has 207 (68.8%) missing valuesMissing
Unnamed: 1 has 21 (7.0%) missing valuesMissing
Unnamed: 2 has 21 (7.0%) missing valuesMissing
Unnamed: 3 has 21 (7.0%) missing valuesMissing
Unnamed: 4 has 21 (7.0%) missing valuesMissing
Unnamed: 5 has 21 (7.0%) missing valuesMissing
Unnamed: 6 has 21 (7.0%) missing valuesMissing
Unnamed: 7 has 21 (7.0%) missing valuesMissing
Unnamed: 8 has 21 (7.0%) missing valuesMissing
Unnamed: 9 has 21 (7.0%) missing valuesMissing
8. Number of Workers by Industrial Divisions, Sex of Workers & Provinces has 20 (6.6%) missing valuesMissing
Unnamed: 11 has 21 (7.0%) missing valuesMissing
Unnamed: 12 has 21 (7.0%) missing valuesMissing
Unnamed: 13 has 21 (7.0%) missing valuesMissing
Unnamed: 14 has 21 (7.0%) missing valuesMissing
Unnamed: 15 has 21 (7.0%) missing valuesMissing
Unnamed: 16 has 21 (7.0%) missing valuesMissing
Unnamed: 17 has 21 (7.0%) missing valuesMissing
Unnamed: 18 has 21 (7.0%) missing valuesMissing
Unnamed: 19 has 207 (68.8%) missing valuesMissing
Unnamed: 20 has 22 (7.3%) missing valuesMissing
8. 산업중분류, 종사자남여 및 읍면동별 종사자수.1 has 207 (68.8%) missing valuesMissing
Unnamed: 22 has 21 (7.0%) missing valuesMissing
Unnamed: 23 has 21 (7.0%) missing valuesMissing
Unnamed: 24 has 21 (7.0%) missing valuesMissing
Unnamed: 25 has 21 (7.0%) missing valuesMissing
Unnamed: 26 has 21 (7.0%) missing valuesMissing
Unnamed: 27 has 21 (7.0%) missing valuesMissing
Unnamed: 28 has 21 (7.0%) missing valuesMissing
Unnamed: 29 has 21 (7.0%) missing valuesMissing
Unnamed: 30 has 21 (7.0%) missing valuesMissing
8. Number of Workers by Industrial Divisions, Sex of Workers & Provinces.1 has 20 (6.6%) missing valuesMissing
Unnamed: 32 has 21 (7.0%) missing valuesMissing
Unnamed: 33 has 21 (7.0%) missing valuesMissing
Unnamed: 34 has 21 (7.0%) missing valuesMissing
Unnamed: 35 has 21 (7.0%) missing valuesMissing
Unnamed: 36 has 21 (7.0%) missing valuesMissing
Unnamed: 37 has 21 (7.0%) missing valuesMissing
Unnamed: 38 has 21 (7.0%) missing valuesMissing
Unnamed: 39 has 21 (7.0%) missing valuesMissing
Unnamed: 40 has 21 (7.0%) missing valuesMissing
Unnamed: 41 has 207 (68.8%) missing valuesMissing
Unnamed: 42 has 22 (7.3%) missing valuesMissing
8. 산업중분류, 종사자남여 및 읍면동별 종사자수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
8. Number of Workers by Industrial Divisions, Sex of Workers & Provinces is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
8. 산업중분류, 종사자남여 및 읍면동별 종사자수.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 26 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 27 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 28 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 29 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 30 is an unsupported type, check if it needs cleaning or further analysisUnsupported
8. Number of Workers by Industrial Divisions, Sex of Workers & Provinces.1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 32 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 33 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 34 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 35 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 36 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 37 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 38 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 39 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 40 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 41 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:04:24.881423
Analysis finished2023-12-12 22:04:25.385373
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

8. 산업중분류, 종사자남여 및 읍면동별 종사자수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 1
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T07:04:25.616144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T07:04:26.063367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB
Missing20
Missing (%)6.6%
Memory size2.5 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 20
Text

MISSING 

Distinct95
Distinct (%)34.1%
Missing22
Missing (%)7.3%
Memory size2.5 KiB
2023-12-13T07:04:26.373249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length7.781362
Min length2

Characters and Unicode

Total characters2171
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)33.3%

Sample

1st row전산업
2nd rowMale
3rd rowFemale
4th row농업, 임업 및 어업 (01 ~ 03)
5th rowMale
ValueCountFrequency (%)
male 93
17.6%
female 93
17.6%
50
 
9.5%
제조업 21
 
4.0%
서비스업 14
 
2.7%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
운송업 3
 
0.6%
Other values (188) 226
42.8%
2023-12-13T07:04:26.825517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 279
 
12.9%
250
 
11.5%
l 186
 
8.6%
a 186
 
8.6%
99
 
4.6%
M 93
 
4.3%
F 93
 
4.3%
m 93
 
4.3%
50
 
2.3%
44
 
2.0%
Other values (172) 798
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
38.0%
Lowercase Letter 744
34.3%
Space Separator 250
 
11.5%
Uppercase Letter 186
 
8.6%
Decimal Number 73
 
3.4%
Other Punctuation 38
 
1.8%
Open Punctuation 19
 
0.9%
Close Punctuation 19
 
0.9%
Math Symbol 17
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
7 7
9.6%
0 7
9.6%
9 7
9.6%
8 7
9.6%
1 5
6.8%
2 2
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 279
37.5%
l 186
25.0%
a 186
25.0%
m 93
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 93
50.0%
F 93
50.0%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 930
42.8%
Hangul 825
38.0%
Common 416
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Common
ValueCountFrequency (%)
250
60.1%
, 29
 
7.0%
( 19
 
4.6%
) 19
 
4.6%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
7 7
 
1.7%
Other values (7) 37
 
8.9%
Latin
ValueCountFrequency (%)
e 279
30.0%
l 186
20.0%
a 186
20.0%
M 93
 
10.0%
F 93
 
10.0%
m 93
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
61.9%
Hangul 825
38.0%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 279
20.8%
250
18.6%
l 186
13.8%
a 186
13.8%
M 93
 
6.9%
F 93
 
6.9%
m 93
 
6.9%
, 29
 
2.2%
( 19
 
1.4%
) 19
 
1.4%
Other values (12) 97
 
7.2%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

8. 산업중분류, 종사자남여 및 읍면동별 종사자수.1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 22
Text

MISSING 

Distinct96
Distinct (%)34.3%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-13T07:04:27.174164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length2
Mean length5.8678571
Min length2

Characters and Unicode

Total characters1643
Distinct characters192
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)33.6%

Sample

1st row산업분류 Industrial classification
2nd row전산업
3rd row남자
4th row여자
5th row농업, 임업 및 어업 (01 ~ 03)
ValueCountFrequency (%)
남자 93
17.5%
여자 93
17.5%
50
 
9.4%
제조업 21
 
4.0%
서비스업 14
 
2.6%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
자동차 3
 
0.6%
Other values (191) 229
43.1%
2023-12-13T07:04:27.665738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
15.3%
190
 
11.6%
100
 
6.1%
95
 
5.8%
93
 
5.7%
50
 
3.0%
44
 
2.7%
29
 
1.8%
, 29
 
1.8%
27
 
1.6%
Other values (182) 735
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1201
73.1%
Space Separator 251
 
15.3%
Decimal Number 73
 
4.4%
Other Punctuation 38
 
2.3%
Lowercase Letter 23
 
1.4%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Math Symbol 17
 
1.0%
Control 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Lowercase Letter
ValueCountFrequency (%)
i 4
17.4%
s 3
13.0%
a 3
13.0%
n 2
8.7%
t 2
8.7%
l 2
8.7%
c 2
8.7%
r 1
 
4.3%
d 1
 
4.3%
u 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
9 7
9.6%
7 7
9.6%
8 7
9.6%
0 7
9.6%
1 5
6.8%
2 2
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Space Separator
ValueCountFrequency (%)
251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1201
73.1%
Common 418
 
25.4%
Latin 24
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
Common
ValueCountFrequency (%)
251
60.0%
, 29
 
6.9%
( 19
 
4.5%
) 19
 
4.5%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
9 7
 
1.7%
Other values (8) 38
 
9.1%
Latin
ValueCountFrequency (%)
i 4
16.7%
s 3
12.5%
a 3
12.5%
n 2
8.3%
t 2
8.3%
l 2
8.3%
c 2
8.3%
r 1
 
4.2%
I 1
 
4.2%
d 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1201
73.1%
ASCII 440
 
26.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
57.0%
, 29
 
6.6%
( 19
 
4.3%
) 19
 
4.3%
~ 17
 
3.9%
5 10
 
2.3%
6 10
 
2.3%
4 9
 
2.0%
3 9
 
2.0%
9 7
 
1.6%
Other values (20) 60
 
13.6%
Hangul
ValueCountFrequency (%)
190
 
15.8%
100
 
8.3%
95
 
7.9%
93
 
7.7%
50
 
4.2%
44
 
3.7%
29
 
2.4%
27
 
2.2%
26
 
2.2%
25
 
2.1%
Other values (151) 522
43.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 23
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 25
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 26
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 27
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 28
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 29
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 30
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB
Missing20
Missing (%)6.6%
Memory size2.5 KiB

Unnamed: 32
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 33
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 34
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 35
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 36
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 37
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 38
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 39
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 40
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)7.0%
Memory size2.5 KiB

Unnamed: 41
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)68.8%
Memory size2.5 KiB

Unnamed: 42
Text

MISSING 

Distinct95
Distinct (%)34.1%
Missing22
Missing (%)7.3%
Memory size2.5 KiB
2023-12-13T07:04:28.028994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length7.781362
Min length2

Characters and Unicode

Total characters2171
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)33.3%

Sample

1st row전산업
2nd rowMale
3rd rowFemale
4th row농업, 임업 및 어업 (01 ~ 03)
5th rowMale
ValueCountFrequency (%)
male 93
17.6%
female 93
17.6%
50
 
9.5%
제조업 21
 
4.0%
서비스업 14
 
2.7%
12
 
2.3%
기타 7
 
1.3%
제외 5
 
0.9%
광업 4
 
0.8%
운송업 3
 
0.6%
Other values (188) 226
42.8%
2023-12-13T07:04:28.580476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 279
 
12.9%
250
 
11.5%
l 186
 
8.6%
a 186
 
8.6%
99
 
4.6%
M 93
 
4.3%
F 93
 
4.3%
m 93
 
4.3%
50
 
2.3%
44
 
2.0%
Other values (172) 798
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
38.0%
Lowercase Letter 744
34.3%
Space Separator 250
 
11.5%
Uppercase Letter 186
 
8.6%
Decimal Number 73
 
3.4%
Other Punctuation 38
 
1.8%
Open Punctuation 19
 
0.9%
Close Punctuation 19
 
0.9%
Math Symbol 17
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Decimal Number
ValueCountFrequency (%)
5 10
13.7%
6 10
13.7%
4 9
12.3%
3 9
12.3%
7 7
9.6%
0 7
9.6%
9 7
9.6%
8 7
9.6%
1 5
6.8%
2 2
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 279
37.5%
l 186
25.0%
a 186
25.0%
m 93
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
76.3%
; 7
 
18.4%
· 2
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 93
50.0%
F 93
50.0%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 930
42.8%
Hangul 825
38.0%
Common 416
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
Common
ValueCountFrequency (%)
250
60.1%
, 29
 
7.0%
( 19
 
4.6%
) 19
 
4.6%
~ 17
 
4.1%
5 10
 
2.4%
6 10
 
2.4%
4 9
 
2.2%
3 9
 
2.2%
7 7
 
1.7%
Other values (7) 37
 
8.9%
Latin
ValueCountFrequency (%)
e 279
30.0%
l 186
20.0%
a 186
20.0%
M 93
 
10.0%
F 93
 
10.0%
m 93
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
61.9%
Hangul 825
38.0%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 279
20.8%
250
18.6%
l 186
13.8%
a 186
13.8%
M 93
 
6.9%
F 93
 
6.9%
m 93
 
6.9%
, 29
 
2.2%
( 19
 
1.4%
) 19
 
1.4%
Other values (12) 97
 
7.2%
Hangul
ValueCountFrequency (%)
99
 
12.0%
50
 
6.1%
44
 
5.3%
29
 
3.5%
27
 
3.3%
26
 
3.2%
25
 
3.0%
22
 
2.7%
17
 
2.1%
11
 
1.3%
Other values (149) 475
57.6%
None
ValueCountFrequency (%)
· 2
100.0%

Sample

8. 산업중분류, 종사자남여 및 읍면동별 종사자수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 98. Number of Workers by Industrial Divisions, Sex of Workers & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 208. 산업중분류, 종사자남여 및 읍면동별 종사자수.1Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 308. Number of Workers by Industrial Divisions, Sex of Workers & Provinces.1Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 41Unnamed: 42
0단위 : 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : PersonNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>단위 : 명<NA>NaNNaNNaNNaNNaNNaNNaNNaNUnit : PersonNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>
1NaN산업분류 Industrial classification용인시\nYongin-si처인구\nCheoin-gu포곡읍\nPogok-eup모현면\nMohyeon-myeon남사면\nNamsa-myeon이동면\nIdong-myeon원삼면\nWonsam-myeon백암면\nBaegam-myeon양지면\nYangji-myeon중앙동\nJungang-dong역삼동\nYeoksam-dong유림동\nYurim-dong동부동\nDongbu-dong기흥구\nGiheung-gu구갈동\nGugal-dong상갈동\nSanggal-dong기흥동\nGiheung-dong산업분류\nIndustrial classification<NA>NaN산업분류 Industrial classification서농동\nSeonong-dong구성동\nGuseong-dong마북동\nMabuk-dong동백동\nDongbaek-dong보정동\nBojeong-dong상하동\nSangha-dong신갈동\nSingal-dong영덕동\nYeongdeok-dong수지구\nSuji-gu풍덕천1동\nPungdoekcheon 1(il)-dong풍덕천2동\nPungdoekcheon 2(i)-dong신봉동\nSinbong-dong죽전1동\nJukjeon 1(il)-dong죽전2동\nJukjeon 2(i)-dong동천동\nDongcheon-dong상현1동\nSanghyeon 1(il)-dong상현2동\nSanghyeon 2(i)-dong성복동\nSeongbok-dong산업분류\nIndustrial classification<NA>
2NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>
3TT전산업248 45989 86511 7149 4019 6958 3114 4965 1678 90712 8457 9586 3774 994108 17310 2989 0947 829TT전산업TT전산업23 0363 8589 5389 60210 7064 2327 07912 90150 4216 5786 5073 57211 8075 0587 1932 6403 0654 001TT전산업
4NaN남자141 76155 4586 7146 3206 5895 1623 1153 3486 1116 8144 5773 6323 07662 7384 8884 9285 643NaNMaleNaN남자14 7711 8536 3814 1276 2142 1213 6128 20023 5653 1262 7271 2626 3392 0533 9369961 1162 010NaNMale
5NaN여자106 69834 4075 0003 0813 1063 1491 3811 8192 7966 0313 3812 7451 91845 4355 4104 1662 186NaNFemaleNaN여자8 2652 0053 1575 4754 4922 1113 4674 70126 8563 4523 7802 3105 4683 0053 2571 6441 9491 991NaNFemale
6NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>
7A농업, 임업 및 어업 (01 ~ 03)1039929-29--8-31-2-4---A농업, 임업 및 어업 (01 ~ 03)A농업, 임업 및 어업 (01 ~ 03)------4-----------A농업, 임업 및 어업 (01 ~ 03)
8NaN남자848120-24--8-27-2-3---NaNMaleNaN남자------3-----------NaNMale
9NaN여자19189-5----4---1---NaNFemaleNaN여자------1-----------NaNFemale
8. 산업중분류, 종사자남여 및 읍면동별 종사자수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 98. Number of Workers by Industrial Divisions, Sex of Workers & ProvincesUnnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 208. 산업중분류, 종사자남여 및 읍면동별 종사자수.1Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 308. Number of Workers by Industrial Divisions, Sex of Workers & Provinces.1Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38Unnamed: 39Unnamed: 40Unnamed: 41Unnamed: 42
291NaN여자4 9741 425143103247950551025021571021081 95121321986NaNFemaleNaN여자67102147281303802591941 598210146158252179247101179126NaNFemale
29294협회 및 단체2 660907656624596261891621161011021 0141011197694협회 및 단체94협회 및 단체776381971001028711173958681171405514045724494협회 및 단체
293NaN남자1 71258443482034253860105568669646757336NaNMaleNaN남자40443967606862824824346761013880323333NaNMale
294NaN여자948323221842537232957601533368264640NaNFemaleNaN여자3719423040342529257152241391760133911NaNFemale
29595수리업3 7651 158941368072473617615387188891 7967215717695수리업95수리업2034891373776619447481116633506411720464328195수리업
296NaN남자3 211977771177263423114312969165691 56956135157NaNMaleNaN남자1631811153315616143066514327374998161572271NaNMale
297NaN여자554181171989553324182320227162219NaNFemaleNaN여자43822461033441462361315194371010NaNFemale
29896기타 개인 서비스업4 9631 403146155478717568051811694871 9212492004096기타 개인 서비스업96기타 개인 서비스업37115144313292582751981 63922319014527117319811518613896기타 개인 서비스업
299NaN남자1 491482428935429294097373032565784913NaNMaleNaN남자1135478475227477444517241733054345633NaNMale
300NaN여자3 472921104661245827404217964551 35617115127NaNFemaleNaN여자268097229217362011211 19517211810419814314481130105NaNFemale

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 20Unnamed: 22Unnamed: 42# duplicates
0남자Male남자Male93
1여자Female여자Female93
2<NA><NA><NA><NA>21