Overview

Dataset statistics

Number of variables20
Number of observations2003
Missing cells6071
Missing cells (%)15.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory313.1 KiB
Average record size in memory160.1 B

Variable types

Text7
Unsupported10
Categorical3

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21705/F/1/datasetView.do

Alerts

Unnamed: 17 is highly overall correlated with Unnamed: 18 and 1 other fieldsHigh correlation
Unnamed: 18 is highly overall correlated with Unnamed: 17 and 1 other fieldsHigh correlation
Unnamed: 19 is highly overall correlated with Unnamed: 17 and 1 other fieldsHigh correlation
Unnamed: 17 is highly imbalanced (99.1%)Imbalance
Unnamed: 18 is highly imbalanced (99.1%)Imbalance
Unnamed: 19 is highly imbalanced (56.2%)Imbalance
Unnamed: 2 has 2001 (99.9%) missing valuesMissing
Unnamed: 3 has 2001 (99.9%) missing valuesMissing
Unnamed: 8 has 41 (2.0%) missing valuesMissing
Unnamed: 16 has 2001 (99.9%) missing valuesMissing
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 09:16:04.672263
Analysis finished2023-12-11 09:16:06.322493
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2002
Distinct (%)100.0%
Missing1
Missing (%)< 0.1%
Memory size15.8 KiB
2023-12-11T18:16:06.752352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length3.454046
Min length1

Characters and Unicode

Total characters6915
Distinct characters27
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2002 ?
Unique (%)100.0%

Sample

1st rowID관찰조사
2nd rowID_OBSERV_EXAMIN
3rd row1
4th row2
5th row3
ValueCountFrequency (%)
13 1
 
< 0.1%
1328 1
 
< 0.1%
1342 1
 
< 0.1%
1341 1
 
< 0.1%
1340 1
 
< 0.1%
1339 1
 
< 0.1%
1338 1
 
< 0.1%
1337 1
 
< 0.1%
1336 1
 
< 0.1%
1335 1
 
< 0.1%
Other values (1992) 1992
99.5%
2023-12-11T18:16:07.434583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1600
23.1%
2 601
 
8.7%
5 600
 
8.7%
6 600
 
8.7%
3 600
 
8.7%
7 600
 
8.7%
8 600
 
8.7%
9 600
 
8.7%
4 600
 
8.7%
0 492
 
7.1%
Other values (17) 22
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6893
99.7%
Uppercase Letter 16
 
0.2%
Other Letter 4
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 3
18.8%
D 2
12.5%
E 2
12.5%
M 1
 
6.2%
N 1
 
6.2%
A 1
 
6.2%
X 1
 
6.2%
V 1
 
6.2%
R 1
 
6.2%
S 1
 
6.2%
Other values (2) 2
12.5%
Decimal Number
ValueCountFrequency (%)
1 1600
23.2%
2 601
 
8.7%
5 600
 
8.7%
6 600
 
8.7%
3 600
 
8.7%
7 600
 
8.7%
8 600
 
8.7%
9 600
 
8.7%
4 600
 
8.7%
0 492
 
7.1%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6895
99.7%
Latin 16
 
0.2%
Hangul 4
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 3
18.8%
D 2
12.5%
E 2
12.5%
M 1
 
6.2%
N 1
 
6.2%
A 1
 
6.2%
X 1
 
6.2%
V 1
 
6.2%
R 1
 
6.2%
S 1
 
6.2%
Other values (2) 2
12.5%
Common
ValueCountFrequency (%)
1 1600
23.2%
2 601
 
8.7%
5 600
 
8.7%
6 600
 
8.7%
3 600
 
8.7%
7 600
 
8.7%
8 600
 
8.7%
9 600
 
8.7%
4 600
 
8.7%
0 492
 
7.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6911
99.9%
Hangul 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1600
23.2%
2 601
 
8.7%
5 600
 
8.7%
6 600
 
8.7%
3 600
 
8.7%
7 600
 
8.7%
8 600
 
8.7%
9 600
 
8.7%
4 600
 
8.7%
0 492
 
7.1%
Other values (13) 18
 
0.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct1002
Distinct (%)50.0%
Missing1
Missing (%)< 0.1%
Memory size15.8 KiB
2023-12-11T18:16:07.897172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length6
Mean length6.1868132
Min length6

Characters and Unicode

Total characters12386
Distinct characters30
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row조사지점코드
2nd rowEXAMIN_SPOT_CD
3rd row01-001
4th row01-001
5th row01-002
ValueCountFrequency (%)
13-023 2
 
0.1%
18-007 2
 
0.1%
17-252 2
 
0.1%
19-004 2
 
0.1%
17-258 2
 
0.1%
17-3025 2
 
0.1%
17-3052 2
 
0.1%
17-3070 2
 
0.1%
17-3082 2
 
0.1%
17-3090 2
 
0.1%
Other values (992) 1982
99.0%
2023-12-11T18:16:08.470480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2850
23.0%
- 2000
16.1%
1 1926
15.5%
2 1742
14.1%
3 816
 
6.6%
4 714
 
5.8%
5 620
 
5.0%
6 482
 
3.9%
7 440
 
3.6%
9 400
 
3.2%
Other values (20) 396
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10366
83.7%
Dash Punctuation 2000
 
16.1%
Uppercase Letter 12
 
0.1%
Other Letter 6
 
< 0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 1
8.3%
D 1
8.3%
C 1
8.3%
I 1
8.3%
O 1
8.3%
P 1
8.3%
S 1
8.3%
N 1
8.3%
M 1
8.3%
A 1
8.3%
Other values (2) 2
16.7%
Decimal Number
ValueCountFrequency (%)
0 2850
27.5%
1 1926
18.6%
2 1742
16.8%
3 816
 
7.9%
4 714
 
6.9%
5 620
 
6.0%
6 482
 
4.6%
7 440
 
4.2%
9 400
 
3.9%
8 376
 
3.6%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 2000
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12368
99.9%
Latin 12
 
0.1%
Hangul 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2850
23.0%
- 2000
16.2%
1 1926
15.6%
2 1742
14.1%
3 816
 
6.6%
4 714
 
5.8%
5 620
 
5.0%
6 482
 
3.9%
7 440
 
3.6%
9 400
 
3.2%
Other values (2) 378
 
3.1%
Latin
ValueCountFrequency (%)
T 1
8.3%
D 1
8.3%
C 1
8.3%
I 1
8.3%
O 1
8.3%
P 1
8.3%
S 1
8.3%
N 1
8.3%
M 1
8.3%
A 1
8.3%
Other values (2) 2
16.7%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12380
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2850
23.0%
- 2000
16.2%
1 1926
15.6%
2 1742
14.1%
3 816
 
6.6%
4 714
 
5.8%
5 620
 
5.0%
6 482
 
3.9%
7 440
 
3.6%
9 400
 
3.2%
Other values (14) 390
 
3.2%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Unnamed: 2
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing2001
Missing (%)99.9%
Memory size15.8 KiB
2023-12-11T18:16:08.634314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length7
Min length4

Characters and Unicode

Total characters14
Distinct characters13
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row조사일자
2nd rowEXAMIN_DAY
ValueCountFrequency (%)
조사일자 1
50.0%
examin_day 1
50.0%
2023-12-11T18:16:08.912082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
E 1
 
7.1%
X 1
 
7.1%
M 1
 
7.1%
I 1
 
7.1%
N 1
 
7.1%
Other values (3) 3
21.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 9
64.3%
Other Letter 4
28.6%
Connector Punctuation 1
 
7.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 2
22.2%
E 1
11.1%
X 1
11.1%
M 1
11.1%
I 1
11.1%
N 1
11.1%
D 1
11.1%
Y 1
11.1%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9
64.3%
Hangul 4
28.6%
Common 1
 
7.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 2
22.2%
E 1
11.1%
X 1
11.1%
M 1
11.1%
I 1
11.1%
N 1
11.1%
D 1
11.1%
Y 1
11.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10
71.4%
Hangul 4
 
28.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 2
20.0%
E 1
10.0%
X 1
10.0%
M 1
10.0%
I 1
10.0%
N 1
10.0%
_ 1
10.0%
D 1
10.0%
Y 1
10.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 3
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing2001
Missing (%)99.9%
Memory size15.8 KiB
2023-12-11T18:16:09.100846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length7.5
Mean length7.5
Min length4

Characters and Unicode

Total characters15
Distinct characters13
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row조사요일
2nd rowEXAMIN_DATE
ValueCountFrequency (%)
조사요일 1
50.0%
examin_date 1
50.0%
2023-12-11T18:16:09.419420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 2
13.3%
A 2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
X 1
 
6.7%
M 1
 
6.7%
I 1
 
6.7%
N 1
 
6.7%
Other values (3) 3
20.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 10
66.7%
Other Letter 4
 
26.7%
Connector Punctuation 1
 
6.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 2
20.0%
A 2
20.0%
X 1
10.0%
M 1
10.0%
I 1
10.0%
N 1
10.0%
D 1
10.0%
T 1
10.0%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10
66.7%
Hangul 4
 
26.7%
Common 1
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 2
20.0%
A 2
20.0%
X 1
10.0%
M 1
10.0%
I 1
10.0%
N 1
10.0%
D 1
10.0%
T 1
10.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11
73.3%
Hangul 4
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 2
18.2%
A 2
18.2%
X 1
9.1%
M 1
9.1%
I 1
9.1%
N 1
9.1%
_ 1
9.1%
D 1
9.1%
T 1
9.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct147
Distinct (%)7.3%
Missing1
Missing (%)< 0.1%
Memory size15.8 KiB
2023-12-11T18:16:09.712600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length4
Mean length4.0064935
Min length4

Characters and Unicode

Total characters8021
Distinct characters25
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)1.6%

Sample

1st row조사시작시간
2nd rowEXAMIN_START_TM
3rd row0810
4th row0920
5th row0755
ValueCountFrequency (%)
1005 134
 
6.7%
0735 110
 
5.5%
0905 96
 
4.8%
1235 92
 
4.6%
1305 78
 
3.9%
1505 73
 
3.6%
0805 73
 
3.6%
1405 70
 
3.5%
1605 66
 
3.3%
1000 61
 
3.0%
Other values (137) 1149
57.4%
2023-12-11T18:16:10.199872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2822
35.2%
1 1419
17.7%
5 1365
17.0%
3 709
 
8.8%
7 326
 
4.1%
2 296
 
3.7%
4 295
 
3.7%
8 278
 
3.5%
9 256
 
3.2%
6 234
 
2.9%
Other values (15) 21
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8000
99.7%
Uppercase Letter 13
 
0.2%
Other Letter 6
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2822
35.3%
1 1419
17.7%
5 1365
17.1%
3 709
 
8.9%
7 326
 
4.1%
2 296
 
3.7%
4 295
 
3.7%
8 278
 
3.5%
9 256
 
3.2%
6 234
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
T 3
23.1%
A 2
15.4%
M 2
15.4%
E 1
 
7.7%
X 1
 
7.7%
I 1
 
7.7%
N 1
 
7.7%
S 1
 
7.7%
R 1
 
7.7%
Other Letter
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8002
99.8%
Latin 13
 
0.2%
Hangul 6
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2822
35.3%
1 1419
17.7%
5 1365
17.1%
3 709
 
8.9%
7 326
 
4.1%
2 296
 
3.7%
4 295
 
3.7%
8 278
 
3.5%
9 256
 
3.2%
6 234
 
2.9%
Latin
ValueCountFrequency (%)
T 3
23.1%
A 2
15.4%
M 2
15.4%
E 1
 
7.7%
X 1
 
7.7%
I 1
 
7.7%
N 1
 
7.7%
S 1
 
7.7%
R 1
 
7.7%
Hangul
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8015
99.9%
Hangul 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2822
35.2%
1 1419
17.7%
5 1365
17.0%
3 709
 
8.8%
7 326
 
4.1%
2 296
 
3.7%
4 295
 
3.7%
8 278
 
3.5%
9 256
 
3.2%
6 234
 
2.9%
Other values (10) 15
 
0.2%
Hangul
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct181
Distinct (%)9.0%
Missing1
Missing (%)< 0.1%
Memory size15.8 KiB
2023-12-11T18:16:10.569824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length4.0054945
Min length4

Characters and Unicode

Total characters8019
Distinct characters25
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)2.5%

Sample

1st row조사완료시간
2nd rowEXAMIN_END_TM
3rd row0910
4th row1020
5th row0855
ValueCountFrequency (%)
0955 69
 
3.4%
1105 57
 
2.8%
1455 55
 
2.7%
1130 49
 
2.4%
1355 48
 
2.4%
0855 48
 
2.4%
1000 46
 
2.3%
1100 46
 
2.3%
1055 45
 
2.2%
1110 44
 
2.2%
Other values (171) 1495
74.7%
2023-12-11T18:16:11.009215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2109
26.3%
1 1950
24.3%
5 1635
20.4%
3 563
 
7.0%
9 478
 
6.0%
4 377
 
4.7%
8 302
 
3.8%
2 229
 
2.9%
6 200
 
2.5%
7 157
 
2.0%
Other values (15) 19
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8000
99.8%
Uppercase Letter 11
 
0.1%
Other Letter 6
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2109
26.4%
1 1950
24.4%
5 1635
20.4%
3 563
 
7.0%
9 478
 
6.0%
4 377
 
4.7%
8 302
 
3.8%
2 229
 
2.9%
6 200
 
2.5%
7 157
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
N 2
18.2%
M 2
18.2%
E 2
18.2%
D 1
9.1%
A 1
9.1%
I 1
9.1%
X 1
9.1%
T 1
9.1%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8002
99.8%
Latin 11
 
0.1%
Hangul 6
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2109
26.4%
1 1950
24.4%
5 1635
20.4%
3 563
 
7.0%
9 478
 
6.0%
4 377
 
4.7%
8 302
 
3.8%
2 229
 
2.9%
6 200
 
2.5%
7 157
 
2.0%
Latin
ValueCountFrequency (%)
N 2
18.2%
M 2
18.2%
E 2
18.2%
D 1
9.1%
A 1
9.1%
I 1
9.1%
X 1
9.1%
T 1
9.1%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8013
99.9%
Hangul 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2109
26.3%
1 1950
24.3%
5 1635
20.4%
3 563
 
7.0%
9 478
 
6.0%
4 377
 
4.7%
8 302
 
3.8%
2 229
 
2.9%
6 200
 
2.5%
7 157
 
2.0%
Other values (9) 13
 
0.2%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Unnamed: 6
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing41
Missing (%)2.0%
Memory size15.8 KiB

Unnamed: 9
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 10
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 11
Unsupported

REJECTED  UNSUPPORTED 

Missing11
Missing (%)0.5%
Memory size15.8 KiB

Unnamed: 12
Unsupported

REJECTED  UNSUPPORTED 

Missing2
Missing (%)0.1%
Memory size15.8 KiB

Unnamed: 13
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 14
Unsupported

REJECTED  UNSUPPORTED 

Missing4
Missing (%)0.2%
Memory size15.8 KiB

Unnamed: 15
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)< 0.1%
Memory size15.8 KiB

Unnamed: 16
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing2001
Missing (%)99.9%
Memory size15.8 KiB
2023-12-11T18:16:11.192613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4.5
Mean length4.5
Min length3

Characters and Unicode

Total characters9
Distinct characters9
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row기타특이사항
2nd rowETC
ValueCountFrequency (%)
기타특이사항 1
50.0%
etc 1
50.0%
2023-12-11T18:16:11.458892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
E 1
11.1%
T 1
11.1%
C 1
11.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6
66.7%
Uppercase Letter 3
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
T 1
33.3%
C 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6
66.7%
Latin 3
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Latin
ValueCountFrequency (%)
E 1
33.3%
T 1
33.3%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6
66.7%
ASCII 3
33.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
ASCII
ValueCountFrequency (%)
E 1
33.3%
T 1
33.3%
C 1
33.3%

Unnamed: 17
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.8 KiB
2014
2000 
<NA>
 
1
년도
 
1
YEAR
 
1

Length

Max length4
Median length4
Mean length3.9990015
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row년도
3rd rowYEAR
4th row2014
5th row2014

Common Values

ValueCountFrequency (%)
2014 2000
99.9%
<NA> 1
 
< 0.1%
년도 1
 
< 0.1%
YEAR 1
 
< 0.1%

Length

2023-12-11T18:16:11.615195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T18:16:11.745059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 2000
99.9%
na 1
 
< 0.1%
년도 1
 
< 0.1%
year 1
 
< 0.1%

Unnamed: 18
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.8 KiB
주중
2000 
<NA>
 
1
주중주말구분
 
1
EXAMIN_WDAY_WEND
 
1

Length

Max length16
Median length2
Mean length2.009985
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row주중주말구분
3rd rowEXAMIN_WDAY_WEND
4th row주중
5th row주중

Common Values

ValueCountFrequency (%)
주중 2000
99.9%
<NA> 1
 
< 0.1%
주중주말구분 1
 
< 0.1%
EXAMIN_WDAY_WEND 1
 
< 0.1%

Length

2023-12-11T18:16:11.878452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T18:16:11.999037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주중 2000
99.9%
na 1
 
< 0.1%
주중주말구분 1
 
< 0.1%
examin_wday_wend 1
 
< 0.1%

Unnamed: 19
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.8 KiB
오전
1016 
오후
984 
<NA>
 
1
오전오후구분
 
1
EXAMIN_AM_PM
 
1

Length

Max length12
Median length2
Mean length2.007988
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row오전오후구분
3rd rowEXAMIN_AM_PM
4th row오전
5th row오전

Common Values

ValueCountFrequency (%)
오전 1016
50.7%
오후 984
49.1%
<NA> 1
 
< 0.1%
오전오후구분 1
 
< 0.1%
EXAMIN_AM_PM 1
 
< 0.1%

Length

2023-12-11T18:16:12.115083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T18:16:12.241290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오전 1016
50.7%
오후 984
49.1%
na 1
 
< 0.1%
오전오후구분 1
 
< 0.1%
examin_am_pm 1
 
< 0.1%

Correlations

2023-12-11T18:16:12.323664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19
Unnamed: 21.0000.0000.0000.0000.0000.000
Unnamed: 30.0001.0000.0000.0000.0000.000
Unnamed: 160.0000.0001.0000.0000.0000.000
Unnamed: 170.0000.0000.0001.0001.0001.000
Unnamed: 180.0000.0000.0001.0001.0001.000
Unnamed: 190.0000.0000.0001.0001.0001.000
2023-12-11T18:16:12.678299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 17Unnamed: 18Unnamed: 19
Unnamed: 171.0001.0001.000
Unnamed: 181.0001.0001.000
Unnamed: 191.0001.0001.000
2023-12-11T18:16:12.774786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 17Unnamed: 18Unnamed: 19
Unnamed: 171.0001.0001.000
Unnamed: 181.0001.0001.000
Unnamed: 191.0001.0001.000

Missing values

2023-12-11T18:16:05.344732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T18:16:05.653756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T18:16:05.987405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

유동인구_관찰조사_2014Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19
0<NA><NA><NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA><NA><NA><NA>
1ID관찰조사조사지점코드조사일자조사요일조사시작시간조사완료시간남자유동인구수여성유동인구수20세미만유동인구수20대30대유동인구수40대50대유동인구수60대이상유동인구수정장착용유동인구수캐주얼착용유동인구수물건소지유동인구수빈손통행유동인구수기타특이사항년도주중주말구분오전오후구분
2ID_OBSERV_EXAMINEXAMIN_SPOT_CDEXAMIN_DAYEXAMIN_DATEEXAMIN_START_TMEXAMIN_END_TMMALEFEMALETWYO_BELOTWNT_THRTSFRTS_FFTSSXTS_ABOVESUIT_WEARCSL_WEARTHING_POSSESEMTHD_PASNGETCYEAREXAMIN_WDAY_WENDEXAMIN_AM_PM
3101-001<NA><NA>0810091099101020313<NA>2014주중오전
4201-001<NA><NA>092010201242000204<NA>2014주중오전
5301-002<NA><NA>075508554540320502<NA>2014주중오전
6401-002<NA><NA>090010001101201410<NA>2014주중오전
7501-004<NA><NA>07360830911413621412116<NA>2014주중오전
8601-004<NA><NA>100611108602831141371<NA>2014주중오전
9701-007<NA><NA>130014003302250403<NA>2014주중오후
유동인구_관찰조사_2014Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19
1993199125-238<NA><NA>1000105962301414140451824<NA>2014주중오전
1994199225-238<NA><NA>1600165929851228672141505976<NA>2014주중오후
1995199325-408<NA><NA>082509307546582721938821650<NA>2014주중오전
1996199425-408<NA><NA>09401055455701922368861773<NA>2014주중오전
1997199525-430<NA><NA>073508352425428162968651<NA>2014주중오전
1998199625-430<NA><NA>1005110516441151632484726<NA>2014주중오전
1999199725-815<NA><NA>12351435242423652381236655<NA>2014주중오후
2000199825-815<NA><NA>143519152136131483517421247<NA>2014주중오후
2001199925-816<NA><NA>073509352010223155711413<NA>2014주중오전
2002200025-816<NA><NA>0935112578032531056<NA>2014주중오전