Overview

Dataset statistics

Number of variables14
Number of observations200
Missing cells290
Missing cells (%)10.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.2 KiB
Average record size in memory118.7 B

Variable types

Text6
Numeric5
Categorical2
Unsupported1

Alerts

TERMINAL_CLSS has constant value ""Constant
PROC_CLSS has constant value ""Constant
BLD_CD is highly overall correlated with HOUS_ID and 1 other fieldsHigh correlation
HOUS_ID is highly overall correlated with BLD_CD and 1 other fieldsHigh correlation
Y_AXIS is highly overall correlated with BLD_CD and 1 other fieldsHigh correlation
BLD_CD has 44 (22.0%) missing valuesMissing
EDIT_NEWADDR has 46 (23.0%) missing valuesMissing
ROUTE_CNT has 200 (100.0%) missing valuesMissing
TERMINAL_CD has unique valuesUnique
TERMINAL_NM has unique valuesUnique
ROUTE_CNT is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 06:16:50.076521
Analysis finished2023-12-10 06:17:06.223117
Duration16.15 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

TERMINAL_CD
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:17:06.609435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters1200
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st rowE00207
2nd rowE00209
3rd rowE00210
4th rowE00211
5th rowE00212
ValueCountFrequency (%)
e00207 1
 
0.5%
e00141 1
 
0.5%
e00160 1
 
0.5%
e00129 1
 
0.5%
e00130 1
 
0.5%
e00131 1
 
0.5%
e00132 1
 
0.5%
e00133 1
 
0.5%
e00134 1
 
0.5%
e00136 1
 
0.5%
Other values (190) 190
95.0%
2023-12-10T15:17:07.417975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 511
42.6%
E 200
 
16.7%
1 113
 
9.4%
2 86
 
7.2%
4 56
 
4.7%
6 44
 
3.7%
3 42
 
3.5%
7 40
 
3.3%
5 40
 
3.3%
8 35
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
83.3%
Uppercase Letter 200
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 511
51.1%
1 113
 
11.3%
2 86
 
8.6%
4 56
 
5.6%
6 44
 
4.4%
3 42
 
4.2%
7 40
 
4.0%
5 40
 
4.0%
8 35
 
3.5%
9 33
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
E 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1000
83.3%
Latin 200
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 511
51.1%
1 113
 
11.3%
2 86
 
8.6%
4 56
 
5.6%
6 44
 
4.4%
3 42
 
4.2%
7 40
 
4.0%
5 40
 
4.0%
8 35
 
3.5%
9 33
 
3.3%
Latin
ValueCountFrequency (%)
E 200
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 511
42.6%
E 200
 
16.7%
1 113
 
9.4%
2 86
 
7.2%
4 56
 
4.7%
6 44
 
3.7%
3 42
 
3.5%
7 40
 
3.3%
5 40
 
3.3%
8 35
 
2.9%

TERMINAL_NM
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:17:07.934481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length9.41
Min length4

Characters and Unicode

Total characters1882
Distinct characters153
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st row동서울고속버스터미널(동서울종합터미널)
2nd row동창공용버스터미널(세지정류소)
3rd row동해시외버스터미널
4th row두밀리정류소
5th row두원공대
ValueCountFrequency (%)
동서울고속버스터미널(동서울종합터미널 1
 
0.5%
다시공용버스정류장 1
 
0.5%
대둔산공용버스터미널 1
 
0.5%
내촌포천정류소 1
 
0.5%
노성정류소 1
 
0.5%
노원역정류소 1
 
0.5%
녹동신항정류소 1
 
0.5%
녹동터미널(녹동공용버스정류장 1
 
0.5%
녹전정류소 1
 
0.5%
논티정류소 1
 
0.5%
Other values (190) 190
95.0%
2023-12-10T15:17:09.116588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128
 
6.8%
122
 
6.5%
117
 
6.2%
112
 
6.0%
112
 
6.0%
102
 
5.4%
102
 
5.4%
100
 
5.3%
60
 
3.2%
55
 
2.9%
Other values (143) 872
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1784
94.8%
Close Punctuation 49
 
2.6%
Open Punctuation 49
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
128
 
7.2%
122
 
6.8%
117
 
6.6%
112
 
6.3%
112
 
6.3%
102
 
5.7%
102
 
5.7%
100
 
5.6%
60
 
3.4%
55
 
3.1%
Other values (141) 774
43.4%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1784
94.8%
Common 98
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
128
 
7.2%
122
 
6.8%
117
 
6.6%
112
 
6.3%
112
 
6.3%
102
 
5.7%
102
 
5.7%
100
 
5.6%
60
 
3.4%
55
 
3.1%
Other values (141) 774
43.4%
Common
ValueCountFrequency (%)
) 49
50.0%
( 49
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1784
94.8%
ASCII 98
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
128
 
7.2%
122
 
6.8%
117
 
6.6%
112
 
6.3%
112
 
6.3%
102
 
5.7%
102
 
5.7%
100
 
5.6%
60
 
3.4%
55
 
3.1%
Other values (141) 774
43.4%
ASCII
ValueCountFrequency (%)
) 49
50.0%
( 49
50.0%

BLD_CD
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct148
Distinct (%)94.9%
Missing44
Missing (%)22.0%
Infinite0
Infinite (%)0.0%
Mean4.2521288 × 1024
Minimum1.1215103 × 1024
Maximum4.889045 × 1024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:17:09.497009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1215103 × 1024
5-th percentile2.7170102 × 1024
Q14.1820307 × 1024
median4.4755322 × 1024
Q34.6822825 × 1024
95-th percentile4.8155417 × 1024
Maximum4.889045 × 1024
Range3.7675347 × 1024
Interquartile range (IQR)5.0025187 × 1023

Descriptive statistics

Standard deviation7.3756726 × 1023
Coefficient of variation (CV)0.17345835
Kurtosis6.6921813
Mean4.2521288 × 1024
Median Absolute Deviation (MAD)2.3749315 × 1023
Skewness-2.4972745
Sum6.6333209 × 1026
Variance5.4400547 × 1047
MonotonicityNot monotonic
2023-12-10T15:17:09.881062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.91401150010049e+24 2
 
1.0%
2.71701020011856e+24 2
 
1.0%
3.01101150010068e+24 2
 
1.0%
3.01401170010759e+24 2
 
1.0%
4.6770253241161294e+24 2
 
1.0%
4.82501090011264e+24 2
 
1.0%
4.5180119001005104e+24 2
 
1.0%
3.01701120010908e+24 2
 
1.0%
4.1570101001038e+24 1
 
0.5%
4.1730350331080497e+24 1
 
0.5%
Other values (138) 138
69.0%
(Missing) 44
 
22.0%
ValueCountFrequency (%)
1.12151030010546e+24 1
0.5%
1.16501080011446e+24 1
0.5%
1.16801010010831e+24 1
0.5%
1.1710107001015e+24 1
0.5%
2.67102502110124e+24 1
0.5%
2.7140102e+24 1
0.5%
2.71401020010328e+24 1
0.5%
2.71701020011856e+24 2
1.0%
2.72601020011041e+24 1
0.5%
2.91401150010049e+24 2
1.0%
ValueCountFrequency (%)
4.889045023108991e+24 1
0.5%
4.88802502411005e+24 1
0.5%
4.88202502510416e+24 1
0.5%
4.87203302610442e+24 1
0.5%
4.8310109001097906e+24 1
0.5%
4.82501090011264e+24 2
1.0%
4.8240350211009693e+24 1
0.5%
4.81271060010267e+24 1
0.5%
4.8125158001000494e+24 1
0.5%
4.7930350211026e+24 1
0.5%
Distinct192
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:17:10.521934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length22.52
Min length18

Characters and Unicode

Total characters4504
Distinct characters202
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique184 ?
Unique (%)92.0%

Sample

1st row서울특별시 광진구 구의동 546-1번지
2nd row전라남도 나주시 세지면 오봉리 727-173번지
3rd row강원도 동해시 천곡동 485-10번지
4th row경기도 가평군 가평읍 두밀리 742번지
5th row경기도 안성시 죽산면 장원리 716번지
ValueCountFrequency (%)
경기도 30
 
3.2%
경상북도 30
 
3.2%
강원도 26
 
2.7%
전라남도 24
 
2.5%
충청남도 23
 
2.4%
전라북도 17
 
1.8%
충청북도 14
 
1.5%
경상남도 12
 
1.3%
대전광역시 9
 
0.9%
가평군 7
 
0.7%
Other values (621) 758
79.8%
2023-12-10T15:17:11.345890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
750
 
16.7%
206
 
4.6%
200
 
4.4%
188
 
4.2%
1 179
 
4.0%
- 164
 
3.6%
145
 
3.2%
110
 
2.4%
100
 
2.2%
98
 
2.2%
Other values (192) 2364
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2807
62.3%
Decimal Number 783
 
17.4%
Space Separator 750
 
16.7%
Dash Punctuation 164
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
 
7.3%
200
 
7.1%
188
 
6.7%
145
 
5.2%
110
 
3.9%
100
 
3.6%
98
 
3.5%
85
 
3.0%
82
 
2.9%
78
 
2.8%
Other values (180) 1515
54.0%
Decimal Number
ValueCountFrequency (%)
1 179
22.9%
3 91
11.6%
2 78
10.0%
6 75
9.6%
4 70
 
8.9%
7 68
 
8.7%
5 64
 
8.2%
8 57
 
7.3%
9 56
 
7.2%
0 45
 
5.7%
Space Separator
ValueCountFrequency (%)
750
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 164
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2807
62.3%
Common 1697
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
 
7.3%
200
 
7.1%
188
 
6.7%
145
 
5.2%
110
 
3.9%
100
 
3.6%
98
 
3.5%
85
 
3.0%
82
 
2.9%
78
 
2.8%
Other values (180) 1515
54.0%
Common
ValueCountFrequency (%)
750
44.2%
1 179
 
10.5%
- 164
 
9.7%
3 91
 
5.4%
2 78
 
4.6%
6 75
 
4.4%
4 70
 
4.1%
7 68
 
4.0%
5 64
 
3.8%
8 57
 
3.4%
Other values (2) 101
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2807
62.3%
ASCII 1697
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
750
44.2%
1 179
 
10.5%
- 164
 
9.7%
3 91
 
5.4%
2 78
 
4.6%
6 75
 
4.4%
4 70
 
4.1%
7 68
 
4.0%
5 64
 
3.8%
8 57
 
3.4%
Other values (2) 101
 
6.0%
Hangul
ValueCountFrequency (%)
206
 
7.3%
200
 
7.1%
188
 
6.7%
145
 
5.2%
110
 
3.9%
100
 
3.6%
98
 
3.5%
85
 
3.0%
82
 
2.9%
78
 
2.8%
Other values (180) 1515
54.0%

EDIT_NEWADDR
Text

MISSING 

Distinct146
Distinct (%)94.8%
Missing46
Missing (%)23.0%
Memory size1.7 KiB
2023-12-10T15:17:11.999658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length19.506494
Min length14

Characters and Unicode

Total characters3004
Distinct characters197
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)89.6%

Sample

1st row서울특별시 광진구 강변역로 50
2nd row전라남도 나주시 세지면 동창로 142
3rd row경기도 가평군 가평읍 태봉두밀로 598
4th row강원도 홍천군 두촌면 자은로 370
5th row강원도 횡성군 둔내면 경강로둔방10길 12
ValueCountFrequency (%)
경상북도 24
 
3.3%
전라남도 22
 
3.0%
경기도 21
 
2.9%
강원도 18
 
2.5%
전라북도 15
 
2.1%
충청남도 13
 
1.8%
충청북도 11
 
1.5%
경상남도 10
 
1.4%
대전광역시 8
 
1.1%
서구 6
 
0.8%
Other values (472) 580
79.7%
2023-12-10T15:17:12.959149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
574
 
19.1%
143
 
4.8%
131
 
4.4%
1 105
 
3.5%
83
 
2.8%
77
 
2.6%
71
 
2.4%
68
 
2.3%
58
 
1.9%
3 56
 
1.9%
Other values (187) 1638
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1942
64.6%
Space Separator 574
 
19.1%
Decimal Number 472
 
15.7%
Dash Punctuation 15
 
0.5%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
7.4%
131
 
6.7%
83
 
4.3%
77
 
4.0%
71
 
3.7%
68
 
3.5%
58
 
3.0%
56
 
2.9%
48
 
2.5%
46
 
2.4%
Other values (174) 1161
59.8%
Decimal Number
ValueCountFrequency (%)
1 105
22.2%
3 56
11.9%
2 53
11.2%
0 43
9.1%
9 42
 
8.9%
4 41
 
8.7%
5 39
 
8.3%
8 36
 
7.6%
7 31
 
6.6%
6 26
 
5.5%
Space Separator
ValueCountFrequency (%)
574
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1942
64.6%
Common 1062
35.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
7.4%
131
 
6.7%
83
 
4.3%
77
 
4.0%
71
 
3.7%
68
 
3.5%
58
 
3.0%
56
 
2.9%
48
 
2.5%
46
 
2.4%
Other values (174) 1161
59.8%
Common
ValueCountFrequency (%)
574
54.0%
1 105
 
9.9%
3 56
 
5.3%
2 53
 
5.0%
0 43
 
4.0%
9 42
 
4.0%
4 41
 
3.9%
5 39
 
3.7%
8 36
 
3.4%
7 31
 
2.9%
Other values (3) 42
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1942
64.6%
ASCII 1061
35.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
574
54.1%
1 105
 
9.9%
3 56
 
5.3%
2 53
 
5.0%
0 43
 
4.1%
9 42
 
4.0%
4 41
 
3.9%
5 39
 
3.7%
8 36
 
3.4%
7 31
 
2.9%
Other values (2) 41
 
3.9%
Hangul
ValueCountFrequency (%)
143
 
7.4%
131
 
6.7%
83
 
4.3%
77
 
4.0%
71
 
3.7%
68
 
3.5%
58
 
3.0%
56
 
2.9%
48
 
2.5%
46
 
2.4%
Other values (174) 1161
59.8%
None
ValueCountFrequency (%)
· 1
100.0%

BLK_CD_V4
Real number (ℝ)

Distinct192
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean251878.23
Minimum409
Maximum511318
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:17:13.204778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum409
5-th percentile21383.5
Q174519
median303423
Q3353662.75
95-th percentile485628.7
Maximum511318
Range510909
Interquartile range (IQR)279143.75

Descriptive statistics

Standard deviation156071.17
Coefficient of variation (CV)0.61962946
Kurtosis-1.2449579
Mean251878.23
Median Absolute Deviation (MAD)130583.5
Skewness-0.12774103
Sum50375646
Variance2.4358211 × 1010
MonotonicityNot monotonic
2023-12-10T15:17:13.804349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49693 2
 
1.0%
118832 2
 
1.0%
498133 2
 
1.0%
268880 2
 
1.0%
265930 2
 
1.0%
485265 2
 
1.0%
360227 2
 
1.0%
257379 2
 
1.0%
362889 1
 
0.5%
336890 1
 
0.5%
Other values (182) 182
91.0%
ValueCountFrequency (%)
409 1
0.5%
1905 1
0.5%
2910 1
0.5%
4591 1
0.5%
5516 1
0.5%
6275 1
0.5%
6789 1
0.5%
7212 1
0.5%
12096 1
0.5%
15693 1
0.5%
ValueCountFrequency (%)
511318 1
0.5%
511317 1
0.5%
510962 1
0.5%
508729 1
0.5%
505212 1
0.5%
498133 2
1.0%
498052 1
0.5%
494534 1
0.5%
492539 1
0.5%
485265 2
1.0%

TERMINAL_CLSS
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
시외
200 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시외
2nd row시외
3rd row시외
4th row시외
5th row시외

Common Values

ValueCountFrequency (%)
시외 200
100.0%

Length

2023-12-10T15:17:13.998559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:17:14.133421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시외 200
100.0%

ROUTE_CNT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing200
Missing (%)100.0%
Memory size1.9 KiB
Distinct193
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:17:14.582889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length20.365
Min length14

Characters and Unicode

Total characters4073
Distinct characters203
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique186 ?
Unique (%)93.0%

Sample

1st row서울특별시 광진구 구의동 546-1
2nd row전라남도 나주시 세지면 오봉리 727-173
3rd row강원도 동해시 천곡동 485-10
4th row경기도 가평군 가평읍 두밀리 742
5th row경기도 안성시 죽산면 장원리 716
ValueCountFrequency (%)
경기도 28
 
3.0%
경상북도 27
 
2.9%
강원도 26
 
2.8%
충청남도 23
 
2.4%
전라남도 23
 
2.4%
전라북도 15
 
1.6%
경상남도 12
 
1.3%
충청북도 12
 
1.3%
대전광역시 8
 
0.9%
가평군 7
 
0.7%
Other values (628) 758
80.7%
2023-12-10T15:17:15.287785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
739
 
18.1%
180
 
4.4%
1 177
 
4.3%
- 154
 
3.8%
138
 
3.4%
105
 
2.6%
102
 
2.5%
102
 
2.5%
3 89
 
2.2%
85
 
2.1%
Other values (193) 2202
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2395
58.8%
Decimal Number 785
 
19.3%
Space Separator 739
 
18.1%
Dash Punctuation 154
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
7.5%
138
 
5.8%
105
 
4.4%
102
 
4.3%
102
 
4.3%
85
 
3.5%
79
 
3.3%
75
 
3.1%
67
 
2.8%
59
 
2.5%
Other values (181) 1403
58.6%
Decimal Number
ValueCountFrequency (%)
1 177
22.5%
3 89
11.3%
2 83
10.6%
6 72
9.2%
7 72
9.2%
4 71
9.0%
5 64
 
8.2%
8 55
 
7.0%
9 54
 
6.9%
0 48
 
6.1%
Space Separator
ValueCountFrequency (%)
739
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 154
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2395
58.8%
Common 1678
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
7.5%
138
 
5.8%
105
 
4.4%
102
 
4.3%
102
 
4.3%
85
 
3.5%
79
 
3.3%
75
 
3.1%
67
 
2.8%
59
 
2.5%
Other values (181) 1403
58.6%
Common
ValueCountFrequency (%)
739
44.0%
1 177
 
10.5%
- 154
 
9.2%
3 89
 
5.3%
2 83
 
4.9%
6 72
 
4.3%
7 72
 
4.3%
4 71
 
4.2%
5 64
 
3.8%
8 55
 
3.3%
Other values (2) 102
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2395
58.8%
ASCII 1678
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
739
44.0%
1 177
 
10.5%
- 154
 
9.2%
3 89
 
5.3%
2 83
 
4.9%
6 72
 
4.3%
7 72
 
4.3%
4 71
 
4.2%
5 64
 
3.8%
8 55
 
3.3%
Other values (2) 102
 
6.1%
Hangul
ValueCountFrequency (%)
180
 
7.5%
138
 
5.8%
105
 
4.4%
102
 
4.3%
102
 
4.3%
85
 
3.5%
79
 
3.3%
75
 
3.1%
67
 
2.8%
59
 
2.5%
Other values (181) 1403
58.6%

HOUS_ID
Real number (ℝ)

HIGH CORRELATION 

Distinct192
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2377179 × 1018
Minimum1.1215103 × 1018
Maximum4.889045 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:17:15.545408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1215103 × 1018
5-th percentile2.7170102 × 1018
Q14.182031 × 1018
median4.4180246 × 1018
Q34.6792863 × 1018
95-th percentile4.8132768 × 1018
Maximum4.889045 × 1018
Range3.7675347 × 1018
Interquartile range (IQR)4.9725532 × 1017

Descriptive statistics

Standard deviation7.4323121 × 1017
Coefficient of variation (CV)0.17538478
Kurtosis7.311128
Mean4.2377179 × 1018
Median Absolute Deviation (MAD)2.5300471 × 1017
Skewness-2.6215074
Sum-1.0066548 × 1018
Variance5.5239264 × 1035
MonotonicityNot monotonic
2023-12-10T15:17:15.831844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4518011900000510001 2
 
1.0%
3017011200009080000 2
 
1.0%
4677025324016130001 2
 
1.0%
3014011500004650001 2
 
1.0%
3011011500000680002 2
 
1.0%
4825010900012640000 2
 
1.0%
2914011500000490001 2
 
1.0%
2717010200018560003 2
 
1.0%
4380031021001170015 1
 
0.5%
4773044036000570001 1
 
0.5%
Other values (182) 182
91.0%
ValueCountFrequency (%)
1121510300005460001 1
0.5%
1135010500005890006 1
0.5%
1165010800014460001 1
0.5%
1168010100008310000 1
0.5%
1171010700001500013 1
0.5%
1171010700003200008 1
0.5%
2671025021001240000 1
0.5%
2714010200003280001 1
0.5%
2714010200004810001 1
0.5%
2717010200018560003 2
1.0%
ValueCountFrequency (%)
4889045023008990005 1
0.5%
4888025024010050000 1
0.5%
4882025025004160002 1
0.5%
4872038021002240014 1
0.5%
4872033026004410031 1
0.5%
4831010900009790002 1
0.5%
4827039021001110019 1
0.5%
4825010900012640000 2
1.0%
4824035021000970002 1
0.5%
4812710600002670000 1
0.5%

PROC_CLSS
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
1
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 200
100.0%

Length

2023-12-10T15:17:16.053899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:17:16.230749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 200
100.0%

X_AXIS
Real number (ℝ)

Distinct192
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean371236.46
Minimum259130
Maximum535968
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:17:16.401677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum259130
5-th percentile274011.3
Q1318229.75
median351265.5
Q3432259.75
95-th percentile494784.25
Maximum535968
Range276838
Interquartile range (IQR)114030

Descriptive statistics

Standard deviation70505.442
Coefficient of variation (CV)0.18992058
Kurtosis-0.87320849
Mean371236.46
Median Absolute Deviation (MAD)53114.5
Skewness0.42652476
Sum74247291
Variance4.9710174 × 109
MonotonicityNot monotonic
2023-12-10T15:17:16.651822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
302812 2
 
1.0%
345841 2
 
1.0%
321167 2
 
1.0%
345249 2
 
1.0%
349711 2
 
1.0%
479594 2
 
1.0%
298151 2
 
1.0%
450317 2
 
1.0%
431816 1
 
0.5%
432104 1
 
0.5%
Other values (182) 182
91.0%
ValueCountFrequency (%)
259130 1
0.5%
261255 1
0.5%
261794 1
0.5%
263893 1
0.5%
267888 1
0.5%
269050 1
0.5%
269191 1
0.5%
272961 1
0.5%
273137 1
0.5%
273580 1
0.5%
ValueCountFrequency (%)
535968 1
0.5%
529831 1
0.5%
525967 1
0.5%
523995 1
0.5%
523301 1
0.5%
510676 1
0.5%
508826 1
0.5%
499581 1
0.5%
498720 1
0.5%
497772 1
0.5%

Y_AXIS
Real number (ℝ)

HIGH CORRELATION 

Distinct192
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean425622.15
Minimum201664
Maximum654421
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:17:16.938560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201664
5-th percentile254336.75
Q1343833.5
median421704
Q3514789.75
95-th percentile580470.45
Maximum654421
Range452757
Interquartile range (IQR)170956.25

Descriptive statistics

Standard deviation107922.41
Coefficient of variation (CV)0.2535639
Kurtosis-0.90383956
Mean425622.15
Median Absolute Deviation (MAD)91515
Skewness-0.13554869
Sum85124429
Variance1.1647247 × 1010
MonotonicityNot monotonic
2023-12-10T15:17:17.173637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
322252 2
 
1.0%
418070 2
 
1.0%
215368 2
 
1.0%
412641 2
 
1.0%
416881 2
 
1.0%
292611 2
 
1.0%
285206 2
 
1.0%
365176 2
 
1.0%
480141 1
 
0.5%
427481 1
 
0.5%
Other values (182) 182
91.0%
ValueCountFrequency (%)
201664 1
0.5%
206644 1
0.5%
207405 1
0.5%
215368 2
1.0%
219172 1
0.5%
223503 1
0.5%
231322 1
0.5%
237217 1
0.5%
245041 1
0.5%
254826 1
0.5%
ValueCountFrequency (%)
654421 1
0.5%
617513 1
0.5%
616901 1
0.5%
612947 1
0.5%
612832 1
0.5%
605753 1
0.5%
593750 1
0.5%
585189 1
0.5%
584040 1
0.5%
583785 1
0.5%
Distinct198
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:17:17.525722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length7.09
Min length4

Characters and Unicode

Total characters1418
Distinct characters148
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)98.0%

Sample

1st row동서울고속버스터미널
2nd row동창공용버스터미널
3rd row동해시외버스터미널
4th row두밀리정류소
5th row두원공대
ValueCountFrequency (%)
광주종합버스터미널 2
 
1.0%
남부터미널 2
 
1.0%
내장산공용버스터미널 1
 
0.5%
나로도공용터미널 1
 
0.5%
동서울고속버스터미널 1
 
0.5%
노원역정류소 1
 
0.5%
녹동신항정류소 1
 
0.5%
녹동터미널 1
 
0.5%
녹전정류소 1
 
0.5%
논티정류소 1
 
0.5%
Other values (188) 188
94.0%
2023-12-10T15:17:18.082459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
117
 
8.3%
113
 
8.0%
93
 
6.6%
84
 
5.9%
79
 
5.6%
79
 
5.6%
75
 
5.3%
75
 
5.3%
44
 
3.1%
40
 
2.8%
Other values (138) 619
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1418
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
8.3%
113
 
8.0%
93
 
6.6%
84
 
5.9%
79
 
5.6%
79
 
5.6%
75
 
5.3%
75
 
5.3%
44
 
3.1%
40
 
2.8%
Other values (138) 619
43.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1418
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
8.3%
113
 
8.0%
93
 
6.6%
84
 
5.9%
79
 
5.6%
79
 
5.6%
75
 
5.3%
75
 
5.3%
44
 
3.1%
40
 
2.8%
Other values (138) 619
43.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1418
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
117
 
8.3%
113
 
8.0%
93
 
6.6%
84
 
5.9%
79
 
5.6%
79
 
5.6%
75
 
5.3%
75
 
5.3%
44
 
3.1%
40
 
2.8%
Other values (138) 619
43.7%

Interactions

2023-12-10T15:17:03.589434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:51.197493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:57.554254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:59.399603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:01.616857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:04.816145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:53.356105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:58.845325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:00.522865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:02.987756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:04.937490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:54.305824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:58.980713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:00.652394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:03.153276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:05.094972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:55.576890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:59.139148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:01.255741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:03.317575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:05.283114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:56.587204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:16:59.269226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:01.480890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:17:03.461564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:17:18.241370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
BLD_CDBLK_CD_V4HOUS_IDX_AXISY_AXIS
BLD_CD1.0000.6251.0000.4980.721
BLK_CD_V40.6251.0000.5330.6010.653
HOUS_ID1.0000.5331.0000.4800.710
X_AXIS0.4980.6010.4801.0000.467
Y_AXIS0.7210.6530.7100.4671.000
2023-12-10T15:17:18.427744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
BLD_CDBLK_CD_V4HOUS_IDX_AXISY_AXIS
BLD_CD1.000-0.0361.0000.205-0.557
BLK_CD_V4-0.0361.000-0.0820.1490.127
HOUS_ID1.000-0.0821.0000.167-0.586
X_AXIS0.2050.1490.1671.0000.154
Y_AXIS-0.5570.127-0.5860.1541.000

Missing values

2023-12-10T15:17:05.590050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:17:05.940004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T15:17:06.139369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

TERMINAL_CDTERMINAL_NMBLD_CDEDIT_ADDREDIT_NEWADDRBLK_CD_V4TERMINAL_CLSSROUTE_CNTADDRESSHOUS_IDPROC_CLSSX_AXISY_AXISEDIT_TERMINAL_NM
0E00207동서울고속버스터미널(동서울종합터미널)1121510300105460001009810서울특별시 광진구 구의동 546-1번지서울특별시 광진구 강변역로 50211864시외<NA>서울특별시 광진구 구의동 546-111215103000054600011320140548421동서울고속버스터미널
1E00209동창공용버스터미널(세지정류소)4617031021107270173097938전라남도 나주시 세지면 오봉리 727-173번지전라남도 나주시 세지면 동창로 14262753시외<NA>전라남도 나주시 세지면 오봉리 727-17346170310210072701731285827258757동창공용버스터미널
2E00210동해시외버스터미널<NA>강원도 동해시 천곡동 485-10번지<NA>184778시외<NA>강원도 동해시 천곡동 485-1042170101000048500101497772547797동해시외버스터미널
3E00211두밀리정류소4182025026107420000002868경기도 가평군 가평읍 두밀리 742번지경기도 가평군 가평읍 태봉두밀로 598303267시외<NA>경기도 가평군 가평읍 두밀리 74241820250260074200001350717578877두밀리정류소
4E00212두원공대<NA>경기도 안성시 죽산면 장원리 716번지<NA>511317시외<NA>경기도 안성시 죽산면 장원리 71641550400230071600001348840496340두원공대
5E00213두촌정류소4272032024108170001010797강원도 홍천군 두촌면 자은리 817-1번지강원도 홍천군 두촌면 자은로 370307710시외<NA>강원도 홍천군 두촌면 자은리 817-142720320240081700011402052585189두촌정류소
6E00214둔내버스터미널4273033022100890007010748강원도 횡성군 둔내면 둔방내리 89-7번지강원도 횡성군 둔내면 경강로둔방10길 12418386시외<NA>강원도 횡성군 둔내면 둔방내리 89-742730330220008900071419036545734둔내버스터미널
7E00215둔산시외버스정류소(고속)3017011200109080000018415대전광역시 서구 둔산동 908번지대전광역시 서구 청사로 281118832시외<NA>대전광역시 서구 둔산2동 90830170112000090800001345841418070둔산시외버스정류소
8E00216둔산시외버스정류장3017011200109080000018415대전광역시 서구 둔산동 908번지대전광역시 서구 청사로 281118832시외<NA>대전광역시 서구 둔산2동 90830170112000090800001345841418070둔산시외버스정류장
9E00217둔포정류소4420036021104770008068879충청남도 아산시 둔포면 둔포리 477-8번지충청남도 아산시 둔포면 둔포중앙로 115441183시외<NA>충청남도 아산시 둔포면 둔포리 477-844200360210047700081314559481407둔포정류소
TERMINAL_CDTERMINAL_NMBLD_CDEDIT_ADDREDIT_NEWADDRBLK_CD_V4TERMINAL_CLSSROUTE_CNTADDRESSHOUS_IDPROC_CLSSX_AXISY_AXISEDIT_TERMINAL_NM
190E00474영산포터미널(영산포공용터미널)4617012800101910000017217전라남도 나주시 이창동 191번지전라남도 나주시 예향로 380363733시외<NA>전라남도 나주시 이창동 19146170128000019100001282691266986영산포터미널
191E00475영암고속터미널(영암버스터미널)4683025025100040001044503전라남도 영암군 영암읍 남풍리 4-1번지전라남도 영암군 영암읍 동문로 868200시외<NA>전라남도 영암군 영암읍 남풍리 4-146830250250000400011281492245041영암고속터미널
192E00477영월시외버스터미널4275025022101870006018617강원도 영월군 영월읍 하송리 187-1번지강원도 영월군 영월읍 중앙1로 2368035시외<NA>강원도 영월군 영월읍 하송리 187-142750250220018700011441506509125영월시외버스터미널
193E00482영해버스정류장4777036030106370001028935경상북도 영덕군 영해면 성내리 637-1번지경상북도 영덕군 영해면 예주시장길 5337756시외<NA>경상북도 영덕군 영해면 성내리 637-147770360300063700011525967438433영해버스정류장
194E00483예미정류소4277025921106910004002482강원도 정선군 신동읍 예미리 691-4번지강원도 정선군 신동읍 의림로 286187222시외<NA>강원도 정선군 신동읍 예미리 691-442770259210069100041457383513025예미정류소
195E00485예천시외버스공용정류장(예천버스터미널)4790025027100650002049512경상북도 예천군 예천읍 대심리 66-1번지경상북도 예천군 예천읍 충효로 165442748시외<NA>경상북도 예천군 예천읍 대심리 66-147900250270006600011439826449785예천시외버스공용정류장
196E00486오산시외버스터미널<NA>경기도 오산시 오산동 603-1번지<NA>141753시외<NA>경기도 오산시 오산동 603-141370101000060300011317335506078오산시외버스터미널
197E00488오수공용버스터미널(오수공용정류장)4575035521103620001127739전라북도 임실군 오수면 오수리 362-1번지전라북도 임실군 오수면 오수로 173325008시외<NA>전라북도 임실군 오수면 오수리 362-145750355210036200011339124326892오수공용버스터미널
198E00489오음리정류소<NA>강원도 화천군 간동면 오음리 612-9번지<NA>309149시외<NA>강원도 화천군 간동면 오음리 612-942790310240061200091384153605753오음리정류소
199E00490오창시외버스정류소4371025345008210004004630충청북도 청주시 청원구 오창읍 양청리 821-4번지충청북도 청주시 청원구 오창읍 중심상업로 7442760시외<NA>충청북도 청주시 청원구 오창읍 중심상업로 743114253450082100041349154456841오창시외버스정류소