Overview

Dataset statistics

Number of variables17
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory55.2 KiB
Average record size in memory141.3 B

Variable types

Categorical7
Text5
Numeric5

Alerts

"서울" has constant value ""Constant
"" has constant value ""Constant
"".1 has constant value ""Constant
"".2 has constant value ""Constant
"421" is highly overall correlated with "1808" and 1 other fieldsHigh correlation
"간선" is highly overall correlated with "421" and 1 other fieldsHigh correlation
"1808" is highly overall correlated with "421" and 1 other fieldsHigh correlation
311938 is highly overall correlated with 127.0014981High correlation
547522 is highly overall correlated with 37.5256914High correlation
127.0014981 is highly overall correlated with 311938High correlation
37.5256914 is highly overall correlated with 547522High correlation
"간선" is highly imbalanced (90.3%)Imbalance
18 has 5 (1.2%) zerosZeros

Reproduction

Analysis started2023-12-10 06:40:14.314158
Analysis finished2023-12-10 06:40:20.650085
Duration6.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"서울"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"서울"
400 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"서울"
2nd row"서울"
3rd row"서울"
4th row"서울"
5th row"서울"

Common Values

ValueCountFrequency (%)
"서울" 400
100.0%

Length

2023-12-10T15:40:20.765976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:20.920419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울 400
100.0%

"421"
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"421"
100 
"507"
92 
"463"
83 
"104"
58 
"604"
49 
Other values (2)
18 

Length

Max length5
Median length5
Mean length4.9875
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"421"
2nd row"421"
3rd row"421"
4th row"421"
5th row"421"

Common Values

ValueCountFrequency (%)
"421" 100
25.0%
"507" 92
23.0%
"463" 83
20.8%
"104" 58
14.5%
"604" 49
12.2%
"143" 13
 
3.2%
"02" 5
 
1.2%

Length

2023-12-10T15:40:21.079753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:21.224331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
421 100
25.0%
507 92
23.0%
463 83
20.8%
104 58
14.5%
604 49
12.2%
143 13
 
3.2%
02 5
 
1.2%

""
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
""
400 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row""
2nd row""
3rd row""
4th row""
5th row""

Common Values

ValueCountFrequency (%)
"" 400
100.0%

Length

2023-12-10T15:40:21.364897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:21.527777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
400
100.0%
Distinct105
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:40:21.868490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters4000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)5.8%

Sample

1st row"11170650"
2nd row"11170650"
3rd row"11170650"
4th row"11170520"
5th row"11170530"
ValueCountFrequency (%)
11140590 22
 
5.5%
11140570 19
 
4.8%
11140540 18
 
4.5%
11680640 10
 
2.5%
11170530 9
 
2.2%
11170625 9
 
2.2%
11140550 9
 
2.2%
11440565 9
 
2.2%
11650652 9
 
2.2%
11305645 7
 
1.8%
Other values (95) 279
69.8%
2023-12-10T15:40:22.383250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1000
25.0%
" 800
20.0%
0 714
17.8%
5 510
12.8%
6 314
 
7.8%
4 241
 
6.0%
2 118
 
2.9%
7 103
 
2.6%
3 71
 
1.8%
9 67
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3200
80.0%
Other Punctuation 800
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1000
31.2%
0 714
22.3%
5 510
15.9%
6 314
 
9.8%
4 241
 
7.5%
2 118
 
3.7%
7 103
 
3.2%
3 71
 
2.2%
9 67
 
2.1%
8 62
 
1.9%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1000
25.0%
" 800
20.0%
0 714
17.8%
5 510
12.8%
6 314
 
7.8%
4 241
 
6.0%
2 118
 
2.9%
7 103
 
2.6%
3 71
 
1.8%
9 67
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1000
25.0%
" 800
20.0%
0 714
17.8%
5 510
12.8%
6 314
 
7.8%
4 241
 
6.0%
2 118
 
2.9%
7 103
 
2.6%
3 71
 
1.8%
9 67
 
1.7%
Distinct322
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:40:22.831865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.7725
Min length6

Characters and Unicode

Total characters3109
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique276 ?
Unique (%)69.0%

Sample

1st row"209258"
2nd row"209224"
3rd row"209186"
4th row"208581"
5th row"208711"
ValueCountFrequency (%)
206648 7
 
1.8%
206775 5
 
1.2%
206778 5
 
1.2%
206749 5
 
1.2%
207208 5
 
1.2%
206582 5
 
1.2%
206743 4
 
1.0%
206789 4
 
1.0%
206799 4
 
1.0%
207056 3
 
0.8%
Other values (312) 353
88.2%
2023-12-10T15:40:23.544698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
25.7%
2 522
16.8%
1 271
 
8.7%
0 248
 
8.0%
7 237
 
7.6%
6 203
 
6.5%
3 198
 
6.4%
9 172
 
5.5%
8 165
 
5.3%
4 150
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2309
74.3%
Other Punctuation 800
 
25.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 522
22.6%
1 271
11.7%
0 248
10.7%
7 237
10.3%
6 203
 
8.8%
3 198
 
8.6%
9 172
 
7.4%
8 165
 
7.1%
4 150
 
6.5%
5 143
 
6.2%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3109
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
" 800
25.7%
2 522
16.8%
1 271
 
8.7%
0 248
 
8.0%
7 237
 
7.6%
6 203
 
6.5%
3 198
 
6.4%
9 172
 
5.5%
8 165
 
5.3%
4 150
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3109
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
25.7%
2 522
16.8%
1 271
 
8.7%
0 248
 
8.0%
7 237
 
7.6%
6 203
 
6.5%
3 198
 
6.4%
9 172
 
5.5%
8 165
 
5.3%
4 150
 
4.8%

"1808"
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"1808"
100 
"1406"
92 
"987"
83 
"771"
58 
"1169"
49 
Other values (2)
18 

Length

Max length6
Median length6
Mean length5.6025
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"1808"
2nd row"1808"
3rd row"1808"
4th row"1808"
5th row"1808"

Common Values

ValueCountFrequency (%)
"1808" 100
25.0%
"1406" 92
23.0%
"987" 83
20.8%
"771" 58
14.5%
"1169" 49
12.2%
"872" 13
 
3.2%
"715" 5
 
1.2%

Length

2023-12-10T15:40:23.782740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:24.049214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1808 100
25.0%
1406 92
23.0%
987 83
20.8%
771 58
14.5%
1169 49
12.2%
872 13
 
3.2%
715 5
 
1.2%

"988"
Text

Distinct300
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:40:24.651462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length5.97
Min length5

Characters and Unicode

Total characters2388
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)58.8%

Sample

1st row"877"
2nd row"877"
3rd row"868"
4th row"623"
5th row"649"
ValueCountFrequency (%)
375 6
 
1.5%
361 5
 
1.2%
368 5
 
1.2%
334 5
 
1.2%
392 5
 
1.2%
347 5
 
1.2%
359 4
 
1.0%
402 4
 
1.0%
365 4
 
1.0%
373 4
 
1.0%
Other values (290) 353
88.2%
2023-12-10T15:40:25.596204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
33.5%
1 325
13.6%
3 198
 
8.3%
4 172
 
7.2%
2 144
 
6.0%
7 135
 
5.7%
5 131
 
5.5%
9 128
 
5.4%
8 121
 
5.1%
0 118
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1588
66.5%
Other Punctuation 800
33.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 325
20.5%
3 198
12.5%
4 172
10.8%
2 144
9.1%
7 135
8.5%
5 131
8.2%
9 128
 
8.1%
8 121
 
7.6%
0 118
 
7.4%
6 116
 
7.3%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2388
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
" 800
33.5%
1 325
13.6%
3 198
 
8.3%
4 172
 
7.2%
2 144
 
6.0%
7 135
 
5.7%
5 131
 
5.5%
9 128
 
5.4%
8 121
 
5.1%
0 118
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2388
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
33.5%
1 325
13.6%
3 198
 
8.3%
4 172
 
7.2%
2 144
 
6.0%
7 135
 
5.7%
5 131
 
5.5%
9 128
 
5.4%
8 121
 
5.1%
0 118
 
4.9%

"".1
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
""
400 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row""
2nd row""
3rd row""
4th row""
5th row""

Common Values

ValueCountFrequency (%)
"" 400
100.0%

Length

2023-12-10T15:40:25.858038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:26.048702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
400
100.0%

"".2
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
""
400 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row""
2nd row""
3rd row""
4th row""
5th row""

Common Values

ValueCountFrequency (%)
"" 400
100.0%

Length

2023-12-10T15:40:26.246037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:26.440488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
400
100.0%

18
Real number (ℝ)

ZEROS 

Distinct113
Distinct (%)28.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.955
Minimum0
Maximum113
Zeros5
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:40:26.656430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q119
median40
Q366
95-th percentile96.05
Maximum113
Range113
Interquartile range (IQR)47

Descriptive statistics

Standard deviation29.617325
Coefficient of variation (CV)0.67381015
Kurtosis-0.85477447
Mean43.955
Median Absolute Deviation (MAD)23
Skewness0.40345306
Sum17582
Variance877.18594
MonotonicityNot monotonic
2023-12-10T15:40:26.918287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 6
 
1.5%
2 6
 
1.5%
24 6
 
1.5%
7 6
 
1.5%
4 6
 
1.5%
9 6
 
1.5%
21 6
 
1.5%
14 5
 
1.2%
51 5
 
1.2%
52 5
 
1.2%
Other values (103) 343
85.8%
ValueCountFrequency (%)
0 5
1.2%
1 5
1.2%
2 6
1.5%
3 6
1.5%
4 6
1.5%
5 5
1.2%
6 5
1.2%
7 6
1.5%
8 5
1.2%
9 6
1.5%
ValueCountFrequency (%)
113 1
0.2%
112 1
0.2%
111 1
0.2%
110 1
0.2%
109 1
0.2%
108 1
0.2%
107 1
0.2%
106 1
0.2%
105 1
0.2%
103 1
0.2%
Distinct253
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:40:27.376953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length9.805
Min length4

Characters and Unicode

Total characters3922
Distinct characters291
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)38.0%

Sample

1st row"한국폴리텍1대학"
2nd row"이태원역.보광동입구"
3rd row"이태원역1번출구.해밀턴호텔"
4th row"녹사평역"
5th row"전쟁기념관"
ValueCountFrequency (%)
퇴계로3가.한옥마을.한국의집 9
 
2.2%
남대문시장.회현역 8
 
2.0%
퇴계로6가 6
 
1.5%
충무로역8번출구.대한극장앞 5
 
1.2%
퇴계로5가.제일병원 5
 
1.2%
명동입구 5
 
1.2%
한겨레신문사 4
 
1.0%
충무로역2번출구.대한극장앞 4
 
1.0%
광희문.광희동사거리 4
 
1.0%
숙대입구역 4
 
1.0%
Other values (246) 351
86.7%
2023-12-10T15:40:28.107373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
 
20.4%
. 138
 
3.5%
103
 
2.6%
96
 
2.4%
84
 
2.1%
75
 
1.9%
70
 
1.8%
61
 
1.6%
52
 
1.3%
50
 
1.3%
Other values (281) 2393
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2867
73.1%
Other Punctuation 938
 
23.9%
Decimal Number 81
 
2.1%
Uppercase Letter 25
 
0.6%
Space Separator 5
 
0.1%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
103
 
3.6%
96
 
3.3%
84
 
2.9%
75
 
2.6%
70
 
2.4%
61
 
2.1%
52
 
1.8%
50
 
1.7%
50
 
1.7%
49
 
1.7%
Other values (258) 2177
75.9%
Uppercase Letter
ValueCountFrequency (%)
T 7
28.0%
K 3
12.0%
G 3
12.0%
S 3
12.0%
C 2
 
8.0%
V 2
 
8.0%
L 2
 
8.0%
E 1
 
4.0%
A 1
 
4.0%
B 1
 
4.0%
Decimal Number
ValueCountFrequency (%)
3 18
22.2%
2 14
17.3%
5 12
14.8%
1 12
14.8%
4 11
13.6%
6 6
 
7.4%
8 5
 
6.2%
9 3
 
3.7%
Other Punctuation
ValueCountFrequency (%)
" 800
85.3%
. 138
 
14.7%
Space Separator
ValueCountFrequency (%)
5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2867
73.1%
Common 1030
 
26.3%
Latin 25
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
103
 
3.6%
96
 
3.3%
84
 
2.9%
75
 
2.6%
70
 
2.4%
61
 
2.1%
52
 
1.8%
50
 
1.7%
50
 
1.7%
49
 
1.7%
Other values (258) 2177
75.9%
Common
ValueCountFrequency (%)
" 800
77.7%
. 138
 
13.4%
3 18
 
1.7%
2 14
 
1.4%
5 12
 
1.2%
1 12
 
1.2%
4 11
 
1.1%
6 6
 
0.6%
5
 
0.5%
8 5
 
0.5%
Other values (3) 9
 
0.9%
Latin
ValueCountFrequency (%)
T 7
28.0%
K 3
12.0%
G 3
12.0%
S 3
12.0%
C 2
 
8.0%
V 2
 
8.0%
L 2
 
8.0%
E 1
 
4.0%
A 1
 
4.0%
B 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2867
73.1%
ASCII 1055
 
26.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
75.8%
. 138
 
13.1%
3 18
 
1.7%
2 14
 
1.3%
5 12
 
1.1%
1 12
 
1.1%
4 11
 
1.0%
T 7
 
0.7%
6 6
 
0.6%
5
 
0.5%
Other values (13) 32
 
3.0%
Hangul
ValueCountFrequency (%)
103
 
3.6%
96
 
3.3%
84
 
2.9%
75
 
2.6%
70
 
2.4%
61
 
2.1%
52
 
1.8%
50
 
1.7%
50
 
1.7%
49
 
1.7%
Other values (258) 2177
75.9%

311938
Real number (ℝ)

HIGH CORRELATION 

Distinct329
Distinct (%)82.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean310289.1
Minimum297433
Maximum316216
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:40:28.370507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum297433
5-th percentile303059.45
Q1307620.5
median311677
Q3313855
95-th percentile315251.9
Maximum316216
Range18783
Interquartile range (IQR)6234.5

Descriptive statistics

Standard deviation4490.0286
Coefficient of variation (CV)0.014470468
Kurtosis-0.17834535
Mean310289.1
Median Absolute Deviation (MAD)2617.5
Skewness-0.8718532
Sum1.2411564 × 108
Variance20160357
MonotonicityNot monotonic
2023-12-10T15:40:28.617961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
311121 5
 
1.2%
311537 5
 
1.2%
311844 5
 
1.2%
309933 5
 
1.2%
310538 5
 
1.2%
311159 4
 
1.0%
310464 4
 
1.0%
311401 4
 
1.0%
308558 3
 
0.8%
312180 3
 
0.8%
Other values (319) 357
89.2%
ValueCountFrequency (%)
297433 1
0.2%
297473 1
0.2%
297528 1
0.2%
297535 1
0.2%
297591 1
0.2%
297814 1
0.2%
298150 1
0.2%
298375 1
0.2%
298630 1
0.2%
299500 1
0.2%
ValueCountFrequency (%)
316216 2
0.5%
316173 1
0.2%
316014 1
0.2%
315995 1
0.2%
315938 1
0.2%
315791 1
0.2%
315776 1
0.2%
315751 1
0.2%
315665 2
0.5%
315506 1
0.2%

547522
Real number (ℝ)

HIGH CORRELATION 

Distinct331
Distinct (%)82.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean549291.02
Minimum537626
Maximum561101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:40:28.847816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum537626
5-th percentile540901
Q1546074.25
median550144
Q3551695
95-th percentile558144.9
Maximum561101
Range23475
Interquartile range (IQR)5620.75

Descriptive statistics

Standard deviation4907.8886
Coefficient of variation (CV)0.0089349515
Kurtosis-0.046141954
Mean549291.02
Median Absolute Deviation (MAD)2203
Skewness-0.055134845
Sum2.1971641 × 108
Variance24087370
MonotonicityNot monotonic
2023-12-10T15:40:29.074959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
551549 5
 
1.2%
551176 5
 
1.2%
551470 5
 
1.2%
551427 5
 
1.2%
551600 5
 
1.2%
551438 4
 
1.0%
551478 4
 
1.0%
551446 4
 
1.0%
551174 3
 
0.8%
551626 3
 
0.8%
Other values (321) 357
89.2%
ValueCountFrequency (%)
537626 1
0.2%
537891 1
0.2%
538003 1
0.2%
538417 1
0.2%
538875 1
0.2%
538957 1
0.2%
539076 1
0.2%
539084 1
0.2%
539342 1
0.2%
539370 1
0.2%
ValueCountFrequency (%)
561101 1
0.2%
561088 1
0.2%
560967 1
0.2%
560902 1
0.2%
560850 1
0.2%
560839 1
0.2%
560782 1
0.2%
560761 1
0.2%
560356 1
0.2%
559819 1
0.2%
Distinct334
Distinct (%)83.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:40:29.679535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9775
Min length7

Characters and Unicode

Total characters3191
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique293 ?
Unique (%)73.2%

Sample

1st row"166682"
2nd row"105094"
3rd row"105070"
4th row"104949"
5th row"104780"
ValueCountFrequency (%)
104965 5
 
1.2%
105128 5
 
1.2%
105055 5
 
1.2%
104797 5
 
1.2%
105197 5
 
1.2%
105107 4
 
1.0%
104953 4
 
1.0%
105060 4
 
1.0%
105188 3
 
0.8%
105276 3
 
0.8%
Other values (324) 357
89.2%
2023-12-10T15:40:30.830133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
25.1%
1 542
17.0%
0 425
13.3%
5 236
 
7.4%
6 231
 
7.2%
2 205
 
6.4%
4 189
 
5.9%
8 154
 
4.8%
7 146
 
4.6%
9 140
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2391
74.9%
Other Punctuation 800
 
25.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 542
22.7%
0 425
17.8%
5 236
9.9%
6 231
9.7%
2 205
 
8.6%
4 189
 
7.9%
8 154
 
6.4%
7 146
 
6.1%
9 140
 
5.9%
3 123
 
5.1%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3191
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
" 800
25.1%
1 542
17.0%
0 425
13.3%
5 236
 
7.4%
6 231
 
7.2%
2 205
 
6.4%
4 189
 
5.9%
8 154
 
4.8%
7 146
 
4.6%
9 140
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3191
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
25.1%
1 542
17.0%
0 425
13.3%
5 236
 
7.4%
6 231
 
7.2%
2 205
 
6.4%
4 189
 
5.9%
8 154
 
4.8%
7 146
 
4.6%
9 140
 
4.4%

127.0014981
Real number (ℝ)

HIGH CORRELATION 

Distinct334
Distinct (%)83.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.98264
Minimum126.83722
Maximum127.05057
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:40:31.075467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.83722
5-th percentile126.90119
Q1126.95245
median126.99799
Q3127.02217
95-th percentile127.03958
Maximum127.05057
Range0.2133497
Interquartile range (IQR)0.069716225

Descriptive statistics

Standard deviation0.050575756
Coefficient of variation (CV)0.00039828874
Kurtosis-0.16085329
Mean126.98264
Median Absolute Deviation (MAD)0.03018225
Skewness-0.86433307
Sum50793.056
Variance0.0025579071
MonotonicityNot monotonic
2023-12-10T15:40:31.295151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.985185 5
 
1.2%
126.9783688 5
 
1.2%
126.9999448 5
 
1.2%
126.9964764 5
 
1.2%
126.9917779 5
 
1.2%
126.9949458 4
 
1.0%
126.9843452 4
 
1.0%
126.9922118 4
 
1.0%
127.0038267 3
 
0.8%
126.9998172 3
 
0.8%
Other values (324) 357
89.2%
ValueCountFrequency (%)
126.8372247 1
0.2%
126.8377669 1
0.2%
126.8383419 1
0.2%
126.8384862 1
0.2%
126.8389842 1
0.2%
126.8414791 1
0.2%
126.845197 1
0.2%
126.8476914 1
0.2%
126.8505273 1
0.2%
126.8603672 1
0.2%
ValueCountFrequency (%)
127.0505744 2
0.5%
127.0501015 1
0.2%
127.0482624 1
0.2%
127.0480501 1
0.2%
127.0474759 1
0.2%
127.0457703 1
0.2%
127.0455426 1
0.2%
127.0452513 1
0.2%
127.0444182 2
0.5%
127.0424023 1
0.2%

37.5256914
Real number (ℝ)

HIGH CORRELATION 

Distinct333
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.541459
Minimum37.435654
Maximum37.648144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:40:31.512435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.435654
5-th percentile37.46635
Q137.511991
median37.549042
Q337.563312
95-th percentile37.621568
Maximum37.648144
Range0.2124905
Interquartile range (IQR)0.05132075

Descriptive statistics

Standard deviation0.044380379
Coefficient of variation (CV)0.0011821698
Kurtosis-0.044491209
Mean37.541459
Median Absolute Deviation (MAD)0.02002165
Skewness-0.054688218
Sum15016.584
Variance0.0019696181
MonotonicityNot monotonic
2023-12-10T15:40:31.745692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5607384 5
 
1.2%
37.5584181 5
 
1.2%
37.562423 5
 
1.2%
37.5619341 5
 
1.2%
37.5611823 5
 
1.2%
37.5612813 4
 
1.0%
37.5609024 4
 
1.0%
37.5608976 4
 
1.0%
37.5633117 3
 
0.8%
37.5626562 3
 
0.8%
Other values (323) 357
89.2%
ValueCountFrequency (%)
37.4356535 1
0.2%
37.438042 1
0.2%
37.4390472 1
0.2%
37.4427869 1
0.2%
37.4469074 1
0.2%
37.4476491 1
0.2%
37.4487282 1
0.2%
37.4487967 1
0.2%
37.451139 1
0.2%
37.4513934 1
0.2%
ValueCountFrequency (%)
37.648144 1
0.2%
37.6480289 1
0.2%
37.6469012 1
0.2%
37.6463035 1
0.2%
37.645827 1
0.2%
37.645729 1
0.2%
37.6452888 1
0.2%
37.6450981 1
0.2%
37.6414603 1
0.2%
37.6366255 1
0.2%

"간선"
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"간선"
395 
"순환"
 
5

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"간선"
2nd row"간선"
3rd row"간선"
4th row"간선"
5th row"간선"

Common Values

ValueCountFrequency (%)
"간선" 395
98.8%
"순환" 5
 
1.2%

Length

2023-12-10T15:40:31.939511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:40:32.070003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
간선 395
98.8%
순환 5
 
1.2%

Interactions

2023-12-10T15:40:19.358348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:15.358680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:16.095314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:17.951872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.685969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.484613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:15.499902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:16.247571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.089993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.816415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.641306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:15.660616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:17.466319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.229412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.966136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.796863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:15.802811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:17.616685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.382328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.104538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.947020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:15.962262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:17.778658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:18.539955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:40:19.251676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:40:32.183509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"421""1808"18311938547522127.001498137.5256914"간선"
"421"1.0001.0000.4020.6870.6880.6840.6871.000
"1808"1.0001.0000.4020.6870.6880.6840.6871.000
180.4020.4021.0000.6460.7480.6460.7440.161
3119380.6870.6870.6461.0000.7950.9990.7980.265
5475220.6880.6880.7480.7951.0000.7871.0000.059
127.00149810.6840.6840.6460.9990.7871.0000.7890.244
37.52569140.6870.6870.7440.7981.0000.7891.0000.078
"간선"1.0001.0000.1610.2650.0590.2440.0781.000
2023-12-10T15:40:32.372177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"421""간선""1808"
"421"1.0000.9941.000
"간선"0.9941.0000.994
"1808"1.0000.9941.000
2023-12-10T15:40:32.517389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
18311938547522127.001498137.5256914"421""1808""간선"
181.0000.073-0.0800.074-0.0780.2160.2160.122
3119380.0731.0000.2681.0000.2760.4380.4380.201
547522-0.0800.2681.0000.2611.0000.4390.4390.045
127.00149810.0741.0000.2611.0000.2680.4330.4330.185
37.5256914-0.0780.2761.0000.2681.0000.4360.4360.059
"421"0.2160.4380.4390.4330.4361.0001.0000.994
"1808"0.2160.4380.4390.4330.4361.0001.0000.994
"간선"0.1220.2010.0450.1850.0590.9940.9941.000

Missing values

2023-12-10T15:40:20.168493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:40:20.501272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"서울""421""""11170700""166342""1808""988""".1"".218"보광동신동아아파트"311938547522"166679"127.001498137.5256914"간선"
0"서울""421""""11170650""209258""1808""877"""""20"한국폴리텍1대학"311502548068"166682"126.996537.530569"간선"
1"서울""421""""11170650""209224""1808""877"""""21"이태원역.보광동입구"311307548429"105094"126.99425137.533803"간선"
2"서울""421""""11170650""209186""1808""868"""""22"이태원역1번출구.해밀턴호텔"311202548514"105070"126.99305237.534558"간선"
3"서울""421""""11170520""208581""1808""623"""""24"녹사평역"310472548578"104949"126.98478637.535064"간선"
4"서울""421""""11170530""208711""1808""649"""""25"전쟁기념관"309769548553"104780"126.97683637.53477"간선"
5"서울""421""""11170625""209119""1808""778"""""26"삼각지역11번출구"309571548735"166689"126.97457337.536391"간선"
6"서울""421""""11170530""159587""1808""649"""""27"숙대입구역"309397549500"126812"126.9725137.543266"간선"
7"서울""421""""11170530""208664""1808""651"""""28"갈월동"309376550357"126805"126.97216737.550985"간선"
8"서울""421""""11140540""206477""1808""339"""""29"서울스퀘어앞"309484550840"81018"126.97332937.555347"간선"
9"서울""421""""11140540""206582""1808""347"""""30"남대문시장.회현역"309933551176"104797"126.97836937.558418"간선"
"서울""421""""11170700""166342""1808""988""".1"".218"보광동신동아아파트"311938547522"166679"127.001498137.5256914"간선"
390"서울""143""""11290650""217041""872""3752"""""3"KEB.하나은행.정릉중앙지점"312652557048"105379"127.00843937.611583"간선"
391"서울""143""""11290650""217070""872""3748"""""4"정릉시장입구"312766556717"105419"127.0097737.608612"간선"
392"서울""143""""11290630""216627""872""3676"""""5"정릉우체국앞"312907556331"105486"127.01141337.605148"간선"
393"서울""143""""11290620""216198""872""3664"""""6"숭덕초교"313193556124"105654"127.01467637.60331"간선"
394"서울""143""""11290620""216219""872""3659"""""7"정릉길음시장"313597555994"105831"127.01926637.602177"간선"
395"서울""143""""11290580""215217""872""3522"""""8"미아리고개.미아리예술극장"313813555615"105934"127.02175737.598782"간선"
396"서울""143""""11290575""218356""872""4148"""""9"돈암사거리.성신여대입구"313515555084"105790"127.01844537.59397"간선"
397"서울""143""""11290555""218141""872""4086"""""10"삼선교.한성대학교"312692554631"105395"127.00918137.589811"간선"
398"서울""143""""11110650""344214""872""229"""""11"혜화동로터리"312022554229"151977"127.00164437.586126"간선"
399"서울""143""""11110650""61780""872""234"""""12"명륜3가.성대입구"311731553859"105159"126.99839437.582764"간선"