Overview

Dataset statistics

Number of variables10
Number of observations1000
Missing cells76
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory83.1 KiB
Average record size in memory85.1 B

Variable types

Categorical1
Numeric5
Text4

Alerts

CTY_NM has constant value ""Constant
LNM_ADDR has 31 (3.1%) missing valuesMissing
RDNMADR_NM has 23 (2.3%) missing valuesMissing
RSTRNT_TEL_NO has 22 (2.2%) missing valuesMissing
RSTRNT_ID has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:08:50.319201
Analysis finished2023-12-10 10:08:58.215324
Duration7.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

CTY_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
ソウル特別市
1000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowソウル特別市
2nd rowソウル特別市
3rd rowソウル特別市
4th rowソウル特別市
5th rowソウル特別市

Common Values

ValueCountFrequency (%)
ソウル特別市 1000
100.0%

Length

2023-12-10T19:08:58.339647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:58.518137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ソウル特別市 1000
100.0%

RSTRNT_ID
Real number (ℝ)

UNIQUE 

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5648.17
Minimum1088
Maximum9743
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-10T19:08:58.723938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1088
5-th percentile1574.9
Q13256.5
median5734
Q37843.5
95-th percentile9637.2
Maximum9743
Range8655
Interquartile range (IQR)4587

Descriptive statistics

Standard deviation2713.1837
Coefficient of variation (CV)0.48036508
Kurtosis-1.2686303
Mean5648.17
Median Absolute Deviation (MAD)2132.5
Skewness0.015609224
Sum5648170
Variance7361365.5
MonotonicityStrictly increasing
2023-12-10T19:08:59.023134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1088 1
 
0.1%
7729 1
 
0.1%
7707 1
 
0.1%
7708 1
 
0.1%
7710 1
 
0.1%
7711 1
 
0.1%
7713 1
 
0.1%
7715 1
 
0.1%
7717 1
 
0.1%
7719 1
 
0.1%
Other values (990) 990
99.0%
ValueCountFrequency (%)
1088 1
0.1%
1115 1
0.1%
1116 1
0.1%
1117 1
0.1%
1121 1
0.1%
1123 1
0.1%
1126 1
0.1%
1130 1
0.1%
1134 1
0.1%
1156 1
0.1%
ValueCountFrequency (%)
9743 1
0.1%
9742 1
0.1%
9741 1
0.1%
9740 1
0.1%
9739 1
0.1%
9738 1
0.1%
9734 1
0.1%
9733 1
0.1%
9732 1
0.1%
9731 1
0.1%
Distinct929
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-10T19:08:59.551079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length36
Mean length13.892
Min length1

Characters and Unicode

Total characters13892
Distinct characters267
Distinct categories9 ?
Distinct scripts6 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique875 ?
Unique (%)87.5%

Sample

1st rowpenand coffee
2nd rowtodamtodam
3rd rowttowachamsutdwaeji galbi
4th row東成閣
5th rowhurendeu chicken
ValueCountFrequency (%)
sikdang 72
 
4.2%
chicken 29
 
1.7%
coffee 23
 
1.3%
galbi 21
 
1.2%
hof 17
 
1.0%
kalguksu 16
 
0.9%
jokbal 13
 
0.8%
gamjatang 12
 
0.7%
sundae 11
 
0.6%
baguette 11
 
0.6%
Other values (1130) 1487
86.9%
2023-12-10T19:09:00.362758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 1450
 
10.4%
a 1407
 
10.1%
g 1173
 
8.4%
o 1126
 
8.1%
e 1102
 
7.9%
i 727
 
5.2%
716
 
5.2%
u 696
 
5.0%
s 595
 
4.3%
k 507
 
3.6%
Other values (257) 4393
31.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12715
91.5%
Space Separator 716
 
5.2%
Other Letter 346
 
2.5%
Uppercase Letter 69
 
0.5%
Decimal Number 23
 
0.2%
Dash Punctuation 11
 
0.1%
Other Punctuation 10
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
2.6%
8
 
2.3%
6
 
1.7%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
Other values (197) 288
83.2%
Lowercase Letter
ValueCountFrequency (%)
n 1450
11.4%
a 1407
11.1%
g 1173
 
9.2%
o 1126
 
8.9%
e 1102
 
8.7%
i 727
 
5.7%
u 696
 
5.5%
s 595
 
4.7%
k 507
 
4.0%
j 497
 
3.9%
Other values (15) 3435
27.0%
Uppercase Letter
ValueCountFrequency (%)
A 9
13.0%
E 6
 
8.7%
R 6
 
8.7%
C 5
 
7.2%
I 5
 
7.2%
U 5
 
7.2%
H 4
 
5.8%
T 4
 
5.8%
D 4
 
5.8%
Y 4
 
5.8%
Other values (8) 17
24.6%
Decimal Number
ValueCountFrequency (%)
0 4
17.4%
1 4
17.4%
2 4
17.4%
9 3
13.0%
4 3
13.0%
7 2
8.7%
6 1
 
4.3%
8 1
 
4.3%
5 1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
& 5
50.0%
. 3
30.0%
, 1
 
10.0%
' 1
 
10.0%
Space Separator
ValueCountFrequency (%)
716
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12784
92.0%
Common 762
 
5.5%
Han 318
 
2.3%
Hiragana 20
 
0.1%
Hangul 5
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
9
 
2.8%
8
 
2.5%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
Other values (172) 260
81.8%
Latin
ValueCountFrequency (%)
n 1450
11.3%
a 1407
11.0%
g 1173
 
9.2%
o 1126
 
8.8%
e 1102
 
8.6%
i 727
 
5.7%
u 696
 
5.4%
s 595
 
4.7%
k 507
 
4.0%
j 497
 
3.9%
Other values (33) 3504
27.4%
Common
ValueCountFrequency (%)
716
94.0%
- 11
 
1.4%
& 5
 
0.7%
0 4
 
0.5%
1 4
 
0.5%
2 4
 
0.5%
9 3
 
0.4%
. 3
 
0.4%
4 3
 
0.4%
7 2
 
0.3%
Other values (7) 7
 
0.9%
Hiragana
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13546
97.5%
CJK 312
 
2.2%
Hiragana 20
 
0.1%
CJK Compat Ideographs 6
 
< 0.1%
Hangul 5
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 1450
 
10.7%
a 1407
 
10.4%
g 1173
 
8.7%
o 1126
 
8.3%
e 1102
 
8.1%
i 727
 
5.4%
716
 
5.3%
u 696
 
5.1%
s 595
 
4.4%
k 507
 
3.7%
Other values (50) 4047
29.9%
CJK
ValueCountFrequency (%)
9
 
2.9%
8
 
2.6%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
Other values (168) 254
81.4%
Hiragana
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

LNM_ADDR
Text

MISSING 

Distinct911
Distinct (%)94.0%
Missing31
Missing (%)3.1%
Memory size7.9 KiB
2023-12-10T19:09:00.694435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length23
Mean length19.101135
Min length14

Characters and Unicode

Total characters18509
Distinct characters293
Distinct categories4 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique868 ?
Unique (%)89.6%

Sample

1st rowソウル特別市 鍾路区 寬勲洞198-10
2nd rowソウル特別市 鍾路区 樂園洞218-1
3rd rowソウル特別市 鍾路区 臥竜洞166
4th rowソウル特別市 鍾路区 唐珠洞18
5th rowソウル特別市 龍山区 梨泰院洞225-112
ValueCountFrequency (%)
ソウル特別市 969
33.3%
中区 253
 
8.7%
鍾路区 159
 
5.5%
永登浦区 89
 
3.1%
龍山区 55
 
1.9%
西大門区 45
 
1.5%
東大門区 42
 
1.4%
広津区 36
 
1.2%
城東区 35
 
1.2%
麻浦区 34
 
1.2%
Other values (931) 1194
41.0%
2023-12-10T19:09:01.220377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1942
 
10.5%
969
 
5.2%
969
 
5.2%
969
 
5.2%
969
 
5.2%
969
 
5.2%
969
 
5.2%
969
 
5.2%
835
 
4.5%
- 801
 
4.3%
Other values (283) 8148
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11849
64.0%
Decimal Number 3917
 
21.2%
Space Separator 1942
 
10.5%
Dash Punctuation 801
 
4.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
969
 
8.2%
969
 
8.2%
969
 
8.2%
969
 
8.2%
969
 
8.2%
969
 
8.2%
969
 
8.2%
835
 
7.0%
289
 
2.4%
288
 
2.4%
Other values (271) 3654
30.8%
Decimal Number
ValueCountFrequency (%)
1 801
20.4%
2 626
16.0%
3 508
13.0%
4 403
10.3%
5 343
8.8%
6 294
 
7.5%
0 247
 
6.3%
7 241
 
6.2%
8 231
 
5.9%
9 223
 
5.7%
Space Separator
ValueCountFrequency (%)
1942
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 801
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 8934
48.3%
Common 6660
36.0%
Katakana 2907
 
15.7%
Hangul 8
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
969
 
10.8%
969
 
10.8%
969
 
10.8%
969
 
10.8%
835
 
9.3%
289
 
3.2%
288
 
3.2%
260
 
2.9%
192
 
2.1%
165
 
1.8%
Other values (260) 3029
33.9%
Common
ValueCountFrequency (%)
1942
29.2%
- 801
12.0%
1 801
12.0%
2 626
 
9.4%
3 508
 
7.6%
4 403
 
6.1%
5 343
 
5.2%
6 294
 
4.4%
0 247
 
3.7%
7 241
 
3.6%
Other values (2) 454
 
6.8%
Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Katakana
ValueCountFrequency (%)
969
33.3%
969
33.3%
969
33.3%

Most occurring blocks

ValueCountFrequency (%)
CJK 8802
47.6%
ASCII 6660
36.0%
Katakana 2907
 
15.7%
CJK Compat Ideographs 132
 
0.7%
Hangul 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1942
29.2%
- 801
12.0%
1 801
12.0%
2 626
 
9.4%
3 508
 
7.6%
4 403
 
6.1%
5 343
 
5.2%
6 294
 
4.4%
0 247
 
3.7%
7 241
 
3.6%
Other values (2) 454
 
6.8%
Katakana
ValueCountFrequency (%)
969
33.3%
969
33.3%
969
33.3%
CJK
ValueCountFrequency (%)
969
 
11.0%
969
 
11.0%
969
 
11.0%
969
 
11.0%
835
 
9.5%
289
 
3.3%
288
 
3.3%
260
 
3.0%
192
 
2.2%
165
 
1.9%
Other values (243) 2897
32.9%
CJK Compat Ideographs
ValueCountFrequency (%)
75
56.8%
9
 
6.8%
7
 
5.3%
6
 
4.5%
5
 
3.8%
5
 
3.8%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (7) 13
 
9.8%
Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

RDNMADR_NM
Text

MISSING 

Distinct914
Distinct (%)93.6%
Missing23
Missing (%)2.3%
Memory size7.9 KiB
2023-12-10T19:09:01.538509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length21.055271
Min length15

Characters and Unicode

Total characters20571
Distinct characters128
Distinct categories4 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique867 ?
Unique (%)88.7%

Sample

1st rowソウル特別市 鍾路区 インサドン5ギル12
2nd rowソウル特別市 鍾路区 スピョロ121
3rd rowソウル特別市 鍾路区 トンファムンロ11ガギル48
4th rowソウル特別市 鍾路区 セムンアンロ9ギル29-2
5th rowソウル特別市 龍山区 フェナムロ13ギル10
ValueCountFrequency (%)
ソウル特別市 977
33.3%
中区 254
 
8.7%
鍾路区 157
 
5.4%
永登浦区 86
 
2.9%
龍山区 52
 
1.8%
西大門区 47
 
1.6%
東大門区 44
 
1.5%
広津区 37
 
1.3%
城東区 37
 
1.3%
麻浦区 33
 
1.1%
Other values (930) 1207
41.2%
2023-12-10T19:09:02.168538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1954
 
9.5%
1686
 
8.2%
1092
 
5.3%
1045
 
5.1%
1043
 
5.1%
977
 
4.7%
977
 
4.7%
977
 
4.7%
977
 
4.7%
890
 
4.3%
Other values (118) 8953
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15187
73.8%
Decimal Number 3197
 
15.5%
Space Separator 1954
 
9.5%
Dash Punctuation 233
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1686
 
11.1%
1092
 
7.2%
1045
 
6.9%
1043
 
6.9%
977
 
6.4%
977
 
6.4%
977
 
6.4%
977
 
6.4%
890
 
5.9%
605
 
4.0%
Other values (106) 4918
32.4%
Decimal Number
ValueCountFrequency (%)
1 761
23.8%
2 466
14.6%
3 357
11.2%
4 344
10.8%
5 249
 
7.8%
7 222
 
6.9%
8 209
 
6.5%
0 208
 
6.5%
6 206
 
6.4%
9 175
 
5.5%
Space Separator
ValueCountFrequency (%)
1954
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 233
100.0%

Most occurring scripts

ValueCountFrequency (%)
Katakana 9377
45.6%
Han 5809
28.2%
Common 5384
26.2%
Hangul 1
 
< 0.1%

Most frequent character per script

Katakana
ValueCountFrequency (%)
1686
18.0%
1092
11.6%
1045
11.1%
1043
11.1%
890
9.5%
605
 
6.5%
283
 
3.0%
219
 
2.3%
206
 
2.2%
166
 
1.8%
Other values (60) 2142
22.8%
Han
ValueCountFrequency (%)
977
16.8%
977
16.8%
977
16.8%
977
16.8%
269
 
4.6%
163
 
2.8%
157
 
2.7%
119
 
2.0%
92
 
1.6%
91
 
1.6%
Other values (35) 1010
17.4%
Common
ValueCountFrequency (%)
1954
36.3%
1 761
 
14.1%
2 466
 
8.7%
3 357
 
6.6%
4 344
 
6.4%
5 249
 
4.6%
- 233
 
4.3%
7 222
 
4.1%
8 209
 
3.9%
0 208
 
3.9%
Other values (2) 381
 
7.1%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Katakana 9377
45.6%
CJK 5750
28.0%
ASCII 5384
26.2%
CJK Compat Ideographs 59
 
0.3%
Hangul 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1954
36.3%
1 761
 
14.1%
2 466
 
8.7%
3 357
 
6.6%
4 344
 
6.4%
5 249
 
4.6%
- 233
 
4.3%
7 222
 
4.1%
8 209
 
3.9%
0 208
 
3.9%
Other values (2) 381
 
7.1%
Katakana
ValueCountFrequency (%)
1686
18.0%
1092
11.6%
1045
11.1%
1043
11.1%
890
9.5%
605
 
6.5%
283
 
3.0%
219
 
2.3%
206
 
2.2%
166
 
1.8%
Other values (60) 2142
22.8%
CJK
ValueCountFrequency (%)
977
17.0%
977
17.0%
977
17.0%
977
17.0%
269
 
4.7%
163
 
2.8%
157
 
2.7%
119
 
2.1%
92
 
1.6%
91
 
1.6%
Other values (33) 951
16.5%
CJK Compat Ideographs
ValueCountFrequency (%)
52
88.1%
7
 
11.9%
Hangul
ValueCountFrequency (%)
1
100.0%

RSTRNT_TEL_NO
Real number (ℝ)

MISSING 

Distinct970
Distinct (%)99.2%
Missing22
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean3.142388 × 108
Minimum18111111
Maximum5.0409531 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-10T19:09:02.427516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18111111
5-th percentile23592293
Q127162094
median27814119
Q32.2266342 × 108
95-th percentile2.27629 × 108
Maximum5.0409531 × 1010
Range5.039142 × 1010
Interquartile range (IQR)1.9550133 × 108

Descriptive statistics

Standard deviation2.8649085 × 109
Coefficient of variation (CV)9.1169788
Kurtosis286.11331
Mean3.142388 × 108
Median Absolute Deviation (MAD)2838111.5
Skewness16.575536
Sum3.0732555 × 1011
Variance8.2077007 × 1018
MonotonicityNot monotonic
2023-12-10T19:09:02.673034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
237898088 3
 
0.3%
27765348 2
 
0.2%
27773131 2
 
0.2%
222679396 2
 
0.2%
27523177 2
 
0.2%
25711110 2
 
0.2%
27775668 2
 
0.2%
27635588 1
 
0.1%
27201910 1
 
0.1%
236763933 1
 
0.1%
Other values (960) 960
96.0%
(Missing) 22
 
2.2%
ValueCountFrequency (%)
18111111 1
0.1%
22738413 1
0.1%
22794658 1
0.1%
23022421 1
0.1%
23049346 1
0.1%
23055554 1
0.1%
23058555 1
0.1%
23098359 1
0.1%
23101936 1
0.1%
23109665 1
0.1%
ValueCountFrequency (%)
50409530969 1
0.1%
50409530696 1
0.1%
50409530371 1
0.1%
7088880908 1
0.1%
7088623830 1
0.1%
7088452114 1
0.1%
7087221542 1
0.1%
7082258075 1
0.1%
7075700871 1
0.1%
7075568204 1
0.1%
Distinct170
Distinct (%)17.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-10T19:09:03.173458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length12
Mean length6.39
Min length3

Characters and Unicode

Total characters6390
Distinct characters75
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)4.8%

Sample

1st rowアングク駅
2nd rowアングク駅
3rd rowアングク駅
4th rowクァンファムン駅
5th rowイテウォン駅
ValueCountFrequency (%)
ウルチロサガ駅 74
 
7.4%
フェヒョン駅 63
 
6.3%
クァンファムン駅 56
 
5.6%
ミョンドン駅 53
 
5.3%
ヨンドゥンポ駅 33
 
3.3%
アングク駅 28
 
2.8%
ウルチロサムガ駅 27
 
2.7%
ソウル駅(京義線 25
 
2.5%
ソデムン駅 25
 
2.5%
トンデムンヨクサムンファゴンウォン駅 22
 
2.2%
Other values (160) 594
59.4%
2023-12-10T19:09:04.082957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1131
17.7%
1000
 
15.6%
265
 
4.1%
228
 
3.6%
225
 
3.5%
210
 
3.3%
194
 
3.0%
192
 
3.0%
187
 
2.9%
167
 
2.6%
Other values (65) 2591
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6324
99.0%
Close Punctuation 25
 
0.4%
Open Punctuation 25
 
0.4%
Other Punctuation 16
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1131
17.9%
1000
 
15.8%
265
 
4.2%
228
 
3.6%
225
 
3.6%
210
 
3.3%
194
 
3.1%
192
 
3.0%
187
 
3.0%
167
 
2.6%
Other values (62) 2525
39.9%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Other Punctuation
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Katakana 5249
82.1%
Han 1075
 
16.8%
Common 66
 
1.0%

Most frequent character per script

Katakana
ValueCountFrequency (%)
1131
21.5%
265
 
5.0%
228
 
4.3%
225
 
4.3%
210
 
4.0%
194
 
3.7%
192
 
3.7%
187
 
3.6%
167
 
3.2%
157
 
3.0%
Other values (58) 2293
43.7%
Han
ValueCountFrequency (%)
1000
93.0%
25
 
2.3%
25
 
2.3%
25
 
2.3%
Common
ValueCountFrequency (%)
) 25
37.9%
( 25
37.9%
16
24.2%

Most occurring blocks

ValueCountFrequency (%)
Katakana 5265
82.4%
CJK 1075
 
16.8%
ASCII 50
 
0.8%

Most frequent character per block

Katakana
ValueCountFrequency (%)
1131
21.5%
265
 
5.0%
228
 
4.3%
225
 
4.3%
210
 
4.0%
194
 
3.7%
192
 
3.6%
187
 
3.6%
167
 
3.2%
157
 
3.0%
Other values (59) 2309
43.9%
CJK
ValueCountFrequency (%)
1000
93.0%
25
 
2.3%
25
 
2.3%
25
 
2.3%
ASCII
ValueCountFrequency (%)
) 25
50.0%
( 25
50.0%

SUBWAYST_NM.1
Real number (ℝ)

Distinct936
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean413.77836
Minimum0.009446
Maximum699.54752
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-10T19:09:04.346476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.009446
5-th percentile98.42562
Q1259.30608
median430.40847
Q3575.79551
95-th percentile676.28204
Maximum699.54752
Range699.53807
Interquartile range (IQR)316.48943

Descriptive statistics

Standard deviation184.4123
Coefficient of variation (CV)0.44567895
Kurtosis-1.1090928
Mean413.77836
Median Absolute Deviation (MAD)153.88229
Skewness-0.24843532
Sum413778.36
Variance34007.898
MonotonicityNot monotonic
2023-12-10T19:09:04.589042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
59.77552 6
 
0.6%
257.409397 6
 
0.6%
612.785909 4
 
0.4%
462.410271 4
 
0.4%
677.616009 4
 
0.4%
676.21183 3
 
0.3%
461.551236 3
 
0.3%
109.059737 2
 
0.2%
559.154933 2
 
0.2%
213.346422 2
 
0.2%
Other values (926) 964
96.4%
ValueCountFrequency (%)
0.009446 1
0.1%
18.673123 1
0.1%
18.967334 1
0.1%
25.132352 1
0.1%
25.359498 1
0.1%
39.684891 1
0.1%
41.966893 1
0.1%
46.021078 1
0.1%
46.034054 1
0.1%
47.776278 1
0.1%
ValueCountFrequency (%)
699.547518 1
0.1%
698.483019 1
0.1%
698.192538 1
0.1%
696.924441 1
0.1%
696.670583 1
0.1%
696.317406 1
0.1%
695.72837 1
0.1%
693.632945 1
0.1%
693.52106 1
0.1%
692.39651 1
0.1%

RSTRNT_LA
Real number (ℝ)

Distinct935
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.555564
Minimum37.453428
Maximum37.68503
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-10T19:09:04.861787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.453428
5-th percentile37.492714
Q137.539114
median37.562906
Q337.570897
95-th percentile37.603853
Maximum37.68503
Range0.2316017
Interquartile range (IQR)0.031782475

Descriptive statistics

Standard deviation0.031949077
Coefficient of variation (CV)0.00085071488
Kurtosis1.1729098
Mean37.555564
Median Absolute Deviation (MAD)0.0109993
Skewness-0.17347138
Sum37555.564
Variance0.0010207435
MonotonicityNot monotonic
2023-12-10T19:09:05.142146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5214704 6
 
0.6%
37.504154 6
 
0.6%
37.5208744 4
 
0.4%
37.5644687 4
 
0.4%
37.5121721 4
 
0.4%
37.5641656 3
 
0.3%
37.5196767 3
 
0.3%
37.5210051 2
 
0.2%
37.5703041 2
 
0.2%
37.562291 2
 
0.2%
Other values (925) 964
96.4%
ValueCountFrequency (%)
37.4534281 1
0.1%
37.4559852 1
0.1%
37.4672522 2
0.2%
37.4685471 1
0.1%
37.4755582 1
0.1%
37.4789287 1
0.1%
37.4801437 1
0.1%
37.4802585 1
0.1%
37.4802634 1
0.1%
37.4804543 1
0.1%
ValueCountFrequency (%)
37.6850298 1
0.1%
37.6846059 1
0.1%
37.6773506 1
0.1%
37.6713881 1
0.1%
37.665656 1
0.1%
37.6638325 1
0.1%
37.6516194 1
0.1%
37.6513711 1
0.1%
37.6475992 1
0.1%
37.6357937 1
0.1%

RSTRNT_LO
Real number (ℝ)

Distinct936
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.98345
Minimum126.80745
Maximum127.15365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-10T19:09:05.410103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.80745
5-th percentile126.88923
Q1126.94847
median126.98542
Q3127.01215
95-th percentile127.07999
Maximum127.15365
Range0.3461964
Interquartile range (IQR)0.063678875

Descriptive statistics

Standard deviation0.057537291
Coefficient of variation (CV)0.00045310859
Kurtosis0.49320298
Mean126.98345
Median Absolute Deviation (MAD)0.0304145
Skewness-0.18079765
Sum126983.45
Variance0.0033105398
MonotonicityNot monotonic
2023-12-10T19:09:05.711608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9249738 6
 
0.6%
126.879538 6
 
0.6%
126.9313093 4
 
0.4%
127.0653425 4
 
0.4%
126.9800463 4
 
0.4%
126.9811637 3
 
0.3%
126.928942 3
 
0.3%
126.9238388 2
 
0.2%
126.9824573 2
 
0.2%
126.9078716 2
 
0.2%
Other values (926) 964
96.4%
ValueCountFrequency (%)
126.8074545 1
0.1%
126.8079647 1
0.1%
126.8080279 2
0.2%
126.8081111 1
0.1%
126.8091358 1
0.1%
126.8099671 1
0.1%
126.8120516 1
0.1%
126.8121299 1
0.1%
126.82014 1
0.1%
126.8409761 1
0.1%
ValueCountFrequency (%)
127.1536509 1
0.1%
127.1528684 1
0.1%
127.1343437 1
0.1%
127.1313787 1
0.1%
127.1294834 1
0.1%
127.1267408 1
0.1%
127.1265552 1
0.1%
127.1263565 1
0.1%
127.1263319 1
0.1%
127.126184 1
0.1%

Interactions

2023-12-10T19:08:56.547528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:51.646560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:52.811721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:53.838357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:55.501370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:56.716546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:51.914930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:52.974695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:54.083668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:55.761106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:56.924736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:52.185199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:53.204715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:54.417776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:55.943722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:57.114688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:52.406319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:53.474537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:54.617038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:56.182424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:57.300222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:52.601087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:53.639578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:54.801723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:08:56.369307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:09:05.999096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RSTRNT_IDRSTRNT_TEL_NOSUBWAYST_NM.1RSTRNT_LARSTRNT_LO
RSTRNT_ID1.0000.0000.1010.1910.313
RSTRNT_TEL_NO0.0001.0000.1620.0430.145
SUBWAYST_NM.10.1010.1621.0000.2130.299
RSTRNT_LA0.1910.0430.2131.0000.709
RSTRNT_LO0.3130.1450.2990.7091.000
2023-12-10T19:09:06.255905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RSTRNT_IDRSTRNT_TEL_NOSUBWAYST_NM.1RSTRNT_LARSTRNT_LO
RSTRNT_ID1.000-0.050-0.067-0.1650.056
RSTRNT_TEL_NO-0.0501.000-0.001-0.045-0.043
SUBWAYST_NM.1-0.067-0.0011.0000.0260.010
RSTRNT_LA-0.165-0.0450.0261.0000.369
RSTRNT_LO0.056-0.0430.0100.3691.000

Missing values

2023-12-10T19:08:57.586469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:08:57.842411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T19:08:58.073752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

CTY_NMRSTRNT_IDRSTRNT_NMLNM_ADDRRDNMADR_NMRSTRNT_TEL_NOSUBWAYST_NMSUBWAYST_NM.1RSTRNT_LARSTRNT_LO
0ソウル特別市1088penand coffeeソウル特別市 鍾路区 寬勲洞198-10ソウル特別市 鍾路区 インサドン5ギル1227250062アングク駅429.32820737.572857126.985577
1ソウル特別市1115todamtodamソウル特別市 鍾路区 樂園洞218-1ソウル特別市 鍾路区 スピョロ12127447934アングク駅588.60089337.571998126.988855
2ソウル特別市1116ttowachamsutdwaeji galbiソウル特別市 鍾路区 臥竜洞166ソウル特別市 鍾路区 トンファムンロ11ガギル48236760399アングク駅480.51917337.574727126.990652
3ソウル特別市1117東成閣ソウル特別市 鍾路区 唐珠洞18ソウル特別市 鍾路区 セムンアンロ9ギル29-227350107クァンファムン駅135.40508237.571444126.975272
4ソウル特別市1121hurendeu chickenソウル特別市 龍山区 梨泰院洞225-112ソウル特別市 龍山区 フェナムロ13ギル1027964642イテウォン駅691.73399537.539777126.989591
5ソウル特別市1123donghosutbul barbecueソウル特別市 蘆原区 上渓洞651ソウル特別市 蘆原区 トンイルロ154129321090マドゥル駅82.55110737.665656127.057086
6ソウル特別市1126beuraeseoriソウル特別市 江南区 三成洞159ソウル特別市 江南区 ポンウンサロ524234308585ポンウンサ駅358.33778237.512878127.057291
7ソウル特別市1130amarelloソウル特別市 江東区 城內洞552-12ソウル特別市 江東区 ソンネロ6ギル14-2624886394カンドングチョン駅122.3476937.529883127.12154
8ソウル特別市1134cafe rinneseu gardenソウル特別市 永登浦区 汝矣島洞24-2ソウル特別市 永登浦区 ヨウィナルロ77-127834877ヨイナル駅585.87504737.524395126.927161
9ソウル特別市1156梅河ソウル特別市 恩平区 仏光洞281-52ソウル特別市 恩平区 トンイルロ66ギル11<NA>プルグァン駅138.9523737.61083126.931351
CTY_NMRSTRNT_IDRSTRNT_NMLNM_ADDRRDNMADR_NMRSTRNT_TEL_NOSUBWAYST_NMSUBWAYST_NM.1RSTRNT_LARSTRNT_LO
990ソウル特別市9731cheongdammyeonokソウル特別市 江南区 三成洞8-1ソウル特別市 江南区 ソンルンロ66425483777ソンジョンヌン駅571.35410237.515896127.042335
991ソウル特別市9732duruchigi yuseonsaengソウル特別市 江南区 三成洞166-6ソウル特別市 江南区 ポンウンサロ114ギル4225631159ポンウンサ駅462.41027137.512172127.065342
992ソウル特別市9733うつわソウル特別市 江南区 三成洞9-6ソウル特別市 江南区 ハクドンロ56ギル3225114956ソンジョンヌン駅486.3521637.51519127.042846
993ソウル特別市9734samhwanso hanmariソウル特別市 江南区 三成洞107-5ソウル特別市 江南区 ヨンドンデロ112ギル1025452429ポンウンサ駅71.13774637.514992127.061034
994ソウル特別市9738coffee olliソウル特別市 江南区 論峴洞164-11ソウル特別市 江南区 カンナムデロ506260840015ノンヒョン駅443.28190837.50749127.023517
995ソウル特別市9739insaengkacheuソウル特別市 江南区 新沙洞615-1ソウル特別市 江南区 アブグジョンロ21625453442アックジョン駅292.37846137.528095127.031045
996ソウル特別市974060nyeon jeontong sinchon hwang sogopchangソウル特別市 江南区 論峴洞144ソウル特別市 江南区 カンナムデロ128ギル1025114632ノンヒョン駅198.03923337.509822127.023083
997ソウル特別市9741hanul dakgalbiソウル特別市 江南区 論峴洞144-1ソウル特別市 江南区 ハクドンロ2ギル2925488970ノンヒョン駅221.58882537.509577127.023119
998ソウル特別市9742sammiソウル特別市 江南区 論峴洞143-10ソウル特別市 江南区 ハクドンロ2ギル3025499485ノンヒョン駅222.15376837.509475127.022935
999ソウル特別市9743mr. dolsoeソウル特別市 江南区 三成洞166-6ソウル特別市 江南区 ポンウンサロ114ギル4225670933ポンウンサ駅462.41027137.512172127.065342