Overview

Dataset statistics

Number of variables8
Number of observations7964
Missing cells1057
Missing cells (%)1.7%
Duplicate rows19
Duplicate rows (%)0.2%
Total size in memory513.4 KiB
Average record size in memory66.0 B

Variable types

Categorical2
Text4
Numeric2

Alerts

Dataset has 19 (0.2%) duplicate rowsDuplicates
WGS84위도 is highly overall correlated with 시군명High correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
시설구분명 is highly imbalanced (95.9%)Imbalance
소재지도로명주소 has 481 (6.0%) missing valuesMissing
WGS84위도 has 269 (3.4%) missing valuesMissing
WGS84경도 has 269 (3.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 21:30:36.427738
Analysis finished2023-12-10 21:30:38.926522
Duration2.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size62.3 KiB
고양시
715 
수원시
675 
용인시
595 
성남시
560 
부천시
543 
Other values (26)
4876 

Length

Max length4
Median length3
Mean length3.0995731
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
고양시 715
 
9.0%
수원시 675
 
8.5%
용인시 595
 
7.5%
성남시 560
 
7.0%
부천시 543
 
6.8%
안산시 474
 
6.0%
화성시 459
 
5.8%
남양주시 438
 
5.5%
시흥시 320
 
4.0%
안양시 319
 
4.0%
Other values (21) 2866
36.0%

Length

2023-12-11T06:30:38.994538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고양시 715
 
9.0%
수원시 675
 
8.5%
용인시 595
 
7.5%
성남시 560
 
7.0%
부천시 543
 
6.8%
안산시 474
 
6.0%
화성시 459
 
5.8%
남양주시 438
 
5.5%
시흥시 320
 
4.0%
안양시 319
 
4.0%
Other values (21) 2866
36.0%

시설구분명
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size62.3 KiB
체육도장업
7859 
테니스
 
60
육상시설
 
34
양궁장
 
5
하키시설
 
2
Other values (3)
 
4

Length

Max length5
Median length5
Mean length4.9781517
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row체육도장업
2nd row육상시설
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 7859
98.7%
테니스 60
 
0.8%
육상시설 34
 
0.4%
양궁장 5
 
0.1%
하키시설 2
 
< 0.1%
투기 2
 
< 0.1%
골프연습장 1
 
< 0.1%
사격장 1
 
< 0.1%

Length

2023-12-11T06:30:39.144373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:30:39.266295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 7859
98.7%
테니스 60
 
0.8%
육상시설 34
 
0.4%
양궁장 5
 
0.1%
하키시설 2
 
< 0.1%
투기 2
 
< 0.1%
골프연습장 1
 
< 0.1%
사격장 1
 
< 0.1%
Distinct6539
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Memory size62.3 KiB
2023-12-11T06:30:39.513750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length30
Mean length8.4293069
Min length1

Characters and Unicode

Total characters67131
Distinct characters658
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5570 ?
Unique (%)69.9%

Sample

1st row해동검도 가평본관
2nd row가평종합운동장
3rd row중앙체육관
4th row충효체육관
5th row혜성운암태권도장
ValueCountFrequency (%)
태권도장 1030
 
8.0%
태권도 637
 
5.0%
용인대 521
 
4.1%
경희대 373
 
2.9%
체육관 231
 
1.8%
석사 122
 
1.0%
국가대표 98
 
0.8%
한국체대 94
 
0.7%
복싱 82
 
0.6%
복싱클럽 43
 
0.3%
Other values (6022) 9610
74.8%
2023-12-11T06:30:40.001572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6135
 
9.1%
5434
 
8.1%
5359
 
8.0%
4887
 
7.3%
3009
 
4.5%
2705
 
4.0%
2240
 
3.3%
1896
 
2.8%
1760
 
2.6%
1214
 
1.8%
Other values (648) 32492
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59362
88.4%
Space Separator 4887
 
7.3%
Uppercase Letter 1719
 
2.6%
Lowercase Letter 465
 
0.7%
Open Punctuation 169
 
0.3%
Close Punctuation 169
 
0.3%
Other Punctuation 167
 
0.2%
Decimal Number 153
 
0.2%
Dash Punctuation 37
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6135
 
10.3%
5434
 
9.2%
5359
 
9.0%
3009
 
5.1%
2705
 
4.6%
2240
 
3.8%
1896
 
3.2%
1760
 
3.0%
1214
 
2.0%
1191
 
2.0%
Other values (575) 28419
47.9%
Uppercase Letter
ValueCountFrequency (%)
T 244
14.2%
M 211
12.3%
A 162
 
9.4%
K 146
 
8.5%
S 112
 
6.5%
G 89
 
5.2%
J 81
 
4.7%
Y 76
 
4.4%
C 65
 
3.8%
E 62
 
3.6%
Other values (15) 471
27.4%
Lowercase Letter
ValueCountFrequency (%)
s 52
11.2%
e 51
11.0%
i 45
9.7%
o 34
 
7.3%
m 32
 
6.9%
t 32
 
6.9%
r 32
 
6.9%
a 29
 
6.2%
n 29
 
6.2%
k 21
 
4.5%
Other values (12) 108
23.2%
Decimal Number
ValueCountFrequency (%)
2 59
38.6%
1 42
27.5%
3 24
15.7%
8 11
 
7.2%
7 5
 
3.3%
5 4
 
2.6%
4 4
 
2.6%
0 3
 
2.0%
9 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 65
38.9%
& 53
31.7%
, 21
 
12.6%
' 15
 
9.0%
6
 
3.6%
· 3
 
1.8%
: 3
 
1.8%
/ 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 168
99.4%
[ 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 168
99.4%
] 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
4887
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59340
88.4%
Common 5585
 
8.3%
Latin 2184
 
3.3%
Han 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6135
 
10.3%
5434
 
9.2%
5359
 
9.0%
3009
 
5.1%
2705
 
4.6%
2240
 
3.8%
1896
 
3.2%
1760
 
3.0%
1214
 
2.0%
1191
 
2.0%
Other values (561) 28397
47.9%
Latin
ValueCountFrequency (%)
T 244
 
11.2%
M 211
 
9.7%
A 162
 
7.4%
K 146
 
6.7%
S 112
 
5.1%
G 89
 
4.1%
J 81
 
3.7%
Y 76
 
3.5%
C 65
 
3.0%
E 62
 
2.8%
Other values (37) 936
42.9%
Common
ValueCountFrequency (%)
4887
87.5%
( 168
 
3.0%
) 168
 
3.0%
. 65
 
1.2%
2 59
 
1.1%
& 53
 
0.9%
1 42
 
0.8%
- 37
 
0.7%
3 24
 
0.4%
, 21
 
0.4%
Other values (16) 61
 
1.1%
Han
ValueCountFrequency (%)
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (4) 4
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59340
88.4%
ASCII 7760
 
11.6%
CJK 22
 
< 0.1%
None 9
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6135
 
10.3%
5434
 
9.2%
5359
 
9.0%
3009
 
5.1%
2705
 
4.6%
2240
 
3.8%
1896
 
3.2%
1760
 
3.0%
1214
 
2.0%
1191
 
2.0%
Other values (561) 28397
47.9%
ASCII
ValueCountFrequency (%)
4887
63.0%
T 244
 
3.1%
M 211
 
2.7%
( 168
 
2.2%
) 168
 
2.2%
A 162
 
2.1%
K 146
 
1.9%
S 112
 
1.4%
G 89
 
1.1%
J 81
 
1.0%
Other values (61) 1492
 
19.2%
None
ValueCountFrequency (%)
6
66.7%
· 3
33.3%
CJK
ValueCountFrequency (%)
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (4) 4
18.2%
Distinct3295
Distinct (%)41.6%
Missing37
Missing (%)0.5%
Memory size62.3 KiB
2023-12-11T06:30:40.361757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length5.655229
Min length5

Characters and Unicode

Total characters44829
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1657 ?
Unique (%)20.9%

Sample

1st row12413
2nd row12416
3rd row477853
4th row477801
5th row477804
ValueCountFrequency (%)
445160 31
 
0.4%
410831 27
 
0.3%
445360 25
 
0.3%
472901 24
 
0.3%
415060 21
 
0.3%
15010 21
 
0.3%
482060 21
 
0.3%
482050 19
 
0.2%
483020 18
 
0.2%
472865 18
 
0.2%
Other values (3285) 7702
97.2%
2023-12-11T06:30:40.834161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 8524
19.0%
1 6893
15.4%
8 5428
12.1%
0 4997
11.1%
2 4497
10.0%
3 3486
7.8%
6 3105
 
6.9%
5 3082
 
6.9%
7 2626
 
5.9%
9 1975
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44613
99.5%
Dash Punctuation 216
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 8524
19.1%
1 6893
15.5%
8 5428
12.2%
0 4997
11.2%
2 4497
10.1%
3 3486
7.8%
6 3105
 
7.0%
5 3082
 
6.9%
7 2626
 
5.9%
9 1975
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44829
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 8524
19.0%
1 6893
15.4%
8 5428
12.1%
0 4997
11.1%
2 4497
10.0%
3 3486
7.8%
6 3105
 
6.9%
5 3082
 
6.9%
7 2626
 
5.9%
9 1975
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44829
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 8524
19.0%
1 6893
15.4%
8 5428
12.1%
0 4997
11.1%
2 4497
10.0%
3 3486
7.8%
6 3105
 
6.9%
5 3082
 
6.9%
7 2626
 
5.9%
9 1975
 
4.4%
Distinct7646
Distinct (%)96.0%
Missing1
Missing (%)< 0.1%
Memory size62.3 KiB
2023-12-11T06:30:41.194819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length52
Mean length26.790908
Min length10

Characters and Unicode

Total characters213336
Distinct characters562
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7354 ?
Unique (%)92.4%

Sample

1st row경기도 가평군 가평읍 읍내리 506-6번지
2nd row경기도 가평군 가평읍 대곡리 316번지
3rd row경기도 가평군 설악면 신천리 407-9번지
4th row경기도 가평군 가평읍 읍내리 493-6번지
5th row경기도 가평군 가평읍 대곡리 285-9번지
ValueCountFrequency (%)
경기도 7963
 
17.6%
고양시 715
 
1.6%
수원시 672
 
1.5%
2층 663
 
1.5%
용인시 595
 
1.3%
성남시 559
 
1.2%
3층 554
 
1.2%
부천시 543
 
1.2%
안산시 475
 
1.1%
화성시 462
 
1.0%
Other values (9780) 31929
70.7%
2023-12-11T06:30:41.736964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38068
 
17.8%
8333
 
3.9%
8310
 
3.9%
8234
 
3.9%
8125
 
3.8%
8025
 
3.8%
1 7781
 
3.6%
7724
 
3.6%
6629
 
3.1%
2 6483
 
3.0%
Other values (552) 105624
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 124095
58.2%
Decimal Number 43759
 
20.5%
Space Separator 38068
 
17.8%
Dash Punctuation 5688
 
2.7%
Other Punctuation 774
 
0.4%
Uppercase Letter 378
 
0.2%
Open Punctuation 192
 
0.1%
Close Punctuation 192
 
0.1%
Math Symbol 135
 
0.1%
Lowercase Letter 47
 
< 0.1%
Other values (2) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8333
 
6.7%
8310
 
6.7%
8234
 
6.6%
8125
 
6.5%
8025
 
6.5%
7724
 
6.2%
6629
 
5.3%
3632
 
2.9%
2799
 
2.3%
2492
 
2.0%
Other values (487) 59792
48.2%
Uppercase Letter
ValueCountFrequency (%)
B 100
26.5%
A 84
22.2%
L 24
 
6.3%
S 23
 
6.1%
P 23
 
6.1%
T 17
 
4.5%
K 17
 
4.5%
C 15
 
4.0%
G 13
 
3.4%
M 10
 
2.6%
Other values (15) 52
13.8%
Lowercase Letter
ValueCountFrequency (%)
e 17
36.2%
l 8
17.0%
a 8
17.0%
c 4
 
8.5%
b 2
 
4.3%
s 2
 
4.3%
h 1
 
2.1%
g 1
 
2.1%
z 1
 
2.1%
p 1
 
2.1%
Other values (2) 2
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 7781
17.8%
2 6483
14.8%
3 5289
12.1%
0 5202
11.9%
4 4181
9.6%
5 3792
8.7%
6 3127
7.1%
7 2981
 
6.8%
8 2590
 
5.9%
9 2333
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 662
85.5%
. 57
 
7.4%
@ 38
 
4.9%
/ 10
 
1.3%
& 4
 
0.5%
· 2
 
0.3%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Math Symbol
ValueCountFrequency (%)
~ 133
98.5%
2
 
1.5%
Space Separator
ValueCountFrequency (%)
38068
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5688
100.0%
Open Punctuation
ValueCountFrequency (%)
( 192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 192
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124094
58.2%
Common 88809
41.6%
Latin 432
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8333
 
6.7%
8310
 
6.7%
8234
 
6.6%
8125
 
6.5%
8025
 
6.5%
7724
 
6.2%
6629
 
5.3%
3632
 
2.9%
2799
 
2.3%
2492
 
2.0%
Other values (486) 59791
48.2%
Latin
ValueCountFrequency (%)
B 100
23.1%
A 84
19.4%
L 24
 
5.6%
S 23
 
5.3%
P 23
 
5.3%
T 17
 
3.9%
e 17
 
3.9%
K 17
 
3.9%
C 15
 
3.5%
G 13
 
3.0%
Other values (31) 99
22.9%
Common
ValueCountFrequency (%)
38068
42.9%
1 7781
 
8.8%
2 6483
 
7.3%
- 5688
 
6.4%
3 5289
 
6.0%
0 5202
 
5.9%
4 4181
 
4.7%
5 3792
 
4.3%
6 3127
 
3.5%
7 2981
 
3.4%
Other values (14) 6217
 
7.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 124093
58.2%
ASCII 89228
41.8%
Number Forms 7
 
< 0.1%
None 3
 
< 0.1%
Math Operators 2
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38068
42.7%
1 7781
 
8.7%
2 6483
 
7.3%
- 5688
 
6.4%
3 5289
 
5.9%
0 5202
 
5.8%
4 4181
 
4.7%
5 3792
 
4.2%
6 3127
 
3.5%
7 2981
 
3.3%
Other values (47) 6636
 
7.4%
Hangul
ValueCountFrequency (%)
8333
 
6.7%
8310
 
6.7%
8234
 
6.6%
8125
 
6.5%
8025
 
6.5%
7724
 
6.2%
6629
 
5.3%
3632
 
2.9%
2799
 
2.3%
2492
 
2.0%
Other values (485) 59790
48.2%
Number Forms
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Math Operators
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct6869
Distinct (%)91.8%
Missing481
Missing (%)6.0%
Memory size62.3 KiB
2023-12-11T06:30:42.051736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length56
Mean length31.312976
Min length13

Characters and Unicode

Total characters234315
Distinct characters600
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6287 ?
Unique (%)84.0%

Sample

1st row경기도 가평군 가평읍 보납로 1
2nd row경기도 가평군 가평읍 문화로 131
3rd row경기도 가평군 설악면 신천중앙로 87-16
4th row경기도 가평군 가평읍 석봉로 168
5th row경기도 가평군 가평읍 석봉로153번길 12
ValueCountFrequency (%)
경기도 7483
 
15.5%
2층 717
 
1.5%
고양시 672
 
1.4%
수원시 640
 
1.3%
용인시 553
 
1.1%
성남시 540
 
1.1%
부천시 525
 
1.1%
3층 524
 
1.1%
안산시 445
 
0.9%
남양주시 427
 
0.9%
Other values (8090) 35832
74.1%
2023-12-11T06:30:42.510797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42420
 
18.1%
7953
 
3.4%
7942
 
3.4%
7891
 
3.4%
7835
 
3.3%
7793
 
3.3%
7149
 
3.1%
1 6963
 
3.0%
, 6929
 
3.0%
( 6536
 
2.8%
Other values (590) 124904
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131450
56.1%
Space Separator 42420
 
18.1%
Decimal Number 38606
 
16.5%
Other Punctuation 6991
 
3.0%
Open Punctuation 6536
 
2.8%
Close Punctuation 6536
 
2.8%
Dash Punctuation 1129
 
0.5%
Uppercase Letter 364
 
0.2%
Math Symbol 217
 
0.1%
Lowercase Letter 58
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7953
 
6.1%
7942
 
6.0%
7891
 
6.0%
7835
 
6.0%
7793
 
5.9%
7149
 
5.4%
3514
 
2.7%
2993
 
2.3%
2840
 
2.2%
2707
 
2.1%
Other values (524) 72833
55.4%
Uppercase Letter
ValueCountFrequency (%)
B 107
29.4%
A 63
17.3%
S 17
 
4.7%
C 17
 
4.7%
L 16
 
4.4%
E 16
 
4.4%
G 16
 
4.4%
K 15
 
4.1%
P 15
 
4.1%
T 13
 
3.6%
Other values (13) 69
19.0%
Lowercase Letter
ValueCountFrequency (%)
e 24
41.4%
l 9
 
15.5%
h 4
 
6.9%
a 4
 
6.9%
t 3
 
5.2%
p 3
 
5.2%
s 2
 
3.4%
o 2
 
3.4%
k 1
 
1.7%
z 1
 
1.7%
Other values (5) 5
 
8.6%
Decimal Number
ValueCountFrequency (%)
1 6963
18.0%
2 6447
16.7%
0 5054
13.1%
3 4941
12.8%
4 3803
9.9%
5 3155
8.2%
6 2432
 
6.3%
7 2140
 
5.5%
8 1930
 
5.0%
9 1741
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 6929
99.1%
. 41
 
0.6%
@ 9
 
0.1%
& 6
 
0.1%
/ 3
 
< 0.1%
· 2
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
50.0%
2
25.0%
1
 
12.5%
1
 
12.5%
Math Symbol
ValueCountFrequency (%)
~ 214
98.6%
2
 
0.9%
+ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
42420
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6536
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6536
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1129
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 131449
56.1%
Common 102435
43.7%
Latin 430
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7953
 
6.1%
7942
 
6.0%
7891
 
6.0%
7835
 
6.0%
7793
 
5.9%
7149
 
5.4%
3514
 
2.7%
2993
 
2.3%
2840
 
2.2%
2707
 
2.1%
Other values (523) 72832
55.4%
Latin
ValueCountFrequency (%)
B 107
24.9%
A 63
14.7%
e 24
 
5.6%
S 17
 
4.0%
C 17
 
4.0%
L 16
 
3.7%
E 16
 
3.7%
G 16
 
3.7%
K 15
 
3.5%
P 15
 
3.5%
Other values (32) 124
28.8%
Common
ValueCountFrequency (%)
42420
41.4%
1 6963
 
6.8%
, 6929
 
6.8%
( 6536
 
6.4%
) 6536
 
6.4%
2 6447
 
6.3%
0 5054
 
4.9%
3 4941
 
4.8%
4 3803
 
3.7%
5 3155
 
3.1%
Other values (14) 9651
 
9.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131449
56.1%
ASCII 102852
43.9%
Number Forms 8
 
< 0.1%
None 3
 
< 0.1%
Math Operators 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42420
41.2%
1 6963
 
6.8%
, 6929
 
6.7%
( 6536
 
6.4%
) 6536
 
6.4%
2 6447
 
6.3%
0 5054
 
4.9%
3 4941
 
4.8%
4 3803
 
3.7%
5 3155
 
3.1%
Other values (49) 10068
 
9.8%
Hangul
ValueCountFrequency (%)
7953
 
6.1%
7942
 
6.0%
7891
 
6.0%
7835
 
6.0%
7793
 
5.9%
7149
 
5.4%
3514
 
2.7%
2993
 
2.3%
2840
 
2.2%
2707
 
2.1%
Other values (523) 72832
55.4%
Number Forms
ValueCountFrequency (%)
4
50.0%
2
25.0%
1
 
12.5%
1
 
12.5%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Math Operators
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5902
Distinct (%)76.7%
Missing269
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean37.438629
Minimum36.95861
Maximum38.158096
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size70.1 KiB
2023-12-11T06:30:42.669481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.95861
5-th percentile37.100425
Q137.292522
median37.404272
Q337.623213
95-th percentile37.77472
Maximum38.158096
Range1.1994864
Interquartile range (IQR)0.33069009

Descriptive statistics

Standard deviation0.21032539
Coefficient of variation (CV)0.005617871
Kurtosis-0.44267333
Mean37.438629
Median Absolute Deviation (MAD)0.13566901
Skewness0.18856324
Sum288090.25
Variance0.04423677
MonotonicityNot monotonic
2023-12-11T06:30:42.791213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3660200932 6
 
0.1%
37.5880541645 6
 
0.1%
37.2933002148 6
 
0.1%
37.3084437746 5
 
0.1%
37.1563738117 5
 
0.1%
37.2971310229 5
 
0.1%
37.6719368348 5
 
0.1%
37.4008981603 5
 
0.1%
37.4725250423 5
 
0.1%
37.1363065699 5
 
0.1%
Other values (5892) 7642
96.0%
(Missing) 269
 
3.4%
ValueCountFrequency (%)
36.9586098016 2
< 0.1%
36.9605410794 1
< 0.1%
36.9632453036 1
< 0.1%
36.9643269427 1
< 0.1%
36.9643606434 2
< 0.1%
36.9646954649 1
< 0.1%
36.9653633742 1
< 0.1%
36.9662549734 1
< 0.1%
36.9768403552 1
< 0.1%
36.9776918612 1
< 0.1%
ValueCountFrequency (%)
38.1580962294 1
< 0.1%
38.10702468 2
< 0.1%
38.0994814482 2
< 0.1%
38.0981274274 2
< 0.1%
38.0922736597 2
< 0.1%
38.0344901657 1
< 0.1%
38.0327840029 1
< 0.1%
38.0322174448 1
< 0.1%
38.0320288045 1
< 0.1%
38.0305162095 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5902
Distinct (%)76.7%
Missing269
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean126.99957
Minimum126.58183
Maximum127.71417
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size70.1 KiB
2023-12-11T06:30:42.937846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.58183
5-th percentile126.7393
Q1126.83235
median127.02408
Q3127.1239
95-th percentile127.28419
Maximum127.71417
Range1.1323392
Interquartile range (IQR)0.29154956

Descriptive statistics

Standard deviation0.18823861
Coefficient of variation (CV)0.0014821988
Kurtosis0.28203379
Mean126.99957
Median Absolute Deviation (MAD)0.13822035
Skewness0.47918513
Sum977261.69
Variance0.035433774
MonotonicityNot monotonic
2023-12-11T06:30:43.075580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9642912126 6
 
0.1%
127.2132623546 6
 
0.1%
126.8648009663 6
 
0.1%
126.8273443791 5
 
0.1%
127.0779658056 5
 
0.1%
126.9937659478 5
 
0.1%
126.7589777549 5
 
0.1%
126.9083869732 5
 
0.1%
126.8033941135 5
 
0.1%
127.0825583853 5
 
0.1%
Other values (5892) 7642
96.0%
(Missing) 269
 
3.4%
ValueCountFrequency (%)
126.5818311267 1
< 0.1%
126.5829862284 2
< 0.1%
126.5837814903 1
< 0.1%
126.5845685345 1
< 0.1%
126.5847887606 1
< 0.1%
126.5862207582 1
< 0.1%
126.5864818952 2
< 0.1%
126.5937185429 1
< 0.1%
126.5943919211 1
< 0.1%
126.5951969768 1
< 0.1%
ValueCountFrequency (%)
127.7141703278 1
< 0.1%
127.7083286196 1
< 0.1%
127.6809393215 1
< 0.1%
127.6808831865 1
< 0.1%
127.6612332091 1
< 0.1%
127.6472836094 1
< 0.1%
127.6464884895 1
< 0.1%
127.6453885949 1
< 0.1%
127.6451846932 1
< 0.1%
127.6450606674 1
< 0.1%

Interactions

2023-12-11T06:30:38.354111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:30:38.153190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:30:38.457662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:30:38.249331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:30:43.155207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명시설구분명WGS84위도WGS84경도
시군명1.0000.1080.9620.941
시설구분명0.1081.0000.1020.042
WGS84위도0.9620.1021.0000.619
WGS84경도0.9410.0420.6191.000
2023-12-11T06:30:43.423651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분명시군명
시설구분명1.0000.042
시군명0.0421.000
2023-12-11T06:30:43.492631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
WGS84위도WGS84경도시군명시설구분명
WGS84위도1.000-0.1990.7790.049
WGS84경도-0.1991.0000.7040.020
시군명0.7790.7041.0000.042
시설구분명0.0490.0200.0421.000

Missing values

2023-12-11T06:30:38.580998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:30:38.706369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:30:38.837267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명시설구분명시설명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
0가평군체육도장업해동검도 가평본관12413경기도 가평군 가평읍 읍내리 506-6번지경기도 가평군 가평읍 보납로 137.831465127.510671
1가평군육상시설가평종합운동장12416경기도 가평군 가평읍 대곡리 316번지경기도 가평군 가평읍 문화로 13137.824874127.507218
2가평군체육도장업중앙체육관477853경기도 가평군 설악면 신천리 407-9번지경기도 가평군 설악면 신천중앙로 87-1637.676004127.493139
3가평군체육도장업충효체육관477801경기도 가평군 가평읍 읍내리 493-6번지경기도 가평군 가평읍 석봉로 16837.830122127.510977
4가평군체육도장업혜성운암태권도장477804경기도 가평군 가평읍 대곡리 285-9번지경기도 가평군 가평읍 석봉로153번길 1237.828734127.50975
5가평군체육도장업태산태권도전문도장477815경기도 가평군 청평면 청평리 417번지경기도 가평군 청평면 여울길 21-237.737304127.420531
6가평군체육도장업가평경기태권도체육관477801경기도 가평군 가평읍 읍내리 460번지경기도 가평군 가평읍 연인2길 1237.830176127.512193
7가평군체육도장업승우체육관477804경기도 가평군 가평읍 대곡리 258-22번지경기도 가평군 가평읍 광장로 537.82621127.511227
8가평군체육도장업청평 권투체육관477815경기도 가평군 청평면 청평리 438-26번지경기도 가평군 청평면 청평중앙로 4-137.735787127.415675
9가평군체육도장업크로스나인12437경기도 가평군 조종면 현리 296-8경기도 가평군 조종면 청군로 1292, 2층37.819549127.34643
시군명시설구분명시설명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
7954화성시테니스조암 공공하수처리장 테니스장<NA>경기도 화성시 석우동 4<NA>37.224133127.076335
7955화성시테니스동탄2 제1호 체육공원 테니스장18472경기도 화성시 영천동 670-2경기도 화성시 동탄순환대로29길 3837.20587127.106867
7956화성시테니스진안공공지18389경기도 화성시 진안동 933경기도 화성시 병점중앙로 214-6237.2179127.0314
7957화성시테니스금반저류지18633경기도 화성시 양감면 신왕리 640-7<NA>37.081705126.946436
7958화성시사격장경기도종합사격장18626경기도 화성시 양감면 사창리 810-2번지경기도 화성시 양감면 사격장길 14237.093594126.956236
7959화성시체육도장업한체대왕배태권도장18486경기도 화성시 오산동 1031-3번지경기도 화성시 동탄대로14길 5-20 (오산동)37.185411127.10214
7960화성시체육도장업TKD 뮤직점핑 줄넘기18477경기도 화성시 청계동 514-2번지경기도 화성시 동탄대로시범길 253 (청계동)37.202215127.10275
7961화성시체육도장업ECN태권도장18476경기도 화성시 청계동 525번지 시범호반베르디움 207,208호경기도 화성시 동탄대로시범길 122, 207,208호 (청계동, 시범호반베르디움)37.197684127.111777
7962화성시체육도장업경희대예솔태권도장18483경기도 화성시 청계동 556번지 청계숲사랑으로부영 상가동 201호경기도 화성시 동탄순환대로22길 46, 201호 (청계동, 청계숲사랑으로부영 상가동)37.197783127.120205
7963화성시체육도장업MTA태권도18396경기도 화성시 반월동 627-3번지 3층경기도 화성시 동탄원천로 382-36, 3층 (반월동)37.222058127.057942

Duplicate rows

Most frequently occurring

시군명시설구분명시설명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도# duplicates
0과천시체육도장업JL태권도장13824경기도 과천시 갈현동 253 M타워 과천경기도 과천시 과천대로2길 12, M타워 과천 201~203호 (갈현동)37.407113126.9803382
1광명시체육도장업소하 태권도 체육관423823경기도 광명시 소하동 883-29번지경기도 광명시 기아로5번길 13 (소하동)37.437692126.880292
2군포시체육도장업용인대 가야태권도15869경기도 군포시 산본동 1155-4번지 도장교육센터경기도 군포시 번영로 373 (산본동, 도장교육센터)37.349095126.9246342
3군포시체육도장업품 태권도장15800경기도 군포시 산본동 1082-1번지 1층경기도 군포시 산본로482번길 3, 1층 (산본동)37.372186126.9279032
4남양주시체육도장업도제원 태권도장472821경기도 남양주시 퇴계원면 퇴계원리 211-5번지경기도 남양주시 퇴계원면 경춘북로558번길 637.647747127.145712
5부천시체육도장업베스트 체대입시422807경기도 부천시 소사본동 75-3번지경기도 부천시 경인옛로 27 (소사본동)37.480984126.7949352
6부천시체육도장업파이널 유도 멀티짐14726경기도 부천시 송내동 586번지 4층경기도 부천시 경인로 132, 4층 (송내동)37.483743126.7698672
7수원시체육도장업매탄체육관443800경기도 수원시 영통구 매탄동 172-51번지경기도 수원시 영통구 중부대로246번길 52 (매탄동)37.272817127.0415732
8수원시테니스화산체육공원 테니스장18358경기도 화성시 송산동 5-1번지경기도 화성시 태안로 26337.214515127.0211312
9안산시체육도장업EQ태권도425856경기도 안산시 단원구 초지동 743-1번지 비젼타운 509호경기도 안산시 단원구 원포공원1로 67, 509호 (초지동,비젼타운)37.302532126.8103812