Overview

Dataset statistics

Number of variables12
Number of observations2242
Missing cells369
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory219.1 KiB
Average record size in memory100.1 B

Variable types

Categorical4
Text5
Numeric3

Alerts

집계년도 has constant value ""Constant
분기 has constant value ""Constant
소재지우편번호 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
업태명 is highly imbalanced (60.7%)Imbalance
전화번호 has 342 (15.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 21:54:10.939530
Analysis finished2023-12-10 21:54:13.342254
Duration2.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2021
2242 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 2242
100.0%

Length

2023-12-11T06:54:13.420053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:54:13.556052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 2242
100.0%

분기
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
하반기
2242 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하반기
2nd row하반기
3rd row하반기
4th row하반기
5th row하반기

Common Values

ValueCountFrequency (%)
하반기 2242
100.0%

Length

2023-12-11T06:54:13.663773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:54:13.764881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하반기 2242
100.0%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
성남시
251 
수원시
189 
고양시
152 
용인시
 
137
안양시
 
136
Other values (26)
1377 

Length

Max length4
Median length3
Mean length3.0950045
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
성남시 251
 
11.2%
수원시 189
 
8.4%
고양시 152
 
6.8%
용인시 137
 
6.1%
안양시 136
 
6.1%
남양주시 131
 
5.8%
안산시 107
 
4.8%
시흥시 95
 
4.2%
포천시 79
 
3.5%
김포시 72
 
3.2%
Other values (21) 893
39.8%

Length

2023-12-11T06:54:13.892368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성남시 251
 
11.2%
수원시 189
 
8.4%
고양시 152
 
6.8%
용인시 137
 
6.1%
안양시 136
 
6.1%
남양주시 131
 
5.8%
안산시 107
 
4.8%
시흥시 95
 
4.2%
포천시 79
 
3.5%
김포시 72
 
3.2%
Other values (21) 893
39.8%
Distinct2115
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2023-12-11T06:54:14.227092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length6.2609277
Min length1

Characters and Unicode

Total characters14037
Distinct characters678
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2035 ?
Unique (%)90.8%

Sample

1st row산이좋은사람들
2nd row종점가든
3rd row가평(서울방향)휴게소 한식당
4th row가평(춘천방향)휴게소 한식당
5th row가평축협 한우명가
ValueCountFrequency (%)
주식회사 10
 
0.4%
장수촌 8
 
0.3%
신선설농탕 6
 
0.2%
돈까스클럽 6
 
0.2%
남원추어탕 6
 
0.2%
육대장 6
 
0.2%
무봉리토종순대국 6
 
0.2%
사조참치 5
 
0.2%
계절밥상 5
 
0.2%
양촌리 5
 
0.2%
Other values (2319) 2503
97.5%
2023-12-11T06:54:14.764385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
337
 
2.4%
296
 
2.1%
234
 
1.7%
233
 
1.7%
208
 
1.5%
207
 
1.5%
202
 
1.4%
183
 
1.3%
176
 
1.3%
172
 
1.2%
Other values (668) 11789
84.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13118
93.5%
Space Separator 337
 
2.4%
Close Punctuation 165
 
1.2%
Open Punctuation 163
 
1.2%
Uppercase Letter 83
 
0.6%
Decimal Number 73
 
0.5%
Lowercase Letter 50
 
0.4%
Other Punctuation 45
 
0.3%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
296
 
2.3%
234
 
1.8%
233
 
1.8%
208
 
1.6%
207
 
1.6%
202
 
1.5%
183
 
1.4%
176
 
1.3%
172
 
1.3%
172
 
1.3%
Other values (608) 11035
84.1%
Uppercase Letter
ValueCountFrequency (%)
A 12
14.5%
C 8
 
9.6%
T 7
 
8.4%
E 6
 
7.2%
O 6
 
7.2%
B 6
 
7.2%
S 6
 
7.2%
I 4
 
4.8%
R 4
 
4.8%
N 3
 
3.6%
Other values (12) 21
25.3%
Lowercase Letter
ValueCountFrequency (%)
e 8
16.0%
a 6
12.0%
o 5
10.0%
p 5
10.0%
i 4
8.0%
l 4
8.0%
s 4
8.0%
h 3
 
6.0%
n 3
 
6.0%
r 3
 
6.0%
Other values (5) 5
10.0%
Decimal Number
ValueCountFrequency (%)
2 18
24.7%
1 12
16.4%
3 10
13.7%
0 9
12.3%
5 8
11.0%
4 5
 
6.8%
9 4
 
5.5%
6 3
 
4.1%
8 3
 
4.1%
7 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
& 16
35.6%
. 12
26.7%
· 6
 
13.3%
, 4
 
8.9%
' 2
 
4.4%
! 2
 
4.4%
? 1
 
2.2%
/ 1
 
2.2%
@ 1
 
2.2%
Space Separator
ValueCountFrequency (%)
337
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%
Open Punctuation
ValueCountFrequency (%)
( 163
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13115
93.4%
Common 783
 
5.6%
Latin 133
 
0.9%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
296
 
2.3%
234
 
1.8%
233
 
1.8%
208
 
1.6%
207
 
1.6%
202
 
1.5%
183
 
1.4%
176
 
1.3%
172
 
1.3%
172
 
1.3%
Other values (603) 11032
84.1%
Latin
ValueCountFrequency (%)
A 12
 
9.0%
C 8
 
6.0%
e 8
 
6.0%
T 7
 
5.3%
E 6
 
4.5%
a 6
 
4.5%
O 6
 
4.5%
B 6
 
4.5%
S 6
 
4.5%
o 5
 
3.8%
Other values (27) 63
47.4%
Common
ValueCountFrequency (%)
337
43.0%
) 165
21.1%
( 163
20.8%
2 18
 
2.3%
& 16
 
2.0%
. 12
 
1.5%
1 12
 
1.5%
3 10
 
1.3%
0 9
 
1.1%
5 8
 
1.0%
Other values (12) 33
 
4.2%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13111
93.4%
ASCII 910
 
6.5%
None 9
 
0.1%
CJK 6
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
337
37.0%
) 165
18.1%
( 163
17.9%
2 18
 
2.0%
& 16
 
1.8%
. 12
 
1.3%
A 12
 
1.3%
1 12
 
1.3%
3 10
 
1.1%
0 9
 
1.0%
Other values (48) 156
17.1%
Hangul
ValueCountFrequency (%)
296
 
2.3%
234
 
1.8%
233
 
1.8%
208
 
1.6%
207
 
1.6%
202
 
1.5%
183
 
1.4%
176
 
1.3%
172
 
1.3%
172
 
1.3%
Other values (601) 11028
84.1%
None
ValueCountFrequency (%)
· 6
66.7%
3
33.3%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct1894
Distinct (%)99.7%
Missing342
Missing (%)15.3%
Memory size17.6 KiB
2023-12-11T06:54:15.015934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.006842
Min length9

Characters and Unicode

Total characters22813
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1888 ?
Unique (%)99.4%

Sample

1st row031-585-8645
2nd row031-584-0716
3rd row031-584-1425
4th row031-584-1426
5th row031-581-1592
ValueCountFrequency (%)
031-238-3883 2
 
0.1%
031-776-0988 2
 
0.1%
031-264-9959 2
 
0.1%
031-715-2708 2
 
0.1%
031-703-8892 2
 
0.1%
031-286-5001 2
 
0.1%
031-424-0094 1
 
0.1%
031-469-0041 1
 
0.1%
031-8086-9030 1
 
0.1%
031-458-8878 1
 
0.1%
Other values (1884) 1884
99.2%
2023-12-11T06:54:15.428478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3799
16.7%
3 3288
14.4%
0 3149
13.8%
1 2865
12.6%
2 1697
7.4%
7 1515
 
6.6%
5 1504
 
6.6%
8 1456
 
6.4%
9 1293
 
5.7%
6 1193
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 19014
83.3%
Dash Punctuation 3799
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 3288
17.3%
0 3149
16.6%
1 2865
15.1%
2 1697
8.9%
7 1515
8.0%
5 1504
7.9%
8 1456
7.7%
9 1293
 
6.8%
6 1193
 
6.3%
4 1054
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 3799
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22813
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3799
16.7%
3 3288
14.4%
0 3149
13.8%
1 2865
12.6%
2 1697
7.4%
7 1515
 
6.6%
5 1504
 
6.6%
8 1456
 
6.4%
9 1293
 
5.7%
6 1193
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22813
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3799
16.7%
3 3288
14.4%
0 3149
13.8%
1 2865
12.6%
2 1697
7.4%
7 1515
 
6.6%
5 1504
 
6.6%
8 1456
 
6.4%
9 1293
 
5.7%
6 1193
 
5.2%

업태명
Categorical

IMBALANCE 

Distinct21
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
한식
1652 
일식
 
150
식육(숯불구이)
 
101
중국식
 
58
경양식
 
54
Other values (16)
227 

Length

Max length8
Median length2
Mean length2.4063336
Min length2

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row경양식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 1652
73.7%
일식 150
 
6.7%
식육(숯불구이) 101
 
4.5%
중국식 58
 
2.6%
경양식 54
 
2.4%
뷔페식 44
 
2.0%
중식 37
 
1.7%
양식 29
 
1.3%
기타 28
 
1.2%
분식 23
 
1.0%
Other values (11) 66
 
2.9%

Length

2023-12-11T06:54:15.561294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 1652
73.7%
일식 150
 
6.7%
식육(숯불구이 101
 
4.5%
중국식 58
 
2.6%
경양식 54
 
2.4%
뷔페식 44
 
2.0%
중식 37
 
1.7%
양식 29
 
1.3%
기타 28
 
1.2%
분식 23
 
1.0%
Other values (11) 66
 
2.9%
Distinct940
Distinct (%)42.1%
Missing7
Missing (%)0.3%
Memory size17.6 KiB
2023-12-11T06:54:15.842308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length4.5525727
Min length1

Characters and Unicode

Total characters10175
Distinct characters326
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique708 ?
Unique (%)31.7%

Sample

1st row돈까스
2nd row잣칼국수
3rd row황태해장국
4th row황태해장국
5th row등심
ValueCountFrequency (%)
한정식 71
 
2.8%
갈비 61
 
2.4%
돼지갈비 53
 
2.1%
추어탕 51
 
2.0%
부대찌개 45
 
1.8%
감자탕 44
 
1.7%
칼국수 37
 
1.5%
삼겹살 37
 
1.5%
순대국 36
 
1.4%
생선회 33
 
1.3%
Other values (852) 2069
81.6%
2023-12-11T06:54:16.303624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 542
 
5.3%
370
 
3.6%
339
 
3.3%
318
 
3.1%
313
 
3.1%
259
 
2.5%
243
 
2.4%
222
 
2.2%
203
 
2.0%
199
 
2.0%
Other values (316) 7167
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9263
91.0%
Other Punctuation 557
 
5.5%
Space Separator 313
 
3.1%
Close Punctuation 20
 
0.2%
Open Punctuation 20
 
0.2%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
370
 
4.0%
339
 
3.7%
318
 
3.4%
259
 
2.8%
243
 
2.6%
222
 
2.4%
203
 
2.2%
199
 
2.1%
197
 
2.1%
176
 
1.9%
Other values (308) 6737
72.7%
Other Punctuation
ValueCountFrequency (%)
, 542
97.3%
. 14
 
2.5%
/ 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
313
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9263
91.0%
Common 910
 
8.9%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
370
 
4.0%
339
 
3.7%
318
 
3.4%
259
 
2.8%
243
 
2.6%
222
 
2.4%
203
 
2.2%
199
 
2.1%
197
 
2.1%
176
 
1.9%
Other values (308) 6737
72.7%
Common
ValueCountFrequency (%)
, 542
59.6%
313
34.4%
) 20
 
2.2%
( 20
 
2.2%
. 14
 
1.5%
/ 1
 
0.1%
Latin
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9263
91.0%
ASCII 912
 
9.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 542
59.4%
313
34.3%
) 20
 
2.2%
( 20
 
2.2%
. 14
 
1.5%
L 1
 
0.1%
A 1
 
0.1%
/ 1
 
0.1%
Hangul
ValueCountFrequency (%)
370
 
4.0%
339
 
3.7%
318
 
3.4%
259
 
2.8%
243
 
2.6%
222
 
2.4%
203
 
2.2%
199
 
2.1%
197
 
2.1%
176
 
1.9%
Other values (308) 6737
72.7%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1266
Distinct (%)56.6%
Missing4
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean13963.008
Minimum10000
Maximum18614
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.8 KiB
2023-12-11T06:54:16.457297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile10301
Q112010.5
median13620
Q316223
95-th percentile17935.15
Maximum18614
Range8614
Interquartile range (IQR)4212.5

Descriptive statistics

Standard deviation2402.9502
Coefficient of variation (CV)0.17209402
Kurtosis-1.0902067
Mean13963.008
Median Absolute Deviation (MAD)1929.5
Skewness0.15626268
Sum31249211
Variance5774169.5
MonotonicityNot monotonic
2023-12-11T06:54:16.601809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10301 15
 
0.7%
15865 15
 
0.7%
15040 13
 
0.6%
11103 12
 
0.5%
13505 11
 
0.5%
15062 11
 
0.5%
14240 10
 
0.4%
15590 10
 
0.4%
14066 10
 
0.4%
10383 10
 
0.4%
Other values (1256) 2121
94.6%
ValueCountFrequency (%)
10000 1
 
< 0.1%
10011 1
 
< 0.1%
10012 1
 
< 0.1%
10016 1
 
< 0.1%
10017 2
0.1%
10018 2
0.1%
10020 3
0.1%
10023 1
 
< 0.1%
10024 2
0.1%
10039 2
0.1%
ValueCountFrequency (%)
18614 1
< 0.1%
18611 1
< 0.1%
18608 1
< 0.1%
18595 1
< 0.1%
18593 1
< 0.1%
18589 2
0.1%
18578 1
< 0.1%
18577 1
< 0.1%
18574 1
< 0.1%
18555 2
0.1%
Distinct2111
Distinct (%)94.8%
Missing16
Missing (%)0.7%
Memory size17.6 KiB
2023-12-11T06:54:16.923210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length31
Mean length21.78212
Min length15

Characters and Unicode

Total characters48487
Distinct characters320
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2021 ?
Unique (%)90.8%

Sample

1st row경기도 가평군 조종면 운악리 486-13번지
2nd row경기도 가평군 설악면 가일리 249-3번지
3rd row경기도 가평군 설악면 미사리 145-3번지
4th row경기도 가평군 설악면 미사리 149-4번지
5th row경기도 가평군 가평읍 달전리 382-1번지
ValueCountFrequency (%)
경기도 2226
 
21.4%
성남시 244
 
2.3%
수원시 189
 
1.8%
고양시 151
 
1.4%
용인시 137
 
1.3%
안양시 136
 
1.3%
분당구 132
 
1.3%
남양주시 131
 
1.3%
안산시 106
 
1.0%
시흥시 95
 
0.9%
Other values (2780) 6871
66.0%
2023-12-11T06:54:17.436255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8192
 
16.9%
2313
 
4.8%
2295
 
4.7%
2289
 
4.7%
2253
 
4.6%
2227
 
4.6%
2225
 
4.6%
1951
 
4.0%
1 1864
 
3.8%
- 1710
 
3.5%
Other values (310) 21168
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29492
60.8%
Decimal Number 9051
 
18.7%
Space Separator 8192
 
16.9%
Dash Punctuation 1710
 
3.5%
Lowercase Letter 27
 
0.1%
Uppercase Letter 12
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2313
 
7.8%
2295
 
7.8%
2289
 
7.8%
2253
 
7.6%
2227
 
7.6%
2225
 
7.5%
1951
 
6.6%
1052
 
3.6%
714
 
2.4%
564
 
1.9%
Other values (285) 11609
39.4%
Decimal Number
ValueCountFrequency (%)
1 1864
20.6%
2 1108
12.2%
3 956
10.6%
4 893
9.9%
5 883
9.8%
7 753
8.3%
6 743
 
8.2%
8 665
 
7.3%
0 605
 
6.7%
9 581
 
6.4%
Lowercase Letter
ValueCountFrequency (%)
m 6
22.2%
e 3
11.1%
c 3
11.1%
a 3
11.1%
l 3
11.1%
t 3
11.1%
i 3
11.1%
u 3
11.1%
Uppercase Letter
ValueCountFrequency (%)
P 3
25.0%
S 3
25.0%
C 3
25.0%
I 3
25.0%
Space Separator
ValueCountFrequency (%)
8192
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1710
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29492
60.8%
Common 18956
39.1%
Latin 39
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2313
 
7.8%
2295
 
7.8%
2289
 
7.8%
2253
 
7.6%
2227
 
7.6%
2225
 
7.5%
1951
 
6.6%
1052
 
3.6%
714
 
2.4%
564
 
1.9%
Other values (285) 11609
39.4%
Common
ValueCountFrequency (%)
8192
43.2%
1 1864
 
9.8%
- 1710
 
9.0%
2 1108
 
5.8%
3 956
 
5.0%
4 893
 
4.7%
5 883
 
4.7%
7 753
 
4.0%
6 743
 
3.9%
8 665
 
3.5%
Other values (3) 1189
 
6.3%
Latin
ValueCountFrequency (%)
m 6
15.4%
e 3
7.7%
c 3
7.7%
a 3
7.7%
l 3
7.7%
P 3
7.7%
t 3
7.7%
i 3
7.7%
u 3
7.7%
S 3
7.7%
Other values (2) 6
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29492
60.8%
ASCII 18995
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8192
43.1%
1 1864
 
9.8%
- 1710
 
9.0%
2 1108
 
5.8%
3 956
 
5.0%
4 893
 
4.7%
5 883
 
4.6%
7 753
 
4.0%
6 743
 
3.9%
8 665
 
3.5%
Other values (15) 1228
 
6.5%
Hangul
ValueCountFrequency (%)
2313
 
7.8%
2295
 
7.8%
2289
 
7.8%
2253
 
7.6%
2227
 
7.6%
2225
 
7.5%
1951
 
6.6%
1052
 
3.6%
714
 
2.4%
564
 
1.9%
Other values (285) 11609
39.4%
Distinct2130
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2023-12-11T06:54:17.763510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length19.628011
Min length13

Characters and Unicode

Total characters44006
Distinct characters339
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2043 ?
Unique (%)91.1%

Sample

1st row경기도 가평군 조종면 와곡길 3-16
2nd row경기도 가평군 설악면 유명산길 76
3rd row경기도 가평군 설악면 미사리로 544
4th row경기도 가평군 설악면 미사리로540번길 51
5th row경기도 가평군 가평읍 달전로 19
ValueCountFrequency (%)
경기도 2242
 
21.5%
성남시 251
 
2.4%
수원시 189
 
1.8%
고양시 152
 
1.5%
용인시 137
 
1.3%
안양시 136
 
1.3%
분당구 132
 
1.3%
남양주시 131
 
1.3%
안산시 107
 
1.0%
시흥시 95
 
0.9%
Other values (2399) 6868
65.8%
2023-12-11T06:54:18.184238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8198
18.6%
2336
 
5.3%
2334
 
5.3%
2309
 
5.2%
2292
 
5.2%
2053
 
4.7%
1 1715
 
3.9%
2 1085
 
2.5%
1064
 
2.4%
3 910
 
2.1%
Other values (329) 19710
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27427
62.3%
Space Separator 8198
 
18.6%
Decimal Number 7957
 
18.1%
Dash Punctuation 423
 
1.0%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2336
 
8.5%
2334
 
8.5%
2309
 
8.4%
2292
 
8.4%
2053
 
7.5%
1064
 
3.9%
880
 
3.2%
670
 
2.4%
656
 
2.4%
568
 
2.1%
Other values (316) 12265
44.7%
Decimal Number
ValueCountFrequency (%)
1 1715
21.6%
2 1085
13.6%
3 910
11.4%
4 736
9.2%
5 680
 
8.5%
6 625
 
7.9%
8 569
 
7.2%
7 567
 
7.1%
0 541
 
6.8%
9 529
 
6.6%
Space Separator
ValueCountFrequency (%)
8198
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 423
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27427
62.3%
Common 16579
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2336
 
8.5%
2334
 
8.5%
2309
 
8.4%
2292
 
8.4%
2053
 
7.5%
1064
 
3.9%
880
 
3.2%
670
 
2.4%
656
 
2.4%
568
 
2.1%
Other values (316) 12265
44.7%
Common
ValueCountFrequency (%)
8198
49.4%
1 1715
 
10.3%
2 1085
 
6.5%
3 910
 
5.5%
4 736
 
4.4%
5 680
 
4.1%
6 625
 
3.8%
8 569
 
3.4%
7 567
 
3.4%
0 541
 
3.3%
Other values (3) 953
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27427
62.3%
ASCII 16579
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8198
49.4%
1 1715
 
10.3%
2 1085
 
6.5%
3 910
 
5.5%
4 736
 
4.4%
5 680
 
4.1%
6 625
 
3.8%
8 569
 
3.4%
7 567
 
3.4%
0 541
 
3.3%
Other values (3) 953
 
5.7%
Hangul
ValueCountFrequency (%)
2336
 
8.5%
2334
 
8.5%
2309
 
8.4%
2292
 
8.4%
2053
 
7.5%
1064
 
3.9%
880
 
3.2%
670
 
2.4%
656
 
2.4%
568
 
2.1%
Other values (316) 12265
44.7%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct2127
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.453345
Minimum36.957325
Maximum38.155576
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.8 KiB
2023-12-11T06:54:18.317376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.957325
5-th percentile37.138463
Q137.300781
median37.404136
Q337.630718
95-th percentile37.828845
Maximum38.155576
Range1.1982516
Interquartile range (IQR)0.32993642

Descriptive statistics

Standard deviation0.21821125
Coefficient of variation (CV)0.0058262153
Kurtosis-0.016981287
Mean37.453345
Median Absolute Deviation (MAD)0.13206577
Skewness0.40112893
Sum83970.399
Variance0.047616149
MonotonicityNot monotonic
2023-12-11T06:54:18.462499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.33698869 6
 
0.3%
37.43993803 5
 
0.2%
37.37625723 4
 
0.2%
37.40061147 4
 
0.2%
37.43255774 4
 
0.2%
37.39269609 3
 
0.1%
37.42688503 3
 
0.1%
37.6712668 3
 
0.1%
37.39719971 3
 
0.1%
37.39321724 3
 
0.1%
Other values (2117) 2204
98.3%
ValueCountFrequency (%)
36.9573249 1
< 0.1%
36.95969101 1
< 0.1%
36.96268988 1
< 0.1%
36.96308191 1
< 0.1%
36.96383349 1
< 0.1%
36.97500236 1
< 0.1%
36.97713875 1
< 0.1%
36.97860048 1
< 0.1%
36.98188362 1
< 0.1%
36.98232154 1
< 0.1%
ValueCountFrequency (%)
38.15557649 1
< 0.1%
38.10482945 1
< 0.1%
38.10074666 1
< 0.1%
38.10017265 1
< 0.1%
38.09727915 1
< 0.1%
38.09102702 1
< 0.1%
38.08958494 1
< 0.1%
38.08922176 1
< 0.1%
38.07603251 1
< 0.1%
38.07431002 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct2127
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.04187
Minimum126.52624
Maximum127.77451
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.8 KiB
2023-12-11T06:54:18.593291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.52624
5-th percentile126.72314
Q1126.86912
median127.05236
Q3127.14863
95-th percentile127.46716
Maximum127.77451
Range1.2482702
Interquartile range (IQR)0.27951045

Descriptive statistics

Standard deviation0.22080393
Coefficient of variation (CV)0.0017380405
Kurtosis0.40985866
Mean127.04187
Median Absolute Deviation (MAD)0.12602605
Skewness0.47598296
Sum284827.88
Variance0.048754374
MonotonicityNot monotonic
2023-12-11T06:54:18.752800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.303304 6
 
0.3%
127.1776552 5
 
0.2%
127.1166533 4
 
0.2%
127.1069322 4
 
0.2%
127.1591196 4
 
0.2%
127.1119578 3
 
0.1%
126.9919457 3
 
0.1%
126.7904577 3
 
0.1%
127.1134889 3
 
0.1%
126.9631428 3
 
0.1%
Other values (2117) 2204
98.3%
ValueCountFrequency (%)
126.5262357 1
< 0.1%
126.5319561 1
< 0.1%
126.5340623 1
< 0.1%
126.5346179 1
< 0.1%
126.5417591 1
< 0.1%
126.5482864 1
< 0.1%
126.5523216 1
< 0.1%
126.5575644 1
< 0.1%
126.5590327 1
< 0.1%
126.5605522 1
< 0.1%
ValueCountFrequency (%)
127.7745059 1
< 0.1%
127.7423233 1
< 0.1%
127.7413818 1
< 0.1%
127.7412612 1
< 0.1%
127.7378219 1
< 0.1%
127.7119898 1
< 0.1%
127.6900811 1
< 0.1%
127.6818023 1
< 0.1%
127.6772675 1
< 0.1%
127.6723598 1
< 0.1%

Interactions

2023-12-11T06:54:12.461345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:11.969828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.206818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.554476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.041263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.286041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.641470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.120045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:54:12.374825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:54:18.831174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업태명소재지우편번호WGS84위도WGS84경도
시군명1.0000.5320.9930.9540.940
업태명0.5321.0000.3460.2060.320
소재지우편번호0.9930.3461.0000.9210.855
WGS84위도0.9540.2060.9211.0000.602
WGS84경도0.9400.3200.8550.6021.000
2023-12-11T06:54:18.912787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태명시군명
업태명1.0000.161
시군명0.1611.000
2023-12-11T06:54:18.999140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명업태명
소재지우편번호1.000-0.8980.0290.9340.134
WGS84위도-0.8981.000-0.0750.7450.077
WGS84경도0.029-0.0751.0000.6980.123
시군명0.9340.7450.6981.0000.161
업태명0.1340.0770.1230.1611.000

Missing values

2023-12-11T06:54:12.964677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:54:13.109548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:54:13.246533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

집계년도분기시군명업소명전화번호업태명주메뉴명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
02021하반기가평군산이좋은사람들031-585-8645경양식돈까스12432경기도 가평군 조종면 운악리 486-13번지경기도 가평군 조종면 와곡길 3-1637.866032127.349181
12021하반기가평군종점가든031-584-0716한식잣칼국수12473경기도 가평군 설악면 가일리 249-3번지경기도 가평군 설악면 유명산길 7637.59532127.490311
22021하반기가평군가평(서울방향)휴게소 한식당031-584-1425한식황태해장국12462경기도 가평군 설악면 미사리 145-3번지경기도 가평군 설악면 미사리로 54437.703079127.544991
32021하반기가평군가평(춘천방향)휴게소 한식당031-584-1426한식황태해장국12462경기도 가평군 설악면 미사리 149-4번지경기도 가평군 설악면 미사리로540번길 5137.701586127.5465
42021하반기가평군가평축협 한우명가031-581-1592한식등심12422경기도 가평군 가평읍 달전리 382-1번지경기도 가평군 가평읍 달전로 1937.815844127.516128
52021하반기가평군가평축협 한우명가(설악지점)031-585-4200한식등심12465경기도 가평군 설악면 신천리 121-9번지경기도 가평군 설악면 한서로 337.676616127.494141
62021하반기가평군늘 봄031-582-1441한식양념돼지갈비12413경기도 가평군 가평읍 읍내리 680-10번지경기도 가평군 가평읍 가화로 173-1637.834764127.51
72021하반기가평군두메산골식당031-584-9380한식불고기12446경기도 가평군 상면 덕현리 402-10번지경기도 가평군 상면 청군로 43037.759855127.395542
82021하반기가평군들풀031-585-4322한식청국장12464경기도 가평군 설악면 창의리 420-6번지경기도 가평군 설악면 한서로124번길 16-1237.671055127.503194
92021하반기가평군미락무교동낚지031-582-7644한식낚지볶음12424경기도 가평군 가평읍 하색리 63-1번지경기도 가평군 가평읍 경춘로 205537.815348127.501577
집계년도분기시군명업소명전화번호업태명주메뉴명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
22322021하반기화성시영천두툼한숯불갈비031-224-4421한식돼지양념갈비숯불구이18404경기도 화성시 진안동 882-5번지경기도 화성시 병점로81번길 1737.21286127.042031
22332021하반기화성시오엔푸드한국인의밥상031-222-3700한식한정식18345경기도 화성시 안녕동 186-71번지경기도 화성시 효행로481번길 2637.207324126.98906
22342021하반기화성시와우리장작구이(화성점)031-223-3382기타오리, 삼겹살 장작구이18324경기도 화성시 안녕동 180-401번지경기도 화성시 세자로 44537.204746126.983789
22352021하반기화성시왕골남서문곰탕031-366-5516한식곰탕18516경기도 화성시 정남면 보통리 12-26번지경기도 화성시 정남면 세자로 330-337.195528126.981561
22362021하반기화성시이서방 왕족발보쌈031-613-8885한식마늘보쌈18434경기도 화성시 반송동 23-3번지경기도 화성시 동탄반송3길 36-1037.209245127.066193
22372021하반기화성시이야기가있는아라031-227-2269경양식생선회18303경기도 화성시 봉담읍 동화리 421-8번지경기도 화성시 봉담읍 동화새터길 837.222713126.952878
22382021하반기화성시일식담031-372-0852일식생선회, 탕18454경기도 화성시 반송동 92-5번지경기도 화성시 노작로 16537.202718127.073137
22392021하반기화성시장쟁이쌈선생031-239-1530한식쌈밥18327경기도 화성시 안녕동 180-128번지경기도 화성시 세자로441번길 337.204133126.983034
22402021하반기화성시제암종가집가든031-354-5020한식갈비18595경기도 화성시 향남읍 제암리 449-2번지경기도 화성시 향남읍 제암고주로 937.122223126.891765
22412021하반기화성시권가네갈비031-374-9233식육(숯불구이)등심, 갈비18510경기도 화성시 장지동 512-2번지경기도 화성시 장지남길3번길 6-237.153702127.117467