Overview

Dataset statistics

Number of variables15
Number of observations258
Missing cells793
Missing cells (%)20.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.4 KiB
Average record size in memory124.5 B

Variable types

Categorical4
Text4
DateTime2
Numeric3
Boolean1
Unsupported1

Dataset

Description휴게음식점(떡카페) 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=F7V0HSTZZ1D1HBG91ZGH14613893&infSeq=1

Alerts

위생업태명 has constant value ""Constant
위생업종명 is highly overall correlated with 년도 and 5 other fieldsHigh correlation
영업상태명 is highly overall correlated with 위생업종명High correlation
다중이용업소여부 is highly overall correlated with 위생업종명High correlation
시군명 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
년도 is highly overall correlated with 위생업종명High correlation
WGS84위도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
다중이용업소여부 is highly imbalanced (89.9%)Imbalance
폐업일자 has 168 (65.1%) missing valuesMissing
년도 has 182 (70.5%) missing valuesMissing
다중이용업소여부 has 182 (70.5%) missing valuesMissing
총시설규모(㎡) has 258 (100.0%) missing valuesMissing
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-10 20:53:49.789736
Analysis finished2024-05-10 20:53:58.720750
Duration8.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
부천시
26 
화성시
24 
수원시
21 
평택시
18 
남양주시
18 
Other values (23)
151 

Length

Max length4
Median length3
Mean length3.124031
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
부천시 26
 
10.1%
화성시 24
 
9.3%
수원시 21
 
8.1%
평택시 18
 
7.0%
남양주시 18
 
7.0%
성남시 18
 
7.0%
안산시 14
 
5.4%
김포시 14
 
5.4%
용인시 13
 
5.0%
고양시 13
 
5.0%
Other values (18) 79
30.6%

Length

2024-05-10T20:53:58.913299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부천시 26
 
10.1%
화성시 24
 
9.3%
수원시 21
 
8.1%
평택시 18
 
7.0%
남양주시 18
 
7.0%
성남시 18
 
7.0%
안산시 14
 
5.4%
김포시 14
 
5.4%
용인시 13
 
5.0%
고양시 13
 
5.0%
Other values (18) 79
30.6%
Distinct242
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-05-10T20:53:59.258837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length7.1434109
Min length2

Characters and Unicode

Total characters1843
Distinct characters330
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)89.9%

Sample

1st row메고지고 떡창고
2nd row푸른열매달
3rd row메고지고 원흥점
4th row카민스
5th row늘품
ValueCountFrequency (%)
메고지고 13
 
3.7%
떡창고 10
 
2.8%
떡탐 6
 
1.7%
5
 
1.4%
메고지고떡창고 4
 
1.1%
원조공주떡집 4
 
1.1%
잔기지떡 4
 
1.1%
떡탐(이마트 3
 
0.9%
cake 3
 
0.9%
모찌이야기 3
 
0.9%
Other values (276) 296
84.3%
2024-05-10T20:54:00.173681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
118
 
6.4%
93
 
5.0%
72
 
3.9%
63
 
3.4%
61
 
3.3%
42
 
2.3%
33
 
1.8%
28
 
1.5%
26
 
1.4%
) 24
 
1.3%
Other values (320) 1283
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1579
85.7%
Space Separator 93
 
5.0%
Lowercase Letter 80
 
4.3%
Uppercase Letter 31
 
1.7%
Close Punctuation 24
 
1.3%
Open Punctuation 24
 
1.3%
Other Punctuation 8
 
0.4%
Decimal Number 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
 
7.5%
72
 
4.6%
63
 
4.0%
61
 
3.9%
42
 
2.7%
33
 
2.1%
28
 
1.8%
26
 
1.6%
22
 
1.4%
21
 
1.3%
Other values (275) 1093
69.2%
Lowercase Letter
ValueCountFrequency (%)
e 11
13.8%
a 9
11.2%
o 8
 
10.0%
i 6
 
7.5%
r 5
 
6.2%
l 4
 
5.0%
n 4
 
5.0%
c 4
 
5.0%
s 4
 
5.0%
t 4
 
5.0%
Other values (9) 21
26.2%
Uppercase Letter
ValueCountFrequency (%)
C 7
22.6%
E 5
16.1%
S 2
 
6.5%
L 2
 
6.5%
F 2
 
6.5%
O 2
 
6.5%
A 2
 
6.5%
K 2
 
6.5%
R 1
 
3.2%
T 1
 
3.2%
Other values (5) 5
16.1%
Other Punctuation
ValueCountFrequency (%)
' 3
37.5%
& 2
25.0%
, 2
25.0%
. 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
1 1
25.0%
5 1
25.0%
3 1
25.0%
Space Separator
ValueCountFrequency (%)
93
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1578
85.6%
Common 153
 
8.3%
Latin 111
 
6.0%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
 
7.5%
72
 
4.6%
63
 
4.0%
61
 
3.9%
42
 
2.7%
33
 
2.1%
28
 
1.8%
26
 
1.6%
22
 
1.4%
21
 
1.3%
Other values (274) 1092
69.2%
Latin
ValueCountFrequency (%)
e 11
 
9.9%
a 9
 
8.1%
o 8
 
7.2%
C 7
 
6.3%
i 6
 
5.4%
r 5
 
4.5%
E 5
 
4.5%
l 4
 
3.6%
n 4
 
3.6%
c 4
 
3.6%
Other values (24) 48
43.2%
Common
ValueCountFrequency (%)
93
60.8%
) 24
 
15.7%
( 24
 
15.7%
' 3
 
2.0%
& 2
 
1.3%
, 2
 
1.3%
2 1
 
0.7%
. 1
 
0.7%
1 1
 
0.7%
5 1
 
0.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1578
85.6%
ASCII 264
 
14.3%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
118
 
7.5%
72
 
4.6%
63
 
4.0%
61
 
3.9%
42
 
2.7%
33
 
2.1%
28
 
1.8%
26
 
1.6%
22
 
1.4%
21
 
1.3%
Other values (274) 1092
69.2%
ASCII
ValueCountFrequency (%)
93
35.2%
) 24
 
9.1%
( 24
 
9.1%
e 11
 
4.2%
a 9
 
3.4%
o 8
 
3.0%
C 7
 
2.7%
i 6
 
2.3%
r 5
 
1.9%
E 5
 
1.9%
Other values (35) 72
27.3%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct231
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2007-08-10 00:00:00
Maximum2024-04-30 00:00:00
2024-05-10T20:54:00.637525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:54:01.061276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
영업
115 
폐업
67 
운영중
53 
폐업 등
23 

Length

Max length4
Median length2
Mean length2.3837209
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
영업 115
44.6%
폐업 67
26.0%
운영중 53
20.5%
폐업 등 23
 
8.9%

Length

2024-05-10T20:54:01.490633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:54:01.848828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 115
40.9%
폐업 90
32.0%
운영중 53
18.9%
23
 
8.2%

폐업일자
Date

MISSING 

Distinct78
Distinct (%)86.7%
Missing168
Missing (%)65.1%
Memory size2.1 KiB
Minimum2015-09-22 00:00:00
Maximum2024-04-28 00:00:00
2024-05-10T20:54:02.332944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:54:02.798239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

년도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct7
Distinct (%)9.2%
Missing182
Missing (%)70.5%
Infinite0
Infinite (%)0.0%
Mean2016.3947
Minimum2007
Maximum2018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-05-10T20:54:03.163010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2015
Q12016
median2016
Q32018
95-th percentile2018
Maximum2018
Range11
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.781976
Coefficient of variation (CV)0.00088374365
Kurtosis12.770198
Mean2016.3947
Median Absolute Deviation (MAD)1
Skewness-2.9153805
Sum153246
Variance3.1754386
MonotonicityNot monotonic
2024-05-10T20:54:03.529388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2016 28
 
10.9%
2018 22
 
8.5%
2017 14
 
5.4%
2015 9
 
3.5%
2013 1
 
0.4%
2007 1
 
0.4%
2009 1
 
0.4%
(Missing) 182
70.5%
ValueCountFrequency (%)
2007 1
 
0.4%
2009 1
 
0.4%
2013 1
 
0.4%
2015 9
 
3.5%
2016 28
10.9%
2017 14
5.4%
2018 22
8.5%
ValueCountFrequency (%)
2018 22
8.5%
2017 14
5.4%
2016 28
10.9%
2015 9
 
3.5%
2013 1
 
0.4%
2009 1
 
0.4%
2007 1
 
0.4%

다중이용업소여부
Boolean

HIGH CORRELATION  IMBALANCE  MISSING 

Distinct2
Distinct (%)2.6%
Missing182
Missing (%)70.5%
Memory size648.0 B
False
75 
True
 
1
(Missing)
182 
ValueCountFrequency (%)
False 75
29.1%
True 1
 
0.4%
(Missing) 182
70.5%
2024-05-10T20:54:03.890494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing258
Missing (%)100.0%
Memory size2.4 KiB

위생업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
<NA>
182 
휴게음식점
76 

Length

Max length5
Median length4
Mean length4.2945736
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row휴게음식점
4th row휴게음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
<NA> 182
70.5%
휴게음식점 76
29.5%

Length

2024-05-10T20:54:04.186901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:54:04.522970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 182
70.5%
휴게음식점 76
29.5%

위생업태명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
떡카페
258 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row떡카페
2nd row떡카페
3rd row떡카페
4th row떡카페
5th row떡카페

Common Values

ValueCountFrequency (%)
떡카페 258
100.0%

Length

2024-05-10T20:54:04.960966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:54:05.286280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
떡카페 258
100.0%
Distinct251
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-05-10T20:54:05.917149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length50
Mean length38.100775
Min length20

Characters and Unicode

Total characters9830
Distinct characters340
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)94.6%

Sample

1st row경기도 고양시 일산동구 위시티2로11번길 11, 근생제2동 116호 (식사동, 위시티 휴먼빌)
2nd row경기도 고양시 덕양구 호국로 836, 1층 일부호 (성사동)
3rd row경기도 고양시 덕양구 권율대로 668, 티오피클래식 C동 1층 137호 (원흥동)
4th row경기도 고양시 일산서구 대화로 136, 대화동 대성프라자 127(일부)호 (대화동)
5th row경기도 고양시 일산서구 일산로695번길 12, 1층일부호 (대화동)
ValueCountFrequency (%)
경기도 258
 
12.5%
1층 134
 
6.5%
일부호 44
 
2.1%
부천시 26
 
1.3%
화성시 24
 
1.2%
일부 23
 
1.1%
수원시 21
 
1.0%
지하1층 20
 
1.0%
평택시 19
 
0.9%
남양주시 18
 
0.9%
Other values (862) 1473
71.5%
2024-05-10T20:54:07.211634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1803
 
18.3%
1 549
 
5.6%
335
 
3.4%
, 298
 
3.0%
287
 
2.9%
274
 
2.8%
271
 
2.8%
267
 
2.7%
) 249
 
2.5%
( 249
 
2.5%
Other values (330) 5248
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5586
56.8%
Space Separator 1803
 
18.3%
Decimal Number 1540
 
15.7%
Other Punctuation 298
 
3.0%
Close Punctuation 249
 
2.5%
Open Punctuation 249
 
2.5%
Dash Punctuation 55
 
0.6%
Uppercase Letter 47
 
0.5%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
335
 
6.0%
287
 
5.1%
274
 
4.9%
271
 
4.9%
267
 
4.8%
243
 
4.4%
203
 
3.6%
196
 
3.5%
140
 
2.5%
122
 
2.2%
Other values (300) 3248
58.1%
Uppercase Letter
ValueCountFrequency (%)
A 9
19.1%
B 8
17.0%
T 6
12.8%
E 4
8.5%
C 4
8.5%
L 3
 
6.4%
K 3
 
6.4%
H 2
 
4.3%
D 2
 
4.3%
I 2
 
4.3%
Other values (3) 4
8.5%
Decimal Number
ValueCountFrequency (%)
1 549
35.6%
2 232
15.1%
0 179
 
11.6%
3 129
 
8.4%
4 86
 
5.6%
6 82
 
5.3%
5 79
 
5.1%
7 78
 
5.1%
8 70
 
4.5%
9 56
 
3.6%
Space Separator
ValueCountFrequency (%)
1803
100.0%
Other Punctuation
ValueCountFrequency (%)
, 298
100.0%
Close Punctuation
ValueCountFrequency (%)
) 249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 249
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5586
56.8%
Common 4195
42.7%
Latin 49
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
335
 
6.0%
287
 
5.1%
274
 
4.9%
271
 
4.9%
267
 
4.8%
243
 
4.4%
203
 
3.6%
196
 
3.5%
140
 
2.5%
122
 
2.2%
Other values (300) 3248
58.1%
Common
ValueCountFrequency (%)
1803
43.0%
1 549
 
13.1%
, 298
 
7.1%
) 249
 
5.9%
( 249
 
5.9%
2 232
 
5.5%
0 179
 
4.3%
3 129
 
3.1%
4 86
 
2.1%
6 82
 
2.0%
Other values (6) 339
 
8.1%
Latin
ValueCountFrequency (%)
A 9
18.4%
B 8
16.3%
T 6
12.2%
E 4
8.2%
C 4
8.2%
L 3
 
6.1%
K 3
 
6.1%
H 2
 
4.1%
D 2
 
4.1%
I 2
 
4.1%
Other values (4) 6
12.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5586
56.8%
ASCII 4244
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1803
42.5%
1 549
 
12.9%
, 298
 
7.0%
) 249
 
5.9%
( 249
 
5.9%
2 232
 
5.5%
0 179
 
4.2%
3 129
 
3.0%
4 86
 
2.0%
6 82
 
1.9%
Other values (20) 388
 
9.1%
Hangul
ValueCountFrequency (%)
335
 
6.0%
287
 
5.1%
274
 
4.9%
271
 
4.9%
267
 
4.8%
243
 
4.4%
203
 
3.6%
196
 
3.5%
140
 
2.5%
122
 
2.2%
Other values (300) 3248
58.1%
Distinct256
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-05-10T20:54:07.851636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length39
Mean length28.887597
Min length16

Characters and Unicode

Total characters7453
Distinct characters305
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)98.4%

Sample

1st row경기도 고양시 일산동구 식사동 1533 위시티 휴먼빌 근생제2동 116호
2nd row경기도 고양시 덕양구 성사동 519-38
3rd row경기도 고양시 덕양구 원흥동 628번지 티오피클래식 C동 1층 137호
4th row경기도 고양시 일산서구 대화동 867-2번지 대화동 대성프라자 127호일부
5th row경기도 고양시 일산서구 대화동 2133-11번지
ValueCountFrequency (%)
경기도 258
 
16.0%
1층 62
 
3.9%
부천시 26
 
1.6%
일부 25
 
1.6%
화성시 24
 
1.5%
일부호 23
 
1.4%
수원시 21
 
1.3%
남양주시 18
 
1.1%
성남시 18
 
1.1%
평택시 18
 
1.1%
Other values (687) 1115
69.3%
2024-05-10T20:54:09.030277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1504
20.2%
1 416
 
5.6%
297
 
4.0%
277
 
3.7%
267
 
3.6%
264
 
3.5%
262
 
3.5%
- 177
 
2.4%
2 157
 
2.1%
0 128
 
1.7%
Other values (295) 3704
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4309
57.8%
Space Separator 1504
 
20.2%
Decimal Number 1400
 
18.8%
Dash Punctuation 177
 
2.4%
Uppercase Letter 30
 
0.4%
Open Punctuation 12
 
0.2%
Close Punctuation 12
 
0.2%
Other Punctuation 6
 
0.1%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
297
 
6.9%
277
 
6.4%
267
 
6.2%
264
 
6.1%
262
 
6.1%
123
 
2.9%
118
 
2.7%
113
 
2.6%
112
 
2.6%
96
 
2.2%
Other values (266) 2380
55.2%
Uppercase Letter
ValueCountFrequency (%)
A 4
13.3%
T 4
13.3%
E 4
13.3%
B 3
10.0%
L 3
10.0%
C 3
10.0%
K 2
6.7%
I 2
6.7%
O 2
6.7%
H 1
 
3.3%
Other values (2) 2
6.7%
Decimal Number
ValueCountFrequency (%)
1 416
29.7%
2 157
 
11.2%
0 128
 
9.1%
4 121
 
8.6%
3 114
 
8.1%
6 103
 
7.4%
7 97
 
6.9%
5 93
 
6.6%
8 92
 
6.6%
9 79
 
5.6%
Space Separator
ValueCountFrequency (%)
1504
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 177
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4309
57.8%
Common 3112
41.8%
Latin 32
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
297
 
6.9%
277
 
6.4%
267
 
6.2%
264
 
6.1%
262
 
6.1%
123
 
2.9%
118
 
2.7%
113
 
2.6%
112
 
2.6%
96
 
2.2%
Other values (266) 2380
55.2%
Common
ValueCountFrequency (%)
1504
48.3%
1 416
 
13.4%
- 177
 
5.7%
2 157
 
5.0%
0 128
 
4.1%
4 121
 
3.9%
3 114
 
3.7%
6 103
 
3.3%
7 97
 
3.1%
5 93
 
3.0%
Other values (6) 202
 
6.5%
Latin
ValueCountFrequency (%)
A 4
12.5%
T 4
12.5%
E 4
12.5%
B 3
9.4%
L 3
9.4%
C 3
9.4%
K 2
6.2%
I 2
6.2%
O 2
6.2%
e 2
6.2%
Other values (3) 3
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4309
57.8%
ASCII 3144
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1504
47.8%
1 416
 
13.2%
- 177
 
5.6%
2 157
 
5.0%
0 128
 
4.1%
4 121
 
3.8%
3 114
 
3.6%
6 103
 
3.3%
7 97
 
3.1%
5 93
 
3.0%
Other values (19) 234
 
7.4%
Hangul
ValueCountFrequency (%)
297
 
6.9%
277
 
6.4%
267
 
6.2%
264
 
6.1%
262
 
6.1%
123
 
2.9%
118
 
2.7%
113
 
2.6%
112
 
2.6%
96
 
2.2%
Other values (266) 2380
55.2%
Distinct225
Distinct (%)87.5%
Missing1
Missing (%)0.4%
Memory size2.1 KiB
2024-05-10T20:54:09.942423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.4046693
Min length5

Characters and Unicode

Total characters1389
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)77.8%

Sample

1st row10323
2nd row412-807
3rd row412040
4th row411802
5th row411803
ValueCountFrequency (%)
14546 5
 
1.9%
443270 3
 
1.2%
415080 3
 
1.2%
445160 3
 
1.2%
14637 3
 
1.2%
18004 2
 
0.8%
10068 2
 
0.8%
10111 2
 
0.8%
14547 2
 
0.8%
12097 2
 
0.8%
Other values (215) 230
89.5%
2024-05-10T20:54:11.214791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 292
21.0%
4 218
15.7%
0 147
10.6%
8 128
9.2%
6 112
 
8.1%
5 111
 
8.0%
2 100
 
7.2%
7 97
 
7.0%
3 96
 
6.9%
9 72
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1373
98.8%
Dash Punctuation 16
 
1.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 292
21.3%
4 218
15.9%
0 147
10.7%
8 128
9.3%
6 112
 
8.2%
5 111
 
8.1%
2 100
 
7.3%
7 97
 
7.1%
3 96
 
7.0%
9 72
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1389
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 292
21.0%
4 218
15.7%
0 147
10.6%
8 128
9.2%
6 112
 
8.1%
5 111
 
8.0%
2 100
 
7.2%
7 97
 
7.0%
3 96
 
6.9%
9 72
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1389
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 292
21.0%
4 218
15.7%
0 147
10.6%
8 128
9.2%
6 112
 
8.1%
5 111
 
8.0%
2 100
 
7.2%
7 97
 
7.0%
3 96
 
6.9%
9 72
 
5.2%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct233
Distinct (%)90.7%
Missing1
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean37.415293
Minimum36.978016
Maximum38.028699
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-05-10T20:54:11.658417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.978016
5-th percentile37.014531
Q137.28064
median37.393641
Q337.621114
95-th percentile37.745479
Maximum38.028699
Range1.0506832
Interquartile range (IQR)0.34047377

Descriptive statistics

Standard deviation0.21822969
Coefficient of variation (CV)0.0058326333
Kurtosis-0.55342221
Mean37.415293
Median Absolute Deviation (MAD)0.14941784
Skewness0.042885967
Sum9615.7304
Variance0.047624197
MonotonicityNot monotonic
2024-05-10T20:54:12.118104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5043171668 5
 
1.9%
37.4840373391 3
 
1.2%
37.2826692976 2
 
0.8%
37.7442353403 2
 
0.8%
37.5040675095 2
 
0.8%
37.500072002 2
 
0.8%
37.4790162948 2
 
0.8%
37.5025517408 2
 
0.8%
37.2844317618 2
 
0.8%
37.754599675 2
 
0.8%
Other values (223) 233
90.3%
ValueCountFrequency (%)
36.9780159068 1
0.4%
36.9796898655 1
0.4%
36.9855507312 1
0.4%
36.9945235041 2
0.8%
36.9947076171 1
0.4%
36.9968025274 1
0.4%
36.9993858937 1
0.4%
37.0019012295 2
0.8%
37.002609684 1
0.4%
37.0099562684 1
0.4%
ValueCountFrequency (%)
38.0286990957 1
0.4%
37.8913934011 1
0.4%
37.8904045853 1
0.4%
37.8826801368 1
0.4%
37.8172434759 1
0.4%
37.8160340242 1
0.4%
37.7936598761 1
0.4%
37.7917991052 1
0.4%
37.790887378 1
0.4%
37.7810498855 1
0.4%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct233
Distinct (%)90.7%
Missing1
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean127.00473
Minimum126.6239
Maximum127.6534
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-05-10T20:54:12.792111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.6239
5-th percentile126.72389
Q1126.82935
median127.05118
Q3127.11441
95-th percentile127.2695
Maximum127.6534
Range1.0294998
Interquartile range (IQR)0.28505987

Descriptive statistics

Standard deviation0.1933029
Coefficient of variation (CV)0.0015220134
Kurtosis0.2770692
Mean127.00473
Median Absolute Deviation (MAD)0.10960133
Skewness0.31671107
Sum32640.215
Variance0.03736601
MonotonicityNot monotonic
2024-05-10T20:54:13.195444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.7620745903 5
 
1.9%
126.7826449277 3
 
1.2%
126.9703199061 2
 
0.8%
127.0929016943 2
 
0.8%
126.7639551478 2
 
0.8%
126.7752058095 2
 
0.8%
126.8036505413 2
 
0.8%
126.7753741701 2
 
0.8%
127.4550559819 2
 
0.8%
126.7813346334 2
 
0.8%
Other values (223) 233
90.3%
ValueCountFrequency (%)
126.6239008954 1
0.4%
126.6246417693 1
0.4%
126.6261996286 1
0.4%
126.6282980439 1
0.4%
126.6454578129 1
0.4%
126.6472242776 1
0.4%
126.6610641872 1
0.4%
126.6766676379 1
0.4%
126.6774205372 1
0.4%
126.6847967272 1
0.4%
ValueCountFrequency (%)
127.6534006536 1
0.4%
127.6373729111 1
0.4%
127.5015837002 1
0.4%
127.4915838373 1
0.4%
127.4674290634 1
0.4%
127.4550559819 2
0.8%
127.445649637 1
0.4%
127.4420084844 1
0.4%
127.4418828082 1
0.4%
127.4350452541 1
0.4%

Interactions

2024-05-10T20:53:56.167110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:54.493751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:55.301663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:56.482827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:54.756508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:55.579311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:56.789401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:55.026065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:53:55.902824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-10T20:54:13.445629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명폐업일자년도다중이용업소여부WGS84위도WGS84경도
시군명1.0000.2020.9250.0840.1300.9770.945
영업상태명0.2021.0001.0000.4190.0000.0000.129
폐업일자0.9251.0001.0001.000NaN0.9430.000
년도0.0840.4191.0001.0000.0000.0000.343
다중이용업소여부0.1300.000NaN0.0001.0000.0000.494
WGS84위도0.9770.0000.9430.0000.0001.0000.666
WGS84경도0.9450.1290.0000.3430.4940.6661.000
2024-05-10T20:54:13.754870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위생업종명영업상태명다중이용업소여부시군명
위생업종명1.0001.0001.0001.000
영업상태명1.0001.0000.0000.090
다중이용업소여부1.0000.0001.0000.067
시군명1.0000.0900.0671.000
2024-05-10T20:54:14.036810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도WGS84위도WGS84경도시군명영업상태명다중이용업소여부위생업종명
년도1.0000.270-0.0860.3680.4900.0001.000
WGS84위도0.2701.000-0.2460.8250.0000.0001.000
WGS84경도-0.086-0.2461.0000.7010.0750.4701.000
시군명0.3680.8250.7011.0000.0900.0671.000
영업상태명0.4900.0000.0750.0901.0000.0001.000
다중이용업소여부0.0000.0000.4700.0670.0001.0001.000
위생업종명1.0001.0001.0001.0001.0001.0001.000

Missing values

2024-05-10T20:53:57.227040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-10T20:53:57.911932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-10T20:53:58.488058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자년도다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0고양시메고지고 떡창고20220524영업<NA><NA><NA><NA><NA>떡카페경기도 고양시 일산동구 위시티2로11번길 11, 근생제2동 116호 (식사동, 위시티 휴먼빌)경기도 고양시 일산동구 식사동 1533 위시티 휴먼빌 근생제2동 116호1032337.677467126.811836
1고양시푸른열매달2022-02-16영업<NA><NA><NA><NA><NA>떡카페경기도 고양시 덕양구 호국로 836, 1층 일부호 (성사동)경기도 고양시 덕양구 성사동 519-38412-80737.659609126.839834
2고양시메고지고 원흥점20180709운영중<NA>2018N<NA>휴게음식점떡카페경기도 고양시 덕양구 권율대로 668, 티오피클래식 C동 1층 137호 (원흥동)경기도 고양시 덕양구 원흥동 628번지 티오피클래식 C동 1층 137호41204037.649448126.873898
3고양시카민스20180124운영중<NA>2018N<NA>휴게음식점떡카페경기도 고양시 일산서구 대화로 136, 대화동 대성프라자 127(일부)호 (대화동)경기도 고양시 일산서구 대화동 867-2번지 대화동 대성프라자 127호일부41180237.670596126.735292
4고양시늘품20180402운영중<NA>2018N<NA>휴게음식점떡카페경기도 고양시 일산서구 일산로695번길 12, 1층일부호 (대화동)경기도 고양시 일산서구 대화동 2133-11번지41180337.680484126.753755
5고양시디저트판다20180502운영중<NA>2018N<NA>휴게음식점떡카페경기도 고양시 일산서구 원일로 32, 일산허스상가 가동 103호 (일산동)경기도 고양시 일산서구 일산동 960-22번지 일산허스상가 가동 103호41185737.687059126.765882
6고양시원조공주떡집2024-03-19폐업2024-03-28<NA><NA><NA><NA>떡카페경기도 고양시 일산서구 호수로 817, 현대백화점 킨텍스점 지하1층 일부호 (대화동)경기도 고양시 일산서구 대화동 2602 현대백화점 킨텍스점 지하1층 일부호411-76637.667979126.751624
7고양시설봄20200625폐업20210621<NA><NA><NA><NA>떡카페경기도 고양시 일산동구 중앙로 1205, 차움라이프센터 102(일부)호 (장항동)경기도 고양시 일산동구 장항동 889 차움라이프센터 102호 일부(107호)1041437.654123126.77574
8고양시우긋2021-11-18폐업2023-04-28<NA><NA><NA><NA>떡카페경기도 고양시 일산서구 일산로635번길 32-1, 1층 전체호 (대화동)경기도 고양시 일산서구 대화동 2110-10 1층 전체호1037037.680336126.757397
9고양시예당떡방2020-03-06폐업2023-08-07<NA><NA><NA><NA>떡카페경기도 고양시 일산서구 호수로 817, 현대백화점킨텍스점 지하1층 일부호 (대화동)경기도 고양시 일산서구 대화동 2602 현대백화점킨텍스점 지하1층 일부호1039137.667979126.751624
시군명사업장명인허가일자영업상태명폐업일자년도다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
248화성시지나케이크2018-12-04폐업2024-01-29<NA><NA><NA><NA>떡카페경기도 화성시 동탄하나2길 24-1, 1층 일부호 (능동)경기도 화성시 능동 1071-8 1층 일부호1842337.214995127.061376
249화성시미담잔기지떡20201116폐업20211029<NA><NA><NA><NA>떡카페경기도 화성시 동탄대로14길 5-34, 1층 (오산동)경기도 화성시 오산동 1033-1 1층1848637.184972127.103145
250화성시떡탐2021-03-10폐업2024-01-09<NA><NA><NA><NA>떡카페경기도 화성시 동탄중앙로 376, 화성 동탄 이마트 1층 일부호 (석우동)경기도 화성시 석우동 44 화성 동탄 이마트 1층 일부호1845137.21444127.079683
251화성시궁 잔기지떡 화성시청점20200422폐업20211020<NA><NA><NA><NA>떡카페경기도 화성시 남양읍 역골로 82-1, 1동 1층경기도 화성시 남양읍 남양리 2274-11826737.206249126.823463
252화성시떡의작품 봉담2지구점2023-01-06폐업2023-04-27<NA><NA><NA><NA>떡카페경기도 화성시 봉담읍 상리중심상가길 6-4, ST프라자 1동 1층 111호경기도 화성시 봉담읍 상리 690-51830937.224366126.946392
253화성시모찌이야기20151002폐업 등201802132015N<NA>휴게음식점떡카페경기도 화성시 지산1길 9, 101호 (영천동)경기도 화성시 영천동 산 27-50번지 101호44513037.207204127.112033
254화성시이종순전통음식'떡해랑'20160621폐업 등201705242016N<NA>휴게음식점떡카페경기도 화성시 여울로2길 25, 1층 일부호 (능동)경기도 화성시 능동 1141-4번지 1층 일부호44532037.204046127.052537
255화성시아뜰리에드앨리스20150629폐업 등201605122015N<NA>휴게음식점떡카페경기도 화성시 노작로4길 22-32 (반송동)경기도 화성시 반송동 112-1번지 101호44516037.199073127.077672
256화성시효나리떡&카페20160328폐업 등201712292016N<NA>휴게음식점떡카페경기도 화성시 향남읍 발안남로55번길 19, 1층 102호경기도 화성시 향남읍 발안리 127-16번지 1층 102호44592337.130484126.905241
257화성시Rice Cake Gallery 소연20151119폐업 등201807192015N<NA>휴게음식점떡카페경기도 화성시 노작로4길 18-13, 1층 (반송동)경기도 화성시 반송동 113-5번지 1층44516037.197973127.077813