Overview

Dataset statistics

Number of variables14
Number of observations8296
Missing cells21279
Missing cells (%)18.3%
Duplicate rows3
Duplicate rows (%)< 0.1%
Total size in memory948.0 KiB
Average record size in memory117.0 B

Variable types

Categorical4
Text4
DateTime1
Unsupported2
Numeric3

Dataset

Description휴게음식점(패스트푸드) 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=HA1WV8A6FUGNEFEPC97913459044&infSeq=1

Alerts

위생업태명 has constant value ""Constant
Dataset has 3 (< 0.1%) duplicate rowsDuplicates
위생업종명 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
영업상태명 is highly overall correlated with 위생업종명High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
폐업일자 has 3540 (42.7%) missing valuesMissing
다중이용업소여부 has 8296 (100.0%) missing valuesMissing
총시설규모(㎡) has 8296 (100.0%) missing valuesMissing
소재지도로명주소 has 479 (5.8%) missing valuesMissing
소재지우편번호 has 220 (2.7%) missing valuesMissing
WGS84위도 has 224 (2.7%) missing valuesMissing
WGS84경도 has 224 (2.7%) missing valuesMissing
다중이용업소여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-10 20:42:46.659420
Analysis finished2024-05-10 20:42:54.801190
Duration8.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
부천시
1048 
고양시
908 
수원시
722 
성남시
620 
용인시
556 
Other values (27)
4442 

Length

Max length4
Median length3
Mean length3.0742527
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
부천시 1048
12.6%
고양시 908
 
10.9%
수원시 722
 
8.7%
성남시 620
 
7.5%
용인시 556
 
6.7%
안산시 456
 
5.5%
화성시 408
 
4.9%
안양시 356
 
4.3%
남양주시 330
 
4.0%
평택시 318
 
3.8%
Other values (22) 2574
31.0%

Length

2024-05-10T20:42:55.011282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부천시 1048
12.6%
고양시 908
 
10.9%
수원시 722
 
8.7%
성남시 620
 
7.5%
용인시 556
 
6.7%
안산시 456
 
5.5%
화성시 408
 
4.9%
안양시 356
 
4.3%
남양주시 330
 
4.0%
평택시 318
 
3.8%
Other values (22) 2574
31.0%
Distinct6323
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
2024-05-10T20:42:55.593556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length8.0273626
Min length1

Characters and Unicode

Total characters66595
Distinct characters917
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5325 ?
Unique (%)64.2%

Sample

1st row캠프통포레스트 바지선
2nd row빨간커피통
3rd row더달달
4th row세븐일레븐 가평군청점
5th row롯데리아 가평점
ValueCountFrequency (%)
롯데리아 267
 
2.5%
피자스쿨 151
 
1.4%
버거킹 107
 
1.0%
이삭토스트 105
 
1.0%
맘스터치 59
 
0.6%
써브웨이 58
 
0.5%
도미노피자 57
 
0.5%
피자 52
 
0.5%
한국맥도날드(유 52
 
0.5%
59쌀피자 50
 
0.5%
Other values (6489) 9709
91.0%
2024-05-10T20:42:57.029473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3771
 
5.7%
2778
 
4.2%
2687
 
4.0%
2615
 
3.9%
2375
 
3.6%
1346
 
2.0%
) 1319
 
2.0%
( 1301
 
2.0%
1298
 
1.9%
1144
 
1.7%
Other values (907) 45961
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58170
87.3%
Space Separator 2375
 
3.6%
Uppercase Letter 1679
 
2.5%
Close Punctuation 1320
 
2.0%
Open Punctuation 1302
 
2.0%
Decimal Number 824
 
1.2%
Lowercase Letter 729
 
1.1%
Other Punctuation 172
 
0.3%
Dash Punctuation 14
 
< 0.1%
Math Symbol 7
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3771
 
6.5%
2778
 
4.8%
2687
 
4.6%
2615
 
4.5%
1346
 
2.3%
1298
 
2.2%
1144
 
2.0%
813
 
1.4%
775
 
1.3%
756
 
1.3%
Other values (824) 40187
69.1%
Uppercase Letter
ValueCountFrequency (%)
C 216
12.9%
T 178
10.6%
D 160
 
9.5%
S 153
 
9.1%
G 135
 
8.0%
K 122
 
7.3%
F 89
 
5.3%
P 78
 
4.6%
B 62
 
3.7%
A 61
 
3.6%
Other values (16) 425
25.3%
Lowercase Letter
ValueCountFrequency (%)
e 92
12.6%
a 74
 
10.2%
o 58
 
8.0%
i 49
 
6.7%
r 45
 
6.2%
s 44
 
6.0%
n 43
 
5.9%
c 33
 
4.5%
f 32
 
4.4%
z 30
 
4.1%
Other values (15) 229
31.4%
Decimal Number
ValueCountFrequency (%)
5 227
27.5%
2 170
20.6%
9 129
15.7%
1 111
13.5%
0 72
 
8.7%
3 31
 
3.8%
7 28
 
3.4%
6 20
 
2.4%
4 19
 
2.3%
8 17
 
2.1%
Other Punctuation
ValueCountFrequency (%)
& 60
34.9%
. 55
32.0%
/ 19
 
11.0%
, 17
 
9.9%
' 11
 
6.4%
· 4
 
2.3%
? 3
 
1.7%
1
 
0.6%
1
 
0.6%
! 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
+ 5
71.4%
> 1
 
14.3%
< 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 1319
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1301
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2375
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58166
87.3%
Common 6017
 
9.0%
Latin 2408
 
3.6%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3771
 
6.5%
2778
 
4.8%
2687
 
4.6%
2615
 
4.5%
1346
 
2.3%
1298
 
2.2%
1144
 
2.0%
813
 
1.4%
775
 
1.3%
756
 
1.3%
Other values (821) 40183
69.1%
Latin
ValueCountFrequency (%)
C 216
 
9.0%
T 178
 
7.4%
D 160
 
6.6%
S 153
 
6.4%
G 135
 
5.6%
K 122
 
5.1%
e 92
 
3.8%
F 89
 
3.7%
P 78
 
3.2%
a 74
 
3.1%
Other values (41) 1111
46.1%
Common
ValueCountFrequency (%)
2375
39.5%
) 1319
21.9%
( 1301
21.6%
5 227
 
3.8%
2 170
 
2.8%
9 129
 
2.1%
1 111
 
1.8%
0 72
 
1.2%
& 60
 
1.0%
. 55
 
0.9%
Other values (22) 198
 
3.3%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58166
87.3%
ASCII 8418
 
12.6%
None 6
 
< 0.1%
CJK 4
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3771
 
6.5%
2778
 
4.8%
2687
 
4.6%
2615
 
4.5%
1346
 
2.3%
1298
 
2.2%
1144
 
2.0%
813
 
1.4%
775
 
1.3%
756
 
1.3%
Other values (821) 40183
69.1%
ASCII
ValueCountFrequency (%)
2375
28.2%
) 1319
15.7%
( 1301
15.5%
5 227
 
2.7%
C 216
 
2.6%
T 178
 
2.1%
2 170
 
2.0%
D 160
 
1.9%
S 153
 
1.8%
G 135
 
1.6%
Other values (69) 2184
25.9%
None
ValueCountFrequency (%)
· 4
66.7%
1
 
16.7%
1
 
16.7%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct4333
Distinct (%)52.2%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
Minimum1970-11-29 00:00:00
Maximum2024-04-25 00:00:00
2024-05-10T20:42:57.543555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:57.999852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
폐업 등
3500 
운영중
2139 
영업
1401 
폐업
1256 

Length

Max length4
Median length3
Mean length3.1016152
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
폐업 등 3500
42.2%
운영중 2139
25.8%
영업 1401
16.9%
폐업 1256
 
15.1%

Length

2024-05-10T20:42:58.476333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:42:58.973728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 4756
40.3%
3500
29.7%
운영중 2139
18.1%
영업 1401
 
11.9%

폐업일자
Text

MISSING 

Distinct3294
Distinct (%)69.3%
Missing3540
Missing (%)42.7%
Memory size64.9 KiB
2024-05-10T20:42:59.564302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.3864592
Min length6

Characters and Unicode

Total characters39886
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2285 ?
Unique (%)48.0%

Sample

1st row2023-11-23
2nd row2023-08-28
3rd row2023-11-20
4th row20211108
5th row20210628
ValueCountFrequency (%)
20161004 17
 
0.4%
20151012 12
 
0.3%
20210706 9
 
0.2%
20171121 7
 
0.1%
20230102 7
 
0.1%
20120928 6
 
0.1%
20151005 6
 
0.1%
20090121 6
 
0.1%
20140403 6
 
0.1%
20050603 6
 
0.1%
Other values (3284) 4674
98.3%
2024-05-10T20:43:00.649754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12461
31.2%
2 8505
21.3%
1 7015
17.6%
- 1840
 
4.6%
3 1767
 
4.4%
6 1491
 
3.7%
7 1427
 
3.6%
4 1390
 
3.5%
5 1378
 
3.5%
9 1328
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38046
95.4%
Dash Punctuation 1840
 
4.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12461
32.8%
2 8505
22.4%
1 7015
18.4%
3 1767
 
4.6%
6 1491
 
3.9%
7 1427
 
3.8%
4 1390
 
3.7%
5 1378
 
3.6%
9 1328
 
3.5%
8 1284
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 1840
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39886
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12461
31.2%
2 8505
21.3%
1 7015
17.6%
- 1840
 
4.6%
3 1767
 
4.4%
6 1491
 
3.7%
7 1427
 
3.6%
4 1390
 
3.5%
5 1378
 
3.5%
9 1328
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39886
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12461
31.2%
2 8505
21.3%
1 7015
17.6%
- 1840
 
4.6%
3 1767
 
4.4%
6 1491
 
3.7%
7 1427
 
3.6%
4 1390
 
3.5%
5 1378
 
3.5%
9 1328
 
3.3%

다중이용업소여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8296
Missing (%)100.0%
Memory size73.0 KiB

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8296
Missing (%)100.0%
Memory size73.0 KiB

위생업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
휴게음식점
5639 
<NA>
2657 

Length

Max length5
Median length5
Mean length4.6797252
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
휴게음식점 5639
68.0%
<NA> 2657
32.0%

Length

2024-05-10T20:43:01.050653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:43:01.344612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 5639
68.0%
na 2657
32.0%

위생업태명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
패스트푸드
8296 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row패스트푸드
2nd row패스트푸드
3rd row패스트푸드
4th row패스트푸드
5th row패스트푸드

Common Values

ValueCountFrequency (%)
패스트푸드 8296
100.0%

Length

2024-05-10T20:43:01.655593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T20:43:01.985472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
패스트푸드 8296
100.0%
Distinct6667
Distinct (%)85.3%
Missing479
Missing (%)5.8%
Memory size64.9 KiB
2024-05-10T20:43:02.639486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length59
Mean length30.691186
Min length13

Characters and Unicode

Total characters239913
Distinct characters632
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5745 ?
Unique (%)73.5%

Sample

1st row경기도 가평군 청평면 경춘로 89-1, 마동 1층
2nd row경기도 가평군 조종면 현창로38번길 16, 1층
3rd row경기도 가평군 가평읍 석봉로 192, 1층
4th row경기도 가평군 가평읍 가화로 106 (외 1필지)
5th row경기도 가평군 가평읍 가화로 37, 1층
ValueCountFrequency (%)
경기도 7816
 
15.4%
1층 1832
 
3.6%
부천시 960
 
1.9%
고양시 885
 
1.7%
수원시 685
 
1.3%
성남시 585
 
1.2%
용인시 531
 
1.0%
원미구 478
 
0.9%
안산시 431
 
0.8%
화성시 385
 
0.8%
Other values (7677) 36232
71.3%
2024-05-10T20:43:03.759291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43096
 
18.0%
1 12634
 
5.3%
8249
 
3.4%
8197
 
3.4%
8190
 
3.4%
8159
 
3.4%
7573
 
3.2%
6755
 
2.8%
, 6739
 
2.8%
2 5393
 
2.2%
Other values (622) 124928
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 136758
57.0%
Space Separator 43096
 
18.0%
Decimal Number 40318
 
16.8%
Other Punctuation 6803
 
2.8%
Open Punctuation 5373
 
2.2%
Close Punctuation 5373
 
2.2%
Dash Punctuation 1297
 
0.5%
Uppercase Letter 731
 
0.3%
Math Symbol 86
 
< 0.1%
Lowercase Letter 71
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8249
 
6.0%
8197
 
6.0%
8190
 
6.0%
8159
 
6.0%
7573
 
5.5%
6755
 
4.9%
4564
 
3.3%
3330
 
2.4%
3223
 
2.4%
2668
 
2.0%
Other values (552) 75850
55.5%
Uppercase Letter
ValueCountFrequency (%)
A 137
18.7%
B 127
17.4%
C 61
 
8.3%
I 43
 
5.9%
S 42
 
5.7%
K 37
 
5.1%
D 33
 
4.5%
F 30
 
4.1%
T 29
 
4.0%
G 27
 
3.7%
Other values (15) 165
22.6%
Lowercase Letter
ValueCountFrequency (%)
e 16
22.5%
c 8
11.3%
b 6
 
8.5%
n 5
 
7.0%
a 4
 
5.6%
r 4
 
5.6%
i 4
 
5.6%
t 3
 
4.2%
m 3
 
4.2%
l 3
 
4.2%
Other values (9) 15
21.1%
Decimal Number
ValueCountFrequency (%)
1 12634
31.3%
2 5393
13.4%
0 4314
 
10.7%
3 3829
 
9.5%
4 2854
 
7.1%
5 2697
 
6.7%
6 2350
 
5.8%
7 2272
 
5.6%
8 2065
 
5.1%
9 1910
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 6739
99.1%
. 50
 
0.7%
/ 6
 
0.1%
@ 3
 
< 0.1%
· 3
 
< 0.1%
& 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 82
95.3%
> 2
 
2.3%
< 2
 
2.3%
Letter Number
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
Space Separator
ValueCountFrequency (%)
43096
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5373
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5373
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1297
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 136758
57.0%
Common 102346
42.7%
Latin 809
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8249
 
6.0%
8197
 
6.0%
8190
 
6.0%
8159
 
6.0%
7573
 
5.5%
6755
 
4.9%
4564
 
3.3%
3330
 
2.4%
3223
 
2.4%
2668
 
2.0%
Other values (552) 75850
55.5%
Latin
ValueCountFrequency (%)
A 137
16.9%
B 127
15.7%
C 61
 
7.5%
I 43
 
5.3%
S 42
 
5.2%
K 37
 
4.6%
D 33
 
4.1%
F 30
 
3.7%
T 29
 
3.6%
G 27
 
3.3%
Other values (37) 243
30.0%
Common
ValueCountFrequency (%)
43096
42.1%
1 12634
 
12.3%
, 6739
 
6.6%
2 5393
 
5.3%
( 5373
 
5.2%
) 5373
 
5.2%
0 4314
 
4.2%
3 3829
 
3.7%
4 2854
 
2.8%
5 2697
 
2.6%
Other values (13) 10044
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 136758
57.0%
ASCII 103145
43.0%
Number Forms 7
 
< 0.1%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43096
41.8%
1 12634
 
12.2%
, 6739
 
6.5%
2 5393
 
5.2%
( 5373
 
5.2%
) 5373
 
5.2%
0 4314
 
4.2%
3 3829
 
3.7%
4 2854
 
2.8%
5 2697
 
2.6%
Other values (56) 10843
 
10.5%
Hangul
ValueCountFrequency (%)
8249
 
6.0%
8197
 
6.0%
8190
 
6.0%
8159
 
6.0%
7573
 
5.5%
6755
 
4.9%
4564
 
3.3%
3330
 
2.4%
3223
 
2.4%
2668
 
2.0%
Other values (552) 75850
55.5%
Number Forms
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
None
ValueCountFrequency (%)
· 3
100.0%
Distinct7956
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size64.9 KiB
2024-05-10T20:43:04.370213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length54
Mean length29.152001
Min length14

Characters and Unicode

Total characters241845
Distinct characters612
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7726 ?
Unique (%)93.1%

Sample

1st row경기도 가평군 설악면 사룡리 282-2 지선 2층
2nd row경기도 가평군 청평면 대성리 399-10 외 6필지, 마동 1층
3rd row경기도 가평군 조종면 현리 262-49 보석타운
4th row경기도 가평군 가평읍 읍내리 617-6 외 1필지, 1층
5th row경기도 가평군 가평읍 읍내리 474-15 외 1필지
ValueCountFrequency (%)
경기도 8295
 
16.4%
1층 1579
 
3.1%
부천시 1048
 
2.1%
고양시 908
 
1.8%
수원시 722
 
1.4%
성남시 620
 
1.2%
용인시 556
 
1.1%
원미구 541
 
1.1%
안산시 456
 
0.9%
화성시 409
 
0.8%
Other values (10322) 35378
70.0%
2024-05-10T20:43:05.528225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44583
 
18.4%
1 14554
 
6.0%
8852
 
3.7%
8622
 
3.6%
8621
 
3.6%
8542
 
3.5%
8361
 
3.5%
7613
 
3.1%
- 6241
 
2.6%
5716
 
2.4%
Other values (602) 120140
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137702
56.9%
Decimal Number 49198
 
20.3%
Space Separator 44583
 
18.4%
Dash Punctuation 6241
 
2.6%
Other Punctuation 1434
 
0.6%
Uppercase Letter 855
 
0.4%
Open Punctuation 838
 
0.3%
Close Punctuation 835
 
0.3%
Math Symbol 84
 
< 0.1%
Lowercase Letter 67
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8852
 
6.4%
8622
 
6.3%
8621
 
6.3%
8542
 
6.2%
8361
 
6.1%
7613
 
5.5%
5716
 
4.2%
4768
 
3.5%
3323
 
2.4%
3224
 
2.3%
Other values (533) 70060
50.9%
Uppercase Letter
ValueCountFrequency (%)
B 160
18.7%
A 152
17.8%
C 65
 
7.6%
S 60
 
7.0%
G 48
 
5.6%
I 43
 
5.0%
F 40
 
4.7%
K 39
 
4.6%
D 33
 
3.9%
T 33
 
3.9%
Other values (15) 182
21.3%
Lowercase Letter
ValueCountFrequency (%)
e 15
22.4%
c 7
10.4%
n 5
 
7.5%
a 5
 
7.5%
r 4
 
6.0%
i 4
 
6.0%
t 3
 
4.5%
l 3
 
4.5%
m 3
 
4.5%
d 3
 
4.5%
Other values (10) 15
22.4%
Decimal Number
ValueCountFrequency (%)
1 14554
29.6%
2 5550
 
11.3%
0 5217
 
10.6%
3 4339
 
8.8%
4 3794
 
7.7%
5 3691
 
7.5%
6 3404
 
6.9%
7 3196
 
6.5%
8 2776
 
5.6%
9 2677
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 1334
93.0%
. 71
 
5.0%
@ 15
 
1.0%
/ 10
 
0.7%
& 2
 
0.1%
· 2
 
0.1%
Letter Number
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Space Separator
ValueCountFrequency (%)
44583
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6241
100.0%
Open Punctuation
ValueCountFrequency (%)
( 838
100.0%
Close Punctuation
ValueCountFrequency (%)
) 835
100.0%
Math Symbol
ValueCountFrequency (%)
~ 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 137699
56.9%
Common 103213
42.7%
Latin 930
 
0.4%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8852
 
6.4%
8622
 
6.3%
8621
 
6.3%
8542
 
6.2%
8361
 
6.1%
7613
 
5.5%
5716
 
4.2%
4768
 
3.5%
3323
 
2.4%
3224
 
2.3%
Other values (532) 70057
50.9%
Latin
ValueCountFrequency (%)
B 160
17.2%
A 152
16.3%
C 65
 
7.0%
S 60
 
6.5%
G 48
 
5.2%
I 43
 
4.6%
F 40
 
4.3%
K 39
 
4.2%
D 33
 
3.5%
T 33
 
3.5%
Other values (38) 257
27.6%
Common
ValueCountFrequency (%)
44583
43.2%
1 14554
 
14.1%
- 6241
 
6.0%
2 5550
 
5.4%
0 5217
 
5.1%
3 4339
 
4.2%
4 3794
 
3.7%
5 3691
 
3.6%
6 3404
 
3.3%
7 3196
 
3.1%
Other values (11) 8644
 
8.4%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 137698
56.9%
ASCII 104133
43.1%
Number Forms 8
 
< 0.1%
CJK 3
 
< 0.1%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
44583
42.8%
1 14554
 
14.0%
- 6241
 
6.0%
2 5550
 
5.3%
0 5217
 
5.0%
3 4339
 
4.2%
4 3794
 
3.6%
5 3691
 
3.5%
6 3404
 
3.3%
7 3196
 
3.1%
Other values (55) 9564
 
9.2%
Hangul
ValueCountFrequency (%)
8852
 
6.4%
8622
 
6.3%
8621
 
6.3%
8542
 
6.2%
8361
 
6.1%
7613
 
5.5%
5716
 
4.2%
4768
 
3.5%
3323
 
2.4%
3224
 
2.3%
Other values (531) 70056
50.9%
CJK
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2364
Distinct (%)29.3%
Missing220
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean14273.987
Minimum10017
Maximum34921
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.0 KiB
2024-05-10T20:43:05.919004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10017
5-th percentile10366
Q112067
median14545
Q316507
95-th percentile18145
Maximum34921
Range24904
Interquartile range (IQR)4440

Descriptive statistics

Standard deviation2528.436
Coefficient of variation (CV)0.17713594
Kurtosis-0.60857174
Mean14273.987
Median Absolute Deviation (MAD)2127
Skewness-0.027766773
Sum1.1527672 × 108
Variance6392988.5
MonotonicityNot monotonic
2024-05-10T20:43:06.317577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14637 56
 
0.7%
10401 53
 
0.6%
14545 50
 
0.6%
13992 49
 
0.6%
10500 38
 
0.5%
14548 35
 
0.4%
15865 34
 
0.4%
15360 34
 
0.4%
17006 32
 
0.4%
14546 31
 
0.4%
Other values (2354) 7664
92.4%
(Missing) 220
 
2.7%
ValueCountFrequency (%)
10017 3
 
< 0.1%
10018 9
0.1%
10019 1
 
< 0.1%
10020 1
 
< 0.1%
10023 1
 
< 0.1%
10031 3
 
< 0.1%
10039 4
< 0.1%
10040 1
 
< 0.1%
10044 1
 
< 0.1%
10048 1
 
< 0.1%
ValueCountFrequency (%)
34921 1
 
< 0.1%
18634 1
 
< 0.1%
18624 2
< 0.1%
18623 1
 
< 0.1%
18614 2
< 0.1%
18611 2
< 0.1%
18608 1
 
< 0.1%
18606 4
< 0.1%
18603 3
< 0.1%
18602 3
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5569
Distinct (%)69.0%
Missing224
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean37.43812
Minimum36.327974
Maximum38.101919
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.0 KiB
2024-05-10T20:43:06.824383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.327974
5-th percentile37.070059
Q137.291564
median37.410359
Q337.623844
95-th percentile37.778275
Maximum38.101919
Range1.7739449
Interquartile range (IQR)0.33228014

Descriptive statistics

Standard deviation0.20958607
Coefficient of variation (CV)0.0055981998
Kurtosis-0.30892575
Mean37.43812
Median Absolute Deviation (MAD)0.13591909
Skewness0.079536842
Sum302200.5
Variance0.043926322
MonotonicityNot monotonic
2024-05-10T20:43:07.286757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5041752477 32
 
0.4%
37.8163995178 28
 
0.3%
37.4840373391 25
 
0.3%
37.6026921337 19
 
0.2%
37.5043171668 18
 
0.2%
37.653754696 17
 
0.2%
37.3171248925 17
 
0.2%
37.6679786706 16
 
0.2%
37.4888795916 16
 
0.2%
37.4029764684 15
 
0.2%
Other values (5559) 7869
94.9%
(Missing) 224
 
2.7%
ValueCountFrequency (%)
36.3279744725 1
< 0.1%
36.9594313452 1
< 0.1%
36.9596360398 1
< 0.1%
36.9598214981 1
< 0.1%
36.960419103 1
< 0.1%
36.9605120801 1
< 0.1%
36.9606650409 1
< 0.1%
36.9608411003 1
< 0.1%
36.9632477671 1
< 0.1%
36.9643606434 2
< 0.1%
ValueCountFrequency (%)
38.1019193582 1
< 0.1%
38.1016023516 1
< 0.1%
38.1005066857 1
< 0.1%
38.0989737349 1
< 0.1%
38.0909917096 1
< 0.1%
38.0908290855 1
< 0.1%
38.0479480115 1
< 0.1%
38.0305133462 1
< 0.1%
38.0293251219 1
< 0.1%
38.0282034881 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5569
Distinct (%)69.0%
Missing224
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean126.97919
Minimum126.54699
Maximum127.68065
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size73.0 KiB
2024-05-10T20:43:07.721210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.54699
5-th percentile126.74536
Q1126.80407
median126.97229
Q3127.10966
95-th percentile127.28818
Maximum127.68065
Range1.1336578
Interquartile range (IQR)0.30559454

Descriptive statistics

Standard deviation0.19287127
Coefficient of variation (CV)0.0015189203
Kurtosis0.45209316
Mean126.97919
Median Absolute Deviation (MAD)0.15245713
Skewness0.69602003
Sum1024976
Variance0.037199325
MonotonicityNot monotonic
2024-05-10T20:43:08.139436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.7566905848 32
 
0.4%
127.5289081709 28
 
0.3%
126.7826449277 25
 
0.3%
127.1437935543 19
 
0.2%
126.7620745903 18
 
0.2%
126.7686925095 17
 
0.2%
126.8500756669 17
 
0.2%
126.7516242854 16
 
0.2%
126.7552924952 16
 
0.2%
126.9221889785 15
 
0.2%
Other values (5559) 7869
94.9%
(Missing) 224
 
2.7%
ValueCountFrequency (%)
126.5469890908 1
< 0.1%
126.569437195 1
< 0.1%
126.574551471 2
< 0.1%
126.5829862284 1
< 0.1%
126.5832333559 1
< 0.1%
126.5833687361 1
< 0.1%
126.5835139492 1
< 0.1%
126.5842926171 1
< 0.1%
126.5934833222 2
< 0.1%
126.5962032106 1
< 0.1%
ValueCountFrequency (%)
127.6806468826 1
< 0.1%
127.6572227621 1
< 0.1%
127.6459893506 1
< 0.1%
127.6445962471 1
< 0.1%
127.6437253737 1
< 0.1%
127.6423443633 1
< 0.1%
127.6407297023 2
< 0.1%
127.6396258835 1
< 0.1%
127.6394888035 1
< 0.1%
127.6373729111 1
< 0.1%

Interactions

2024-05-10T20:42:52.332147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:50.460407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:51.385129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:52.563565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:50.770782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:51.716050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:52.940442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:51.066421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T20:42:52.062080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-10T20:43:08.410816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명소재지우편번호WGS84위도WGS84경도
시군명1.0000.6190.9970.9700.947
영업상태명0.6191.0000.2370.4820.262
소재지우편번호0.9970.2371.0000.9500.569
WGS84위도0.9700.4820.9501.0000.495
WGS84경도0.9470.2620.5690.4951.000
2024-05-10T20:43:08.652944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위생업종명시군명영업상태명
위생업종명1.0001.0001.000
시군명1.0001.0000.370
영업상태명1.0000.3701.000
2024-05-10T20:43:08.909194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명영업상태명위생업종명
소재지우편번호1.000-0.9040.2470.9890.1951.000
WGS84위도-0.9041.000-0.3060.8510.2311.000
WGS84경도0.247-0.3061.0000.7220.1591.000
시군명0.9890.8510.7221.0000.3701.000
영업상태명0.1950.2310.1590.3701.0001.000
위생업종명1.0001.0001.0001.0001.0001.000

Missing values

2024-05-10T20:42:53.395300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-10T20:42:54.088826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-10T20:42:54.549969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군캠프통포레스트 바지선20210705영업<NA><NA><NA><NA>패스트푸드<NA>경기도 가평군 설악면 사룡리 282-2 지선 2층1246037.703431127.490866
1가평군빨간커피통20121019영업<NA><NA><NA><NA>패스트푸드경기도 가평군 청평면 경춘로 89-1, 마동 1층경기도 가평군 청평면 대성리 399-10 외 6필지, 마동 1층1245737.683448127.377106
2가평군더달달2023-10-16영업<NA><NA><NA><NA>패스트푸드경기도 가평군 조종면 현창로38번길 16, 1층경기도 가평군 조종면 현리 262-49 보석타운1243737.819589127.349271
3가평군세븐일레븐 가평군청점20100202영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 석봉로 192, 1층경기도 가평군 가평읍 읍내리 617-6 외 1필지, 1층1241337.832213127.510319
4가평군롯데리아 가평점20060504영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 가화로 106 (외 1필지)경기도 가평군 가평읍 읍내리 474-15 외 1필지1241937.829202127.514182
5가평군치치꼬꼬2015-06-24영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 가화로 37, 1층경기도 가평군 가평읍 대곡리 219-1 ,1층1242037.822839127.515408
6가평군가평(서울방향)휴게소 롯데리아2021-07-01영업<NA><NA><NA><NA>패스트푸드경기도 가평군 설악면 미사리로 544경기도 가평군 설악면 미사리 145-3 외 37필지1246237.701738127.543471
7가평군베스킨라빈스(가평점)20111111영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 가화로 124경기도 가평군 가평읍 읍내리 471-3 외1필지1241937.830734127.513512
8가평군메가엠지씨커피 가평점19990312영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 가화로 110-1, 주1동 1층경기도 가평군 가평읍 읍내리 474-3 주1동, 1층1241937.829626127.514011
9가평군이쎈피자19911120영업<NA><NA><NA><NA>패스트푸드경기도 가평군 가평읍 중촌로 10경기도 가평군 가평읍 읍내리 6131241437.832997127.509319
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
8286화성시향남발안공단점20140328폐업 등20180424<NA><NA>휴게음식점패스트푸드경기도 화성시 향남읍 발안공단로5길 93, 1층경기도 화성시 향남읍 구문천리 152-8번지1862337.081945126.905722
8287화성시피자스쿨(동탄능동점)20090427폐업 등20140722<NA><NA>휴게음식점패스트푸드경기도 화성시 동탄지성로 135 (능동,(동탄엠플렉스 105호))경기도 화성시 능동 1114-5번지 (동탄엠플렉스 105호)1843137.209232127.060429
8288화성시장금이치킨20101215폐업 등20120315<NA><NA>휴게음식점패스트푸드경기도 화성시 향남읍 행정서남로 58-2경기도 화성시 향남읍 행정리 399-9번지1859737.129421126.910957
8289화성시황철수피자점20011228폐업 등20180811<NA><NA>휴게음식점패스트푸드경기도 화성시 향남읍 3.1만세로 1130경기도 화성시 향남읍 평리 30-23번지1859337.13389126.910382
8290화성시요기꺼리 봉담점20081215폐업 등20090609<NA><NA>휴게음식점패스트푸드<NA>경기도 화성시 봉담읍 동화리 1번지 봉담택지개발지구 A블럭3로트외1필지 204호1829837.230417126.968293
8291화성시새벽닭옛날통닭20141104폐업 등20151201<NA><NA>휴게음식점패스트푸드경기도 화성시 송산면 송산포도로 104, 1층경기도 화성시 송산면 사강리 628-2번지1855037.214209126.737477
8292화성시앵커맨북광장점20141118폐업 등20160425<NA><NA>휴게음식점패스트푸드경기도 화성시 동탄지성로 14 (반송동, 삼현빌딩 105호 일부)경기도 화성시 반송동 91-8번지 삼현빌딩 105호 일부1845337.205149127.072961
8293화성시던킨도너츠(동탄나루점)20080218폐업 등20130401<NA><NA>휴게음식점패스트푸드경기도 화성시 동탄솔빛로 68 (반송동, 218-1번지 계림네이쳐 103호)경기도 화성시 반송동 218-1번지 (계림네이쳐 103호)1844237.19442127.074937
8294화성시로티맘진안점20081208폐업 등20150921<NA><NA>휴게음식점패스트푸드경기도 화성시 효행로 1063 (진안동,(메디프랜드105호))경기도 화성시 진안동 914-4번지 (메디프랜드105호)1839837.214971127.043117
8295<NA>(주)파워라인지20100929폐업 등20120406<NA><NA>휴게음식점패스트푸드대전광역시 중구 대종로480번길 27 (은행동,(2층))대전광역시 중구 은행동 33-6번지 (2층)3492136.327974127.428058

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도# duplicates
0남양주시(주)이마트 남양주점2005-11-23영업<NA><NA>패스트푸드경기도 남양주시 늘을2로 27 (호평동)경기도 남양주시 호평동 6371214937.655012127.2430825
1안산시항공전(세븐일레븐)20100503폐업 등20100505휴게음식점패스트푸드<NA>경기도 안산시 상록구 사동 1639번지 항공전음식부스1559637.280434126.8330762
2용인시롯데쇼핑(주)롯데마트신갈점20141124영업<NA><NA>패스트푸드경기도 용인시 기흥구 중부대로 375 (신갈동, 지하1층)경기도 용인시 기흥구 신갈동 63 지하1층1706437.272311127.1091162