Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells44105
Missing cells (%)31.5%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory1.2 MiB
Average record size in memory126.0 B

Variable types

Categorical3
Text3
DateTime2
Unsupported4
Numeric2

Dataset

Description휴게음식점(기타) 현황_인허가
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=UFJN30HK4YJXCDPMH9DU14298267&infSeq=1

Alerts

위생업태명 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
WGS84위도 is highly overall correlated with 시군명High correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
폐업일자 has 3648 (36.5%) missing valuesMissing
다중이용업소여부 has 10000 (100.0%) missing valuesMissing
총시설규모(㎡) has 10000 (100.0%) missing valuesMissing
위생업종명 has 10000 (100.0%) missing valuesMissing
소재지도로명주소 has 305 (3.0%) missing valuesMissing
소재지우편번호 has 10000 (100.0%) missing valuesMissing
다중이용업소여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported
위생업종명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-23 01:44:00.185621
Analysis finished2024-03-23 01:44:07.621482
Duration7.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
1245 
부천시
1206 
고양시
804 
성남시
609 
시흥시
571 
Other values (26)
5565 

Length

Max length4
Median length3
Mean length3.0763
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시
2nd row수원시
3rd row용인시
4th row고양시
5th row파주시

Common Values

ValueCountFrequency (%)
수원시 1245
12.4%
부천시 1206
12.1%
고양시 804
 
8.0%
성남시 609
 
6.1%
시흥시 571
 
5.7%
용인시 566
 
5.7%
화성시 540
 
5.4%
남양주시 522
 
5.2%
평택시 519
 
5.2%
파주시 411
 
4.1%
Other values (21) 3007
30.1%

Length

2024-03-23T01:44:07.805840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 1245
12.4%
부천시 1206
12.1%
고양시 804
 
8.0%
성남시 609
 
6.1%
시흥시 571
 
5.7%
용인시 566
 
5.7%
화성시 540
 
5.4%
남양주시 522
 
5.2%
평택시 519
 
5.2%
파주시 411
 
4.1%
Other values (21) 3007
30.1%
Distinct8905
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T01:44:08.354003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length7.7182
Min length1

Characters and Unicode

Total characters77182
Distinct characters1083
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8289 ?
Unique (%)82.9%

Sample

1st row엄마손칼국수
2nd row허브로틴 스마일
3rd row곰카페(G.O.M.cafe)
4th row복준 타코야끼
5th row보광훼미리마트
ValueCountFrequency (%)
세븐일레븐 75
 
0.6%
씨유 64
 
0.5%
카페 56
 
0.4%
gs25 50
 
0.4%
베스킨라빈스 41
 
0.3%
pc방 41
 
0.3%
주식회사 33
 
0.3%
지에스25 33
 
0.3%
pc 31
 
0.2%
휴게음식점 29
 
0.2%
Other values (9846) 12575
96.5%
2024-03-23T01:44:09.468755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3032
 
3.9%
2843
 
3.7%
2409
 
3.1%
1953
 
2.5%
) 1607
 
2.1%
( 1602
 
2.1%
1172
 
1.5%
772
 
1.0%
759
 
1.0%
736
 
1.0%
Other values (1073) 60297
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63809
82.7%
Uppercase Letter 3212
 
4.2%
Space Separator 3032
 
3.9%
Lowercase Letter 2303
 
3.0%
Close Punctuation 1620
 
2.1%
Open Punctuation 1615
 
2.1%
Decimal Number 1219
 
1.6%
Other Punctuation 252
 
0.3%
Dash Punctuation 107
 
0.1%
Math Symbol 5
 
< 0.1%
Other values (4) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2843
 
4.5%
2409
 
3.8%
1953
 
3.1%
1172
 
1.8%
772
 
1.2%
759
 
1.2%
736
 
1.2%
734
 
1.2%
714
 
1.1%
667
 
1.0%
Other values (986) 51050
80.0%
Uppercase Letter
ValueCountFrequency (%)
C 652
20.3%
P 562
17.5%
S 243
 
7.6%
G 185
 
5.8%
E 167
 
5.2%
A 166
 
5.2%
O 124
 
3.9%
T 105
 
3.3%
B 101
 
3.1%
R 95
 
3.0%
Other values (16) 812
25.3%
Lowercase Letter
ValueCountFrequency (%)
e 330
14.3%
o 236
 
10.2%
a 219
 
9.5%
c 145
 
6.3%
n 140
 
6.1%
i 131
 
5.7%
s 120
 
5.2%
r 106
 
4.6%
l 102
 
4.4%
t 97
 
4.2%
Other values (15) 677
29.4%
Other Punctuation
ValueCountFrequency (%)
& 107
42.5%
, 59
23.4%
. 38
 
15.1%
' 25
 
9.9%
/ 8
 
3.2%
: 4
 
1.6%
! 3
 
1.2%
% 3
 
1.2%
# 2
 
0.8%
· 1
 
0.4%
Other values (2) 2
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 425
34.9%
5 242
19.9%
1 123
 
10.1%
4 94
 
7.7%
0 86
 
7.1%
3 83
 
6.8%
9 53
 
4.3%
8 45
 
3.7%
7 35
 
2.9%
6 33
 
2.7%
Math Symbol
ValueCountFrequency (%)
+ 3
60.0%
> 1
 
20.0%
< 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 1607
99.2%
] 13
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 1602
99.2%
[ 13
 
0.8%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
3032
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 107
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63799
82.7%
Common 7856
 
10.2%
Latin 5517
 
7.1%
Han 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2843
 
4.5%
2409
 
3.8%
1953
 
3.1%
1172
 
1.8%
772
 
1.2%
759
 
1.2%
736
 
1.2%
734
 
1.2%
714
 
1.1%
667
 
1.0%
Other values (976) 51040
80.0%
Latin
ValueCountFrequency (%)
C 652
 
11.8%
P 562
 
10.2%
e 330
 
6.0%
S 243
 
4.4%
o 236
 
4.3%
a 219
 
4.0%
G 185
 
3.4%
E 167
 
3.0%
A 166
 
3.0%
c 145
 
2.6%
Other values (43) 2612
47.3%
Common
ValueCountFrequency (%)
3032
38.6%
) 1607
20.5%
( 1602
20.4%
2 425
 
5.4%
5 242
 
3.1%
1 123
 
1.6%
& 107
 
1.4%
- 107
 
1.4%
4 94
 
1.2%
0 86
 
1.1%
Other values (24) 431
 
5.5%
Han
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63799
82.7%
ASCII 13369
 
17.3%
CJK 8
 
< 0.1%
Number Forms 2
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
None 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3032
22.7%
) 1607
 
12.0%
( 1602
 
12.0%
C 652
 
4.9%
P 562
 
4.2%
2 425
 
3.2%
e 330
 
2.5%
S 243
 
1.8%
5 242
 
1.8%
o 236
 
1.8%
Other values (73) 4438
33.2%
Hangul
ValueCountFrequency (%)
2843
 
4.5%
2409
 
3.8%
1953
 
3.1%
1172
 
1.8%
772
 
1.2%
759
 
1.2%
736
 
1.2%
734
 
1.2%
714
 
1.1%
667
 
1.0%
Other values (976) 51040
80.0%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct4269
Distinct (%)42.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1972-08-21 00:00:00
Maximum2024-03-12 00:00:00
2024-03-23T01:44:10.024710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:44:10.727393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
6352 
영업
3648 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row폐업
3rd row영업
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
폐업 6352
63.5%
영업 3648
36.5%

Length

2024-03-23T01:44:11.353559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T01:44:11.797414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 6352
63.5%
영업 3648
36.5%

폐업일자
Date

MISSING 

Distinct3345
Distinct (%)52.7%
Missing3648
Missing (%)36.5%
Memory size156.2 KiB
Minimum1994-10-01 00:00:00
Maximum2024-03-12 00:00:00
2024-03-23T01:44:12.321662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:44:12.818521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

다중이용업소여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

위생업종명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

위생업태명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
기타 휴게음식점
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타 휴게음식점
2nd row기타 휴게음식점
3rd row기타 휴게음식점
4th row기타 휴게음식점
5th row기타 휴게음식점

Common Values

ValueCountFrequency (%)
기타 휴게음식점 10000
100.0%

Length

2024-03-23T01:44:13.370097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T01:44:13.753452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 10000
50.0%
휴게음식점 10000
50.0%
Distinct8809
Distinct (%)90.9%
Missing305
Missing (%)3.0%
Memory size156.2 KiB
2024-03-23T01:44:14.607441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length115
Median length59
Mean length35.696854
Min length13

Characters and Unicode

Total characters346081
Distinct characters709
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8484 ?
Unique (%)87.5%

Sample

1st row경기도 수원시 장안구 정조로945번길 21 (영화동)
2nd row경기도 수원시 영통구 매탄로 142, 양성프라자 1층 101호 (매탄동)
3rd row경기도 용인시 처인구 포곡읍 에버랜드로 74, 1층 일부호
4th row경기도 고양시 일산동구 경의로 309, 백마상가 지하1층 B03(일부)호 (마두동)
5th row경기도 파주시 문산읍 봉미로 32 (선유리)
ValueCountFrequency (%)
경기도 9695
 
13.4%
1층 3088
 
4.3%
수원시 1212
 
1.7%
부천시 1149
 
1.6%
일부호 1116
 
1.5%
지하1층 812
 
1.1%
고양시 797
 
1.1%
일부 731
 
1.0%
성남시 591
 
0.8%
시흥시 561
 
0.8%
Other values (10729) 52376
72.6%
2024-03-23T01:44:15.832787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62541
 
18.1%
1 16830
 
4.9%
10988
 
3.2%
10813
 
3.1%
10213
 
3.0%
10053
 
2.9%
10037
 
2.9%
, 9499
 
2.7%
9367
 
2.7%
( 8903
 
2.6%
Other values (699) 186837
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 197659
57.1%
Space Separator 62541
 
18.1%
Decimal Number 54671
 
15.8%
Other Punctuation 9589
 
2.8%
Open Punctuation 8903
 
2.6%
Close Punctuation 8902
 
2.6%
Dash Punctuation 1928
 
0.6%
Uppercase Letter 1662
 
0.5%
Lowercase Letter 109
 
< 0.1%
Math Symbol 99
 
< 0.1%
Other values (2) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10988
 
5.6%
10813
 
5.5%
10213
 
5.2%
10053
 
5.1%
10037
 
5.1%
9367
 
4.7%
6219
 
3.1%
5785
 
2.9%
4673
 
2.4%
4641
 
2.3%
Other values (626) 114870
58.1%
Uppercase Letter
ValueCountFrequency (%)
B 306
18.4%
A 263
15.8%
C 120
 
7.2%
E 108
 
6.5%
K 107
 
6.4%
S 87
 
5.2%
T 83
 
5.0%
G 72
 
4.3%
L 62
 
3.7%
R 50
 
3.0%
Other values (16) 404
24.3%
Lowercase Letter
ValueCountFrequency (%)
e 26
23.9%
c 11
10.1%
l 9
 
8.3%
m 7
 
6.4%
a 7
 
6.4%
k 6
 
5.5%
p 6
 
5.5%
t 6
 
5.5%
s 5
 
4.6%
u 5
 
4.6%
Other values (8) 21
19.3%
Decimal Number
ValueCountFrequency (%)
1 16830
30.8%
2 7512
13.7%
0 6302
 
11.5%
3 4826
 
8.8%
4 3869
 
7.1%
5 3677
 
6.7%
7 3314
 
6.1%
6 3124
 
5.7%
8 2688
 
4.9%
9 2529
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 9499
99.1%
. 70
 
0.7%
@ 8
 
0.1%
: 6
 
0.1%
/ 3
 
< 0.1%
& 2
 
< 0.1%
· 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 96
97.0%
< 1
 
1.0%
> 1
 
1.0%
+ 1
 
1.0%
Letter Number
ValueCountFrequency (%)
9
52.9%
6
35.3%
2
 
11.8%
Space Separator
ValueCountFrequency (%)
62541
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8903
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8902
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1928
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 197657
57.1%
Common 146634
42.4%
Latin 1788
 
0.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10988
 
5.6%
10813
 
5.5%
10213
 
5.2%
10053
 
5.1%
10037
 
5.1%
9367
 
4.7%
6219
 
3.1%
5785
 
2.9%
4673
 
2.4%
4641
 
2.3%
Other values (625) 114868
58.1%
Latin
ValueCountFrequency (%)
B 306
17.1%
A 263
14.7%
C 120
 
6.7%
E 108
 
6.0%
K 107
 
6.0%
S 87
 
4.9%
T 83
 
4.6%
G 72
 
4.0%
L 62
 
3.5%
R 50
 
2.8%
Other values (37) 530
29.6%
Common
ValueCountFrequency (%)
62541
42.7%
1 16830
 
11.5%
, 9499
 
6.5%
( 8903
 
6.1%
) 8902
 
6.1%
2 7512
 
5.1%
0 6302
 
4.3%
3 4826
 
3.3%
4 3869
 
2.6%
5 3677
 
2.5%
Other values (16) 13773
 
9.4%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 197656
57.1%
ASCII 148404
42.9%
Number Forms 17
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
62541
42.1%
1 16830
 
11.3%
, 9499
 
6.4%
( 8903
 
6.0%
) 8902
 
6.0%
2 7512
 
5.1%
0 6302
 
4.2%
3 4826
 
3.3%
4 3869
 
2.6%
5 3677
 
2.5%
Other values (59) 15543
 
10.5%
Hangul
ValueCountFrequency (%)
10988
 
5.6%
10813
 
5.5%
10213
 
5.2%
10053
 
5.1%
10037
 
5.1%
9367
 
4.7%
6219
 
3.1%
5785
 
2.9%
4673
 
2.4%
4641
 
2.3%
Other values (624) 114867
58.1%
Number Forms
ValueCountFrequency (%)
9
52.9%
6
35.3%
2
 
11.8%
CJK
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct8855
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T01:44:16.493606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length57
Mean length28.4289
Min length14

Characters and Unicode

Total characters284289
Distinct characters674
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8397 ?
Unique (%)84.0%

Sample

1st row경기도 수원시 장안구 영화동 342-1
2nd row경기도 수원시 영통구 매탄동 1233-2 양성프라자
3rd row경기도 용인시 처인구 포곡읍 전대리 102-2
4th row경기도 고양시 일산동구 마두동 745 백마상가
5th row경기도 파주시 문산읍 선유리 904번지
ValueCountFrequency (%)
경기도 10000
 
16.5%
1층 1339
 
2.2%
수원시 1245
 
2.0%
부천시 1206
 
2.0%
고양시 804
 
1.3%
일부 790
 
1.3%
성남시 609
 
1.0%
시흥시 571
 
0.9%
용인시 566
 
0.9%
화성시 540
 
0.9%
Other values (11982) 43102
70.9%
2024-03-23T01:44:17.943237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56943
20.0%
1 14493
 
5.1%
10836
 
3.8%
10627
 
3.7%
10261
 
3.6%
10259
 
3.6%
10122
 
3.6%
- 6888
 
2.4%
2 6348
 
2.2%
5831
 
2.1%
Other values (664) 141681
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 163125
57.4%
Space Separator 56943
 
20.0%
Decimal Number 53592
 
18.9%
Dash Punctuation 6888
 
2.4%
Uppercase Letter 1383
 
0.5%
Other Punctuation 850
 
0.3%
Open Punctuation 656
 
0.2%
Close Punctuation 652
 
0.2%
Lowercase Letter 108
 
< 0.1%
Math Symbol 76
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10836
 
6.6%
10627
 
6.5%
10261
 
6.3%
10259
 
6.3%
10122
 
6.2%
5831
 
3.6%
4872
 
3.0%
3750
 
2.3%
3543
 
2.2%
3374
 
2.1%
Other values (591) 89650
55.0%
Uppercase Letter
ValueCountFrequency (%)
B 202
14.6%
A 192
13.9%
C 99
 
7.2%
E 94
 
6.8%
K 87
 
6.3%
S 86
 
6.2%
G 82
 
5.9%
T 72
 
5.2%
L 67
 
4.8%
R 44
 
3.2%
Other values (16) 358
25.9%
Lowercase Letter
ValueCountFrequency (%)
e 26
24.1%
l 11
10.2%
c 10
 
9.3%
t 7
 
6.5%
m 7
 
6.5%
k 6
 
5.6%
a 6
 
5.6%
p 5
 
4.6%
b 4
 
3.7%
n 4
 
3.7%
Other values (8) 22
20.4%
Decimal Number
ValueCountFrequency (%)
1 14493
27.0%
2 6348
11.8%
0 5331
 
9.9%
3 4773
 
8.9%
4 4399
 
8.2%
6 4287
 
8.0%
5 4178
 
7.8%
7 3650
 
6.8%
8 3260
 
6.1%
9 2873
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 760
89.4%
. 64
 
7.5%
@ 11
 
1.3%
· 6
 
0.7%
/ 6
 
0.7%
& 2
 
0.2%
? 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 74
97.4%
< 1
 
1.3%
> 1
 
1.3%
Letter Number
ValueCountFrequency (%)
8
50.0%
6
37.5%
2
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 655
99.8%
{ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 651
99.8%
} 1
 
0.2%
Space Separator
ValueCountFrequency (%)
56943
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6888
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 163122
57.4%
Common 119657
42.1%
Latin 1507
 
0.5%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10836
 
6.6%
10627
 
6.5%
10261
 
6.3%
10259
 
6.3%
10122
 
6.2%
5831
 
3.6%
4872
 
3.0%
3750
 
2.3%
3543
 
2.2%
3374
 
2.1%
Other values (589) 89647
55.0%
Latin
ValueCountFrequency (%)
B 202
13.4%
A 192
 
12.7%
C 99
 
6.6%
E 94
 
6.2%
K 87
 
5.8%
S 86
 
5.7%
G 82
 
5.4%
T 72
 
4.8%
L 67
 
4.4%
R 44
 
2.9%
Other values (37) 482
32.0%
Common
ValueCountFrequency (%)
56943
47.6%
1 14493
 
12.1%
- 6888
 
5.8%
2 6348
 
5.3%
0 5331
 
4.5%
3 4773
 
4.0%
4 4399
 
3.7%
6 4287
 
3.6%
5 4178
 
3.5%
7 3650
 
3.1%
Other values (16) 8367
 
7.0%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 163119
57.4%
ASCII 121142
42.6%
Number Forms 16
 
< 0.1%
None 6
 
< 0.1%
CJK 3
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56943
47.0%
1 14493
 
12.0%
- 6888
 
5.7%
2 6348
 
5.2%
0 5331
 
4.4%
3 4773
 
3.9%
4 4399
 
3.6%
6 4287
 
3.5%
5 4178
 
3.4%
7 3650
 
3.0%
Other values (59) 9852
 
8.1%
Hangul
ValueCountFrequency (%)
10836
 
6.6%
10627
 
6.5%
10261
 
6.3%
10259
 
6.3%
10122
 
6.2%
5831
 
3.6%
4872
 
3.0%
3750
 
2.3%
3543
 
2.2%
3374
 
2.1%
Other values (586) 89644
55.0%
Number Forms
ValueCountFrequency (%)
8
50.0%
6
37.5%
2
 
12.5%
None
ValueCountFrequency (%)
· 6
100.0%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct7420
Distinct (%)74.8%
Missing76
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean37.434136
Minimum36.935029
Maximum38.099935
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-23T01:44:18.372742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.935029
5-th percentile37.063111
Q137.286474
median37.407133
Q337.602702
95-th percentile37.752257
Maximum38.099935
Range1.1649062
Interquartile range (IQR)0.31622779

Descriptive statistics

Standard deviation0.20703534
Coefficient of variation (CV)0.0055306562
Kurtosis-0.34356735
Mean37.434136
Median Absolute Deviation (MAD)0.13600397
Skewness0.060275962
Sum371496.37
Variance0.042863632
MonotonicityNot monotonic
2024-03-23T01:44:19.047123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.545418129 87
 
0.9%
37.5043171668 84
 
0.8%
37.3927822598 81
 
0.8%
37.6679786706 79
 
0.8%
37.6649345 54
 
0.5%
37.200675377 50
 
0.5%
37.2656360259 50
 
0.5%
37.5025517408 41
 
0.4%
37.6690309166 39
 
0.4%
37.6685436 37
 
0.4%
Other values (7410) 9322
93.2%
(Missing) 76
 
0.8%
ValueCountFrequency (%)
36.9350292 1
< 0.1%
36.9418894 1
< 0.1%
36.9494147 1
< 0.1%
36.9565773 1
< 0.1%
36.9572990341 1
< 0.1%
36.9589137 1
< 0.1%
36.9592702 1
< 0.1%
36.959508969 1
< 0.1%
36.959554 1
< 0.1%
36.9597362144 1
< 0.1%
ValueCountFrequency (%)
38.0999354099 1
< 0.1%
38.0910048 1
< 0.1%
38.0909435 1
< 0.1%
38.089984794 1
< 0.1%
38.0898087 1
< 0.1%
38.0763315 1
< 0.1%
38.0677045003 1
< 0.1%
38.0673819661 1
< 0.1%
38.0654485 2
< 0.1%
38.0653995 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct7418
Distinct (%)74.7%
Missing76
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean126.98243
Minimum126.53735
Maximum127.73809
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-23T01:44:19.521871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.53735
5-th percentile126.73205
Q1126.79033
median127.00083
Q3127.11209
95-th percentile127.29867
Maximum127.73809
Range1.200739
Interquartile range (IQR)0.32176308

Descriptive statistics

Standard deviation0.19593611
Coefficient of variation (CV)0.0015430176
Kurtosis0.10172041
Mean126.98243
Median Absolute Deviation (MAD)0.1514412
Skewness0.496001
Sum1260173.6
Variance0.038390961
MonotonicityNot monotonic
2024-03-23T01:44:20.020138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.2237611843 87
 
0.9%
126.7620745903 84
 
0.8%
127.1120948679 81
 
0.8%
126.7516242854 79
 
0.8%
126.7418539 54
 
0.5%
127.0979238519 50
 
0.5%
127.000033255 50
 
0.5%
126.7753741701 41
 
0.4%
126.7456041294 39
 
0.4%
126.7442442 37
 
0.4%
Other values (7408) 9322
93.2%
(Missing) 76
 
0.8%
ValueCountFrequency (%)
126.5373525818 1
< 0.1%
126.5373874904 1
< 0.1%
126.5430528165 1
< 0.1%
126.5548362839 1
< 0.1%
126.5572444891 1
< 0.1%
126.5595216374 1
< 0.1%
126.5604013 1
< 0.1%
126.5634256955 1
< 0.1%
126.5687539636 1
< 0.1%
126.5695802096 1
< 0.1%
ValueCountFrequency (%)
127.7380916069 1
< 0.1%
127.7278774409 1
< 0.1%
127.7227544608 1
< 0.1%
127.7162503137 1
< 0.1%
127.7131269 1
< 0.1%
127.7056804 1
< 0.1%
127.6952428872 1
< 0.1%
127.674265596 1
< 0.1%
127.6729802351 1
< 0.1%
127.6509996 2
< 0.1%

Interactions

2024-03-23T01:44:05.551608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:44:04.985724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:44:05.888493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:44:05.256640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T01:44:20.323230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명WGS84위도WGS84경도
시군명1.0000.2130.9600.936
영업상태명0.2131.0000.1010.139
WGS84위도0.9600.1011.0000.710
WGS84경도0.9360.1390.7101.000
2024-03-23T01:44:20.597672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명
시군명1.0000.181
영업상태명0.1811.000
2024-03-23T01:44:20.817936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
WGS84위도WGS84경도시군명영업상태명
WGS84위도1.000-0.2420.7690.078
WGS84경도-0.2421.0000.6900.106
시군명0.7690.6901.0000.181
영업상태명0.0780.1060.1811.000

Missing values

2024-03-23T01:44:06.296772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T01:44:06.924905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T01:44:07.429565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
11025수원시엄마손칼국수20060127폐업20121128<NA><NA><NA>기타 휴게음식점경기도 수원시 장안구 정조로945번길 21 (영화동)경기도 수원시 장안구 영화동 342-1<NA>37.291254127.010875
11916수원시허브로틴 스마일2021-07-12폐업2023-03-28<NA><NA><NA>기타 휴게음식점경기도 수원시 영통구 매탄로 142, 양성프라자 1층 101호 (매탄동)경기도 수원시 영통구 매탄동 1233-2 양성프라자<NA>37.263299127.044213
16314용인시곰카페(G.O.M.cafe)20210914영업<NA><NA><NA><NA>기타 휴게음식점경기도 용인시 처인구 포곡읍 에버랜드로 74, 1층 일부호경기도 용인시 처인구 포곡읍 전대리 102-2<NA>37.281227127.225023
739고양시복준 타코야끼2023-02-06폐업2023-09-08<NA><NA><NA>기타 휴게음식점경기도 고양시 일산동구 경의로 309, 백마상가 지하1층 B03(일부)호 (마두동)경기도 고양시 일산동구 마두동 745 백마상가<NA>37.657448126.793507
18912파주시보광훼미리마트20080430폐업20101209<NA><NA><NA>기타 휴게음식점경기도 파주시 문산읍 봉미로 32 (선유리)경기도 파주시 문산읍 선유리 904번지<NA>37.862799126.785227
18887파주시미니스톱 파주장현점20160725폐업20190117<NA><NA><NA>기타 휴게음식점경기도 파주시 적성면 율곡로 2615 (1층)경기도 파주시 적성면 장현리 510-6번지 1층<NA>37.980457126.964706
21439화성시텐퍼센트스페셜티 커피 동탄나루마을점20220126영업<NA><NA><NA><NA>기타 휴게음식점경기도 화성시 동탄솔빛로 65, 정림프라자 1층 106호 (반송동)경기도 화성시 반송동 219-3 정림프라자 1층 106호<NA>37.194589127.074204
19920평택시평택항국제여객터미널휴게실20011119폐업20090422<NA><NA><NA>기타 휴게음식점경기도 평택시 포승읍 평택항만길 86 (만호리)경기도 평택시 포승읍 만호리 570번지<NA>36.95927126.847617
14647안양시미품당2023-07-31폐업2023-08-10<NA><NA><NA>기타 휴게음식점경기도 안양시 동안구 시민대로 180, G.SQURE, 롯데백화점 평촌점 지하1층 (호계동)경기도 안양시 동안구 호계동 1039 G.SQURE, 롯데백화점 평촌점<NA>37.389951126.950384
6452부천시더프리미엄2023-03-17폐업2023-03-30<NA><NA><NA>기타 휴게음식점경기도 부천시 원미구 길주로 180, 현대백화점 중동점 지하1층일부호 (중동)경기도 부천시 원미구 중동 1164 현대백화점 중동점 지하1층일부호<NA>37.504317126.762075
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
14503안양시씨유 금정역 SKV1점20201124영업<NA><NA><NA><NA>기타 휴게음식점경기도 안양시 동안구 엘에스로 142, 지상 1층 C103, C104호 (호계동)경기도 안양시 동안구 호계동 555-37<NA>37.374468126.94712
17292용인시코리아골프장15홀휴게소20000714폐업20010810<NA><NA><NA>기타 휴게음식점<NA>경기도 용인시 처인구 이동읍 서리 산 238-1번지<NA>37.222798127.160714
7990부천시세븐일레븐(부천소사점)20030121폐업20030806<NA><NA><NA>기타 휴게음식점경기도 부천시 경인로 214 (심곡본동)경기도 부천시 심곡본동 667-16번지<NA>37.482986126.779109
13755안산시카페브래그(cafe BRAGG)20191230폐업20200612<NA><NA><NA>기타 휴게음식점경기도 안산시 단원구 석수로 138, 상가2동 102호 (선부동, 안산 메트로타운 푸르지오힐스테이트)경기도 안산시 단원구 선부동 1177번지 안산 메트로타운 푸르지오힐스테이트<NA>37.348951126.805452
17392용인시이모네떡볶이20160715폐업20170926<NA><NA><NA>기타 휴게음식점경기도 용인시 수지구 대지로 49 (죽전동, 죽전퍼스트하임상가동 102-1호)경기도 용인시 수지구 죽전동 488번지 죽전퍼스트하임상가동 102-1호<NA>37.329623127.114326
21518화성시소금20211102영업<NA><NA><NA><NA>기타 휴게음식점경기도 화성시 동탄감배산로 143, 동탄역 유림노르웨이숲 203동 1층 105호 (오산동)경기도 화성시 오산동 978<NA>37.198642127.088571
11592수원시마포집 손칼국수20071206폐업20090102<NA><NA><NA>기타 휴게음식점경기도 수원시 권선구 효탑로 51경기도 수원시 권선구 탑동 430-2<NA>37.268069126.972802
6607부천시부부(BOOBOO)2014-12-01폐업2016-02-11<NA><NA><NA>기타 휴게음식점경기도 부천시 원미구 소향로 127 (중동, 프라움시티 101호)경기도 부천시 원미구 중동 1161-2 프라움시티 101호<NA>37.50246126.761991
11544수원시쌍떼20040528폐업20050725<NA><NA><NA>기타 휴게음식점경기도 수원시 팔달구 행궁로 98경기도 수원시 팔달구 교동 103-1<NA>37.273786127.015545
6821부천시꾼떡2010-12-03폐업2015-07-20<NA><NA><NA>기타 휴게음식점경기도 부천시 소사구 소사동로72번길 22, 1층 102호 (소사본동, 주공뜨란채아파트상가A동)경기도 부천시 소사구 소사본동 411-1 ,주공뜨란채아파트상가A동 1층 102호<NA>37.471984126.803365

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자위생업태명소재지도로명주소소재지지번주소WGS84위도WGS84경도# duplicates
0양주시(주) 롯데쇼핑20140623폐업20200601기타 휴게음식점경기도 양주시 평화로 1547, 지하1층 (회정동)경기도 양주시 회정동 351-2번지 외2필지 지하1층37.827551127.0525942