Overview

Dataset statistics

Number of variables14
Number of observations5472
Missing cells15518
Missing cells (%)20.3%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory625.3 KiB
Average record size in memory117.0 B

Variable types

Categorical4
Text3
DateTime2
Unsupported2
Numeric3

Dataset

Description유흥주점 영업(간이주점) 현황_인허가
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=LZ7TKBWJAWARNRBBNASQ14277276&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
위생업종명 is highly overall correlated with 소재지우편번호 and 5 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
영업상태명 is highly overall correlated with 위생업종명 and 1 other fieldsHigh correlation
위생업태명 is highly overall correlated with 영업상태명 and 1 other fieldsHigh correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
폐업일자 has 4197 (76.7%) missing valuesMissing
다중이용업소여부 has 5472 (100.0%) missing valuesMissing
총시설규모(㎡) has 5472 (100.0%) missing valuesMissing
소재지도로명주소 has 174 (3.2%) missing valuesMissing
WGS84위도 has 84 (1.5%) missing valuesMissing
WGS84경도 has 84 (1.5%) missing valuesMissing
다중이용업소여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 22:03:18.797844
Analysis finished2023-12-10 22:03:21.170403
Duration2.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
성남시
555 
부천시
501 
안산시
473 
평택시
434 
수원시
417 
Other values (26)
3092 

Length

Max length4
Median length3
Mean length3.0917398
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
성남시 555
 
10.1%
부천시 501
 
9.2%
안산시 473
 
8.6%
평택시 434
 
7.9%
수원시 417
 
7.6%
안양시 329
 
6.0%
의정부시 302
 
5.5%
시흥시 231
 
4.2%
파주시 230
 
4.2%
화성시 213
 
3.9%
Other values (21) 1787
32.7%

Length

2023-12-11T07:03:21.223150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성남시 555
 
10.1%
부천시 501
 
9.2%
안산시 473
 
8.6%
평택시 434
 
7.9%
수원시 417
 
7.6%
안양시 329
 
6.0%
의정부시 302
 
5.5%
시흥시 231
 
4.2%
파주시 230
 
4.2%
화성시 213
 
3.9%
Other values (21) 1787
32.7%
Distinct4347
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
2023-12-11T07:03:21.465025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length5.2191155
Min length1

Characters and Unicode

Total characters28559
Distinct characters824
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3697 ?
Unique (%)67.6%

Sample

1st row개미와베짱이
2nd row쿨 술마시는 노래타운
3rd row오아시스노래주점
4th row장녹수
5th row21세기 술노래룸
ValueCountFrequency (%)
단란주점 32
 
0.5%
라이브 27
 
0.5%
노래빠 26
 
0.4%
준코뮤직타운 24
 
0.4%
7080 24
 
0.4%
노래광장 20
 
0.3%
황진이 20
 
0.3%
노래장 19
 
0.3%
노래짱 16
 
0.3%
노래빵 15
 
0.3%
Other values (4325) 5766
96.3%
2023-12-11T07:03:22.090642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1925
 
6.7%
1924
 
6.7%
815
 
2.9%
739
 
2.6%
731
 
2.6%
675
 
2.4%
0 633
 
2.2%
573
 
2.0%
528
 
1.8%
520
 
1.8%
Other values (814) 19496
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25062
87.8%
Decimal Number 1505
 
5.3%
Uppercase Letter 729
 
2.6%
Space Separator 520
 
1.8%
Lowercase Letter 254
 
0.9%
Open Punctuation 215
 
0.8%
Close Punctuation 214
 
0.7%
Other Punctuation 45
 
0.2%
Letter Number 9
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1925
 
7.7%
1924
 
7.7%
815
 
3.3%
739
 
2.9%
731
 
2.9%
675
 
2.7%
573
 
2.3%
528
 
2.1%
471
 
1.9%
423
 
1.7%
Other values (738) 16258
64.9%
Uppercase Letter
ValueCountFrequency (%)
O 60
 
8.2%
E 52
 
7.1%
A 47
 
6.4%
S 47
 
6.4%
N 42
 
5.8%
I 41
 
5.6%
B 39
 
5.3%
K 39
 
5.3%
T 37
 
5.1%
M 34
 
4.7%
Other values (16) 291
39.9%
Lowercase Letter
ValueCountFrequency (%)
e 28
 
11.0%
a 24
 
9.4%
l 19
 
7.5%
r 19
 
7.5%
o 18
 
7.1%
i 18
 
7.1%
s 17
 
6.7%
u 16
 
6.3%
y 13
 
5.1%
g 10
 
3.9%
Other values (14) 72
28.3%
Decimal Number
ValueCountFrequency (%)
0 633
42.1%
7 312
20.7%
8 307
20.4%
2 82
 
5.4%
1 60
 
4.0%
9 48
 
3.2%
3 26
 
1.7%
5 16
 
1.1%
4 13
 
0.9%
6 8
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 32
71.1%
& 5
 
11.1%
, 3
 
6.7%
% 2
 
4.4%
/ 1
 
2.2%
' 1
 
2.2%
· 1
 
2.2%
Open Punctuation
ValueCountFrequency (%)
( 213
99.1%
[ 2
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 212
99.1%
] 2
 
0.9%
Math Symbol
ValueCountFrequency (%)
+ 3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
520
100.0%
Letter Number
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25057
87.7%
Common 2505
 
8.8%
Latin 992
 
3.5%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1925
 
7.7%
1924
 
7.7%
815
 
3.3%
739
 
2.9%
731
 
2.9%
675
 
2.7%
573
 
2.3%
528
 
2.1%
471
 
1.9%
423
 
1.7%
Other values (733) 16253
64.9%
Latin
ValueCountFrequency (%)
O 60
 
6.0%
E 52
 
5.2%
A 47
 
4.7%
S 47
 
4.7%
N 42
 
4.2%
I 41
 
4.1%
B 39
 
3.9%
K 39
 
3.9%
T 37
 
3.7%
M 34
 
3.4%
Other values (41) 554
55.8%
Common
ValueCountFrequency (%)
0 633
25.3%
520
20.8%
7 312
12.5%
8 307
12.3%
( 213
 
8.5%
) 212
 
8.5%
2 82
 
3.3%
1 60
 
2.4%
9 48
 
1.9%
. 32
 
1.3%
Other values (15) 86
 
3.4%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25055
87.7%
ASCII 3486
 
12.2%
Number Forms 9
 
< 0.1%
CJK 4
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Math Operators 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1925
 
7.7%
1924
 
7.7%
815
 
3.3%
739
 
2.9%
731
 
2.9%
675
 
2.7%
573
 
2.3%
528
 
2.1%
471
 
1.9%
423
 
1.7%
Other values (731) 16251
64.9%
ASCII
ValueCountFrequency (%)
0 633
18.2%
520
14.9%
7 312
 
9.0%
8 307
 
8.8%
( 213
 
6.1%
) 212
 
6.1%
2 82
 
2.4%
1 60
 
1.7%
O 60
 
1.7%
E 52
 
1.5%
Other values (63) 1035
29.7%
Number Forms
ValueCountFrequency (%)
9
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct3714
Distinct (%)67.9%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
Minimum1967-09-15 00:00:00
Maximum2023-12-04 00:00:00
2023-12-11T07:03:22.211131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:22.347816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
영업
3434 
폐업 등
804 
운영중
763 
폐업
471 

Length

Max length4
Median length2
Mean length2.4332968
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 3434
62.8%
폐업 등 804
 
14.7%
운영중 763
 
13.9%
폐업 471
 
8.6%

Length

2023-12-11T07:03:22.488746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:03:22.612037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 3434
54.7%
폐업 1275
 
20.3%
804
 
12.8%
운영중 763
 
12.2%

폐업일자
Date

MISSING 

Distinct948
Distinct (%)74.4%
Missing4197
Missing (%)76.7%
Memory size42.9 KiB
Minimum1987-06-30 00:00:00
Maximum2023-12-04 00:00:00
2023-12-11T07:03:22.708848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:22.828429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

다중이용업소여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5472
Missing (%)100.0%
Memory size48.2 KiB

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5472
Missing (%)100.0%
Memory size48.2 KiB

위생업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
<NA>
3955 
유흥주점영업
1517 

Length

Max length6
Median length4
Mean length4.5544591
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3955
72.3%
유흥주점영업 1517
 
27.7%

Length

2023-12-11T07:03:22.955385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:03:23.086078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 3955
72.3%
유흥주점영업 1517
 
27.7%

위생업태명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
간이주점
1848 
룸살롱
1832 
단란주점
1045 
기타
293 
노래클럽
194 
Other values (8)
260 

Length

Max length12
Median length4
Mean length3.5727339
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row스텐드바
2nd row룸살롱
3rd row룸살롱
4th row룸살롱
5th row룸살롱

Common Values

ValueCountFrequency (%)
간이주점 1848
33.8%
룸살롱 1832
33.5%
단란주점 1045
19.1%
기타 293
 
5.4%
노래클럽 194
 
3.5%
카바레 110
 
2.0%
<NA> 50
 
0.9%
스텐드바 49
 
0.9%
비어(바)살롱 26
 
0.5%
고고(디스코)클럽 20
 
0.4%
Other values (3) 5
 
0.1%

Length

2023-12-11T07:03:23.238696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
간이주점 1848
33.8%
룸살롱 1832
33.5%
단란주점 1045
19.1%
기타 293
 
5.4%
노래클럽 194
 
3.5%
카바레 110
 
2.0%
na 50
 
0.9%
스텐드바 49
 
0.9%
비어(바)살롱 26
 
0.5%
고고(디스코)클럽 20
 
0.4%
Other values (3) 5
 
0.1%
Distinct4773
Distinct (%)90.1%
Missing174
Missing (%)3.2%
Memory size42.9 KiB
2023-12-11T07:03:23.544396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length51
Mean length30.251793
Min length14

Characters and Unicode

Total characters160274
Distinct characters462
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4349 ?
Unique (%)82.1%

Sample

1st row경기도 가평군 가평읍 가화로 129, 지하1층
2nd row경기도 가평군 청평면 구청평로 92, 2층
3rd row경기도 가평군 조종면 조종희망로5번길 7, 1층
4th row경기도 가평군 가평읍 굴다리길 2, 가동 2층
5th row경기도 가평군 가평읍 가화로 113-1
ValueCountFrequency (%)
경기도 5298
 
15.9%
성남시 552
 
1.7%
지하1층 542
 
1.6%
부천시 479
 
1.4%
안산시 468
 
1.4%
2층 465
 
1.4%
평택시 430
 
1.3%
수원시 412
 
1.2%
단원구 317
 
1.0%
안양시 315
 
0.9%
Other values (4178) 24015
72.1%
2023-12-11T07:03:24.046645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28076
 
17.5%
1 6552
 
4.1%
5555
 
3.5%
5453
 
3.4%
5379
 
3.4%
5358
 
3.3%
4829
 
3.0%
4804
 
3.0%
2 4693
 
2.9%
) 4378
 
2.7%
Other values (452) 85197
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89512
55.8%
Space Separator 28076
 
17.5%
Decimal Number 28008
 
17.5%
Close Punctuation 4378
 
2.7%
Open Punctuation 4378
 
2.7%
Other Punctuation 4209
 
2.6%
Dash Punctuation 1417
 
0.9%
Uppercase Letter 254
 
0.2%
Math Symbol 30
 
< 0.1%
Letter Number 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5555
 
6.2%
5453
 
6.1%
5379
 
6.0%
5358
 
6.0%
4829
 
5.4%
4804
 
5.4%
2818
 
3.1%
2617
 
2.9%
2355
 
2.6%
2198
 
2.5%
Other values (406) 48146
53.8%
Uppercase Letter
ValueCountFrequency (%)
B 167
65.7%
A 20
 
7.9%
C 9
 
3.5%
M 7
 
2.8%
S 6
 
2.4%
I 5
 
2.0%
H 5
 
2.0%
E 5
 
2.0%
G 4
 
1.6%
L 4
 
1.6%
Other values (9) 22
 
8.7%
Decimal Number
ValueCountFrequency (%)
1 6552
23.4%
2 4693
16.8%
3 3257
11.6%
0 2842
10.1%
5 2227
 
8.0%
4 2185
 
7.8%
6 1747
 
6.2%
7 1661
 
5.9%
9 1527
 
5.5%
8 1317
 
4.7%
Letter Number
ValueCountFrequency (%)
5
55.6%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 4179
99.3%
. 28
 
0.7%
' 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 25
83.3%
3
 
10.0%
> 2
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
l 2
66.7%
a 1
33.3%
Space Separator
ValueCountFrequency (%)
28076
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4378
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4378
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1417
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 89512
55.8%
Common 70496
44.0%
Latin 266
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5555
 
6.2%
5453
 
6.1%
5379
 
6.0%
5358
 
6.0%
4829
 
5.4%
4804
 
5.4%
2818
 
3.1%
2617
 
2.9%
2355
 
2.6%
2198
 
2.5%
Other values (406) 48146
53.8%
Latin
ValueCountFrequency (%)
B 167
62.8%
A 20
 
7.5%
C 9
 
3.4%
M 7
 
2.6%
S 6
 
2.3%
5
 
1.9%
I 5
 
1.9%
H 5
 
1.9%
E 5
 
1.9%
G 4
 
1.5%
Other values (16) 33
 
12.4%
Common
ValueCountFrequency (%)
28076
39.8%
1 6552
 
9.3%
2 4693
 
6.7%
) 4378
 
6.2%
( 4378
 
6.2%
, 4179
 
5.9%
3 3257
 
4.6%
0 2842
 
4.0%
5 2227
 
3.2%
4 2185
 
3.1%
Other values (10) 7729
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 89512
55.8%
ASCII 70750
44.1%
Number Forms 9
 
< 0.1%
Math Operators 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28076
39.7%
1 6552
 
9.3%
2 4693
 
6.6%
) 4378
 
6.2%
( 4378
 
6.2%
, 4179
 
5.9%
3 3257
 
4.6%
0 2842
 
4.0%
5 2227
 
3.1%
4 2185
 
3.1%
Other values (30) 7983
 
11.3%
Hangul
ValueCountFrequency (%)
5555
 
6.2%
5453
 
6.1%
5379
 
6.0%
5358
 
6.0%
4829
 
5.4%
4804
 
5.4%
2818
 
3.1%
2617
 
2.9%
2355
 
2.6%
2198
 
2.5%
Other values (406) 48146
53.8%
Number Forms
ValueCountFrequency (%)
5
55.6%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Math Operators
ValueCountFrequency (%)
3
100.0%
Distinct5088
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size42.9 KiB
2023-12-11T07:03:24.375402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length47
Mean length25.440424
Min length15

Characters and Unicode

Total characters139210
Distinct characters413
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4809 ?
Unique (%)87.9%

Sample

1st row경기도 가평군 가평읍 읍내리 449-1 지하1층
2nd row경기도 가평군 청평면 청평리 619-17 2층
3rd row경기도 가평군 조종면 현리 264-19 1층
4th row경기도 가평군 가평읍 대곡리 232-1 외 1필지 , 2층 가동
5th row경기도 가평군 가평읍 읍내리 468-21
ValueCountFrequency (%)
경기도 5472
 
18.2%
성남시 555
 
1.9%
지하1층 525
 
1.8%
부천시 501
 
1.7%
안산시 473
 
1.6%
평택시 434
 
1.4%
수원시 417
 
1.4%
지층 363
 
1.2%
2층 331
 
1.1%
안양시 329
 
1.1%
Other values (5755) 20597
68.7%
2023-12-11T07:03:25.050149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27930
20.1%
1 7249
 
5.2%
5560
 
4.0%
5548
 
4.0%
5545
 
4.0%
5498
 
3.9%
5134
 
3.7%
- 4821
 
3.5%
2 3955
 
2.8%
3365
 
2.4%
Other values (403) 64605
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74319
53.4%
Decimal Number 30545
21.9%
Space Separator 27930
 
20.1%
Dash Punctuation 4821
 
3.5%
Other Punctuation 558
 
0.4%
Open Punctuation 408
 
0.3%
Close Punctuation 407
 
0.3%
Uppercase Letter 190
 
0.1%
Math Symbol 24
 
< 0.1%
Letter Number 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5560
 
7.5%
5548
 
7.5%
5545
 
7.5%
5498
 
7.4%
5134
 
6.9%
3365
 
4.5%
2276
 
3.1%
2067
 
2.8%
1582
 
2.1%
1477
 
2.0%
Other values (356) 36267
48.8%
Uppercase Letter
ValueCountFrequency (%)
B 124
65.3%
A 16
 
8.4%
C 11
 
5.8%
S 6
 
3.2%
H 5
 
2.6%
I 5
 
2.6%
G 4
 
2.1%
E 3
 
1.6%
M 3
 
1.6%
J 2
 
1.1%
Other values (9) 11
 
5.8%
Decimal Number
ValueCountFrequency (%)
1 7249
23.7%
2 3955
12.9%
3 3257
10.7%
0 3060
10.0%
4 2867
 
9.4%
5 2410
 
7.9%
7 2249
 
7.4%
8 1991
 
6.5%
6 1987
 
6.5%
9 1520
 
5.0%
Letter Number
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 520
93.2%
. 36
 
6.5%
' 2
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 19
79.2%
3
 
12.5%
> 2
 
8.3%
Open Punctuation
ValueCountFrequency (%)
( 407
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 406
99.8%
] 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
b 2
66.7%
a 1
33.3%
Space Separator
ValueCountFrequency (%)
27930
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4821
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74319
53.4%
Common 64693
46.5%
Latin 198
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5560
 
7.5%
5548
 
7.5%
5545
 
7.5%
5498
 
7.4%
5134
 
6.9%
3365
 
4.5%
2276
 
3.1%
2067
 
2.8%
1582
 
2.1%
1477
 
2.0%
Other values (356) 36267
48.8%
Latin
ValueCountFrequency (%)
B 124
62.6%
A 16
 
8.1%
C 11
 
5.6%
S 6
 
3.0%
H 5
 
2.5%
I 5
 
2.5%
G 4
 
2.0%
E 3
 
1.5%
M 3
 
1.5%
J 2
 
1.0%
Other values (15) 19
 
9.6%
Common
ValueCountFrequency (%)
27930
43.2%
1 7249
 
11.2%
- 4821
 
7.5%
2 3955
 
6.1%
3 3257
 
5.0%
0 3060
 
4.7%
4 2867
 
4.4%
5 2410
 
3.7%
7 2249
 
3.5%
8 1991
 
3.1%
Other values (12) 4904
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74319
53.4%
ASCII 64883
46.6%
Number Forms 5
 
< 0.1%
Math Operators 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27930
43.0%
1 7249
 
11.2%
- 4821
 
7.4%
2 3955
 
6.1%
3 3257
 
5.0%
0 3060
 
4.7%
4 2867
 
4.4%
5 2410
 
3.7%
7 2249
 
3.5%
8 1991
 
3.1%
Other values (32) 5094
 
7.9%
Hangul
ValueCountFrequency (%)
5560
 
7.5%
5548
 
7.5%
5545
 
7.5%
5498
 
7.4%
5134
 
6.9%
3365
 
4.5%
2276
 
3.1%
2067
 
2.8%
1582
 
2.1%
1477
 
2.0%
Other values (356) 36267
48.8%
Math Operators
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct687
Distinct (%)12.6%
Missing35
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean14387.396
Minimum10018
Maximum18623
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.2 KiB
2023-12-11T07:03:25.182645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10018
5-th percentile10824
Q112150
median14546
Q316455
95-th percentile18139
Maximum18623
Range8605
Interquartile range (IQR)4305

Descriptive statistics

Standard deviation2407.3452
Coefficient of variation (CV)0.1673232
Kurtosis-1.0871099
Mean14387.396
Median Absolute Deviation (MAD)1944
Skewness0.039260297
Sum78224273
Variance5795311
MonotonicityNot monotonic
2023-12-11T07:03:25.299950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13246 147
 
2.7%
15361 123
 
2.2%
14548 90
 
1.6%
14580 74
 
1.4%
10071 68
 
1.2%
17774 66
 
1.2%
11693 65
 
1.2%
11927 63
 
1.2%
14066 62
 
1.1%
16489 61
 
1.1%
Other values (677) 4618
84.4%
ValueCountFrequency (%)
10018 13
 
0.2%
10019 8
 
0.1%
10024 2
 
< 0.1%
10025 4
 
0.1%
10040 1
 
< 0.1%
10059 2
 
< 0.1%
10071 68
1.2%
10073 2
 
< 0.1%
10098 6
 
0.1%
10129 3
 
0.1%
ValueCountFrequency (%)
18623 4
 
0.1%
18611 5
 
0.1%
18606 22
0.4%
18600 2
 
< 0.1%
18594 1
 
< 0.1%
18593 26
0.5%
18591 1
 
< 0.1%
18577 1
 
< 0.1%
18572 1
 
< 0.1%
18567 5
 
0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3674
Distinct (%)68.2%
Missing84
Missing (%)1.5%
Infinite0
Infinite (%)0.0%
Mean37.43745
Minimum36.92055
Maximum38.186374
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.2 KiB
2023-12-11T07:03:25.434956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.92055
5-th percentile37.042555
Q137.279885
median37.408397
Q337.598441
95-th percentile37.89626
Maximum38.186374
Range1.2658245
Interquartile range (IQR)0.31855659

Descriptive statistics

Standard deviation0.247294
Coefficient of variation (CV)0.0066055247
Kurtosis-0.27516582
Mean37.43745
Median Absolute Deviation (MAD)0.13687008
Skewness0.38253777
Sum201712.98
Variance0.061154323
MonotonicityNot monotonic
2023-12-11T07:03:25.558428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3182972549 15
 
0.3%
37.0458281259 14
 
0.3%
37.7148553223 14
 
0.3%
37.2741372443 11
 
0.2%
37.5614176573 11
 
0.2%
37.0474611388 11
 
0.2%
37.3937521667 10
 
0.2%
37.2005247567 10
 
0.2%
37.1484696706 10
 
0.2%
37.317162652 9
 
0.2%
Other values (3664) 5273
96.4%
(Missing) 84
 
1.5%
ValueCountFrequency (%)
36.9205495128 1
 
< 0.1%
36.9591698642 1
 
< 0.1%
36.9597125094 2
< 0.1%
36.9597668961 1
 
< 0.1%
36.9601292292 2
< 0.1%
36.9602259744 1
 
< 0.1%
36.9603626517 1
 
< 0.1%
36.9604993278 2
< 0.1%
36.9605436675 4
0.1%
36.9605461615 2
< 0.1%
ValueCountFrequency (%)
38.1863740477 1
< 0.1%
38.1861681768 1
< 0.1%
38.1855437589 1
< 0.1%
38.1854088791 1
< 0.1%
38.1853366651 1
< 0.1%
38.178049339 1
< 0.1%
38.1588879213 1
< 0.1%
38.1588157231 1
< 0.1%
38.1334129524 1
< 0.1%
38.1330661468 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3674
Distinct (%)68.2%
Missing84
Missing (%)1.5%
Infinite0
Infinite (%)0.0%
Mean126.99931
Minimum126.55416
Maximum127.6506
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.2 KiB
2023-12-11T07:03:25.701017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.55416
5-th percentile126.74788
Q1126.83762
median127.03025
Q3127.11439
95-th percentile127.3366
Maximum127.6506
Range1.0964451
Interquartile range (IQR)0.27677021

Descriptive statistics

Standard deviation0.19020033
Coefficient of variation (CV)0.0014976485
Kurtosis0.64408782
Mean126.99931
Median Absolute Deviation (MAD)0.11583605
Skewness0.56355244
Sum684272.27
Variance0.036176164
MonotonicityNot monotonic
2023-12-11T07:03:25.846888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.8404047569 15
 
0.3%
127.0450553425 14
 
0.3%
126.7614890719 14
 
0.3%
126.9514301863 11
 
0.2%
127.1912058889 11
 
0.2%
127.0454279562 11
 
0.2%
126.9615971924 10
 
0.2%
126.8280665927 10
 
0.2%
127.0757913493 10
 
0.2%
126.8389926577 9
 
0.2%
Other values (3664) 5273
96.4%
(Missing) 84
 
1.5%
ValueCountFrequency (%)
126.5541581093 1
< 0.1%
126.5567860205 1
< 0.1%
126.55991476 1
< 0.1%
126.5606611859 2
< 0.1%
126.5608313578 1
< 0.1%
126.5824929244 1
< 0.1%
126.5864818952 1
< 0.1%
126.5976057487 1
< 0.1%
126.5978179671 1
< 0.1%
126.5978810854 2
< 0.1%
ValueCountFrequency (%)
127.6506032269 1
< 0.1%
127.64127488 1
< 0.1%
127.6398542064 1
< 0.1%
127.6397188594 1
< 0.1%
127.6390083568 1
< 0.1%
127.6384547077 1
< 0.1%
127.6372797797 1
< 0.1%
127.63706457 1
< 0.1%
127.636962632 1
< 0.1%
127.6362086004 1
< 0.1%

Interactions

2023-12-11T07:03:20.563901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.113212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.334649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.631491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.182754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.408168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.708017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.261729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:20.483757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:03:25.945129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명위생업태명소재지우편번호WGS84위도WGS84경도
시군명1.0000.4630.5060.9930.9660.962
영업상태명0.4631.0000.8110.2730.2320.210
위생업태명0.5060.8111.0000.3170.2390.227
소재지우편번호0.9930.2730.3171.0000.9420.903
WGS84위도0.9660.2320.2390.9421.0000.767
WGS84경도0.9620.2100.2270.9030.7671.000
2023-12-11T07:03:26.028356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위생업종명시군명영업상태명위생업태명
위생업종명1.0001.0001.0001.000
시군명1.0001.0000.2560.190
영업상태명1.0000.2561.0000.501
위생업태명1.0000.1900.5011.000
2023-12-11T07:03:26.119764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명영업상태명위생업종명위생업태명
소재지우편번호1.000-0.924-0.0450.9420.1661.0000.139
WGS84위도-0.9241.000-0.0560.7940.1401.0000.102
WGS84경도-0.045-0.0561.0000.7780.1271.0000.097
시군명0.9420.7940.7781.0000.2561.0000.190
영업상태명0.1660.1400.1270.2561.0001.0000.501
위생업종명1.0001.0001.0001.0001.0001.0001.000
위생업태명0.1390.1020.0970.1900.5011.0001.000

Missing values

2023-12-11T07:03:20.813399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:03:20.961695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:03:21.085027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군개미와베짱이1982-05-15영업<NA><NA><NA><NA>스텐드바경기도 가평군 가평읍 가화로 129, 지하1층경기도 가평군 가평읍 읍내리 449-1 지하1층1241337.83114127.512818
1가평군쿨 술마시는 노래타운2008-05-16영업<NA><NA><NA><NA>룸살롱경기도 가평군 청평면 구청평로 92, 2층경기도 가평군 청평면 청평리 619-17 2층1245337.735327127.415131
2가평군오아시스노래주점1988-04-15영업<NA><NA><NA><NA>룸살롱경기도 가평군 조종면 조종희망로5번길 7, 1층경기도 가평군 조종면 현리 264-19 1층1243737.818903127.349079
3가평군장녹수20051216영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 굴다리길 2, 가동 2층경기도 가평군 가평읍 대곡리 232-1 외 1필지 , 2층 가동1242037.826219127.514965
4가평군21세기 술노래룸2006-03-27영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 가화로 113-1경기도 가평군 가평읍 읍내리 468-211241837.829824127.513341
5가평군탑클래스(TOPclass)뮤직타운20071031영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 보납로 8-1, 2층경기도 가평군 가평읍 읍내리 495-38 2층1241837.831041127.511364
6가평군하모니노래장20130115영업<NA><NA><NA><NA>룸살롱경기도 가평군 조종면 조종새싹로4번길 13, 1층경기도 가평군 조종면 현리 264-18 1층1243737.819093127.349635
7가평군퍼스트2010-05-17영업<NA><NA><NA><NA>룸살롱경기도 가평군 조종면 조종새싹로4번길 14, 2층경기도 가평군 조종면 현리 263-19 외 2필지, 2층1243737.818958127.349783
8가평군고구려2011-05-12영업<NA><NA><NA><NA>룸살롱경기도 가평군 청평면 청평중앙로 62, 지하1층경기도 가평군 청평면 청평리 432-16 지하1층1245237.738494127.42112
9가평군비틀즈20030124영업<NA><NA><NA><NA>룸살롱경기도 가평군 청평면 청평중앙로 59, 지하1층경기도 가평군 청평면 청평리 465-17 지하1층1245237.73862127.420756
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
5462화성시룸영상20000330폐업 등20030411<NA><NA>유흥주점영업간이주점경기도 화성시 향남읍 평3길 5-3경기도 화성시 향남읍 평리 76-23번지1859337.130839126.90955
5463화성시진성옥19720712폐업 등20140407<NA><NA>유흥주점영업간이주점<NA>경기도 화성시 향남읍 발안리 133번지18594<NA><NA>
5464화성시초원가요주점19960319폐업 등20150826<NA><NA>유흥주점영업간이주점경기도 화성시 우정읍 띨뿌리길 19경기도 화성시 우정읍 매향리 753-41번지1857237.037276126.767923
5465화성시19960822폐업 등20121205<NA><NA>유흥주점영업간이주점<NA>경기도 화성시 봉담읍 상리 27-6번지1831337.220049126.948429
5466화성시나비유흥주점20000822폐업 등20151016<NA><NA>유흥주점영업간이주점경기도 화성시 동부대로925번길 26경기도 화성시 오산동 873-3번지<NA>37.185967127.087222
5467화성시비어호프19861101폐업 등20051027<NA><NA>유흥주점영업간이주점경기도 화성시 남양읍 남양시장로 67-6경기도 화성시 남양읍 남양리 1245번지1825837.209574126.816852
5468화성시마당쇠19850819폐업 등20090424<NA><NA>유흥주점영업간이주점경기도 화성시 서신면 매화3길 6경기도 화성시 서신면 매화리 324-4번지1855537.168703126.705394
5469화성시로즈음악홀20100528폐업 등20141006<NA><NA>유흥주점영업간이주점경기도 화성시 향남읍 평3길 15경기도 화성시 향남읍 평리 80-5번지 외10필지 에쓰뻬랑스 202-a호1859337.131311126.908567
5470화성시V 노래 클럽20100528폐업 등20120216<NA><NA>유흥주점영업간이주점경기도 화성시 남양읍 역골로 9-13경기도 화성시 남양읍 남양리 2077-4번지 외 1필지 1동 205호1827137.200525126.828067
5471화성시옹진옥19721102폐업 등20151016<NA><NA>유흥주점영업간이주점경기도 화성시 병점동로 12 (병점동)경기도 화성시 병점동 381번지18411<NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도# duplicates
0화성시친구찾기주점19791122폐업 등20080109유흥주점영업간이주점<NA>경기도 화성시 우정읍 조암리 270-5번지1856737.083325126.8186842