Overview

Dataset statistics

Number of variables14
Number of observations4689
Missing cells13332
Missing cells (%)20.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory535.9 KiB
Average record size in memory117.0 B

Variable types

Categorical4
Text3
DateTime2
Unsupported2
Numeric3

Dataset

Description유흥주점 영업(카바레) 현황_인허가
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=L26ACMJY5OCQ4LNZFFK114238109&infSeq=1

Alerts

위생업태명 is highly overall correlated with 영업상태명 and 1 other fieldsHigh correlation
영업상태명 is highly overall correlated with 위생업종명 and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
위생업종명 is highly overall correlated with 소재지우편번호 and 5 other fieldsHigh correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
폐업일자 has 3729 (79.5%) missing valuesMissing
다중이용업소여부 has 4689 (100.0%) missing valuesMissing
총시설규모(㎡) has 4689 (100.0%) missing valuesMissing
소재지도로명주소 has 111 (2.4%) missing valuesMissing
다중이용업소여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 22:27:52.968361
Analysis finished2023-12-10 22:27:55.883665
Duration2.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
평택시
497 
부천시
484 
안산시
445 
수원시
358 
성남시
336 
Other values (25)
2569 

Length

Max length4
Median length3
Mean length3.074003
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
평택시 497
 
10.6%
부천시 484
 
10.3%
안산시 445
 
9.5%
수원시 358
 
7.6%
성남시 336
 
7.2%
시흥시 270
 
5.8%
화성시 211
 
4.5%
안양시 187
 
4.0%
용인시 173
 
3.7%
의정부시 160
 
3.4%
Other values (20) 1568
33.4%

Length

2023-12-11T07:27:55.941750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
평택시 497
 
10.6%
부천시 484
 
10.3%
안산시 445
 
9.5%
수원시 358
 
7.6%
성남시 336
 
7.2%
시흥시 270
 
5.8%
화성시 211
 
4.5%
안양시 187
 
4.0%
용인시 173
 
3.7%
의정부시 160
 
3.4%
Other values (20) 1568
33.4%
Distinct3901
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
2023-12-11T07:27:56.295464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length5.5263382
Min length1

Characters and Unicode

Total characters25913
Distinct characters778
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3424 ?
Unique (%)73.0%

Sample

1st row동그라미단란주점
2nd row술마시는 쥬 노래장
3rd row춘향이노래장
4th row별천지노래장
5th row장녹수
ValueCountFrequency (%)
단란주점 32
 
0.6%
라이브 24
 
0.5%
준코뮤직타운 22
 
0.4%
7080 22
 
0.4%
노래광장 20
 
0.4%
노래빠 19
 
0.4%
노래짱 16
 
0.3%
노래장 15
 
0.3%
황진이 14
 
0.3%
노래주점 13
 
0.3%
Other values (3895) 4947
96.2%
2023-12-11T07:27:56.788093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1766
 
6.8%
1764
 
6.8%
768
 
3.0%
669
 
2.6%
652
 
2.5%
631
 
2.4%
0 620
 
2.4%
531
 
2.0%
513
 
2.0%
478
 
1.8%
Other values (768) 17521
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22654
87.4%
Decimal Number 1447
 
5.6%
Uppercase Letter 670
 
2.6%
Space Separator 457
 
1.8%
Lowercase Letter 242
 
0.9%
Open Punctuation 198
 
0.8%
Close Punctuation 197
 
0.8%
Other Punctuation 38
 
0.1%
Letter Number 6
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1766
 
7.8%
1764
 
7.8%
768
 
3.4%
669
 
3.0%
652
 
2.9%
631
 
2.8%
531
 
2.3%
513
 
2.3%
478
 
2.1%
382
 
1.7%
Other values (692) 14500
64.0%
Uppercase Letter
ValueCountFrequency (%)
O 56
 
8.4%
E 48
 
7.2%
S 45
 
6.7%
A 43
 
6.4%
I 40
 
6.0%
N 39
 
5.8%
K 37
 
5.5%
T 35
 
5.2%
H 30
 
4.5%
B 30
 
4.5%
Other values (16) 267
39.9%
Lowercase Letter
ValueCountFrequency (%)
e 25
 
10.3%
a 22
 
9.1%
r 19
 
7.9%
s 17
 
7.0%
i 17
 
7.0%
l 17
 
7.0%
o 17
 
7.0%
u 16
 
6.6%
y 13
 
5.4%
b 10
 
4.1%
Other values (14) 69
28.5%
Decimal Number
ValueCountFrequency (%)
0 620
42.8%
7 311
21.5%
8 297
20.5%
2 68
 
4.7%
1 51
 
3.5%
9 47
 
3.2%
3 25
 
1.7%
4 13
 
0.9%
6 9
 
0.6%
5 6
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 21
55.3%
& 5
 
13.2%
, 5
 
13.2%
% 2
 
5.3%
! 2
 
5.3%
· 1
 
2.6%
/ 1
 
2.6%
' 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 196
99.0%
[ 2
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 195
99.0%
] 2
 
1.0%
Space Separator
ValueCountFrequency (%)
457
100.0%
Letter Number
ValueCountFrequency (%)
6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22650
87.4%
Common 2341
 
9.0%
Latin 918
 
3.5%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1766
 
7.8%
1764
 
7.8%
768
 
3.4%
669
 
3.0%
652
 
2.9%
631
 
2.8%
531
 
2.3%
513
 
2.3%
478
 
2.1%
382
 
1.7%
Other values (688) 14496
64.0%
Latin
ValueCountFrequency (%)
O 56
 
6.1%
E 48
 
5.2%
S 45
 
4.9%
A 43
 
4.7%
I 40
 
4.4%
N 39
 
4.2%
K 37
 
4.0%
T 35
 
3.8%
H 30
 
3.3%
B 30
 
3.3%
Other values (41) 515
56.1%
Common
ValueCountFrequency (%)
0 620
26.5%
457
19.5%
7 311
13.3%
8 297
12.7%
( 196
 
8.4%
) 195
 
8.3%
2 68
 
2.9%
1 51
 
2.2%
9 47
 
2.0%
3 25
 
1.1%
Other values (15) 74
 
3.2%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22648
87.4%
ASCII 3252
 
12.5%
Number Forms 6
 
< 0.1%
CJK 4
 
< 0.1%
Compat Jamo 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1766
 
7.8%
1764
 
7.8%
768
 
3.4%
669
 
3.0%
652
 
2.9%
631
 
2.8%
531
 
2.3%
513
 
2.3%
478
 
2.1%
382
 
1.7%
Other values (686) 14494
64.0%
ASCII
ValueCountFrequency (%)
0 620
19.1%
457
14.1%
7 311
 
9.6%
8 297
 
9.1%
( 196
 
6.0%
) 195
 
6.0%
2 68
 
2.1%
O 56
 
1.7%
1 51
 
1.6%
E 48
 
1.5%
Other values (64) 953
29.3%
Number Forms
ValueCountFrequency (%)
6
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct3407
Distinct (%)72.7%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
Minimum1961-09-28 00:00:00
Maximum2023-12-04 00:00:00
2023-12-11T07:27:56.939653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:57.097468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
영업
3436 
폐업 등
489 
폐업
471 
운영중
 
293

Length

Max length4
Median length2
Mean length2.2710599
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 3436
73.3%
폐업 등 489
 
10.4%
폐업 471
 
10.0%
운영중 293
 
6.2%

Length

2023-12-11T07:27:57.241707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:27:57.344933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 3436
66.4%
폐업 960
 
18.5%
489
 
9.4%
운영중 293
 
5.7%

폐업일자
Date

MISSING 

Distinct742
Distinct (%)77.3%
Missing3729
Missing (%)79.5%
Memory size36.8 KiB
Minimum1990-03-09 00:00:00
Maximum2023-12-04 00:00:00
2023-12-11T07:27:57.474696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:57.626468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

다중이용업소여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4689
Missing (%)100.0%
Memory size41.3 KiB

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4689
Missing (%)100.0%
Memory size41.3 KiB

위생업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
<NA>
3942 
유흥주점영업
747 

Length

Max length6
Median length4
Mean length4.318618
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3942
84.1%
유흥주점영업 747
 
15.9%

Length

2023-12-11T07:27:57.780882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:27:57.906736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 3942
84.1%
유흥주점영업 747
 
15.9%

위생업태명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
룸살롱
1828 
단란주점
1047 
카바레
857 
간이주점
329 
기타
293 
Other values (8)
335 

Length

Max length12
Median length3
Mean length3.3510343
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row단란주점
2nd row카바레
3rd row룸살롱
4th row카바레
5th row룸살롱

Common Values

ValueCountFrequency (%)
룸살롱 1828
39.0%
단란주점 1047
22.3%
카바레 857
18.3%
간이주점 329
 
7.0%
기타 293
 
6.2%
노래클럽 193
 
4.1%
스텐드바 49
 
1.0%
<NA> 35
 
0.7%
고고(디스코)클럽 26
 
0.6%
비어(바)살롱 26
 
0.6%
Other values (3) 6
 
0.1%

Length

2023-12-11T07:27:58.037161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
룸살롱 1828
39.0%
단란주점 1047
22.3%
카바레 857
18.3%
간이주점 329
 
7.0%
기타 293
 
6.2%
노래클럽 193
 
4.1%
스텐드바 49
 
1.0%
na 35
 
0.7%
고고(디스코)클럽 26
 
0.6%
비어(바)살롱 26
 
0.6%
Other values (3) 6
 
0.1%
Distinct4336
Distinct (%)94.7%
Missing111
Missing (%)2.4%
Memory size36.8 KiB
2023-12-11T07:27:58.301880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length49
Mean length30.687637
Min length13

Characters and Unicode

Total characters140488
Distinct characters456
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4110 ?
Unique (%)89.8%

Sample

1st row경기도 가평군 조종면 조종희망로5번길 12-4, 2층
2nd row경기도 가평군 가평읍 가화로 122, 지하1층
3rd row경기도 가평군 가평읍 연인2길 2
4th row경기도 가평군 가평읍 오리나무길 15
5th row경기도 가평군 가평읍 굴다리길 2, 가동 2층
ValueCountFrequency (%)
경기도 4578
 
15.7%
지하1층 491
 
1.7%
부천시 482
 
1.7%
평택시 459
 
1.6%
2층 443
 
1.5%
안산시 441
 
1.5%
수원시 353
 
1.2%
성남시 334
 
1.1%
단원구 303
 
1.0%
시흥시 268
 
0.9%
Other values (3990) 21037
72.1%
2023-12-11T07:27:58.762551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24707
 
17.6%
1 5608
 
4.0%
4864
 
3.5%
4713
 
3.4%
4652
 
3.3%
4624
 
3.3%
4206
 
3.0%
2 4189
 
3.0%
4127
 
2.9%
) 3880
 
2.8%
Other values (446) 74918
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 78182
55.7%
Space Separator 24707
 
17.6%
Decimal Number 24625
 
17.5%
Close Punctuation 3880
 
2.8%
Open Punctuation 3879
 
2.8%
Other Punctuation 3801
 
2.7%
Dash Punctuation 1131
 
0.8%
Uppercase Letter 243
 
0.2%
Math Symbol 28
 
< 0.1%
Letter Number 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4864
 
6.2%
4713
 
6.0%
4652
 
6.0%
4624
 
5.9%
4206
 
5.4%
4127
 
5.3%
2458
 
3.1%
2308
 
3.0%
1897
 
2.4%
1872
 
2.4%
Other values (401) 42461
54.3%
Uppercase Letter
ValueCountFrequency (%)
B 157
64.6%
A 19
 
7.8%
C 9
 
3.7%
M 7
 
2.9%
S 6
 
2.5%
H 5
 
2.1%
E 5
 
2.1%
I 5
 
2.1%
K 4
 
1.6%
L 4
 
1.6%
Other values (9) 22
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 5608
22.8%
2 4189
17.0%
3 2888
11.7%
0 2625
10.7%
4 1972
 
8.0%
5 1906
 
7.7%
6 1517
 
6.2%
7 1441
 
5.9%
9 1370
 
5.6%
8 1109
 
4.5%
Letter Number
ValueCountFrequency (%)
5
55.6%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 3776
99.3%
. 24
 
0.6%
' 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 27
96.4%
> 1
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
l 2
66.7%
a 1
33.3%
Space Separator
ValueCountFrequency (%)
24707
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3880
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3879
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1131
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 78182
55.7%
Common 62051
44.2%
Latin 255
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4864
 
6.2%
4713
 
6.0%
4652
 
6.0%
4624
 
5.9%
4206
 
5.4%
4127
 
5.3%
2458
 
3.1%
2308
 
3.0%
1897
 
2.4%
1872
 
2.4%
Other values (401) 42461
54.3%
Latin
ValueCountFrequency (%)
B 157
61.6%
A 19
 
7.5%
C 9
 
3.5%
M 7
 
2.7%
S 6
 
2.4%
H 5
 
2.0%
E 5
 
2.0%
I 5
 
2.0%
5
 
2.0%
K 4
 
1.6%
Other values (16) 33
 
12.9%
Common
ValueCountFrequency (%)
24707
39.8%
1 5608
 
9.0%
2 4189
 
6.8%
) 3880
 
6.3%
( 3879
 
6.3%
, 3776
 
6.1%
3 2888
 
4.7%
0 2625
 
4.2%
4 1972
 
3.2%
5 1906
 
3.1%
Other values (9) 6621
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 78182
55.7%
ASCII 62297
44.3%
Number Forms 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24707
39.7%
1 5608
 
9.0%
2 4189
 
6.7%
) 3880
 
6.2%
( 3879
 
6.2%
, 3776
 
6.1%
3 2888
 
4.6%
0 2625
 
4.2%
4 1972
 
3.2%
5 1906
 
3.1%
Other values (30) 6867
 
11.0%
Hangul
ValueCountFrequency (%)
4864
 
6.2%
4713
 
6.0%
4652
 
6.0%
4624
 
5.9%
4206
 
5.4%
4127
 
5.3%
2458
 
3.1%
2308
 
3.0%
1897
 
2.4%
1872
 
2.4%
Other values (401) 42461
54.3%
Number Forms
ValueCountFrequency (%)
5
55.6%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Distinct4437
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
2023-12-11T07:27:59.075481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length48
Mean length25.479207
Min length15

Characters and Unicode

Total characters119472
Distinct characters410
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4245 ?
Unique (%)90.5%

Sample

1st row경기도 가평군 조종면 현리 264-66 외 1필지 (264-67), 2층
2nd row경기도 가평군 가평읍 읍내리 471-2 지하1층
3rd row경기도 가평군 가평읍 읍내리 468-8
4th row경기도 가평군 가평읍 대곡리 239-1
5th row경기도 가평군 가평읍 대곡리 232-1 외 1필지 , 2층 가동
ValueCountFrequency (%)
경기도 4689
 
18.2%
평택시 497
 
1.9%
부천시 484
 
1.9%
지하1층 476
 
1.8%
안산시 445
 
1.7%
수원시 358
 
1.4%
성남시 336
 
1.3%
2층 310
 
1.2%
단원구 307
 
1.2%
지층 295
 
1.1%
Other values (5196) 17549
68.2%
2023-12-11T07:27:59.492894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24462
20.5%
1 6301
 
5.3%
4840
 
4.1%
4764
 
4.0%
4740
 
4.0%
4717
 
3.9%
4360
 
3.6%
- 4264
 
3.6%
2 3409
 
2.9%
3 2946
 
2.5%
Other values (400) 54669
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62667
52.5%
Decimal Number 26732
22.4%
Space Separator 24462
 
20.5%
Dash Punctuation 4264
 
3.6%
Other Punctuation 470
 
0.4%
Open Punctuation 335
 
0.3%
Close Punctuation 334
 
0.3%
Uppercase Letter 179
 
0.1%
Math Symbol 22
 
< 0.1%
Letter Number 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4840
 
7.7%
4764
 
7.6%
4740
 
7.6%
4717
 
7.5%
4360
 
7.0%
2307
 
3.7%
1802
 
2.9%
1689
 
2.7%
1361
 
2.2%
1162
 
1.9%
Other values (355) 30925
49.3%
Uppercase Letter
ValueCountFrequency (%)
B 113
63.1%
A 14
 
7.8%
C 12
 
6.7%
S 6
 
3.4%
I 5
 
2.8%
H 5
 
2.8%
G 4
 
2.2%
E 4
 
2.2%
M 3
 
1.7%
L 2
 
1.1%
Other values (9) 11
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 6301
23.6%
2 3409
12.8%
3 2946
11.0%
0 2764
10.3%
4 2487
 
9.3%
5 2019
 
7.6%
7 1993
 
7.5%
6 1756
 
6.6%
8 1749
 
6.5%
9 1308
 
4.9%
Letter Number
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 447
95.1%
. 22
 
4.7%
' 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 334
99.7%
[ 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 333
99.7%
] 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
~ 21
95.5%
> 1
 
4.5%
Space Separator
ValueCountFrequency (%)
24462
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4264
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62667
52.5%
Common 56619
47.4%
Latin 186
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4840
 
7.7%
4764
 
7.6%
4740
 
7.6%
4717
 
7.5%
4360
 
7.0%
2307
 
3.7%
1802
 
2.9%
1689
 
2.7%
1361
 
2.2%
1162
 
1.9%
Other values (355) 30925
49.3%
Latin
ValueCountFrequency (%)
B 113
60.8%
A 14
 
7.5%
C 12
 
6.5%
S 6
 
3.2%
I 5
 
2.7%
H 5
 
2.7%
G 4
 
2.2%
E 4
 
2.2%
M 3
 
1.6%
L 2
 
1.1%
Other values (14) 18
 
9.7%
Common
ValueCountFrequency (%)
24462
43.2%
1 6301
 
11.1%
- 4264
 
7.5%
2 3409
 
6.0%
3 2946
 
5.2%
0 2764
 
4.9%
4 2487
 
4.4%
5 2019
 
3.6%
7 1993
 
3.5%
6 1756
 
3.1%
Other values (11) 4218
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62667
52.5%
ASCII 56800
47.5%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24462
43.1%
1 6301
 
11.1%
- 4264
 
7.5%
2 3409
 
6.0%
3 2946
 
5.2%
0 2764
 
4.9%
4 2487
 
4.4%
5 2019
 
3.6%
7 1993
 
3.5%
6 1756
 
3.1%
Other values (31) 4399
 
7.7%
Hangul
ValueCountFrequency (%)
4840
 
7.7%
4764
 
7.6%
4740
 
7.6%
4717
 
7.5%
4360
 
7.0%
2307
 
3.7%
1802
 
2.9%
1689
 
2.7%
1361
 
2.2%
1162
 
1.9%
Other values (355) 30925
49.3%
Number Forms
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct673
Distinct (%)14.4%
Missing22
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean14683.763
Minimum10018
Maximum18623
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size41.3 KiB
2023-12-11T07:27:59.630301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10018
5-th percentile10822
Q112662
median14644
Q316571
95-th percentile18141
Maximum18623
Range8605
Interquartile range (IQR)3909

Descriptive statistics

Standard deviation2413.4505
Coefficient of variation (CV)0.16436185
Kurtosis-1.0288542
Mean14683.763
Median Absolute Deviation (MAD)1932
Skewness-0.15886595
Sum68529122
Variance5824743.3
MonotonicityNot monotonic
2023-12-11T07:27:59.899485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15361 113
 
2.4%
14548 91
 
1.9%
14580 75
 
1.6%
10071 70
 
1.5%
17774 63
 
1.3%
15062 59
 
1.3%
16489 57
 
1.2%
15066 49
 
1.0%
17758 47
 
1.0%
14066 42
 
0.9%
Other values (663) 4001
85.3%
ValueCountFrequency (%)
10018 14
 
0.3%
10019 8
 
0.2%
10024 3
 
0.1%
10025 4
 
0.1%
10040 1
 
< 0.1%
10059 2
 
< 0.1%
10071 70
1.5%
10073 2
 
< 0.1%
10098 7
 
0.1%
10129 3
 
0.1%
ValueCountFrequency (%)
18623 3
 
0.1%
18611 5
 
0.1%
18606 22
0.5%
18600 2
 
< 0.1%
18593 28
0.6%
18591 1
 
< 0.1%
18584 1
 
< 0.1%
18577 1
 
< 0.1%
18567 6
 
0.1%
18565 6
 
0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct3314
Distinct (%)71.4%
Missing46
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean37.409421
Minimum36.92055
Maximum38.185409
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size41.3 KiB
2023-12-11T07:28:00.091153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.92055
5-th percentile36.99568
Q137.266115
median37.383616
Q337.503594
95-th percentile37.870442
Maximum38.185409
Range1.2648594
Interquartile range (IQR)0.23747935

Descriptive statistics

Standard deviation0.24268142
Coefficient of variation (CV)0.0064871739
Kurtosis-0.25875461
Mean37.409421
Median Absolute Deviation (MAD)0.11875836
Skewness0.37094329
Sum173691.94
Variance0.058894272
MonotonicityNot monotonic
2023-12-11T07:28:00.233191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.0458281259 14
 
0.3%
37.7148553223 14
 
0.3%
37.2741372443 11
 
0.2%
37.0474611388 11
 
0.2%
37.3182972549 11
 
0.2%
37.5614176573 11
 
0.2%
37.1484696706 10
 
0.2%
37.6441407006 9
 
0.2%
37.1158099993 9
 
0.2%
37.317162652 8
 
0.2%
Other values (3304) 4535
96.7%
(Missing) 46
 
1.0%
ValueCountFrequency (%)
36.9205495128 1
 
< 0.1%
36.9591698642 1
 
< 0.1%
36.9596657576 1
 
< 0.1%
36.9597125094 2
< 0.1%
36.9597522368 2
< 0.1%
36.9597668961 1
 
< 0.1%
36.9598955481 3
0.1%
36.9600856541 1
 
< 0.1%
36.9601292292 1
 
< 0.1%
36.9602259744 1
 
< 0.1%
ValueCountFrequency (%)
38.1854088791 1
< 0.1%
38.1019576622 1
< 0.1%
38.1012036186 1
< 0.1%
38.1001948371 1
< 0.1%
38.0914439791 1
< 0.1%
38.0911359589 1
< 0.1%
38.0910451365 2
< 0.1%
38.0908114997 1
< 0.1%
38.0613727686 1
< 0.1%
38.058140506 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct3314
Distinct (%)71.4%
Missing46
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean126.98784
Minimum126.55384
Maximum127.68043
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size41.3 KiB
2023-12-11T07:28:00.366273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.55384
5-th percentile126.73597
Q1126.82807
median127.01395
Q3127.09282
95-th percentile127.3678
Maximum127.68043
Range1.1265914
Interquartile range (IQR)0.26475186

Descriptive statistics

Standard deviation0.19758097
Coefficient of variation (CV)0.0015559046
Kurtosis0.72349814
Mean126.98784
Median Absolute Deviation (MAD)0.13528689
Skewness0.69133198
Sum589604.56
Variance0.039038238
MonotonicityNot monotonic
2023-12-11T07:28:00.788859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0450553425 14
 
0.3%
126.7614890719 14
 
0.3%
126.9514301863 11
 
0.2%
127.0454279562 11
 
0.2%
126.8404047569 11
 
0.2%
127.1912058889 11
 
0.2%
127.0757913493 10
 
0.2%
126.6260787927 9
 
0.2%
126.9127510897 9
 
0.2%
126.8389926577 8
 
0.2%
Other values (3304) 4535
96.7%
(Missing) 46
 
1.0%
ValueCountFrequency (%)
126.5538399572 1
< 0.1%
126.5541581093 1
< 0.1%
126.5567860205 1
< 0.1%
126.55991476 1
< 0.1%
126.5606611859 2
< 0.1%
126.5608313578 1
< 0.1%
126.5824929244 1
< 0.1%
126.5864818952 1
< 0.1%
126.5976057487 1
< 0.1%
126.5978179671 1
< 0.1%
ValueCountFrequency (%)
127.6804313198 1
< 0.1%
127.6506032269 1
< 0.1%
127.64127488 1
< 0.1%
127.6398542064 1
< 0.1%
127.6397188594 1
< 0.1%
127.6390083568 1
< 0.1%
127.6384547077 1
< 0.1%
127.6372797797 1
< 0.1%
127.63706457 1
< 0.1%
127.636962632 1
< 0.1%

Interactions

2023-12-11T07:27:55.102384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:54.351837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:54.809798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:55.187485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:54.629813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:54.909624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:55.275940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:54.719368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:27:55.014915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:28:00.892242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명위생업태명소재지우편번호WGS84위도WGS84경도
시군명1.0000.3340.4860.9990.9810.979
영업상태명0.3341.0000.8380.2210.2000.192
위생업태명0.4860.8381.0000.3210.2540.247
소재지우편번호0.9990.2210.3211.0000.9370.909
WGS84위도0.9810.2000.2540.9371.0000.752
WGS84경도0.9790.1920.2470.9090.7521.000
2023-12-11T07:28:00.991127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위생업태명영업상태명시군명위생업종명
위생업태명1.0000.5330.1751.000
영업상태명0.5331.0000.1781.000
시군명0.1750.1781.0001.000
위생업종명1.0001.0001.0001.000
2023-12-11T07:28:01.077270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명영업상태명위생업종명위생업태명
소재지우편번호1.000-0.9270.0510.9350.1341.0000.141
WGS84위도-0.9271.000-0.1530.7830.1201.0000.109
WGS84경도0.051-0.1531.0000.7760.1161.0000.106
시군명0.9350.7830.7761.0000.1781.0000.175
영업상태명0.1340.1200.1160.1781.0001.0000.533
위생업종명1.0001.0001.0001.0001.0001.0001.000
위생업태명0.1410.1090.1060.1750.5331.0001.000

Missing values

2023-12-11T07:27:55.397758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:27:55.591264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:27:55.786986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군동그라미단란주점2008-11-10영업<NA><NA><NA><NA>단란주점경기도 가평군 조종면 조종희망로5번길 12-4, 2층경기도 가평군 조종면 현리 264-66 외 1필지 (264-67), 2층1243737.819111127.349544
1가평군술마시는 쥬 노래장20010920영업<NA><NA><NA><NA>카바레경기도 가평군 가평읍 가화로 122, 지하1층경기도 가평군 가평읍 읍내리 471-2 지하1층1241937.830563127.513671
2가평군춘향이노래장2004-09-15영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 연인2길 2경기도 가평군 가평읍 읍내리 468-81241837.830401127.513239
3가평군별천지노래장1983-09-15영업<NA><NA><NA><NA>카바레경기도 가평군 가평읍 오리나무길 15경기도 가평군 가평읍 대곡리 239-11242037.826228127.514329
4가평군장녹수20051216영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 굴다리길 2, 가동 2층경기도 가평군 가평읍 대곡리 232-1 외 1필지 , 2층 가동1242037.826219127.514965
5가평군개미와베짱이1982-05-15영업<NA><NA><NA><NA>스텐드바경기도 가평군 가평읍 가화로 129, 지하1층경기도 가평군 가평읍 읍내리 449-1 지하1층1241337.83114127.512818
6가평군꽃밭에서2008-04-08영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 굴다리길 4, 지하층경기도 가평군 가평읍 대곡리 231-2 지하층1242037.826156127.514761
7가평군탑클래스(TOPclass)뮤직타운20071031영업<NA><NA><NA><NA>룸살롱경기도 가평군 가평읍 보납로 8-1, 2층경기도 가평군 가평읍 읍내리 495-38 2층1241837.831041127.511364
8가평군퍼스트2010-05-17영업<NA><NA><NA><NA>룸살롱경기도 가평군 조종면 조종새싹로4번길 14, 2층경기도 가평군 조종면 현리 263-19 외 2필지, 2층1243737.818958127.349783
9가평군고구려2011-05-12영업<NA><NA><NA><NA>룸살롱경기도 가평군 청평면 청평중앙로 62, 지하1층경기도 가평군 청평면 청평리 432-16 지하1층1245237.738494127.42112
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
4679화성시목동비지니스클럽19770719폐업 등20140407<NA><NA>유흥주점영업카바레<NA>경기도 화성시 향남읍 평리 903-3번지<NA><NA><NA>
4680화성시엉터리유흥주점19840823폐업 등20050509<NA><NA>유흥주점영업카바레경기도 화성시 향남읍 배터길 11경기도 화성시 향남읍 평리 95-10번지1859337.132436126.906262
4681화성시장희빈19990624폐업 등20151016<NA><NA>유흥주점영업카바레<NA>경기도 화성시 오산동 868-1번지<NA>37.18622127.086614
4682화성시뉴보물섬19970409폐업 등20090522<NA><NA>유흥주점영업카바레<NA>경기도 화성시 진안동 527-16번지1840137.208207127.035247
4683화성시목마19830122폐업 등20151016<NA><NA>유흥주점영업카바레경기도 화성시 떡전골로 1122 (진안동)경기도 화성시 진안동 537-9번지1839037.207771127.033613
4684화성시엘란가요주점19791122폐업 등20150826<NA><NA>유흥주점영업카바레<NA>경기도 화성시 우정읍 조암리 270-5번지1856737.083325126.818684
4685화성시궁전노래빠19890508폐업 등20170622<NA><NA>유흥주점영업카바레경기도 화성시 향남읍 배터길 4-5경기도 화성시 향남읍 평리 102번지 지하1859337.132181126.907547
4686화성시로마 유흥주점19951216폐업 등20160905<NA><NA>유흥주점영업카바레경기도 화성시 장안면 3.1만세로 280경기도 화성시 장안면 어은리 46-11번지1858437.097264126.836755
4687화성시콩빠19950912폐업 등20180807<NA><NA>유흥주점영업카바레경기도 화성시 향남읍 평1길 13-1경기도 화성시 향남읍 평리 85-28번지1859337.132339126.909104
4688화성시풍차유흥주점19980120폐업 등20150826<NA><NA>유흥주점영업카바레경기도 화성시 우정읍 조암서로22번길 14-8경기도 화성시 우정읍 조암리 354-2번지1856737.082947126.817259