Overview

Dataset statistics

Number of variables14
Number of observations114
Missing cells116
Missing cells (%)7.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory119.2 B

Variable types

Categorical5
Text3
Numeric4
Boolean1
Unsupported1

Alerts

폐업일자 is highly overall correlated with 소재지우편번호 and 8 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 6 other fieldsHigh correlation
위생업태명 is highly overall correlated with 소재지우편번호 and 8 other fieldsHigh correlation
영업상태명 is highly overall correlated with 폐업일자 and 2 other fieldsHigh correlation
위생업종명 is highly overall correlated with 소재지우편번호 and 8 other fieldsHigh correlation
다중이용업소여부 is highly overall correlated with WGS84위도 and 4 other fieldsHigh correlation
소재지우편번호 is highly overall correlated with WGS84경도 and 4 other fieldsHigh correlation
인허가일자 is highly overall correlated with 폐업일자 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 시군명 and 4 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
영업상태명 is highly imbalanced (87.3%)Imbalance
폐업일자 is highly imbalanced (90.9%)Imbalance
다중이용업소여부 is highly imbalanced (56.6%)Imbalance
위생업종명 is highly imbalanced (87.3%)Imbalance
위생업태명 is highly imbalanced (87.3%)Imbalance
다중이용업소여부 has 2 (1.8%) missing valuesMissing
총시설규모(㎡) has 114 (100.0%) missing valuesMissing
사업장명 has unique valuesUnique
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:01:20.866575
Analysis finished2023-12-10 23:01:23.450289
Duration2.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
부천시
17 
성남시
15 
화성시
11 
안양시
10 
양평군
 
5
Other values (17)
56 

Length

Max length4
Median length3
Mean length3.1052632
Min length3

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
부천시 17
14.9%
성남시 15
13.2%
화성시 11
 
9.6%
안양시 10
 
8.8%
양평군 5
 
4.4%
수원시 5
 
4.4%
평택시 5
 
4.4%
동두천시 5
 
4.4%
가평군 5
 
4.4%
하남시 4
 
3.5%
Other values (12) 32
28.1%

Length

2023-12-11T08:01:23.511195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부천시 17
14.9%
성남시 15
13.2%
화성시 11
 
9.6%
안양시 10
 
8.8%
양평군 5
 
4.4%
수원시 5
 
4.4%
평택시 5
 
4.4%
동두천시 5
 
4.4%
가평군 5
 
4.4%
안산시 4
 
3.5%
Other values (12) 32
28.1%

사업장명
Text

UNIQUE 

Distinct114
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T08:01:23.743028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length11
Mean length5.9649123
Min length1

Characters and Unicode

Total characters680
Distinct characters201
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)100.0%

Sample

1st row술마시는명지노래장
2nd row비타민노래광장
3rd row유튜브클럽
4th row별노래방
5th row힐링 술마시는 노래장
ValueCountFrequency (%)
노래빠(bar 3
 
2.3%
노래장 3
 
2.3%
링코노래타운 2
 
1.5%
노래짱 2
 
1.5%
술마시는명지노래장 1
 
0.8%
뮤직타운 1
 
0.8%
빨간여우 1
 
0.8%
명동노래빠 1
 
0.8%
딱좋아노래빠 1
 
0.8%
강남라이브클럽 1
 
0.8%
Other values (117) 117
88.0%
2023-12-11T08:01:24.112000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
67
 
9.9%
67
 
9.9%
19
 
2.8%
19
 
2.8%
17
 
2.5%
17
 
2.5%
17
 
2.5%
0 16
 
2.4%
16
 
2.4%
15
 
2.2%
Other values (191) 410
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 581
85.4%
Decimal Number 38
 
5.6%
Space Separator 19
 
2.8%
Uppercase Letter 18
 
2.6%
Lowercase Letter 9
 
1.3%
Close Punctuation 6
 
0.9%
Open Punctuation 6
 
0.9%
Letter Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
11.5%
67
 
11.5%
19
 
3.3%
17
 
2.9%
17
 
2.9%
17
 
2.9%
16
 
2.8%
15
 
2.6%
14
 
2.4%
12
 
2.1%
Other values (161) 320
55.1%
Uppercase Letter
ValueCountFrequency (%)
A 3
16.7%
B 3
16.7%
R 3
16.7%
J 2
11.1%
K 2
11.1%
C 2
11.1%
O 1
 
5.6%
M 1
 
5.6%
U 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
0 16
42.1%
7 9
23.7%
8 6
 
15.8%
2 2
 
5.3%
9 2
 
5.3%
4 1
 
2.6%
1 1
 
2.6%
3 1
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
e 2
22.2%
u 1
11.1%
c 1
11.1%
i 1
11.1%
o 1
11.1%
h 1
11.1%
l 1
11.1%
b 1
11.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 581
85.4%
Common 69
 
10.1%
Latin 30
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
11.5%
67
 
11.5%
19
 
3.3%
17
 
2.9%
17
 
2.9%
17
 
2.9%
16
 
2.8%
15
 
2.6%
14
 
2.4%
12
 
2.1%
Other values (161) 320
55.1%
Latin
ValueCountFrequency (%)
A 3
 
10.0%
B 3
 
10.0%
R 3
 
10.0%
J 2
 
6.7%
2
 
6.7%
e 2
 
6.7%
K 2
 
6.7%
C 2
 
6.7%
O 1
 
3.3%
u 1
 
3.3%
Other values (9) 9
30.0%
Common
ValueCountFrequency (%)
19
27.5%
0 16
23.2%
7 9
13.0%
) 6
 
8.7%
( 6
 
8.7%
8 6
 
8.7%
2 2
 
2.9%
9 2
 
2.9%
4 1
 
1.4%
1 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 581
85.4%
ASCII 96
 
14.1%
Number Forms 3
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
67
 
11.5%
67
 
11.5%
19
 
3.3%
17
 
2.9%
17
 
2.9%
17
 
2.9%
16
 
2.8%
15
 
2.6%
14
 
2.4%
12
 
2.1%
Other values (161) 320
55.1%
ASCII
ValueCountFrequency (%)
19
19.8%
0 16
16.7%
7 9
 
9.4%
) 6
 
6.2%
( 6
 
6.2%
8 6
 
6.2%
A 3
 
3.1%
B 3
 
3.1%
R 3
 
3.1%
2 2
 
2.1%
Other values (18) 23
24.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct79
Distinct (%)69.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean374464.32
Minimum11326
Maximum486903
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T08:01:24.243695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11326
5-th percentile14544.95
Q1427605
median445894.5
Q3465820
95-th percentile482903.65
Maximum486903
Range475577
Interquartile range (IQR)38215

Descriptive statistics

Standard deviation172726.47
Coefficient of variation (CV)0.46126282
Kurtosis0.6750846
Mean374464.32
Median Absolute Deviation (MAD)19925.5
Skewness-1.6112313
Sum42688933
Variance2.9834433 × 1010
MonotonicityNot monotonic
2023-12-11T08:01:24.383198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
445851 4
 
3.5%
465820 4
 
3.5%
462835 4
 
3.5%
461811 3
 
2.6%
445160 3
 
2.6%
476841 3
 
2.6%
431849 3
 
2.6%
477842 2
 
1.8%
482030 2
 
1.8%
14634 2
 
1.8%
Other values (69) 84
73.7%
ValueCountFrequency (%)
11326 1
0.9%
12437 1
0.9%
13646 1
0.9%
14420 1
0.9%
14542 1
0.9%
14543 1
0.9%
14546 1
0.9%
14548 2
1.8%
14580 2
1.8%
14582 2
1.8%
ValueCountFrequency (%)
486903 2
1.8%
483040 1
0.9%
483030 1
0.9%
483020 2
1.8%
482841 1
0.9%
482030 2
1.8%
480848 1
0.9%
480843 1
0.9%
480842 1
0.9%
480841 1
0.9%
Distinct113
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T08:01:24.648348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length41
Mean length32.789474
Min length18

Characters and Unicode

Total characters3738
Distinct characters195
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)98.2%

Sample

1st row경기도 가평군 북면 화악산로 17, 1층
2nd row경기도 가평군 가평읍 가화로 124, 2층
3rd row경기도 가평군 가평읍 가화로 138, 1층
4th row경기도 가평군 북면 화악산로 18-1, 1층
5th row경기도 가평군 조종면 조종새싹로4번길 15-9, 1층
ValueCountFrequency (%)
경기도 114
 
14.6%
2층 22
 
2.8%
지하1층 22
 
2.8%
부천시 17
 
2.2%
성남시 15
 
1.9%
화성시 11
 
1.4%
수정구 10
 
1.3%
안양시 10
 
1.3%
3층 10
 
1.3%
동안구 9
 
1.2%
Other values (334) 539
69.2%
2023-12-11T08:01:25.115916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
665
 
17.8%
1 155
 
4.1%
, 132
 
3.5%
120
 
3.2%
117
 
3.1%
116
 
3.1%
112
 
3.0%
108
 
2.9%
104
 
2.8%
( 101
 
2.7%
Other values (185) 2008
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2045
54.7%
Space Separator 665
 
17.8%
Decimal Number 652
 
17.4%
Other Punctuation 132
 
3.5%
Open Punctuation 101
 
2.7%
Close Punctuation 101
 
2.7%
Dash Punctuation 33
 
0.9%
Uppercase Letter 8
 
0.2%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
5.9%
117
 
5.7%
116
 
5.7%
112
 
5.5%
108
 
5.3%
104
 
5.1%
88
 
4.3%
61
 
3.0%
49
 
2.4%
49
 
2.4%
Other values (167) 1121
54.8%
Decimal Number
ValueCountFrequency (%)
1 155
23.8%
2 99
15.2%
0 78
12.0%
3 76
11.7%
4 56
 
8.6%
5 50
 
7.7%
6 40
 
6.1%
9 37
 
5.7%
7 34
 
5.2%
8 27
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
B 7
87.5%
A 1
 
12.5%
Space Separator
ValueCountFrequency (%)
665
100.0%
Other Punctuation
ValueCountFrequency (%)
, 132
100.0%
Open Punctuation
ValueCountFrequency (%)
( 101
100.0%
Close Punctuation
ValueCountFrequency (%)
) 101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2045
54.7%
Common 1684
45.1%
Latin 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
5.9%
117
 
5.7%
116
 
5.7%
112
 
5.5%
108
 
5.3%
104
 
5.1%
88
 
4.3%
61
 
3.0%
49
 
2.4%
49
 
2.4%
Other values (167) 1121
54.8%
Common
ValueCountFrequency (%)
665
39.5%
1 155
 
9.2%
, 132
 
7.8%
( 101
 
6.0%
) 101
 
6.0%
2 99
 
5.9%
0 78
 
4.6%
3 76
 
4.5%
4 56
 
3.3%
5 50
 
3.0%
Other values (5) 171
 
10.2%
Latin
ValueCountFrequency (%)
B 7
77.8%
1
 
11.1%
A 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2045
54.7%
ASCII 1692
45.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
665
39.3%
1 155
 
9.2%
, 132
 
7.8%
( 101
 
6.0%
) 101
 
6.0%
2 99
 
5.9%
0 78
 
4.6%
3 76
 
4.5%
4 56
 
3.3%
5 50
 
3.0%
Other values (7) 179
 
10.6%
Hangul
ValueCountFrequency (%)
120
 
5.9%
117
 
5.7%
116
 
5.7%
112
 
5.5%
108
 
5.3%
104
 
5.1%
88
 
4.3%
61
 
3.0%
49
 
2.4%
49
 
2.4%
Other values (167) 1121
54.8%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct113
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T08:01:25.386269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length36
Mean length27.596491
Min length17

Characters and Unicode

Total characters3146
Distinct characters170
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)98.2%

Sample

1st row경기도 가평군 북면 목동리 886-10번지 , 1층
2nd row경기도 가평군 가평읍 읍내리 471-3번지 외 1필지, 2층
3rd row경기도 가평군 가평읍 읍내리 443-17번지 1층
4th row경기도 가평군 북면 목동리 886-17번지 , 886-18(1층)
5th row경기도 가평군 조종면 현리 263-9번지 술마시는노래방
ValueCountFrequency (%)
경기도 114
 
17.4%
지하1층 17
 
2.6%
부천시 17
 
2.6%
성남시 15
 
2.3%
2층 11
 
1.7%
화성시 11
 
1.7%
수정구 10
 
1.5%
안양시 10
 
1.5%
동안구 9
 
1.4%
신흥동 7
 
1.1%
Other values (278) 434
66.3%
2023-12-11T08:01:25.840968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
541
 
17.2%
1 164
 
5.2%
147
 
4.7%
117
 
3.7%
116
 
3.7%
115
 
3.7%
114
 
3.6%
106
 
3.4%
105
 
3.3%
- 103
 
3.3%
Other values (160) 1518
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1784
56.7%
Decimal Number 690
 
21.9%
Space Separator 541
 
17.2%
Dash Punctuation 103
 
3.3%
Other Punctuation 17
 
0.5%
Uppercase Letter 4
 
0.1%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
 
8.2%
117
 
6.6%
116
 
6.5%
115
 
6.4%
114
 
6.4%
106
 
5.9%
105
 
5.9%
55
 
3.1%
42
 
2.4%
41
 
2.3%
Other values (142) 826
46.3%
Decimal Number
ValueCountFrequency (%)
1 164
23.8%
2 91
13.2%
3 80
11.6%
0 72
10.4%
4 64
 
9.3%
5 55
 
8.0%
7 47
 
6.8%
6 43
 
6.2%
8 38
 
5.5%
9 36
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 3
75.0%
A 1
 
25.0%
Space Separator
ValueCountFrequency (%)
541
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1784
56.7%
Common 1357
43.1%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
 
8.2%
117
 
6.6%
116
 
6.5%
115
 
6.4%
114
 
6.4%
106
 
5.9%
105
 
5.9%
55
 
3.1%
42
 
2.4%
41
 
2.3%
Other values (142) 826
46.3%
Common
ValueCountFrequency (%)
541
39.9%
1 164
 
12.1%
- 103
 
7.6%
2 91
 
6.7%
3 80
 
5.9%
0 72
 
5.3%
4 64
 
4.7%
5 55
 
4.1%
7 47
 
3.5%
6 43
 
3.2%
Other values (5) 97
 
7.1%
Latin
ValueCountFrequency (%)
B 3
60.0%
1
 
20.0%
A 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1784
56.7%
ASCII 1361
43.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
541
39.8%
1 164
 
12.0%
- 103
 
7.6%
2 91
 
6.7%
3 80
 
5.9%
0 72
 
5.3%
4 64
 
4.7%
5 55
 
4.0%
7 47
 
3.5%
6 43
 
3.2%
Other values (7) 101
 
7.4%
Hangul
ValueCountFrequency (%)
147
 
8.2%
117
 
6.6%
116
 
6.5%
115
 
6.4%
114
 
6.4%
106
 
5.9%
105
 
5.9%
55
 
3.1%
42
 
2.4%
41
 
2.3%
Other values (142) 826
46.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

인허가일자
Real number (ℝ)

HIGH CORRELATION 

Distinct108
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20146508
Minimum19860619
Maximum20180823
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T08:01:25.983745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19860619
5-th percentile20010921
Q120151074
median20161110
Q320171210
95-th percentile20180717
Maximum20180823
Range320204
Interquartile range (IQR)20135.75

Descriptive statistics

Standard deviation64282.825
Coefficient of variation (CV)0.0031907676
Kurtosis11.019224
Mean20146508
Median Absolute Deviation (MAD)10092
Skewness-3.3294994
Sum2.2967019 × 109
Variance4.1322816 × 109
MonotonicityNot monotonic
2023-12-11T08:01:26.164772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20161212 3
 
2.6%
20160307 2
 
1.8%
20180328 2
 
1.8%
20170512 2
 
1.8%
20180615 2
 
1.8%
20170613 1
 
0.9%
20160913 1
 
0.9%
20020627 1
 
0.9%
20000501 1
 
0.9%
20160908 1
 
0.9%
Other values (98) 98
86.0%
ValueCountFrequency (%)
19860619 1
0.9%
19870123 1
0.9%
19871125 1
0.9%
19871212 1
0.9%
20000501 1
0.9%
20010920 1
0.9%
20010922 1
0.9%
20020627 1
0.9%
20021212 1
0.9%
20051111 1
0.9%
ValueCountFrequency (%)
20180823 1
0.9%
20180816 1
0.9%
20180809 1
0.9%
20180807 1
0.9%
20180731 1
0.9%
20180726 1
0.9%
20180712 1
0.9%
20180705 1
0.9%
20180620 1
0.9%
20180615 2
1.8%

영업상태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
운영중
112 
폐업 등
 
2

Length

Max length4
Median length3
Mean length3.0175439
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 112
98.2%
폐업 등 2
 
1.8%

Length

2023-12-11T08:01:26.291867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:01:26.376299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 112
96.6%
폐업 2
 
1.7%
2
 
1.7%

폐업일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
<NA>
112 
20170314
 
1
20170501
 
1

Length

Max length8
Median length4
Mean length4.0701754
Min length4

Unique

Unique2 ?
Unique (%)1.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 112
98.2%
20170314 1
 
0.9%
20170501 1
 
0.9%

Length

2023-12-11T08:01:26.498253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:01:26.624357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 112
98.2%
20170314 1
 
0.9%
20170501 1
 
0.9%

다중이용업소여부
Boolean

HIGH CORRELATION  IMBALANCE  MISSING 

Distinct2
Distinct (%)1.8%
Missing2
Missing (%)1.8%
Memory size360.0 B
True
102 
False
 
10
(Missing)
 
2
ValueCountFrequency (%)
True 102
89.5%
False 10
 
8.8%
(Missing) 2
 
1.8%
2023-12-11T08:01:26.699250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing114
Missing (%)100.0%
Memory size1.1 KiB

위생업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
유흥주점영업
112 
<NA>
 
2

Length

Max length6
Median length6
Mean length5.9649123
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 112
98.2%
<NA> 2
 
1.8%

Length

2023-12-11T08:01:26.812013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:01:26.910778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 112
98.2%
na 2
 
1.8%

위생업태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
노래클럽
112 
<NA>
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노래클럽
2nd row노래클럽
3rd row노래클럽
4th row노래클럽
5th row노래클럽

Common Values

ValueCountFrequency (%)
노래클럽 112
98.2%
<NA> 2
 
1.8%

Length

2023-12-11T08:01:27.002883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:01:27.102241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노래클럽 112
98.2%
na 2
 
1.8%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct106
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.460251
Minimum36.960669
Maximum38.028527
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T08:01:27.213623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.960669
5-th percentile37.115434
Q137.320063
median37.442709
Q337.506728
95-th percentile37.892073
Maximum38.028527
Range1.0678583
Interquartile range (IQR)0.18666491

Descriptive statistics

Standard deviation0.22369124
Coefficient of variation (CV)0.0059714292
Kurtosis0.22877023
Mean37.460251
Median Absolute Deviation (MAD)0.094583252
Skewness0.42441586
Sum4270.4687
Variance0.050037771
MonotonicityNot monotonic
2023-12-11T08:01:27.355128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3934663589 3
 
2.6%
37.4649011867 3
 
2.6%
37.1997037609 2
 
1.8%
37.1154343513 2
 
1.8%
37.4080421705 2
 
1.8%
37.795646755 2
 
1.8%
37.8868313174 1
 
0.9%
37.3688749664 1
 
0.9%
38.0285273965 1
 
0.9%
38.0271977481 1
 
0.9%
Other values (96) 96
84.2%
ValueCountFrequency (%)
36.9606690486 1
0.9%
36.9876400409 1
0.9%
36.9923351093 1
0.9%
37.01828572 1
0.9%
37.0793423067 1
0.9%
37.1154343513 2
1.8%
37.1997037609 2
1.8%
37.1998585243 1
0.9%
37.2007752169 1
0.9%
37.2011454311 1
0.9%
ValueCountFrequency (%)
38.0285273965 1
0.9%
38.0271977481 1
0.9%
37.9101376891 1
0.9%
37.9045642072 1
0.9%
37.8950968626 1
0.9%
37.8920770782 1
0.9%
37.8920711288 1
0.9%
37.8868313174 1
0.9%
37.88675004 1
0.9%
37.8317891047 1
0.9%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct106
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.04928
Minimum126.74713
Maximum127.59483
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T08:01:27.493713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.74713
5-th percentile126.76733
Q1126.84843
median127.05142
Q3127.14652
95-th percentile127.513
Maximum127.59483
Range0.8477056
Interquartile range (IQR)0.29808651

Descriptive statistics

Standard deviation0.21951957
Coefficient of variation (CV)0.0017278301
Kurtosis0.098843972
Mean127.04928
Median Absolute Deviation (MAD)0.13989162
Skewness0.71995226
Sum14483.618
Variance0.04818884
MonotonicityNot monotonic
2023-12-11T08:01:27.648476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9621754003 3
 
2.6%
127.1406487769 3
 
2.6%
126.8279386505 2
 
1.8%
126.9115271029 2
 
1.8%
127.2572838474 2
 
1.8%
127.08010284 2
 
1.8%
127.5499363005 1
 
0.9%
126.9538920639 1
 
0.9%
127.0685906594 1
 
0.9%
127.0697492506 1
 
0.9%
Other values (96) 96
84.2%
ValueCountFrequency (%)
126.7471274321 1
0.9%
126.7508482445 1
0.9%
126.7523469055 1
0.9%
126.7553573099 1
0.9%
126.7565370124 1
0.9%
126.7618757445 1
0.9%
126.7702615032 1
0.9%
126.772343084 1
0.9%
126.7756713671 1
0.9%
126.7761272631 1
0.9%
ValueCountFrequency (%)
127.5948330311 1
0.9%
127.5946185167 1
0.9%
127.5937806751 1
0.9%
127.5503115348 1
0.9%
127.5499363005 1
0.9%
127.5134661294 1
0.9%
127.5127503949 1
0.9%
127.4923304913 1
0.9%
127.4911201345 1
0.9%
127.4895467225 1
0.9%

Interactions

2023-12-11T08:01:22.501345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.543205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.852796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.160686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.578844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.627675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.929106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.237373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.666911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.713734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.004690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.335449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:23.003137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:21.785431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.088673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:01:22.426233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:01:27.744197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명소재지우편번호인허가일자영업상태명폐업일자다중이용업소여부WGS84위도WGS84경도
시군명1.0000.9800.7640.0000.0000.8780.9700.964
소재지우편번호0.9801.0000.2660.000NaN0.0000.7130.857
인허가일자0.7640.2661.0000.000NaN0.4270.5220.000
영업상태명0.0000.0000.0001.000NaN0.0000.0000.000
폐업일자0.000NaNNaNNaN1.000NaN0.0000.000
다중이용업소여부0.8780.0000.4270.000NaN1.0000.7500.381
WGS84위도0.9700.7130.5220.0000.0000.7501.0000.806
WGS84경도0.9640.8570.0000.0000.0000.3810.8061.000
2023-12-11T08:01:27.898183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐업일자시군명위생업태명영업상태명위생업종명다중이용업소여부
폐업일자1.0001.0001.0001.0001.0001.000
시군명1.0001.0001.0000.0001.0000.669
위생업태명1.0001.0001.0001.0001.0001.000
영업상태명1.0000.0001.0001.0001.0000.000
위생업종명1.0001.0001.0001.0001.0001.000
다중이용업소여부1.0000.6691.0000.0001.0001.000
2023-12-11T08:01:28.022955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호인허가일자WGS84위도WGS84경도시군명영업상태명폐업일자다중이용업소여부위생업종명위생업태명
소재지우편번호1.0000.1230.2710.7260.8510.0001.0000.0001.0001.000
인허가일자0.1231.000-0.0370.2180.4150.0001.0000.4051.0001.000
WGS84위도0.271-0.0371.0000.1540.7890.0001.0000.5671.0001.000
WGS84경도0.7260.2180.1541.0000.7660.0001.0000.2801.0001.000
시군명0.8510.4150.7890.7661.0000.0001.0000.6691.0001.000
영업상태명0.0000.0000.0000.0000.0001.0001.0000.0001.0001.000
폐업일자1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
다중이용업소여부0.0000.4050.5670.2800.6690.0001.0001.0001.0001.000
위생업종명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
위생업태명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-11T08:01:23.188861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:01:23.384448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명사업장명소재지우편번호소재지도로명주소소재지지번주소인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명WGS84위도WGS84경도
0가평군술마시는명지노래장477842경기도 가평군 북면 화악산로 17, 1층경기도 가평군 북면 목동리 886-10번지 , 1층20170613운영중<NA>Y<NA>유흥주점영업노래클럽37.886831127.549936
1가평군비타민노래광장477801경기도 가평군 가평읍 가화로 124, 2층경기도 가평군 가평읍 읍내리 471-3번지 외 1필지, 2층20160706운영중<NA>Y<NA>유흥주점영업노래클럽37.830716127.513466
2가평군유튜브클럽477801경기도 가평군 가평읍 가화로 138, 1층경기도 가평군 가평읍 읍내리 443-17번지 1층20180514운영중<NA>Y<NA>유흥주점영업노래클럽37.831789127.51275
3가평군별노래방477842경기도 가평군 북면 화악산로 18-1, 1층경기도 가평군 북면 목동리 886-17번지 , 886-18(1층)20170103운영중<NA>Y<NA>유흥주점영업노래클럽37.88675127.550312
4가평군힐링 술마시는 노래장12437경기도 가평군 조종면 조종새싹로4번길 15-9, 1층경기도 가평군 조종면 현리 263-9번지 술마시는노래방20171129운영중<NA>Y<NA>유흥주점영업노래클럽37.819319127.349805
5고양시보물섬가요주점412827경기도 고양시 덕양구 화신로260번길 37 (화정동, 진솔그린프라자 지하1층(5호일부,6,7,8,9,10,11호,16호일부))경기도 고양시 덕양구 화정동 979번지 진솔그린프라자 지하1층(5호일부,6,7,8,9,10,11호,16호일부)20021212운영중<NA>N<NA>유흥주점영업노래클럽37.632483126.832526
6광명시뿌리423858경기도 광명시 오리로 937 (광명동,지층)경기도 광명시 광명동 158-306번지 지층19870123운영중<NA>Y<NA>유흥주점영업노래클럽37.477967126.858649
7광명시랑데뷰노래바423848경기도 광명시 범안로 1042, 지층 1호 (하안동)경기도 광명시 하안동 36-1번지 지층 1호20170612운영중<NA>Y<NA>유흥주점영업노래클럽37.461833126.879028
8광명시옥타곤노래클럽423837경기도 광명시 오리로854번길 16-7, 4층 (철산동)경기도 광명시 철산동 429번지20170908운영중<NA>Y<NA>유흥주점영업노래클럽37.474769126.869115
9광주시신나는 유흥주점464807경기도 광주시 중앙로 95-35, 낙원빌딩 2층 (역동)경기도 광주시 역동 8-5번지 낙원빌딩 2층20180102운영중<NA>Y<NA>유흥주점영업노래클럽37.408042127.257284
시군명사업장명소재지우편번호소재지도로명주소소재지지번주소인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명WGS84위도WGS84경도
104화성시허브노래빠445170경기도 화성시 삼성1로5길 5, 205호 (석우동)경기도 화성시 석우동 3-1번지 205호20160316운영중<NA>N<NA>유흥주점영업노래클럽37.224918127.073278
105화성시상상445160경기도 화성시 동탄중심상가2길 15, 수성프라자 503호 (반송동)경기도 화성시 반송동 88-8번지 수성프라자 503호20180410운영중<NA>Y<NA>유흥주점영업노래클럽37.205674127.073556
106화성시한마음노래광장445851경기도 화성시 남양읍 역골로 9-14 (2층 202호)경기도 화성시 남양읍 남양리 2076-13번지 2층 202호20170405운영중<NA>Y<NA>유흥주점영업노래클럽37.200775126.827726
107화성시골든노래클럽445938경기도 화성시 향남읍 상신하길로298번길 7-13 (3층 304, 305-1호)경기도 화성시 향남읍 하길리 1472-5번지 3층 304, 305-1호20161012운영중<NA>Y<NA>유흥주점영업노래클럽37.115434126.911527
108화성시빅마마라이브445320경기도 화성시 동탄원천로 354-28, 206호 (능동, 이너매스)경기도 화성시 능동 1064-5번지 206호20160909운영중<NA>Y<NA>유흥주점영업노래클럽37.218275127.058731
109화성시7080라이브아리조나445851경기도 화성시 남양읍 남양로 695 (4층 401, 401-1호)경기도 화성시 남양읍 남양리 2078-4번지 4층 401, 401-1호20170828운영중<NA>Y<NA>유흥주점영업노래클럽37.199859126.828208
110화성시e노래클럽445851경기도 화성시 남양읍 남양로 691-1 (1층일부)경기도 화성시 남양읍 남양리 2078-2번지 1층일부20170516운영중<NA>Y<NA>유흥주점영업노래클럽37.199704126.827939
111화성시해노래클럽445938경기도 화성시 향남읍 상신하길로298번길 7-13 (골든프라자 4층 404호)경기도 화성시 향남읍 하길리 1472-5번지 골든프라자 4층 404호20161014운영중<NA>Y<NA>유흥주점영업노래클럽37.115434126.911527
112화성시레옹445160경기도 화성시 동탄중심상가1길 27, 2층 일부호 (반송동, 프라임빌딩)경기도 화성시 반송동 104-6번지 2층일부호20170512운영중<NA>Y<NA>유흥주점영업노래클럽37.201145127.072556
113화성시K노래클럽445851경기도 화성시 남양읍 남양로 691-1 (1층일부)경기도 화성시 남양읍 남양리 2078-2번지 1층일부20170106폐업 등20170501Y<NA>유흥주점영업노래클럽37.199704126.827939