Overview

Dataset statistics

Number of variables14
Number of observations188
Missing cells353
Missing cells (%)13.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.8 KiB
Average record size in memory118.7 B

Variable types

Categorical4
Text3
Numeric5
Boolean1
Unsupported1

Alerts

위생업종명 is highly overall correlated with 인허가일자 and 8 other fieldsHigh correlation
위생업태명 is highly overall correlated with 인허가일자 and 8 other fieldsHigh correlation
다중이용업소여부 is highly overall correlated with 위생업종명 and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
영업상태명 is highly overall correlated with 폐업일자 and 2 other fieldsHigh correlation
인허가일자 is highly overall correlated with 위생업종명 and 1 other fieldsHigh correlation
폐업일자 is highly overall correlated with 영업상태명 and 2 other fieldsHigh correlation
소재지우편번호 is highly overall correlated with WGS84경도 and 3 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
위생업종명 is highly imbalanced (91.5%)Imbalance
위생업태명 is highly imbalanced (91.5%)Imbalance
폐업일자 has 163 (86.7%) missing valuesMissing
다중이용업소여부 has 2 (1.1%) missing valuesMissing
총시설규모(㎡) has 188 (100.0%) missing valuesMissing
소재지도로명주소 has unique valuesUnique
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 21:41:17.191341
Analysis finished2023-12-10 21:41:20.775050
Duration3.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
성남시
24 
파주시
17 
화성시
15 
김포시
15 
하남시
13 
Other values (20)
104 

Length

Max length4
Median length3
Mean length3.1117021
Min length3

Unique

Unique6 ?
Unique (%)3.2%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
성남시 24
12.8%
파주시 17
 
9.0%
화성시 15
 
8.0%
김포시 15
 
8.0%
하남시 13
 
6.9%
수원시 13
 
6.9%
의정부시 13
 
6.9%
용인시 12
 
6.4%
안산시 12
 
6.4%
평택시 9
 
4.8%
Other values (15) 45
23.9%

Length

2023-12-11T06:41:20.832107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성남시 24
12.8%
파주시 17
 
9.0%
화성시 15
 
8.0%
김포시 15
 
8.0%
하남시 13
 
6.9%
수원시 13
 
6.9%
의정부시 13
 
6.9%
용인시 12
 
6.4%
안산시 12
 
6.4%
평택시 9
 
4.8%
Other values (15) 45
23.9%
Distinct185
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-11T06:41:21.127887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length8.5159574
Min length2

Characters and Unicode

Total characters1601
Distinct characters334
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)96.8%

Sample

1st row화류
2nd row피카(FIKA)
3rd row유로코피자 일산서구점
4th row(주)난리피자푸드
5th row수비드하우스
ValueCountFrequency (%)
맘스터치 4
 
1.6%
올리앤 4
 
1.6%
하남미사점 3
 
1.2%
라라코스트 3
 
1.2%
돈까스클럽 2
 
0.8%
슈퍼스테이크 2
 
0.8%
판교점 2
 
0.8%
서현점 2
 
0.8%
혜화동돈까스 2
 
0.8%
텐바 1
 
0.4%
Other values (231) 231
90.2%
2023-12-11T06:41:21.545257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
68
 
4.2%
66
 
4.1%
64
 
4.0%
( 45
 
2.8%
) 45
 
2.8%
37
 
2.3%
31
 
1.9%
31
 
1.9%
29
 
1.8%
29
 
1.8%
Other values (324) 1156
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1167
72.9%
Uppercase Letter 130
 
8.1%
Lowercase Letter 111
 
6.9%
Space Separator 68
 
4.2%
Open Punctuation 45
 
2.8%
Close Punctuation 45
 
2.8%
Decimal Number 26
 
1.6%
Other Punctuation 9
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
5.7%
64
 
5.5%
37
 
3.2%
31
 
2.7%
31
 
2.7%
29
 
2.5%
29
 
2.5%
15
 
1.3%
14
 
1.2%
14
 
1.2%
Other values (268) 837
71.7%
Uppercase Letter
ValueCountFrequency (%)
A 12
 
9.2%
T 11
 
8.5%
C 10
 
7.7%
N 10
 
7.7%
O 9
 
6.9%
E 9
 
6.9%
U 7
 
5.4%
P 7
 
5.4%
I 7
 
5.4%
G 6
 
4.6%
Other values (12) 42
32.3%
Lowercase Letter
ValueCountFrequency (%)
a 13
11.7%
e 11
9.9%
o 11
9.9%
t 9
 
8.1%
i 9
 
8.1%
c 8
 
7.2%
n 7
 
6.3%
z 6
 
5.4%
r 6
 
5.4%
s 5
 
4.5%
Other values (10) 26
23.4%
Decimal Number
ValueCountFrequency (%)
0 7
26.9%
7 6
23.1%
1 4
15.4%
9 3
11.5%
8 2
 
7.7%
5 2
 
7.7%
3 1
 
3.8%
2 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
& 5
55.6%
. 3
33.3%
' 1
 
11.1%
Space Separator
ValueCountFrequency (%)
68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1167
72.9%
Latin 241
 
15.1%
Common 193
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
 
5.7%
64
 
5.5%
37
 
3.2%
31
 
2.7%
31
 
2.7%
29
 
2.5%
29
 
2.5%
15
 
1.3%
14
 
1.2%
14
 
1.2%
Other values (268) 837
71.7%
Latin
ValueCountFrequency (%)
a 13
 
5.4%
A 12
 
5.0%
T 11
 
4.6%
e 11
 
4.6%
o 11
 
4.6%
C 10
 
4.1%
N 10
 
4.1%
O 9
 
3.7%
t 9
 
3.7%
i 9
 
3.7%
Other values (32) 136
56.4%
Common
ValueCountFrequency (%)
68
35.2%
( 45
23.3%
) 45
23.3%
0 7
 
3.6%
7 6
 
3.1%
& 5
 
2.6%
1 4
 
2.1%
. 3
 
1.6%
9 3
 
1.6%
8 2
 
1.0%
Other values (4) 5
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1167
72.9%
ASCII 434
 
27.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
68
 
15.7%
( 45
 
10.4%
) 45
 
10.4%
a 13
 
3.0%
A 12
 
2.8%
T 11
 
2.5%
e 11
 
2.5%
o 11
 
2.5%
C 10
 
2.3%
N 10
 
2.3%
Other values (46) 198
45.6%
Hangul
ValueCountFrequency (%)
66
 
5.7%
64
 
5.5%
37
 
3.2%
31
 
2.7%
31
 
2.7%
29
 
2.5%
29
 
2.5%
15
 
1.3%
14
 
1.2%
14
 
1.2%
Other values (268) 837
71.7%

인허가일자
Real number (ℝ)

HIGH CORRELATION 

Distinct171
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20155252
Minimum19960503
Maximum20180829
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T06:41:21.680231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19960503
5-th percentile20084190
Q120151020
median20160920
Q320170913
95-th percentile20180525
Maximum20180829
Range220326
Interquartile range (IQR)19892.75

Descriptive statistics

Standard deviation34567.436
Coefficient of variation (CV)0.0017150585
Kurtosis13.731972
Mean20155252
Median Absolute Deviation (MAD)9910.5
Skewness-3.4816387
Sum3.7891874 × 109
Variance1.1949077 × 109
MonotonicityNot monotonic
2023-12-11T06:41:21.813189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20150910 3
 
1.6%
20171113 3
 
1.6%
20160822 3
 
1.6%
20170711 2
 
1.1%
20160720 2
 
1.1%
20160701 2
 
1.1%
20170817 2
 
1.1%
20151020 2
 
1.1%
20150807 2
 
1.1%
20151218 2
 
1.1%
Other values (161) 165
87.8%
ValueCountFrequency (%)
19960503 1
0.5%
19980720 1
0.5%
19980909 1
0.5%
20020608 1
0.5%
20021126 1
0.5%
20050727 1
0.5%
20060102 1
0.5%
20060621 1
0.5%
20080326 1
0.5%
20080620 1
0.5%
ValueCountFrequency (%)
20180829 1
0.5%
20180823 1
0.5%
20180801 1
0.5%
20180712 1
0.5%
20180709 1
0.5%
20180621 1
0.5%
20180620 1
0.5%
20180618 1
0.5%
20180611 1
0.5%
20180531 1
0.5%

영업상태명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
운영중
163 
폐업 등
25 

Length

Max length4
Median length3
Mean length3.1329787
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 163
86.7%
폐업 등 25
 
13.3%

Length

2023-12-11T06:41:21.947668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:41:22.113291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 163
76.5%
폐업 25
 
11.7%
25
 
11.7%

폐업일자
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct24
Distinct (%)96.0%
Missing163
Missing (%)86.7%
Infinite0
Infinite (%)0.0%
Mean20171088
Minimum20150831
Maximum20180816
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T06:41:22.220452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20150831
5-th percentile20153028
Q120161221
median20171103
Q320180307
95-th percentile20180766
Maximum20180816
Range29985
Interquartile range (IQR)19086

Descriptive statistics

Standard deviation9601.5954
Coefficient of variation (CV)0.0004760078
Kurtosis-0.52930926
Mean20171088
Median Absolute Deviation (MAD)9310
Skewness-0.69038431
Sum5.0427721 × 108
Variance92190634
MonotonicityNot monotonic
2023-12-11T06:41:22.373319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
20180123 2
 
1.1%
20161221 1
 
0.5%
20180816 1
 
0.5%
20160420 1
 
0.5%
20180523 1
 
0.5%
20171219 1
 
0.5%
20170828 1
 
0.5%
20150831 1
 
0.5%
20171226 1
 
0.5%
20160331 1
 
0.5%
Other values (14) 14
 
7.4%
(Missing) 163
86.7%
ValueCountFrequency (%)
20150831 1
0.5%
20151202 1
0.5%
20160331 1
0.5%
20160420 1
0.5%
20160921 1
0.5%
20161011 1
0.5%
20161221 1
0.5%
20170517 1
0.5%
20170524 1
0.5%
20170712 1
0.5%
ValueCountFrequency (%)
20180816 1
0.5%
20180807 1
0.5%
20180601 1
0.5%
20180523 1
0.5%
20180503 1
0.5%
20180413 1
0.5%
20180307 1
0.5%
20180123 2
1.1%
20180119 1
0.5%
20171226 1
0.5%

다중이용업소여부
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)1.1%
Missing2
Missing (%)1.1%
Memory size508.0 B
False
131 
True
55 
(Missing)
 
2
ValueCountFrequency (%)
False 131
69.7%
True 55
29.3%
(Missing) 2
 
1.1%
2023-12-11T06:41:22.521346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing188
Missing (%)100.0%
Memory size1.8 KiB

위생업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
일반음식점
186 
<NA>
 
2

Length

Max length5
Median length5
Mean length4.9893617
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 186
98.9%
<NA> 2
 
1.1%

Length

2023-12-11T06:41:22.648207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:41:22.773789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 186
98.9%
na 2
 
1.1%

위생업태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
패밀리레스트랑
186 
<NA>
 
2

Length

Max length7
Median length7
Mean length6.9680851
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row패밀리레스트랑
2nd row패밀리레스트랑
3rd row패밀리레스트랑
4th row패밀리레스트랑
5th row패밀리레스트랑

Common Values

ValueCountFrequency (%)
패밀리레스트랑 186
98.9%
<NA> 2
 
1.1%

Length

2023-12-11T06:41:22.887688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:41:22.990481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
패밀리레스트랑 186
98.9%
na 2
 
1.1%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct131
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean414539.42
Minimum11033
Maximum483020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T06:41:23.123134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11033
5-th percentile14679.2
Q1415623.25
median445926
Q3463830
95-th percentile480551.5
Maximum483020
Range471987
Interquartile range (IQR)48206.75

Descriptive statistics

Standard deviation115782.83
Coefficient of variation (CV)0.27930475
Kurtosis7.9865875
Mean414539.42
Median Absolute Deviation (MAD)19956
Skewness-3.0718904
Sum77933411
Variance1.3405663 × 1010
MonotonicityNot monotonic
2023-12-11T06:41:23.279062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
461210 6
 
3.2%
480090 6
 
3.2%
445160 6
 
3.2%
465150 5
 
2.7%
415060 5
 
2.7%
413110 4
 
2.1%
465030 4
 
2.1%
463420 4
 
2.1%
425868 3
 
1.6%
415080 3
 
1.6%
Other values (121) 142
75.5%
ValueCountFrequency (%)
11033 1
0.5%
12942 1
0.5%
13822 1
0.5%
14423 1
0.5%
14534 1
0.5%
14545 1
0.5%
14548 1
0.5%
14637 1
0.5%
14663 1
0.5%
14668 1
0.5%
ValueCountFrequency (%)
483020 1
 
0.5%
482080 1
 
0.5%
482060 1
 
0.5%
482030 1
 
0.5%
480850 1
 
0.5%
480843 1
 
0.5%
480842 2
 
1.1%
480826 1
 
0.5%
480800 1
 
0.5%
480090 6
3.2%
Distinct188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-11T06:41:23.641458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length50
Mean length36.010638
Min length19

Characters and Unicode

Total characters6770
Distinct characters329
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)100.0%

Sample

1st row경기도 고양시 일산동구 중앙로1261번길 55 (장항동,장항동근생1 115호)
2nd row경기도 고양시 일산서구 가좌4로 12-15, 1(일부)층 (가좌동, 차우빌 1동 )
3rd row경기도 고양시 일산서구 성저로 78, 1층 (대화동)
4th row경기도 고양시 일산서구 한류월드로 300, 원마운트 2040~2043, 2089호 (대화동)
5th row경기도 고양시 일산서구 가좌로 30, 1(일부)층 (가좌동, 진풍빌딩)
ValueCountFrequency (%)
경기도 188
 
13.6%
1층 66
 
4.8%
성남시 24
 
1.7%
2층 23
 
1.7%
분당구 17
 
1.2%
파주시 17
 
1.2%
김포시 15
 
1.1%
화성시 15
 
1.1%
일부 15
 
1.1%
하남시 13
 
0.9%
Other values (624) 985
71.5%
2023-12-11T06:41:24.104969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1192
 
17.6%
1 358
 
5.3%
, 257
 
3.8%
224
 
3.3%
201
 
3.0%
198
 
2.9%
2 198
 
2.9%
194
 
2.9%
191
 
2.8%
) 187
 
2.8%
Other values (319) 3570
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3637
53.7%
Decimal Number 1209
 
17.9%
Space Separator 1192
 
17.6%
Other Punctuation 258
 
3.8%
Close Punctuation 187
 
2.8%
Open Punctuation 187
 
2.8%
Dash Punctuation 47
 
0.7%
Uppercase Letter 27
 
0.4%
Lowercase Letter 17
 
0.3%
Math Symbol 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
224
 
6.2%
201
 
5.5%
198
 
5.4%
194
 
5.3%
191
 
5.3%
174
 
4.8%
144
 
4.0%
91
 
2.5%
85
 
2.3%
85
 
2.3%
Other values (275) 2050
56.4%
Uppercase Letter
ValueCountFrequency (%)
A 6
22.2%
B 5
18.5%
G 2
 
7.4%
J 2
 
7.4%
C 2
 
7.4%
V 1
 
3.7%
T 1
 
3.7%
K 1
 
3.7%
P 1
 
3.7%
L 1
 
3.7%
Other values (5) 5
18.5%
Lowercase Letter
ValueCountFrequency (%)
l 4
23.5%
e 2
11.8%
a 2
11.8%
i 2
11.8%
f 1
 
5.9%
y 1
 
5.9%
r 1
 
5.9%
c 1
 
5.9%
s 1
 
5.9%
n 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 358
29.6%
2 198
16.4%
0 144
11.9%
3 101
 
8.4%
5 93
 
7.7%
4 90
 
7.4%
6 78
 
6.5%
7 59
 
4.9%
8 46
 
3.8%
9 42
 
3.5%
Other Punctuation
ValueCountFrequency (%)
, 257
99.6%
. 1
 
0.4%
Space Separator
ValueCountFrequency (%)
1192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 187
100.0%
Open Punctuation
ValueCountFrequency (%)
( 187
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3637
53.7%
Common 3087
45.6%
Latin 46
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
224
 
6.2%
201
 
5.5%
198
 
5.4%
194
 
5.3%
191
 
5.3%
174
 
4.8%
144
 
4.0%
91
 
2.5%
85
 
2.3%
85
 
2.3%
Other values (275) 2050
56.4%
Latin
ValueCountFrequency (%)
A 6
 
13.0%
B 5
 
10.9%
l 4
 
8.7%
G 2
 
4.3%
e 2
 
4.3%
a 2
 
4.3%
2
 
4.3%
i 2
 
4.3%
J 2
 
4.3%
C 2
 
4.3%
Other values (17) 17
37.0%
Common
ValueCountFrequency (%)
1192
38.6%
1 358
 
11.6%
, 257
 
8.3%
2 198
 
6.4%
) 187
 
6.1%
( 187
 
6.1%
0 144
 
4.7%
3 101
 
3.3%
5 93
 
3.0%
4 90
 
2.9%
Other values (7) 280
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3637
53.7%
ASCII 3131
46.2%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1192
38.1%
1 358
 
11.4%
, 257
 
8.2%
2 198
 
6.3%
) 187
 
6.0%
( 187
 
6.0%
0 144
 
4.6%
3 101
 
3.2%
5 93
 
3.0%
4 90
 
2.9%
Other values (33) 324
 
10.3%
Hangul
ValueCountFrequency (%)
224
 
6.2%
201
 
5.5%
198
 
5.4%
194
 
5.3%
191
 
5.3%
174
 
4.8%
144
 
4.0%
91
 
2.5%
85
 
2.3%
85
 
2.3%
Other values (275) 2050
56.4%
Number Forms
ValueCountFrequency (%)
2
100.0%
Distinct187
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-11T06:41:24.369544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length42
Mean length29.06383
Min length5

Characters and Unicode

Total characters5464
Distinct characters281
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique186 ?
Unique (%)98.9%

Sample

1st row경기도 고양시 일산동구 장항동 861번지 장항동근생1 115호
2nd row경기도 고양시 일산서구 가좌동 397-1번지 차우빌 1동, 1층일부
3rd row경기도 고양시 일산서구 대화동 2072-8번지
4th row경기도 고양시 일산서구 대화동 2606번지 원마운트 2040~2043, 2089호
5th row경기도 고양시 일산서구 가좌동 461-27번지 진풍빌딩
ValueCountFrequency (%)
경기도 187
 
16.6%
1층 36
 
3.2%
성남시 24
 
2.1%
2층 18
 
1.6%
파주시 17
 
1.5%
분당구 17
 
1.5%
화성시 15
 
1.3%
김포시 15
 
1.3%
일부 15
 
1.3%
수원시 13
 
1.2%
Other values (500) 770
68.3%
2023-12-11T06:41:24.748896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
939
 
17.2%
1 313
 
5.7%
218
 
4.0%
208
 
3.8%
198
 
3.6%
190
 
3.5%
189
 
3.5%
188
 
3.4%
187
 
3.4%
2 177
 
3.2%
Other values (271) 2657
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3095
56.6%
Decimal Number 1180
 
21.6%
Space Separator 939
 
17.2%
Dash Punctuation 144
 
2.6%
Other Punctuation 43
 
0.8%
Uppercase Letter 24
 
0.4%
Lowercase Letter 13
 
0.2%
Open Punctuation 10
 
0.2%
Close Punctuation 9
 
0.2%
Math Symbol 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
218
 
7.0%
208
 
6.7%
198
 
6.4%
190
 
6.1%
189
 
6.1%
188
 
6.1%
187
 
6.0%
94
 
3.0%
83
 
2.7%
78
 
2.5%
Other values (231) 1462
47.2%
Uppercase Letter
ValueCountFrequency (%)
A 6
25.0%
B 4
16.7%
J 2
 
8.3%
C 2
 
8.3%
T 1
 
4.2%
G 1
 
4.2%
V 1
 
4.2%
H 1
 
4.2%
K 1
 
4.2%
P 1
 
4.2%
Other values (4) 4
16.7%
Decimal Number
ValueCountFrequency (%)
1 313
26.5%
2 177
15.0%
0 130
11.0%
3 120
 
10.2%
4 93
 
7.9%
6 90
 
7.6%
5 76
 
6.4%
8 70
 
5.9%
7 63
 
5.3%
9 48
 
4.1%
Lowercase Letter
ValueCountFrequency (%)
l 4
30.8%
i 2
15.4%
s 1
 
7.7%
w 1
 
7.7%
n 1
 
7.7%
a 1
 
7.7%
e 1
 
7.7%
r 1
 
7.7%
y 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 42
97.7%
. 1
 
2.3%
Space Separator
ValueCountFrequency (%)
939
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3095
56.6%
Common 2332
42.7%
Latin 37
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
218
 
7.0%
208
 
6.7%
198
 
6.4%
190
 
6.1%
189
 
6.1%
188
 
6.1%
187
 
6.0%
94
 
3.0%
83
 
2.7%
78
 
2.5%
Other values (231) 1462
47.2%
Latin
ValueCountFrequency (%)
A 6
16.2%
l 4
 
10.8%
B 4
 
10.8%
J 2
 
5.4%
C 2
 
5.4%
i 2
 
5.4%
s 1
 
2.7%
T 1
 
2.7%
w 1
 
2.7%
n 1
 
2.7%
Other values (13) 13
35.1%
Common
ValueCountFrequency (%)
939
40.3%
1 313
 
13.4%
2 177
 
7.6%
- 144
 
6.2%
0 130
 
5.6%
3 120
 
5.1%
4 93
 
4.0%
6 90
 
3.9%
5 76
 
3.3%
8 70
 
3.0%
Other values (7) 180
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3095
56.6%
ASCII 2369
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
939
39.6%
1 313
 
13.2%
2 177
 
7.5%
- 144
 
6.1%
0 130
 
5.5%
3 120
 
5.1%
4 93
 
3.9%
6 90
 
3.8%
5 76
 
3.2%
8 70
 
3.0%
Other values (30) 217
 
9.2%
Hangul
ValueCountFrequency (%)
218
 
7.0%
208
 
6.7%
198
 
6.4%
190
 
6.1%
189
 
6.1%
188
 
6.1%
187
 
6.0%
94
 
3.0%
83
 
2.7%
78
 
2.5%
Other values (231) 1462
47.2%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct182
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.458744
Minimum36.989116
Maximum38.027848
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T06:41:24.922165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.989116
5-th percentile37.124508
Q137.280862
median37.468018
Q337.651514
95-th percentile37.751527
Maximum38.027848
Range1.0387313
Interquartile range (IQR)0.37065225

Descriptive statistics

Standard deviation0.21956483
Coefficient of variation (CV)0.0058615106
Kurtosis-0.79121261
Mean37.458744
Median Absolute Deviation (MAD)0.18663129
Skewness-0.081458695
Sum7042.2439
Variance0.048208713
MonotonicityNot monotonic
2023-12-11T06:41:25.052575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.545418129 4
 
2.1%
37.3830147472 2
 
1.1%
37.4734793704 2
 
1.1%
37.6549390097 2
 
1.1%
37.7457780674 1
 
0.5%
37.7508734813 1
 
0.5%
37.7454473184 1
 
0.5%
37.7392148652 1
 
0.5%
37.7453175447 1
 
0.5%
37.7638382679 1
 
0.5%
Other values (172) 172
91.5%
ValueCountFrequency (%)
36.9891162735 1
0.5%
36.989935634 1
0.5%
36.9934240262 1
0.5%
36.9949017569 1
0.5%
36.9997358206 1
0.5%
37.0001128972 1
0.5%
37.004111327 1
0.5%
37.0218110311 1
0.5%
37.0902207369 1
0.5%
37.1215004001 1
0.5%
ValueCountFrequency (%)
38.0278475587 1
0.5%
37.8926917176 1
0.5%
37.8372621422 1
0.5%
37.8130583508 1
0.5%
37.7946697876 1
0.5%
37.7938799288 1
0.5%
37.7881893153 1
0.5%
37.7711997916 1
0.5%
37.7638382679 1
0.5%
37.7518789148 1
0.5%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct182
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.99311
Minimum126.62608
Maximum127.49105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T06:41:25.181040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.62608
5-th percentile126.67892
Q1126.80801
median127.06176
Q3127.11395
95-th percentile127.22786
Maximum127.49105
Range0.86497319
Interquartile range (IQR)0.30593755

Descriptive statistics

Standard deviation0.18878629
Coefficient of variation (CV)0.0014865869
Kurtosis-0.82628711
Mean126.99311
Median Absolute Deviation (MAD)0.10120963
Skewness-0.25640894
Sum23874.705
Variance0.035640264
MonotonicityNot monotonic
2023-12-11T06:41:25.328142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.2237611843 4
 
2.1%
127.1214981282 2
 
1.1%
127.1426886465 2
 
1.1%
126.6762528915 2
 
1.1%
127.0959465888 1
 
0.5%
127.0448258106 1
 
0.5%
127.0941209284 1
 
0.5%
127.0483361667 1
 
0.5%
127.095699378 1
 
0.5%
127.0185652891 1
 
0.5%
Other values (172) 172
91.5%
ValueCountFrequency (%)
126.6260787927 1
0.5%
126.6263036106 1
0.5%
126.627783433 1
0.5%
126.6434009504 1
0.5%
126.6610641872 1
0.5%
126.6654193928 1
0.5%
126.6682027313 1
0.5%
126.6762528915 2
1.1%
126.6785604873 1
0.5%
126.6795904732 1
0.5%
ValueCountFrequency (%)
127.4910519874 1
0.5%
127.4472291102 1
0.5%
127.4443188941 1
0.5%
127.2946111152 1
0.5%
127.2517671714 1
0.5%
127.2512950721 1
0.5%
127.2441040616 1
0.5%
127.2411922947 1
0.5%
127.2369941715 1
0.5%
127.2300715287 1
0.5%

Interactions

2023-12-11T06:41:19.637731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.027867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.420279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.843838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.242412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.956458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.105755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.513842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.921223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.317291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:20.045592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.195728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.597290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.002176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.397212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:20.116075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.269524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.673808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.093146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.480783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:20.220630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.347105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:18.764881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.168984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:19.564392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:41:25.423728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명인허가일자영업상태명폐업일자다중이용업소여부소재지우편번호WGS84위도WGS84경도
시군명1.0000.5950.2420.0000.1390.9970.9820.955
인허가일자0.5951.0000.2250.4310.0000.0700.2000.305
영업상태명0.2420.2251.000NaN0.0000.0000.0990.070
폐업일자0.0000.431NaN1.0000.4470.2640.0000.271
다중이용업소여부0.1390.0000.0000.4471.0000.0670.1510.133
소재지우편번호0.9970.0700.0000.2640.0671.0000.8470.914
WGS84위도0.9820.2000.0990.0000.1510.8471.0000.658
WGS84경도0.9550.3050.0700.2710.1330.9140.6581.000
2023-12-11T06:41:25.530542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위생업종명위생업태명다중이용업소여부시군명영업상태명
위생업종명1.0001.0001.0001.0001.000
위생업태명1.0001.0001.0001.0001.000
다중이용업소여부1.0001.0001.0000.1100.000
시군명1.0001.0000.1101.0000.194
영업상태명1.0001.0000.0000.1941.000
2023-12-11T06:41:25.623363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자폐업일자소재지우편번호WGS84위도WGS84경도시군명영업상태명다중이용업소여부위생업종명위생업태명
인허가일자1.0000.487-0.1080.252-0.1490.2710.1690.0001.0001.000
폐업일자0.4871.0000.0450.0680.1680.2891.0000.3201.0001.000
소재지우편번호-0.1080.0451.0000.0350.7550.8780.0000.1161.0001.000
WGS84위도0.2520.0680.0351.000-0.2410.8230.0730.1121.0001.000
WGS84경도-0.1490.1680.755-0.2411.0000.7510.0670.1301.0001.000
시군명0.2710.2890.8780.8230.7511.0000.1940.1101.0001.000
영업상태명0.1691.0000.0000.0730.0670.1941.0000.0001.0001.000
다중이용업소여부0.0000.3200.1160.1120.1300.1100.0001.0001.0001.000
위생업종명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
위생업태명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-11T06:41:20.372962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:41:20.587368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:41:20.720388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지우편번호소재지도로명주소소재지지번주소WGS84위도WGS84경도
0고양시화류20111012운영중<NA>N<NA>일반음식점패밀리레스트랑410837경기도 고양시 일산동구 중앙로1261번길 55 (장항동,장항동근생1 115호)경기도 고양시 일산동구 장항동 861번지 장항동근생1 115호37.657581126.770854
1고양시피카(FIKA)20170501운영중<NA>N<NA>일반음식점패밀리레스트랑411440경기도 고양시 일산서구 가좌4로 12-15, 1(일부)층 (가좌동, 차우빌 1동 )경기도 고양시 일산서구 가좌동 397-1번지 차우빌 1동, 1층일부37.686458126.718628
2고양시유로코피자 일산서구점20180712운영중<NA>N<NA>일반음식점패밀리레스트랑411803경기도 고양시 일산서구 성저로 78, 1층 (대화동)경기도 고양시 일산서구 대화동 2072-8번지37.685165126.755682
3고양시(주)난리피자푸드20180427운영중<NA>Y<NA>일반음식점패밀리레스트랑411410경기도 고양시 일산서구 한류월드로 300, 원마운트 2040~2043, 2089호 (대화동)경기도 고양시 일산서구 대화동 2606번지 원마운트 2040~2043, 2089호37.664554126.754527
4고양시수비드하우스20170102운영중<NA>N<NA>일반음식점패밀리레스트랑411440경기도 고양시 일산서구 가좌로 30, 1(일부)층 (가좌동, 진풍빌딩)경기도 고양시 일산서구 가좌동 461-27번지 진풍빌딩37.691532126.722726
5과천시AW F&B20160912폐업 등20171103N<NA>일반음식점패밀리레스트랑13822경기도 과천시 경마공원대로 107 (과천동)한국마사회37.443091127.017519
6광주시무쏘20160617운영중<NA>Y<NA>일반음식점패밀리레스트랑464802경기도 광주시 문화로 102, 3층 (경안동)경기도 광주시 경안동 153-2번지 3층37.412213127.251767
7구리시아웃백스테이크하우스구리점20060102운영중<NA>N<NA>일반음식점패밀리레스트랑471823경기도 구리시 경춘로 274, 1,2층 (수택동)경기도 구리시 수택동 534-2번지 성원메스티지상가 1층3호, 2층37.602499127.145449
8구리시(주)이랜드파크 자연별곡20060621운영중<NA><NA><NA><NA><NA>471010경기도 구리시 경춘로 239 (인창동,두진에코맥스(3층)327-345,347-368호)경기도 구리시 인창동 676-1번지 두진에코맥스(3층)327-345,347-368호37.601604127.141646
9구리시애슐리구리20101025운영중<NA><NA><NA><NA><NA>471832경기도 구리시 건원대로 3 (인창동,흥화브라운오피스텔 2층)경기도 구리시 인창동 266-8번지 흥화브라운오피스텔 2층37.601421127.140639
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지우편번호소재지도로명주소소재지지번주소WGS84위도WGS84경도
178화성시임실N치즈피자20151026운영중<NA>N<NA>일반음식점패밀리레스트랑445130경기도 화성시 지산2길 4, 101호 (영천동)경기도 화성시 영천동 651-1269번지 101호37.206052127.112782
179화성시혜화동돈까스20151216운영중<NA>N<NA>일반음식점패밀리레스트랑445913경기도 화성시 팔탄면 노하길 518, 1층경기도 화성시 팔탄면 율암리 483-14번지 외 2필지37.166595126.88081
180화성시텐바20171205운영중<NA>Y<NA>일반음식점패밀리레스트랑445160경기도 화성시 동탄중심상가2길 7, 5층 501호 (반송동, 씨티플러스)경기도 화성시 반송동 88-11번지 씨티플러스 5층 501호37.205897127.072946
181화성시맘스터치화성남양점20171222운영중<NA>N<NA>일반음식점패밀리레스트랑445851경기도 화성시 남양읍 시청로 10, 1동 1층 일부(103호,104호일부)경기도 화성시 남양읍 남양리 1187번지 1동 1층 일부(103호,104호일부)37.208187126.818634
182화성시피니치니20150923운영중<NA>N<NA>일반음식점패밀리레스트랑445360경기도 화성시 병점2로 145, 102호 (병점동)경기도 화성시 병점동 850번지 102호37.212106127.050427
183화성시계림원동탄반송점20150910운영중<NA>N<NA>일반음식점패밀리레스트랑445160경기도 화성시 동탄반송2길 25-11, 1층 (반송동)경기도 화성시 반송동 31-1번지 1층37.209571127.064461
184화성시한국피자헛 동동탄점20171208운영중<NA>N<NA>일반음식점패밀리레스트랑445130경기도 화성시 지산2길 4-17, 1층 전부호 (영천동)경기도 화성시 영천동 701-12번지 1층 전부호37.205645127.112401
185화성시7번가피자동탄점20150716운영중<NA>N<NA>일반음식점패밀리레스트랑445160경기도 화성시 동탄반송길 16, 1층 일부호 (반송동)경기도 화성시 반송동 34-4번지 1층(102호) 일부호37.209588127.064101
186화성시슈퍼스테이크20150903폐업 등20160420Y<NA>일반음식점패밀리레스트랑445160경기도 화성시 동탄중앙로 220, 상가동 305일부호 (반송동)경기도 화성시 반송동 96번지 상가동 305일부호37.204538127.068491
187화성시난타5000피자 동탄청계점20151020폐업 등20180816N<NA>일반음식점패밀리레스트랑445140경기도 화성시 동탄대로시범길 193, 105호 (청계동)경기도 화성시 청계동 510-31번지 105호37.201414127.108524