Overview

Dataset statistics

Number of variables15
Number of observations1333
Missing cells1958
Missing cells (%)9.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory160.2 KiB
Average record size in memory123.1 B

Variable types

Categorical3
Numeric3
DateTime3
Text5
Boolean1

Dataset

Description2019년 인천광역시 강화군_식품위생업소(일반음식점)현황 데이터로 인허가번호,인허가일자 업소명 소재지 항목등을 제공
Author인천광역시 강화군
URLhttps://www.data.go.kr/data/15047502/fileData.do

Alerts

업종명 has constant value ""Constant
지위승계여부 has constant value ""Constant
우편번호(도로명) is highly overall correlated with 행정동명High correlation
행정동명 is highly overall correlated with 우편번호(도로명)High correlation
소재지(도로명) has 228 (17.1%) missing valuesMissing
소재지전화 has 297 (22.3%) missing valuesMissing
우편번호(도로명) has 233 (17.5%) missing valuesMissing
지위승계여부 has 1195 (89.6%) missing valuesMissing
인허가번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:51:43.361632
Analysis finished2023-12-12 04:51:45.855169
Duration2.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
일반음식점
1333 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 1333
100.0%

Length

2023-12-12T13:51:45.915878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:51:46.005155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 1333
100.0%

인허가번호
Real number (ℝ)

UNIQUE 

Distinct1333
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0039303 × 1010
Minimum1.9810211 × 1010
Maximum2.0160211 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-12T13:51:46.096986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.9810211 × 1010
5-th percentile1.9850211 × 1010
Q11.9970211 × 1010
median2.0070211 × 1010
Q32.0110211 × 1010
95-th percentile2.0150211 × 1010
Maximum2.0160211 × 1010
Range3.5000015 × 108
Interquartile range (IQR)1.4000003 × 108

Descriptive statistics

Standard deviation91467789
Coefficient of variation (CV)0.0045644196
Kurtosis-0.5373262
Mean2.0039303 × 1010
Median Absolute Deviation (MAD)60000075
Skewness-0.67191641
Sum2.6712391 × 1013
Variance8.3663564 × 1015
MonotonicityNot monotonic
2023-12-12T13:51:46.229719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19810211010 1
 
0.1%
20100211070 1
 
0.1%
20100211068 1
 
0.1%
20100211067 1
 
0.1%
20100211066 1
 
0.1%
20100211064 1
 
0.1%
20100211051 1
 
0.1%
20100211048 1
 
0.1%
20100211046 1
 
0.1%
20100211044 1
 
0.1%
Other values (1323) 1323
99.2%
ValueCountFrequency (%)
19810211002 1
0.1%
19810211003 1
0.1%
19810211005 1
0.1%
19810211007 1
0.1%
19810211008 1
0.1%
19810211009 1
0.1%
19810211010 1
0.1%
19840211027 1
0.1%
19840211028 1
0.1%
19840211029 1
0.1%
ValueCountFrequency (%)
20160211154 1
0.1%
20160211152 1
0.1%
20160211151 1
0.1%
20160211149 1
0.1%
20160211147 1
0.1%
20160211146 1
0.1%
20160211145 1
0.1%
20160211137 1
0.1%
20160211135 1
0.1%
20160211131 1
0.1%
Distinct1021
Distinct (%)76.6%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
Minimum1981-09-21 00:00:00
Maximum2016-07-29 00:00:00
2023-12-12T13:51:46.603968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:46.708828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1295
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2023-12-12T13:51:46.962589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length4.9692423
Min length1

Characters and Unicode

Total characters6624
Distinct characters597
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1261 ?
Unique (%)94.6%

Sample

1st row원조영양탕
2nd row진주
3rd row푸른집
4th row서울식당
5th row서해안숯불장어구이
ValueCountFrequency (%)
카페 7
 
0.5%
5
 
0.3%
강화점 5
 
0.3%
마니산 4
 
0.3%
우리식당 4
 
0.3%
밥집 4
 
0.3%
강화도령 3
 
0.2%
행복한 3
 
0.2%
치킨 3
 
0.2%
덕성호 3
 
0.2%
Other values (1395) 1451
97.3%
2023-12-12T13:51:47.350800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
225
 
3.4%
202
 
3.0%
159
 
2.4%
138
 
2.1%
131
 
2.0%
109
 
1.6%
104
 
1.6%
95
 
1.4%
91
 
1.4%
91
 
1.4%
Other values (587) 5279
79.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6156
92.9%
Space Separator 159
 
2.4%
Lowercase Letter 97
 
1.5%
Uppercase Letter 82
 
1.2%
Decimal Number 57
 
0.9%
Close Punctuation 27
 
0.4%
Open Punctuation 27
 
0.4%
Other Punctuation 13
 
0.2%
Dash Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
225
 
3.7%
202
 
3.3%
138
 
2.2%
131
 
2.1%
109
 
1.8%
104
 
1.7%
95
 
1.5%
91
 
1.5%
91
 
1.5%
87
 
1.4%
Other values (526) 4883
79.3%
Uppercase Letter
ValueCountFrequency (%)
B 8
 
9.8%
T 7
 
8.5%
E 7
 
8.5%
N 7
 
8.5%
S 7
 
8.5%
A 6
 
7.3%
O 5
 
6.1%
I 5
 
6.1%
C 4
 
4.9%
F 4
 
4.9%
Other values (10) 22
26.8%
Lowercase Letter
ValueCountFrequency (%)
e 16
16.5%
o 10
10.3%
a 8
 
8.2%
f 7
 
7.2%
s 6
 
6.2%
r 6
 
6.2%
n 6
 
6.2%
t 5
 
5.2%
b 5
 
5.2%
l 4
 
4.1%
Other values (9) 24
24.7%
Decimal Number
ValueCountFrequency (%)
2 15
26.3%
1 10
17.5%
0 9
15.8%
7 5
 
8.8%
9 5
 
8.8%
8 5
 
8.8%
5 5
 
8.8%
6 1
 
1.8%
4 1
 
1.8%
3 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 4
30.8%
. 2
15.4%
' 2
15.4%
: 1
 
7.7%
· 1
 
7.7%
@ 1
 
7.7%
, 1
 
7.7%
! 1
 
7.7%
Space Separator
ValueCountFrequency (%)
159
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6155
92.9%
Common 289
 
4.4%
Latin 179
 
2.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
225
 
3.7%
202
 
3.3%
138
 
2.2%
131
 
2.1%
109
 
1.8%
104
 
1.7%
95
 
1.5%
91
 
1.5%
91
 
1.5%
87
 
1.4%
Other values (525) 4882
79.3%
Latin
ValueCountFrequency (%)
e 16
 
8.9%
o 10
 
5.6%
B 8
 
4.5%
a 8
 
4.5%
T 7
 
3.9%
E 7
 
3.9%
f 7
 
3.9%
N 7
 
3.9%
S 7
 
3.9%
s 6
 
3.4%
Other values (29) 96
53.6%
Common
ValueCountFrequency (%)
159
55.0%
) 27
 
9.3%
( 27
 
9.3%
2 15
 
5.2%
1 10
 
3.5%
0 9
 
3.1%
- 6
 
2.1%
7 5
 
1.7%
9 5
 
1.7%
8 5
 
1.7%
Other values (12) 21
 
7.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6155
92.9%
ASCII 467
 
7.1%
CJK 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
225
 
3.7%
202
 
3.3%
138
 
2.2%
131
 
2.1%
109
 
1.8%
104
 
1.7%
95
 
1.5%
91
 
1.5%
91
 
1.5%
87
 
1.4%
Other values (525) 4882
79.3%
ASCII
ValueCountFrequency (%)
159
34.0%
) 27
 
5.8%
( 27
 
5.8%
e 16
 
3.4%
2 15
 
3.2%
o 10
 
2.1%
1 10
 
2.1%
0 9
 
1.9%
B 8
 
1.7%
a 8
 
1.7%
Other values (50) 178
38.1%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

소재지(도로명)
Text

MISSING 

Distinct1007
Distinct (%)91.1%
Missing228
Missing (%)17.1%
Memory size10.5 KiB
2023-12-12T13:51:47.598422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length46
Mean length25.620814
Min length19

Characters and Unicode

Total characters28311
Distinct characters153
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique946 ?
Unique (%)85.6%

Sample

1st row인천광역시 강화군 강화읍 강화대로 254-1
2nd row인천광역시 강화군 길상면 해안동로 96-9 (외 2필지(616-10,616-34))
3rd row인천광역시 강화군 길상면 온수길 44
4th row인천광역시 강화군 양도면 강화남로769번길 71
5th row인천광역시 강화군 강화읍 강화대로403번길 4-1, 1층
ValueCountFrequency (%)
인천광역시 1105
18.2%
강화군 1105
18.2%
강화읍 353
 
5.8%
1층 191
 
3.2%
길상면 187
 
3.1%
화도면 162
 
2.7%
해안남로 134
 
2.2%
중앙로 127
 
2.1%
강화대로 96
 
1.6%
선원면 87
 
1.4%
Other values (912) 2515
41.5%
2023-12-12T13:51:48.084635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4967
17.5%
1877
 
6.6%
1706
 
6.0%
1150
 
4.1%
1 1109
 
3.9%
1106
 
3.9%
1106
 
3.9%
1105
 
3.9%
1105
 
3.9%
1105
 
3.9%
Other values (143) 11975
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17728
62.6%
Space Separator 4967
 
17.5%
Decimal Number 4656
 
16.4%
Dash Punctuation 277
 
1.0%
Open Punctuation 224
 
0.8%
Close Punctuation 224
 
0.8%
Other Punctuation 211
 
0.7%
Math Symbol 18
 
0.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1877
 
10.6%
1706
 
9.6%
1150
 
6.5%
1106
 
6.2%
1106
 
6.2%
1105
 
6.2%
1105
 
6.2%
1105
 
6.2%
910
 
5.1%
752
 
4.2%
Other values (124) 5806
32.8%
Decimal Number
ValueCountFrequency (%)
1 1109
23.8%
2 692
14.9%
3 466
10.0%
4 437
 
9.4%
0 363
 
7.8%
9 357
 
7.7%
5 332
 
7.1%
7 324
 
7.0%
6 303
 
6.5%
8 273
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 3
50.0%
B 2
33.3%
N 1
 
16.7%
Space Separator
ValueCountFrequency (%)
4967
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 277
100.0%
Open Punctuation
ValueCountFrequency (%)
( 224
100.0%
Close Punctuation
ValueCountFrequency (%)
) 224
100.0%
Other Punctuation
ValueCountFrequency (%)
, 211
100.0%
Math Symbol
ValueCountFrequency (%)
~ 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17728
62.6%
Common 10577
37.4%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1877
 
10.6%
1706
 
9.6%
1150
 
6.5%
1106
 
6.2%
1106
 
6.2%
1105
 
6.2%
1105
 
6.2%
1105
 
6.2%
910
 
5.1%
752
 
4.2%
Other values (124) 5806
32.8%
Common
ValueCountFrequency (%)
4967
47.0%
1 1109
 
10.5%
2 692
 
6.5%
3 466
 
4.4%
4 437
 
4.1%
0 363
 
3.4%
9 357
 
3.4%
5 332
 
3.1%
7 324
 
3.1%
6 303
 
2.9%
Other values (6) 1227
 
11.6%
Latin
ValueCountFrequency (%)
A 3
50.0%
B 2
33.3%
N 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17728
62.6%
ASCII 10583
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4967
46.9%
1 1109
 
10.5%
2 692
 
6.5%
3 466
 
4.4%
4 437
 
4.1%
0 363
 
3.4%
9 357
 
3.4%
5 332
 
3.1%
7 324
 
3.1%
6 303
 
2.9%
Other values (9) 1233
 
11.7%
Hangul
ValueCountFrequency (%)
1877
 
10.6%
1706
 
9.6%
1150
 
6.5%
1106
 
6.2%
1106
 
6.2%
1105
 
6.2%
1105
 
6.2%
1105
 
6.2%
910
 
5.1%
752
 
4.2%
Other values (124) 5806
32.8%
Distinct1203
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2023-12-12T13:51:48.461562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length53
Mean length28.726182
Min length4

Characters and Unicode

Total characters38292
Distinct characters152
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1124 ?
Unique (%)84.3%

Sample

1st row인천광역시 강화군 강화읍 신문리 32번지
2nd row인천광역시 강화군 송해면 상도리 415번지
3rd row인천광역시 강화군 강화읍 갑곳리 364번지
4th row인천광역시 강화군 길상면 온수리 665번지
5th row인천광역시 강화군 길상면 초지리 616번지 4호 외 2필지(616-10,616-34)
ValueCountFrequency (%)
인천광역시 1328
 
16.5%
강화군 1328
 
16.5%
강화읍 404
 
5.0%
길상면 245
 
3.0%
1호 216
 
2.7%
화도면 177
 
2.2%
신문리 117
 
1.5%
갑곳리 115
 
1.4%
2호 108
 
1.3%
삼산면 104
 
1.3%
Other values (950) 3900
48.5%
2023-12-12T13:51:49.136506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9087
23.7%
1939
 
5.1%
1736
 
4.5%
1523
 
4.0%
1365
 
3.6%
1346
 
3.5%
1342
 
3.5%
1341
 
3.5%
1329
 
3.5%
1328
 
3.5%
Other values (142) 15956
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22854
59.7%
Space Separator 9087
 
23.7%
Decimal Number 6060
 
15.8%
Dash Punctuation 122
 
0.3%
Close Punctuation 52
 
0.1%
Open Punctuation 52
 
0.1%
Other Punctuation 46
 
0.1%
Math Symbol 13
 
< 0.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1939
 
8.5%
1736
 
7.6%
1523
 
6.7%
1365
 
6.0%
1346
 
5.9%
1342
 
5.9%
1341
 
5.9%
1329
 
5.8%
1328
 
5.8%
1328
 
5.8%
Other values (121) 8277
36.2%
Decimal Number
ValueCountFrequency (%)
1 1285
21.2%
2 852
14.1%
4 623
10.3%
3 623
10.3%
5 618
10.2%
6 548
9.0%
8 431
 
7.1%
9 378
 
6.2%
0 378
 
6.2%
7 324
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
B 2
33.3%
J 1
16.7%
M 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 35
76.1%
. 11
 
23.9%
Space Separator
ValueCountFrequency (%)
9087
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 122
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22854
59.7%
Common 15432
40.3%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1939
 
8.5%
1736
 
7.6%
1523
 
6.7%
1365
 
6.0%
1346
 
5.9%
1342
 
5.9%
1341
 
5.9%
1329
 
5.8%
1328
 
5.8%
1328
 
5.8%
Other values (121) 8277
36.2%
Common
ValueCountFrequency (%)
9087
58.9%
1 1285
 
8.3%
2 852
 
5.5%
4 623
 
4.0%
3 623
 
4.0%
5 618
 
4.0%
6 548
 
3.6%
8 431
 
2.8%
9 378
 
2.4%
0 378
 
2.4%
Other values (7) 609
 
3.9%
Latin
ValueCountFrequency (%)
A 2
33.3%
B 2
33.3%
J 1
16.7%
M 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22854
59.7%
ASCII 15438
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9087
58.9%
1 1285
 
8.3%
2 852
 
5.5%
4 623
 
4.0%
3 623
 
4.0%
5 618
 
4.0%
6 548
 
3.5%
8 431
 
2.8%
9 378
 
2.4%
0 378
 
2.4%
Other values (11) 615
 
4.0%
Hangul
ValueCountFrequency (%)
1939
 
8.5%
1736
 
7.6%
1523
 
6.7%
1365
 
6.0%
1346
 
5.9%
1342
 
5.9%
1341
 
5.9%
1329
 
5.8%
1328
 
5.8%
1328
 
5.8%
Other values (121) 8277
36.2%

영업장면적
Real number (ℝ)

Distinct1082
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108.30651
Minimum0
Maximum2060
Zeros13
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-12T13:51:49.337959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile22.044
Q145.44
median82.5
Q3130.4
95-th percentile265.412
Maximum2060
Range2060
Interquartile range (IQR)84.96

Descriptive statistics

Standard deviation120.83541
Coefficient of variation (CV)1.11568
Kurtosis73.7042
Mean108.30651
Median Absolute Deviation (MAD)41.46
Skewness6.4502319
Sum144372.58
Variance14601.196
MonotonicityNot monotonic
2023-12-12T13:51:49.499135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65.1 15
 
1.1%
36.92 13
 
1.0%
0.0 13
 
1.0%
47.2 11
 
0.8%
24.0 9
 
0.7%
55.92 9
 
0.7%
32.64 8
 
0.6%
56.0 8
 
0.6%
90.02 8
 
0.6%
30.8 6
 
0.5%
Other values (1072) 1233
92.5%
ValueCountFrequency (%)
0.0 13
1.0%
9.2 1
 
0.1%
11.34 2
 
0.2%
12.58 1
 
0.1%
12.69 1
 
0.1%
12.73 1
 
0.1%
12.88 1
 
0.1%
13.02 1
 
0.1%
13.4 1
 
0.1%
13.5 1
 
0.1%
ValueCountFrequency (%)
2060.0 1
0.1%
1450.41 1
0.1%
1205.6 1
0.1%
1121.18 1
0.1%
753.75 1
0.1%
748.21 1
0.1%
691.6 1
0.1%
667.12 1
0.1%
654.3 1
0.1%
630.48 1
0.1%

소재지전화
Text

MISSING 

Distinct1009
Distinct (%)97.4%
Missing297
Missing (%)22.3%
Memory size10.5 KiB
2023-12-12T13:51:49.819074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.006757
Min length9

Characters and Unicode

Total characters12439
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique985 ?
Unique (%)95.1%

Sample

1st row032-934-3380
2nd row032-933-7766
3rd row032-937-0052
4th row032-037-0946
5th row032-937-7639
ValueCountFrequency (%)
032-932-4884 4
 
0.4%
032-937-2116 3
 
0.3%
032-937-6826 2
 
0.2%
032-933-1254 2
 
0.2%
032-937-9998 2
 
0.2%
032-932-6913 2
 
0.2%
032-933-3488 2
 
0.2%
032-937-6606 2
 
0.2%
032-933-7167 2
 
0.2%
032-937-1994 2
 
0.2%
Other values (999) 1013
97.8%
2023-12-12T13:51:50.364135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 2744
22.1%
- 2071
16.6%
2 1703
13.7%
0 1474
11.8%
9 1468
11.8%
7 775
 
6.2%
4 519
 
4.2%
8 485
 
3.9%
5 416
 
3.3%
1 398
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10368
83.4%
Dash Punctuation 2071
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 2744
26.5%
2 1703
16.4%
0 1474
14.2%
9 1468
14.2%
7 775
 
7.5%
4 519
 
5.0%
8 485
 
4.7%
5 416
 
4.0%
1 398
 
3.8%
6 386
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 2071
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12439
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 2744
22.1%
- 2071
16.6%
2 1703
13.7%
0 1474
11.8%
9 1468
11.8%
7 775
 
6.2%
4 519
 
4.2%
8 485
 
3.9%
5 416
 
3.3%
1 398
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12439
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 2744
22.1%
- 2071
16.6%
2 1703
13.7%
0 1474
11.8%
9 1468
11.8%
7 775
 
6.2%
4 519
 
4.2%
8 485
 
3.9%
5 416
 
3.3%
1 398
 
3.2%
Distinct987
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
Minimum1981-09-24 00:00:00
Maximum2016-07-29 00:00:00
2023-12-12T13:51:50.548017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:50.710713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

우편번호(도로명)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct59
Distinct (%)5.4%
Missing233
Missing (%)17.5%
Infinite0
Infinite (%)0.0%
Mean23038.52
Minimum23001
Maximum23062
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-12T13:51:50.914147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum23001
5-th percentile23007
Q123027
median23040
Q323052
95-th percentile23061
Maximum23062
Range61
Interquartile range (IQR)25

Descriptive statistics

Standard deviation16.933326
Coefficient of variation (CV)0.00073500062
Kurtosis-0.71044903
Mean23038.52
Median Absolute Deviation (MAD)12.5
Skewness-0.50386911
Sum25342372
Variance286.73754
MonotonicityNot monotonic
2023-12-12T13:51:51.121715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23050 68
 
5.1%
23035 65
 
4.9%
23060 63
 
4.7%
23007 57
 
4.3%
23031 50
 
3.8%
23052 49
 
3.7%
23027 49
 
3.7%
23054 48
 
3.6%
23059 42
 
3.2%
23037 40
 
3.0%
Other values (49) 569
42.7%
(Missing) 233
17.5%
ValueCountFrequency (%)
23001 4
 
0.3%
23002 25
1.9%
23004 5
 
0.4%
23005 6
 
0.5%
23006 6
 
0.5%
23007 57
4.3%
23008 6
 
0.5%
23010 3
 
0.2%
23011 1
 
0.1%
23012 1
 
0.1%
ValueCountFrequency (%)
23062 37
2.8%
23061 20
 
1.5%
23060 63
4.7%
23059 42
3.2%
23058 3
 
0.2%
23057 11
 
0.8%
23056 17
 
1.3%
23055 12
 
0.9%
23054 48
3.6%
23053 4
 
0.3%
Distinct55
Distinct (%)4.1%
Missing5
Missing (%)0.4%
Memory size10.5 KiB
2023-12-12T13:51:51.389136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters9296
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.5%

Sample

1st row417-943
2nd row417-814
3rd row417-801
4th row417-841
5th row417-843
ValueCountFrequency (%)
417-843 133
 
10.0%
417-841 89
 
6.7%
417-911 86
 
6.5%
417-862 79
 
5.9%
417-803 73
 
5.5%
417-861 69
 
5.2%
417-823 62
 
4.7%
417-807 61
 
4.6%
417-943 56
 
4.2%
417-942 52
 
3.9%
Other values (45) 568
42.8%
2023-12-12T13:51:51.828764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1855
20.0%
4 1795
19.3%
7 1424
15.3%
- 1328
14.3%
8 1046
11.3%
3 503
 
5.4%
2 405
 
4.4%
9 381
 
4.1%
0 269
 
2.9%
6 186
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7968
85.7%
Dash Punctuation 1328
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1855
23.3%
4 1795
22.5%
7 1424
17.9%
8 1046
13.1%
3 503
 
6.3%
2 405
 
5.1%
9 381
 
4.8%
0 269
 
3.4%
6 186
 
2.3%
5 104
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 1328
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9296
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1855
20.0%
4 1795
19.3%
7 1424
15.3%
- 1328
14.3%
8 1046
11.3%
3 503
 
5.4%
2 405
 
4.4%
9 381
 
4.1%
0 269
 
2.9%
6 186
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9296
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1855
20.0%
4 1795
19.3%
7 1424
15.3%
- 1328
14.3%
8 1046
11.3%
3 503
 
5.4%
2 405
 
4.4%
9 381
 
4.1%
0 269
 
2.9%
6 186
 
2.0%
Distinct1015
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
Minimum1981-09-21 00:00:00
Maximum2016-07-29 00:00:00
2023-12-12T13:51:51.998269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:52.175551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지위승계여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.7%
Missing1195
Missing (%)89.6%
Memory size2.7 KiB
True
138 
(Missing)
1195 
ValueCountFrequency (%)
True 138
 
10.4%
(Missing) 1195
89.6%
2023-12-12T13:51:52.292023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

행정동명
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
강화읍
404 
길상면
245 
화도면
177 
삼산면
104 
선원면
97 
Other values (9)
306 

Length

Max length4
Median length3
Mean length3.0037509
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강화읍
2nd row송해면
3rd row강화읍
4th row길상면
5th row길상면

Common Values

ValueCountFrequency (%)
강화읍 404
30.3%
길상면 245
18.4%
화도면 177
13.3%
삼산면 104
 
7.8%
선원면 97
 
7.3%
내가면 92
 
6.9%
양도면 53
 
4.0%
불은면 47
 
3.5%
교동면 39
 
2.9%
하점면 32
 
2.4%
Other values (4) 43
 
3.2%

Length

2023-12-12T13:51:52.411605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강화읍 404
30.3%
길상면 245
18.4%
화도면 177
13.3%
삼산면 104
 
7.8%
선원면 97
 
7.3%
내가면 92
 
6.9%
양도면 53
 
4.0%
불은면 47
 
3.5%
교동면 39
 
2.9%
하점면 32
 
2.4%
Other values (4) 43
 
3.2%

업태명
Categorical

Distinct19
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
한식
842 
일식
95 
회집
 
82
분식
 
50
호프/통닭
 
47
Other values (14)
217 

Length

Max length10
Median length2
Mean length2.4733683
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row탕류(보신용)
2nd row한식
3rd row한식
4th row한식
5th row일식

Common Values

ValueCountFrequency (%)
한식 842
63.2%
일식 95
 
7.1%
회집 82
 
6.2%
분식 50
 
3.8%
호프/통닭 47
 
3.5%
경양식 40
 
3.0%
까페 33
 
2.5%
통닭(치킨) 32
 
2.4%
중국식 31
 
2.3%
식육(숯불구이) 23
 
1.7%
Other values (9) 58
 
4.4%

Length

2023-12-12T13:51:52.584882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 842
63.2%
일식 95
 
7.1%
회집 82
 
6.2%
분식 50
 
3.8%
호프/통닭 47
 
3.5%
경양식 40
 
3.0%
까페 33
 
2.5%
통닭(치킨 32
 
2.4%
중국식 31
 
2.3%
식육(숯불구이 23
 
1.7%
Other values (9) 58
 
4.4%

Interactions

2023-12-12T13:51:45.056119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.453480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.767767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:45.171475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.564742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.865363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:45.269159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.659416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:51:44.958115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:51:52.695224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호영업장면적우편번호(도로명)우편번호(지번)행정동명업태명
인허가번호1.0000.0170.2720.4160.1450.347
영업장면적0.0171.0000.0720.0000.0000.510
우편번호(도로명)0.2720.0721.0000.9950.9390.383
우편번호(지번)0.4160.0000.9951.0001.0000.570
행정동명0.1450.0000.9391.0001.0000.403
업태명0.3470.5100.3830.5700.4031.000
2023-12-12T13:51:52.828724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태명행정동명
업태명1.0000.151
행정동명0.1511.000
2023-12-12T13:51:52.931316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호영업장면적우편번호(도로명)행정동명업태명
인허가번호1.0000.0950.0760.0610.140
영업장면적0.0951.0000.1720.0000.255
우편번호(도로명)0.0760.1721.0000.7720.156
행정동명0.0610.0000.7721.0000.151
업태명0.1400.2550.1560.1511.000

Missing values

2023-12-12T13:51:45.414947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:51:45.609762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T13:51:45.763617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명인허가번호인허가일자업소명소재지(도로명)소재지(지번)영업장면적소재지전화영업자시작일우편번호(도로명)우편번호(지번)소재지시작일지위승계여부행정동명업태명
0일반음식점198102110101981-11-28원조영양탕<NA>인천광역시 강화군 강화읍 신문리 32번지31.2032-934-33802004-04-13<NA>417-9431981-11-28<NA>강화읍탕류(보신용)
1일반음식점198102110051981-09-24진주<NA>인천광역시 강화군 송해면 상도리 415번지83.77<NA>1981-09-24<NA>417-8141981-09-24<NA>송해면한식
2일반음식점198102110071981-09-30푸른집인천광역시 강화군 강화읍 강화대로 254-1인천광역시 강화군 강화읍 갑곳리 364번지237.99032-933-77661981-09-3023025417-8011981-09-30<NA>강화읍한식
3일반음식점198102110021981-09-21서울식당<NA>인천광역시 강화군 길상면 온수리 665번지47.52032-937-00522010-06-03<NA>417-8411981-09-21<NA>길상면한식
4일반음식점198102110031981-09-22서해안숯불장어구이인천광역시 강화군 길상면 해안동로 96-9 (외 2필지(616-10,616-34))인천광역시 강화군 길상면 초지리 616번지 4호 외 2필지(616-10,616-34)282.0032-037-09462016-01-2823049417-8432011-06-01Y길상면일식
5일반음식점198102110091981-10-10원조골목집인천광역시 강화군 길상면 온수길 44인천광역시 강화군 길상면 온수리 514번지 5호79.83032-937-76392013-03-0723050417-8411981-10-10<NA>길상면한식
6일반음식점198102110081981-09-30해뜨고달뜨고인천광역시 강화군 양도면 강화남로769번길 71인천광역시 강화군 양도면 하일리 446번지94.94032-937-01832004-07-0523057417-8532006-02-01Y양도면한식
7일반음식점198402110311984-05-22하얀집인천광역시 강화군 강화읍 강화대로403번길 4-1, 1층인천광역시 강화군 강화읍 신문리 2번지 44호 1층65.28032-934-32082016-06-2723035417-9432016-06-27Y강화읍<NA>
8일반음식점198402110341984-05-30조금인천광역시 강화군 강화읍 강화대로409번길 3인천광역시 강화군 강화읍 신문리 2번지 6호68.4032-934-19772009-02-1323035417-9431984-05-30<NA>강화읍한식
9일반음식점198402110351984-05-30천일관인천광역시 강화군 강화읍 강화대로419번길 9인천광역시 강화군 강화읍 신문리 147번지148.5032-933-66232016-06-0223035417-8071984-05-30<NA>강화읍중국식
업종명인허가번호인허가일자업소명소재지(도로명)소재지(지번)영업장면적소재지전화영업자시작일우편번호(도로명)우편번호(지번)소재지시작일지위승계여부행정동명업태명
1323일반음식점201602111312016-06-24선두반점인천광역시 강화군 길상면 해안남로 757, 1동인천광역시 강화군 길상면 선두리 909번지 1호129.3032-937-87012016-06-2423051417-8422016-06-24<NA>길상면중국식
1324일반음식점201602111352016-07-07다화당인천광역시 강화군 불은면 중앙로 694, 1층인천광역시 강화군 불은면 삼성리 921번지 1층60.59<NA>2016-07-0723038417-8332016-07-07<NA>불은면한식
1325일반음식점201602111372016-07-07엔드하리인천광역시 강화군 길상면 동검길63번길 66, 1층인천광역시 강화군 길상면 동검리 95번지 1층164.79<NA>2016-07-0723053417-8422016-07-07<NA>길상면기타
1326일반음식점201602111452016-07-14장터숯불갈비인천광역시 강화군 송해면 강화대로 659-1, 1층인천광역시 강화군 송해면 하도리 361번지 6호 1층213.6032-934-33442016-07-1423019417-8132016-07-14<NA>송해면한식
1327일반음식점201602111462016-07-15도토리인천광역시 강화군 화도면 해안남로 1206, 1층인천광역시 강화군 화도면 사기리 436번지 1층52.05032-937-42022016-07-1523059417-8622016-07-15<NA>화도면한식
1328일반음식점201602111472016-07-18외포리족향기인천광역시 강화군 내가면 해안서로 937, 1층인천광역시 강화군 내가면 외포리 686번지 8호90.28032-934-98002016-07-1823054417-8942016-07-18<NA>내가면한식
1329일반음식점201602111492016-07-25인천광역시 강화군 내가면 중앙로1128번길 21인천광역시 강화군 내가면 외포리 56번지 3호94.95032-932-93212016-07-25<NA>417-8932016-07-25<NA>내가면한식
1330일반음식점201602111512016-07-28건평휴게소인천광역시 강화군 양도면 해안서로 508, 1층인천광역시 강화군 양도면 건평리 751번지 1호 1층12.73032-937-78552016-07-2823056417-8512016-07-28<NA>양도면한식
1331일반음식점201602111522016-07-28무진장수산인천광역시 강화군 길상면 해안남로117번길 5-15, 1층인천광역시 강화군 길상면 초지리 1325번지 40호 1층56.51<NA>2016-07-2823052417-8432016-07-28<NA>길상면회집
1332일반음식점201602111542016-07-29아모테인천광역시 강화군 화도면 해안남로 1390, 1층인천광역시 강화군 화도면 사기리 493번지 5호 1층310.98<NA>2016-07-2923059417-8622016-07-29<NA>화도면경양식