Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 서구 식품위생업소 현황 에 관한 데이터입니다. 연번, 업종명, 업소명, 소재지 (도로명) 등의 항목을 제공하고 있습니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15039517&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 06:33:57.542159
Analysis finished2024-01-28 06:33:58.569579
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5175.1538
Minimum1
Maximum10374
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-28T15:33:58.626884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile515.95
Q12577.75
median5177
Q37759.25
95-th percentile9859.05
Maximum10374
Range10373
Interquartile range (IQR)5181.5

Descriptive statistics

Standard deviation2995.9993
Coefficient of variation (CV)0.57891986
Kurtosis-1.1999832
Mean5175.1538
Median Absolute Deviation (MAD)2591.5
Skewness0.0056856767
Sum51751538
Variance8976011.8
MonotonicityNot monotonic
2024-01-28T15:33:58.728066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8856 1
 
< 0.1%
3127 1
 
< 0.1%
8540 1
 
< 0.1%
4437 1
 
< 0.1%
9330 1
 
< 0.1%
6724 1
 
< 0.1%
8499 1
 
< 0.1%
5946 1
 
< 0.1%
6115 1
 
< 0.1%
710 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
10374 1
< 0.1%
10373 1
< 0.1%
10372 1
< 0.1%
10371 1
< 0.1%
10370 1
< 0.1%
10369 1
< 0.1%
10368 1
< 0.1%
10367 1
< 0.1%
10366 1
< 0.1%
10365 1
< 0.1%

업종명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반음식점
5450 
휴게음식점
1719 
즉석판매제조가공업
678 
집단급식소
 
484
식품제조가공업
 
314
Other values (14)
1355 

Length

Max length11
Median length5
Mean length5.5905
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반음식점
2nd row일반음식점
3rd row휴게음식점
4th row식품자동판매기영업
5th row유통전문판매업

Common Values

ValueCountFrequency (%)
일반음식점 5450
54.5%
휴게음식점 1719
 
17.2%
즉석판매제조가공업 678
 
6.8%
집단급식소 484
 
4.8%
식품제조가공업 314
 
3.1%
유통전문판매업 232
 
2.3%
식품자동판매기영업 230
 
2.3%
식품소분업 199
 
2.0%
위탁급식영업 175
 
1.8%
제과점영업 138
 
1.4%
Other values (9) 381
 
3.8%

Length

2024-01-28T15:33:58.823083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 5450
53.9%
휴게음식점 1719
 
17.0%
즉석판매제조가공업 678
 
6.7%
집단급식소 588
 
5.8%
식품제조가공업 314
 
3.1%
유통전문판매업 232
 
2.3%
식품자동판매기영업 230
 
2.3%
식품소분업 199
 
2.0%
위탁급식영업 175
 
1.7%
제과점영업 138
 
1.4%
Other values (9) 381
 
3.8%
Distinct8722
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-28T15:33:59.021187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length7.2019
Min length1

Characters and Unicode

Total characters72019
Distinct characters1080
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7705 ?
Unique (%)77.0%

Sample

1st row국밥대장청라점
2nd row원가
3rd row투썸플레이스 인천검암점
4th row지에스25 불로센트럴
5th row(주)지엔인터내셔널
ValueCountFrequency (%)
청라점 143
 
1.2%
주식회사 95
 
0.8%
세븐일레븐 56
 
0.5%
검단점 49
 
0.4%
인천청라점 44
 
0.4%
씨유 42
 
0.3%
이마트24 26
 
0.2%
검암점 24
 
0.2%
루원시티점 24
 
0.2%
검단신도시점 23
 
0.2%
Other values (9084) 11740
95.7%
2024-01-28T15:33:59.340631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2447
 
3.4%
2268
 
3.1%
1589
 
2.2%
) 1258
 
1.7%
( 1258
 
1.7%
1251
 
1.7%
1201
 
1.7%
1059
 
1.5%
978
 
1.4%
949
 
1.3%
Other values (1070) 57761
80.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62548
86.8%
Space Separator 2268
 
3.1%
Uppercase Letter 2110
 
2.9%
Close Punctuation 1259
 
1.7%
Open Punctuation 1259
 
1.7%
Decimal Number 1176
 
1.6%
Lowercase Letter 1156
 
1.6%
Other Punctuation 211
 
0.3%
Dash Punctuation 17
 
< 0.1%
Math Symbol 7
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2447
 
3.9%
1589
 
2.5%
1251
 
2.0%
1201
 
1.9%
1059
 
1.7%
978
 
1.6%
949
 
1.5%
880
 
1.4%
875
 
1.4%
850
 
1.4%
Other values (983) 50469
80.7%
Uppercase Letter
ValueCountFrequency (%)
C 229
 
10.9%
S 204
 
9.7%
O 165
 
7.8%
G 155
 
7.3%
E 154
 
7.3%
F 118
 
5.6%
A 117
 
5.5%
B 109
 
5.2%
P 96
 
4.5%
T 85
 
4.0%
Other values (16) 678
32.1%
Lowercase Letter
ValueCountFrequency (%)
e 187
16.2%
o 107
 
9.3%
a 103
 
8.9%
s 75
 
6.5%
f 75
 
6.5%
c 68
 
5.9%
i 66
 
5.7%
t 64
 
5.5%
n 60
 
5.2%
r 49
 
4.2%
Other values (16) 302
26.1%
Other Punctuation
ValueCountFrequency (%)
& 118
55.9%
, 31
 
14.7%
. 26
 
12.3%
' 14
 
6.6%
· 7
 
3.3%
# 5
 
2.4%
/ 4
 
1.9%
! 2
 
0.9%
: 2
 
0.9%
; 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 357
30.4%
5 171
14.5%
1 132
 
11.2%
4 124
 
10.5%
0 117
 
9.9%
9 74
 
6.3%
3 66
 
5.6%
8 55
 
4.7%
7 50
 
4.3%
6 30
 
2.6%
Math Symbol
ValueCountFrequency (%)
+ 2
28.6%
< 2
28.6%
> 2
28.6%
~ 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 1258
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1258
99.9%
[ 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
5
83.3%
° 1
 
16.7%
Space Separator
ValueCountFrequency (%)
2268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62526
86.8%
Common 6199
 
8.6%
Latin 3267
 
4.5%
Han 27
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2447
 
3.9%
1589
 
2.5%
1251
 
2.0%
1201
 
1.9%
1059
 
1.7%
978
 
1.6%
949
 
1.5%
880
 
1.4%
875
 
1.4%
850
 
1.4%
Other values (960) 50447
80.7%
Latin
ValueCountFrequency (%)
C 229
 
7.0%
S 204
 
6.2%
e 187
 
5.7%
O 165
 
5.1%
G 155
 
4.7%
E 154
 
4.7%
F 118
 
3.6%
A 117
 
3.6%
B 109
 
3.3%
o 107
 
3.3%
Other values (43) 1722
52.7%
Common
ValueCountFrequency (%)
2268
36.6%
) 1258
20.3%
( 1258
20.3%
2 357
 
5.8%
5 171
 
2.8%
1 132
 
2.1%
4 124
 
2.0%
& 118
 
1.9%
0 117
 
1.9%
9 74
 
1.2%
Other values (23) 322
 
5.2%
Han
ValueCountFrequency (%)
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (14) 14
51.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62518
86.8%
ASCII 9457
 
13.1%
CJK 26
 
< 0.1%
None 13
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2447
 
3.9%
1589
 
2.5%
1251
 
2.0%
1201
 
1.9%
1059
 
1.7%
978
 
1.6%
949
 
1.5%
880
 
1.4%
875
 
1.4%
850
 
1.4%
Other values (958) 50439
80.7%
ASCII
ValueCountFrequency (%)
2268
24.0%
) 1258
 
13.3%
( 1258
 
13.3%
2 357
 
3.8%
C 229
 
2.4%
S 204
 
2.2%
e 187
 
2.0%
5 171
 
1.8%
O 165
 
1.7%
G 155
 
1.6%
Other values (73) 3205
33.9%
None
ValueCountFrequency (%)
· 7
53.8%
5
38.5%
° 1
 
7.7%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (13) 13
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct8859
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-28T15:33:59.586303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length76
Mean length34.1677
Min length9

Characters and Unicode

Total characters341677
Distinct characters546
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8048 ?
Unique (%)80.5%

Sample

1st row인천광역시 서구 중봉대로612번길 10-8, 반안프라자 103,104호 (청라동)
2nd row인천광역시 서구 검단로114번안길 9 (오류동)
3rd row인천광역시 서구 승학로 508, 1층일부,2층 (검암동)
4th row인천광역시 서구 검단로 748, 일번지프라자 1층 108호 (불로동)
5th row인천광역시 서구 소담로 24, 1층일부,2층일부(I-FOOD Park 산업단지내) (금곡동)
ValueCountFrequency (%)
인천광역시 9877
 
14.8%
서구 9875
 
14.8%
1층 2558
 
3.8%
청라동 1429
 
2.1%
가좌동 1311
 
2.0%
석남동 1244
 
1.9%
일부호 764
 
1.1%
일부 744
 
1.1%
가정동 710
 
1.1%
마전동 629
 
0.9%
Other values (5375) 37653
56.4%
2024-01-28T15:33:59.930003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56831
 
16.6%
1 17476
 
5.1%
11281
 
3.3%
10866
 
3.2%
, 10822
 
3.2%
) 10296
 
3.0%
( 10296
 
3.0%
10167
 
3.0%
10149
 
3.0%
10119
 
3.0%
Other values (536) 183374
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 187422
54.9%
Decimal Number 60388
 
17.7%
Space Separator 56831
 
16.6%
Other Punctuation 10897
 
3.2%
Close Punctuation 10298
 
3.0%
Open Punctuation 10298
 
3.0%
Uppercase Letter 2524
 
0.7%
Dash Punctuation 2309
 
0.7%
Lowercase Letter 387
 
0.1%
Math Symbol 320
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11281
 
6.0%
10866
 
5.8%
10167
 
5.4%
10149
 
5.4%
10119
 
5.4%
10101
 
5.4%
9964
 
5.3%
9918
 
5.3%
9901
 
5.3%
5512
 
2.9%
Other values (469) 89444
47.7%
Uppercase Letter
ValueCountFrequency (%)
B 489
19.4%
A 335
13.3%
E 328
13.0%
C 187
 
7.4%
L 181
 
7.2%
P 159
 
6.3%
R 141
 
5.6%
S 107
 
4.2%
K 93
 
3.7%
I 84
 
3.3%
Other values (12) 420
16.6%
Lowercase Letter
ValueCountFrequency (%)
e 104
26.9%
a 69
17.8%
r 67
17.3%
s 59
15.2%
d 46
11.9%
k 23
 
5.9%
c 5
 
1.3%
b 3
 
0.8%
n 2
 
0.5%
p 2
 
0.5%
Other values (6) 7
 
1.8%
Decimal Number
ValueCountFrequency (%)
1 17476
28.9%
2 8343
13.8%
0 7054
11.7%
3 5322
 
8.8%
4 4502
 
7.5%
5 3954
 
6.5%
6 3675
 
6.1%
8 3522
 
5.8%
7 3437
 
5.7%
9 3103
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 10822
99.3%
' 47
 
0.4%
. 15
 
0.1%
& 7
 
0.1%
" 2
 
< 0.1%
* 2
 
< 0.1%
@ 1
 
< 0.1%
/ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 127
39.7%
> 127
39.7%
~ 66
20.6%
Close Punctuation
ValueCountFrequency (%)
) 10296
> 99.9%
] 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 10296
> 99.9%
[ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
56831
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2309
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 187421
54.9%
Common 151341
44.3%
Latin 2914
 
0.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11281
 
6.0%
10866
 
5.8%
10167
 
5.4%
10149
 
5.4%
10119
 
5.4%
10101
 
5.4%
9964
 
5.3%
9918
 
5.3%
9901
 
5.3%
5512
 
2.9%
Other values (468) 89443
47.7%
Latin
ValueCountFrequency (%)
B 489
16.8%
A 335
11.5%
E 328
11.3%
C 187
 
6.4%
L 181
 
6.2%
P 159
 
5.5%
R 141
 
4.8%
S 107
 
3.7%
e 104
 
3.6%
K 93
 
3.2%
Other values (30) 790
27.1%
Common
ValueCountFrequency (%)
56831
37.6%
1 17476
 
11.5%
, 10822
 
7.2%
) 10296
 
6.8%
( 10296
 
6.8%
2 8343
 
5.5%
0 7054
 
4.7%
3 5322
 
3.5%
4 4502
 
3.0%
5 3954
 
2.6%
Other values (17) 16445
 
10.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 187317
54.8%
ASCII 154252
45.1%
Compat Jamo 104
 
< 0.1%
Number Forms 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56831
36.8%
1 17476
 
11.3%
, 10822
 
7.0%
) 10296
 
6.7%
( 10296
 
6.7%
2 8343
 
5.4%
0 7054
 
4.6%
3 5322
 
3.5%
4 4502
 
2.9%
5 3954
 
2.6%
Other values (55) 19356
 
12.5%
Hangul
ValueCountFrequency (%)
11281
 
6.0%
10866
 
5.8%
10167
 
5.4%
10149
 
5.4%
10119
 
5.4%
10101
 
5.4%
9964
 
5.3%
9918
 
5.3%
9901
 
5.3%
5512
 
2.9%
Other values (467) 89339
47.7%
Compat Jamo
ValueCountFrequency (%)
104
100.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-09-06
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-09-06
2nd row2022-09-06
3rd row2022-09-06
4th row2022-09-06
5th row2022-09-06

Common Values

ValueCountFrequency (%)
2022-09-06 10000
100.0%

Length

2024-01-28T15:34:00.037443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:34:00.108906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-06 10000
100.0%

Interactions

2024-01-28T15:33:58.348340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:34:00.149895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.891
업종명0.8911.000
2024-01-28T15:34:00.208267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.605
업종명0.6051.000

Missing values

2024-01-28T15:33:58.439498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:33:58.522426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명소재지 (도로명)데이터기준일자
88558856일반음식점국밥대장청라점인천광역시 서구 중봉대로612번길 10-8, 반안프라자 103,104호 (청라동)2022-09-06
41684169일반음식점원가인천광역시 서구 검단로114번안길 9 (오류동)2022-09-06
15901591휴게음식점투썸플레이스 인천검암점인천광역시 서구 승학로 508, 1층일부,2층 (검암동)2022-09-06
99989999식품자동판매기영업지에스25 불로센트럴인천광역시 서구 검단로 748, 일번지프라자 1층 108호 (불로동)2022-09-06
89558956유통전문판매업(주)지엔인터내셔널인천광역시 서구 소담로 24, 1층일부,2층일부(I-FOOD Park 산업단지내) (금곡동)2022-09-06
585586휴게음식점이삭토스트 인천청라점인천광역시 서구 청라라임로 51, 118호 (연희동, 청라에일린의 뜰 )2022-09-06
27192720즉석판매제조가공업클로빗(clovit)인천광역시 서구 승학로506번길 25-5, 1층 (검암동)2022-09-06
11531154휴게음식점노브랜드버거 인천완정역점인천광역시 서구 원당대로 660 (당하동, 당하영프라자 101호)2022-09-06
91349135유통전문판매업주식회사델포유인천광역시 서구 원석로196번길 12, 4층 일부호 (원창동)2022-09-06
77587759일반음식점피자헛청라점인천광역시 서구 청라커낼로319번길 3-5, 1층 (경서동)2022-09-06
연번업종명업소명소재지 (도로명)데이터기준일자
99609961식품자동판매기영업씨유마전중앙점인천광역시 서구 완정로188번3길 2 (마전동, 1층일부)2022-09-06
9991000휴게음식점씨유(CU)가좌공단점인천광역시 서구 보도진로 68, 1층 일부호 (가좌동)2022-09-06
93919392용기.포장지제조업동신관유리공업(주)인천광역시 서구 중봉대로393번길 137 (원창동)2022-09-06
82248225일반음식점롯데쇼핑(주)롯데슈퍼신현점인천광역시 서구 염곡로 351 (신현동)2022-09-06
76067607일반음식점배부장찌개가인천광역시 서구 청마로7번길 24, 105호 (당하동)2022-09-06
79527953일반음식점희락갈비인천광역시 서구 서달로149번길 8, 2,3층 (석남동)2022-09-06
42944295일반음식점도깨비호떡인천광역시 서구 청라라임로 85, C동 105호 (연희동, 청라린스트라우스 판매시설)2022-09-06
77567757일반음식점맘스터치 인천불로점인천광역시 서구 검단로 783, 1층 101호 (불로동, 정모빌딩)2022-09-06
703704휴게음식점컴포즈커피인천루원SK리더스뷰점인천광역시 서구 가정로 437, B219호 (가정동, 루원시티 SK Leaders' VIEW)2022-09-06
60696070일반음식점보리랑콩이랑인천광역시 서구 보석로18번안길 25 (경서동, 1층전부)2022-09-06