Overview

Dataset statistics

Number of variables6
Number of observations598
Missing cells16
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.7 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description인천광역시 남동구 음식물폐기물 다량배출사업장 현황(연번, 업소구분, 업소명, 주소, 전화번호, 데이터기준일자)에 관한 자료입니다.
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15034337

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 16 (2.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 17:33:57.010199
Analysis finished2024-01-28 17:33:57.697953
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct598
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean299.5
Minimum1
Maximum598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-29T02:33:57.761816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.85
Q1150.25
median299.5
Q3448.75
95-th percentile568.15
Maximum598
Range597
Interquartile range (IQR)298.5

Descriptive statistics

Standard deviation172.77201
Coefficient of variation (CV)0.57686814
Kurtosis-1.2
Mean299.5
Median Absolute Deviation (MAD)149.5
Skewness0
Sum179101
Variance29850.167
MonotonicityStrictly increasing
2024-01-29T02:33:57.883291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
395 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
403 1
 
0.2%
404 1
 
0.2%
Other values (588) 588
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
598 1
0.2%
597 1
0.2%
596 1
0.2%
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%

업소구분
Categorical

Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
집단급식소
289 
일반음식점
279 
휴게음식점
 
20
대규모점포
 
6
관광숙박시설
 
3

Length

Max length6
Median length5
Mean length5.006689
Min length5

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row집단급식소
5th row일반음식점

Common Values

ValueCountFrequency (%)
집단급식소 289
48.3%
일반음식점 279
46.7%
휴게음식점 20
 
3.3%
대규모점포 6
 
1.0%
관광숙박시설 3
 
0.5%
농수산물시장 1
 
0.2%

Length

2024-01-29T02:33:58.001310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T02:33:58.098728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 289
48.3%
일반음식점 279
46.7%
휴게음식점 20
 
3.3%
대규모점포 6
 
1.0%
관광숙박시설 3
 
0.5%
농수산물시장 1
 
0.2%
Distinct593
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-01-29T02:33:58.347678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length21
Mean length8.9197324
Min length2

Characters and Unicode

Total characters5334
Distinct characters513
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique588 ?
Unique (%)98.3%

Sample

1st row향계
2nd row취화선
3rd row아웃백스테이크하우스구월점
4th row푸른마을유치원
5th row명가낙지촌
ValueCountFrequency (%)
위탁급식소 14
 
1.8%
코리아 7
 
0.9%
주)스타벅스 7
 
0.9%
구월점 7
 
0.9%
㈜아워홈 6
 
0.8%
㈜동원홈푸드 6
 
0.8%
인천논현점 5
 
0.6%
인천구월점 4
 
0.5%
커피 4
 
0.5%
버거킹 3
 
0.4%
Other values (678) 709
91.8%
2024-01-29T02:33:58.737889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
 
3.3%
131
 
2.5%
128
 
2.4%
119
 
2.2%
114
 
2.1%
102
 
1.9%
94
 
1.8%
93
 
1.7%
84
 
1.6%
81
 
1.5%
Other values (503) 4214
79.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4842
90.8%
Space Separator 174
 
3.3%
Other Symbol 94
 
1.8%
Open Punctuation 70
 
1.3%
Close Punctuation 70
 
1.3%
Uppercase Letter 30
 
0.6%
Other Punctuation 28
 
0.5%
Decimal Number 24
 
0.4%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
131
 
2.7%
128
 
2.6%
119
 
2.5%
114
 
2.4%
102
 
2.1%
93
 
1.9%
84
 
1.7%
81
 
1.7%
79
 
1.6%
75
 
1.5%
Other values (469) 3836
79.2%
Uppercase Letter
ValueCountFrequency (%)
K 7
23.3%
B 4
13.3%
C 3
10.0%
F 3
10.0%
S 2
 
6.7%
R 2
 
6.7%
N 2
 
6.7%
W 1
 
3.3%
J 1
 
3.3%
H 1
 
3.3%
Other values (4) 4
13.3%
Other Punctuation
ValueCountFrequency (%)
* 19
67.9%
& 3
 
10.7%
. 2
 
7.1%
, 2
 
7.1%
1
 
3.6%
/ 1
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 7
29.2%
2 7
29.2%
3 4
16.7%
0 3
12.5%
9 2
 
8.3%
7 1
 
4.2%
Open Punctuation
ValueCountFrequency (%)
( 68
97.1%
[ 2
 
2.9%
Close Punctuation
ValueCountFrequency (%)
) 68
97.1%
] 2
 
2.9%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
174
100.0%
Other Symbol
ValueCountFrequency (%)
94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4936
92.5%
Common 368
 
6.9%
Latin 30
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
131
 
2.7%
128
 
2.6%
119
 
2.4%
114
 
2.3%
102
 
2.1%
94
 
1.9%
93
 
1.9%
84
 
1.7%
81
 
1.6%
79
 
1.6%
Other values (470) 3911
79.2%
Common
ValueCountFrequency (%)
174
47.3%
( 68
 
18.5%
) 68
 
18.5%
* 19
 
5.2%
1 7
 
1.9%
2 7
 
1.9%
3 4
 
1.1%
0 3
 
0.8%
& 3
 
0.8%
. 2
 
0.5%
Other values (9) 13
 
3.5%
Latin
ValueCountFrequency (%)
K 7
23.3%
B 4
13.3%
C 3
10.0%
F 3
10.0%
S 2
 
6.7%
R 2
 
6.7%
N 2
 
6.7%
W 1
 
3.3%
J 1
 
3.3%
H 1
 
3.3%
Other values (4) 4
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4842
90.8%
ASCII 397
 
7.4%
None 95
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
174
43.8%
( 68
 
17.1%
) 68
 
17.1%
* 19
 
4.8%
1 7
 
1.8%
K 7
 
1.8%
2 7
 
1.8%
3 4
 
1.0%
B 4
 
1.0%
0 3
 
0.8%
Other values (22) 36
 
9.1%
Hangul
ValueCountFrequency (%)
131
 
2.7%
128
 
2.6%
119
 
2.5%
114
 
2.4%
102
 
2.1%
93
 
1.9%
84
 
1.7%
81
 
1.7%
79
 
1.6%
75
 
1.5%
Other values (469) 3836
79.2%
None
ValueCountFrequency (%)
94
98.9%
1
 
1.1%

주소
Text

Distinct567
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-01-29T02:33:59.008296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length42
Mean length22.847826
Min length15

Characters and Unicode

Total characters13663
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique543 ?
Unique (%)90.8%

Sample

1st row인천광역시 남동구 인주대로 664, 204호
2nd row인천광역시 남동구 인주대로 664
3rd row인천광역시 남동구 인주대로 582
4th row인천광역시 남동구 용천로4번길 9-9
5th row인천광역시 남동구 인주대로591번길 32
ValueCountFrequency (%)
인천광역시 598
22.0%
남동구 598
22.0%
1층 59
 
2.2%
2층 35
 
1.3%
인주대로 28
 
1.0%
남동대로 26
 
1.0%
인하로 22
 
0.8%
논고개로 20
 
0.7%
백범로 14
 
0.5%
예술로 14
 
0.5%
Other values (691) 1308
48.1%
2024-01-29T02:33:59.401362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2134
 
15.6%
755
 
5.5%
727
 
5.3%
683
 
5.0%
659
 
4.8%
615
 
4.5%
609
 
4.5%
608
 
4.4%
603
 
4.4%
598
 
4.4%
Other values (188) 5672
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8186
59.9%
Decimal Number 2779
 
20.3%
Space Separator 2134
 
15.6%
Other Punctuation 291
 
2.1%
Dash Punctuation 72
 
0.5%
Open Punctuation 58
 
0.4%
Close Punctuation 58
 
0.4%
Math Symbol 49
 
0.4%
Uppercase Letter 35
 
0.3%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
755
 
9.2%
727
 
8.9%
683
 
8.3%
659
 
8.1%
615
 
7.5%
609
 
7.4%
608
 
7.4%
603
 
7.4%
598
 
7.3%
226
 
2.8%
Other values (163) 2103
25.7%
Decimal Number
ValueCountFrequency (%)
1 569
20.5%
2 468
16.8%
3 265
9.5%
5 264
9.5%
0 247
8.9%
4 235
8.5%
6 220
 
7.9%
7 187
 
6.7%
9 172
 
6.2%
8 152
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 16
45.7%
A 8
22.9%
L 7
20.0%
D 1
 
2.9%
M 1
 
2.9%
S 1
 
2.9%
C 1
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 288
99.0%
. 3
 
1.0%
Space Separator
ValueCountFrequency (%)
2134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Math Symbol
ValueCountFrequency (%)
~ 49
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8187
59.9%
Common 5441
39.8%
Latin 35
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
755
 
9.2%
727
 
8.9%
683
 
8.3%
659
 
8.0%
615
 
7.5%
609
 
7.4%
608
 
7.4%
603
 
7.4%
598
 
7.3%
226
 
2.8%
Other values (164) 2104
25.7%
Common
ValueCountFrequency (%)
2134
39.2%
1 569
 
10.5%
2 468
 
8.6%
, 288
 
5.3%
3 265
 
4.9%
5 264
 
4.9%
0 247
 
4.5%
4 235
 
4.3%
6 220
 
4.0%
7 187
 
3.4%
Other values (7) 564
 
10.4%
Latin
ValueCountFrequency (%)
B 16
45.7%
A 8
22.9%
L 7
20.0%
D 1
 
2.9%
M 1
 
2.9%
S 1
 
2.9%
C 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8186
59.9%
ASCII 5476
40.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2134
39.0%
1 569
 
10.4%
2 468
 
8.5%
, 288
 
5.3%
3 265
 
4.8%
5 264
 
4.8%
0 247
 
4.5%
4 235
 
4.3%
6 220
 
4.0%
7 187
 
3.4%
Other values (14) 599
 
10.9%
Hangul
ValueCountFrequency (%)
755
 
9.2%
727
 
8.9%
683
 
8.3%
659
 
8.1%
615
 
7.5%
609
 
7.4%
608
 
7.4%
603
 
7.4%
598
 
7.3%
226
 
2.8%
Other values (163) 2103
25.7%
None
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct551
Distinct (%)94.7%
Missing16
Missing (%)2.7%
Memory size4.8 KiB
2024-01-29T02:33:59.632637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.963918
Min length9

Characters and Unicode

Total characters6963
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique531 ?
Unique (%)91.2%

Sample

1st row032-421-2101
2nd row032-461-9733
3rd row032-431-1761
4th row032-469-1210
5th row032-441-4777
ValueCountFrequency (%)
1522-3232 7
 
1.2%
032-421-3300 7
 
1.2%
032-422-5670 3
 
0.5%
032-468-6888 2
 
0.3%
032-469-9929 2
 
0.3%
070-5129-4250 2
 
0.3%
032-473-7737 2
 
0.3%
032-428-2080 2
 
0.3%
032-468-8884 2
 
0.3%
032-815-1057 2
 
0.3%
Other values (541) 551
94.7%
2024-01-29T02:33:59.973011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1153
16.6%
2 1083
15.6%
0 1015
14.6%
3 970
13.9%
4 656
9.4%
1 426
 
6.1%
8 388
 
5.6%
6 362
 
5.2%
7 342
 
4.9%
9 293
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5810
83.4%
Dash Punctuation 1153
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1083
18.6%
0 1015
17.5%
3 970
16.7%
4 656
11.3%
1 426
 
7.3%
8 388
 
6.7%
6 362
 
6.2%
7 342
 
5.9%
9 293
 
5.0%
5 275
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 1153
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6963
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1153
16.6%
2 1083
15.6%
0 1015
14.6%
3 970
13.9%
4 656
9.4%
1 426
 
6.1%
8 388
 
5.6%
6 362
 
5.2%
7 342
 
4.9%
9 293
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6963
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1153
16.6%
2 1083
15.6%
0 1015
14.6%
3 970
13.9%
4 656
9.4%
1 426
 
6.1%
8 388
 
5.6%
6 362
 
5.2%
7 342
 
4.9%
9 293
 
4.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2021-09-07
598 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-07
2nd row2021-09-07
3rd row2021-09-07
4th row2021-09-07
5th row2021-09-07

Common Values

ValueCountFrequency (%)
2021-09-07 598
100.0%

Length

2024-01-29T02:34:00.103676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T02:34:00.190016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-07 598
100.0%

Interactions

2024-01-29T02:33:57.451787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T02:34:00.269560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소구분
연번1.0000.354
업소구분0.3541.000
2024-01-29T02:34:00.348142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소구분
연번1.0000.194
업소구분0.1941.000

Missing values

2024-01-29T02:33:57.562008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T02:33:57.657233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소구분업체명주소전화번호데이터기준일자
01일반음식점향계인천광역시 남동구 인주대로 664, 204호032-421-21012021-09-07
12일반음식점취화선인천광역시 남동구 인주대로 664032-461-97332021-09-07
23일반음식점아웃백스테이크하우스구월점인천광역시 남동구 인주대로 582032-431-17612021-09-07
34집단급식소푸른마을유치원인천광역시 남동구 용천로4번길 9-9032-469-12102021-09-07
45일반음식점명가낙지촌인천광역시 남동구 인주대로591번길 32032-441-47772021-09-07
56일반음식점취홍인천광역시 남동구 예술로192번길 25032-422-33302021-09-07
67일반음식점사도시인천광역시 남동구 예술로 206, 201~203호032-426-97882021-09-07
78일반음식점곰설채설렁탕㈜(곰설채)인천광역시 남동구 인주대로 625, 1~2층032-446-21212021-09-07
89일반음식점스시라인인천광역시 남동구 인하로489번길 22032-439-99912021-09-07
910일반음식점동해어장&닭상궁인천광역시 남동구 인주대로 606, 2층032-424-38382021-09-07
연번업소구분업체명주소전화번호데이터기준일자
588589휴게음식점투썸플레이스 인천간석사거리점인천광역시 남동구 호구포로 887032-210-93002021-09-07
589590일반음식점낭만에프앤비(F&B)인천광역시 남동구 인하로511번길 40, 2층<NA>2021-09-07
590591집단급식소상원의료재단 인천힘찬종합병원인천광역시 남동구 논현로 72032-820-92352021-09-07
591592휴게음식점(주)스타벅스 커피 코리아 구월길병원점인천광역시 남동구 남동대로 7731522-32322021-09-07
592593휴게음식점(주)스타벅스 커피 코리아 인천터미널사거리점인천광역시 남동구 예술로 126, 1층1522-32322021-09-07
593594휴게음식점(주)스타벅스 커피 코리아 구월로데오점인천광역시 남동구 인하로 497-221522-32322021-09-07
594595휴게음식점(주)스타벅스 커피 코리아 인천구월점인천광역시 남동구 예술로 1381522-32322021-09-07
595596휴게음식점(주)스타벅스 코리아 예술회관역점인천광역시 남동구 예술로 174, 1층1522-32322021-09-07
596597휴게음식점(주)스타벅스 코리아 구월아시아드점인천광역시 남동구 인하로 556, 105호1522-32322021-09-07
597598휴게음식점(주)스타벅스 코리아 인천논현점인천광역시 남동구 논고개로 931522-32322021-09-07