Overview

Dataset statistics

Number of variables6
Number of observations360
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.1 KiB
Average record size in memory51.4 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description대구광역시 수성구 관내 음식물폐기물다량배출사업장 현황에 관한 데이터로 상호, 주소 ,월배출예상, 연배출예상 등의 항목을 제공합니다.
Author대구광역시 수성구
URLhttps://www.data.go.kr/data/15040163/fileData.do

Alerts

월배출예상 is highly overall correlated with 년배출예상High correlation
년배출예상 is highly overall correlated with 월배출예상High correlation
사업장구분 is highly imbalanced (64.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-23 06:54:56.313370
Analysis finished2024-03-23 06:55:08.087921
Duration11.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct360
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean180.5
Minimum1
Maximum360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-03-23T06:55:08.677814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.95
Q190.75
median180.5
Q3270.25
95-th percentile342.05
Maximum360
Range359
Interquartile range (IQR)179.5

Descriptive statistics

Standard deviation104.06729
Coefficient of variation (CV)0.57655006
Kurtosis-1.2
Mean180.5
Median Absolute Deviation (MAD)90
Skewness0
Sum64980
Variance10830
MonotonicityStrictly increasing
2024-03-23T06:55:09.527024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
249 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
244 1
 
0.3%
243 1
 
0.3%
242 1
 
0.3%
241 1
 
0.3%
240 1
 
0.3%
Other values (350) 350
97.2%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%
356 1
0.3%
355 1
0.3%
354 1
0.3%
353 1
0.3%
352 1
0.3%
351 1
0.3%

상호
Text

Distinct359
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-23T06:55:10.676220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length6.7444444
Min length1

Characters and Unicode

Total characters2428
Distinct characters400
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique358 ?
Unique (%)99.4%

Sample

1st row동원초등학교
2nd row만촌초등학교
3rd row파동초등학교
4th row범물초등학교
5th row범일중학교
ValueCountFrequency (%)
수성못점 4
 
1.0%
대구 4
 
1.0%
제형면옥 2
 
0.5%
뜨삽에프엠디 2
 
0.5%
주)초원한우 1
 
0.3%
대구보건환경연구원 1
 
0.3%
신참아나고 1
 
0.3%
하늘타리 1
 
0.3%
봉숙이포차 1
 
0.3%
금곡삼계탕 1
 
0.3%
Other values (367) 367
95.3%
2024-03-23T06:55:11.951846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
3.3%
63
 
2.6%
62
 
2.6%
58
 
2.4%
45
 
1.9%
38
 
1.6%
) 38
 
1.6%
( 38
 
1.6%
37
 
1.5%
36
 
1.5%
Other values (390) 1934
79.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2283
94.0%
Close Punctuation 38
 
1.6%
Open Punctuation 38
 
1.6%
Space Separator 25
 
1.0%
Decimal Number 21
 
0.9%
Uppercase Letter 19
 
0.8%
Connector Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
3.5%
63
 
2.8%
62
 
2.7%
58
 
2.5%
45
 
2.0%
38
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
35
 
1.5%
Other values (366) 1794
78.6%
Uppercase Letter
ValueCountFrequency (%)
T 5
26.3%
D 4
21.1%
B 3
15.8%
L 2
 
10.5%
H 1
 
5.3%
C 1
 
5.3%
O 1
 
5.3%
S 1
 
5.3%
K 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
2 5
23.8%
3 4
19.0%
1 3
14.3%
8 2
 
9.5%
7 2
 
9.5%
0 2
 
9.5%
5 1
 
4.8%
9 1
 
4.8%
4 1
 
4.8%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2282
94.0%
Common 126
 
5.2%
Latin 19
 
0.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
3.5%
63
 
2.8%
62
 
2.7%
58
 
2.5%
45
 
2.0%
38
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
35
 
1.5%
Other values (365) 1793
78.6%
Common
ValueCountFrequency (%)
) 38
30.2%
( 38
30.2%
25
19.8%
2 5
 
4.0%
3 4
 
3.2%
1 3
 
2.4%
8 2
 
1.6%
7 2
 
1.6%
_ 2
 
1.6%
0 2
 
1.6%
Other values (5) 5
 
4.0%
Latin
ValueCountFrequency (%)
T 5
26.3%
D 4
21.1%
B 3
15.8%
L 2
 
10.5%
H 1
 
5.3%
C 1
 
5.3%
O 1
 
5.3%
S 1
 
5.3%
K 1
 
5.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2282
94.0%
ASCII 145
 
6.0%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
79
 
3.5%
63
 
2.8%
62
 
2.7%
58
 
2.5%
45
 
2.0%
38
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
35
 
1.5%
Other values (365) 1793
78.6%
ASCII
ValueCountFrequency (%)
) 38
26.2%
( 38
26.2%
25
17.2%
T 5
 
3.4%
2 5
 
3.4%
D 4
 
2.8%
3 4
 
2.8%
1 3
 
2.1%
B 3
 
2.1%
8 2
 
1.4%
Other values (14) 18
12.4%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct349
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-23T06:55:12.685528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length37
Mean length25.158333
Min length15

Characters and Unicode

Total characters9057
Distinct characters141
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique340 ?
Unique (%)94.4%

Sample

1st row대구광역시 수성구 국채보상로 1000 (만촌동)
2nd row대구광역시 수성구 국채보상로214길 33 (만촌동)
3rd row대구광역시 수성구 파동로51길 34 (파동)
4th row대구광역시 수성구 지범로41길 33 (범물동)
5th row대구광역시 수성구 지범로41길 23 (범물동)
ValueCountFrequency (%)
대구광역시 360
19.2%
수성구 360
19.2%
두산동 107
 
5.7%
들안로 63
 
3.4%
범어동 51
 
2.7%
상동 33
 
1.8%
용학로 33
 
1.8%
동대구로 30
 
1.6%
달구벌대로 29
 
1.5%
지산동 26
 
1.4%
Other values (376) 780
41.7%
2024-03-23T06:55:14.126611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1515
16.7%
814
 
9.0%
459
 
5.1%
433
 
4.8%
414
 
4.6%
410
 
4.5%
378
 
4.2%
361
 
4.0%
361
 
4.0%
( 359
 
4.0%
Other values (131) 3553
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5475
60.5%
Space Separator 1515
 
16.7%
Decimal Number 1214
 
13.4%
Open Punctuation 359
 
4.0%
Close Punctuation 359
 
4.0%
Connector Punctuation 87
 
1.0%
Dash Punctuation 42
 
0.5%
Math Symbol 4
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
814
14.9%
459
 
8.4%
433
 
7.9%
414
 
7.6%
410
 
7.5%
378
 
6.9%
361
 
6.6%
361
 
6.6%
337
 
6.2%
137
 
2.5%
Other values (113) 1371
25.0%
Decimal Number
ValueCountFrequency (%)
1 272
22.4%
2 183
15.1%
3 157
12.9%
4 113
9.3%
6 112
9.2%
5 91
 
7.5%
7 77
 
6.3%
0 71
 
5.8%
9 69
 
5.7%
8 69
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
W 1
50.0%
Space Separator
ValueCountFrequency (%)
1515
100.0%
Open Punctuation
ValueCountFrequency (%)
( 359
100.0%
Close Punctuation
ValueCountFrequency (%)
) 359
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5475
60.5%
Common 3580
39.5%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
814
14.9%
459
 
8.4%
433
 
7.9%
414
 
7.6%
410
 
7.5%
378
 
6.9%
361
 
6.6%
361
 
6.6%
337
 
6.2%
137
 
2.5%
Other values (113) 1371
25.0%
Common
ValueCountFrequency (%)
1515
42.3%
( 359
 
10.0%
) 359
 
10.0%
1 272
 
7.6%
2 183
 
5.1%
3 157
 
4.4%
4 113
 
3.2%
6 112
 
3.1%
5 91
 
2.5%
_ 87
 
2.4%
Other values (6) 332
 
9.3%
Latin
ValueCountFrequency (%)
S 1
50.0%
W 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5475
60.5%
ASCII 3582
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1515
42.3%
( 359
 
10.0%
) 359
 
10.0%
1 272
 
7.6%
2 183
 
5.1%
3 157
 
4.4%
4 113
 
3.2%
6 112
 
3.1%
5 91
 
2.5%
_ 87
 
2.4%
Other values (8) 334
 
9.3%
Hangul
ValueCountFrequency (%)
814
14.9%
459
 
8.4%
433
 
7.9%
414
 
7.6%
410
 
7.5%
378
 
6.9%
361
 
6.6%
361
 
6.6%
337
 
6.2%
137
 
2.5%
Other values (113) 1371
25.0%

사업장구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
일반음식점
287 
집단급식소
67 
휴게음식점
 
4
농수산물시장
 
1
기타
 
1

Length

Max length6
Median length5
Mean length4.9944444
Min length2

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
일반음식점 287
79.7%
집단급식소 67
 
18.6%
휴게음식점 4
 
1.1%
농수산물시장 1
 
0.3%
기타 1
 
0.3%

Length

2024-03-23T06:55:15.345357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:55:16.019789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 287
79.7%
집단급식소 67
 
18.6%
휴게음식점 4
 
1.1%
농수산물시장 1
 
0.3%
기타 1
 
0.3%

월배출예상
Real number (ℝ)

HIGH CORRELATION 

Distinct60
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1802.8528
Minimum40
Maximum140000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-03-23T06:55:16.681949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum40
5-th percentile300
Q1600
median1200
Q31800
95-th percentile3267.5
Maximum140000
Range139960
Interquartile range (IQR)1200

Descriptive statistics

Standard deviation7413.3748
Coefficient of variation (CV)4.1120245
Kurtosis339.04908
Mean1802.8528
Median Absolute Deviation (MAD)600
Skewness18.159555
Sum649027
Variance54958126
MonotonicityNot monotonic
2024-03-23T06:55:17.281594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
900 46
 
12.8%
1500 38
 
10.6%
600 33
 
9.2%
1200 29
 
8.1%
300 25
 
6.9%
3000 21
 
5.8%
1800 17
 
4.7%
450 14
 
3.9%
1000 11
 
3.1%
1300 11
 
3.1%
Other values (50) 115
31.9%
ValueCountFrequency (%)
40 1
 
0.3%
90 2
 
0.6%
100 2
 
0.6%
130 1
 
0.3%
150 4
 
1.1%
250 1
 
0.3%
300 25
6.9%
330 2
 
0.6%
400 3
 
0.8%
450 14
3.9%
ValueCountFrequency (%)
140000 1
 
0.3%
12000 1
 
0.3%
11000 1
 
0.3%
6000 4
1.1%
5000 1
 
0.3%
4800 1
 
0.3%
4500 3
0.8%
4200 1
 
0.3%
3600 5
1.4%
3250 1
 
0.3%

년배출예상
Real number (ℝ)

HIGH CORRELATION 

Distinct58
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16965.233
Minimum480
Maximum144000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-03-23T06:55:17.840921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum480
5-th percentile3600
Q17200
median14400
Q321600
95-th percentile36150
Maximum144000
Range143520
Interquartile range (IQR)14400

Descriptive statistics

Standard deviation15255.626
Coefficient of variation (CV)0.89922875
Kurtosis22.791458
Mean16965.233
Median Absolute Deviation (MAD)7200
Skewness3.6853179
Sum6107484
Variance2.3273411 × 108
MonotonicityNot monotonic
2024-03-23T06:55:18.397657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10800 45
 
12.5%
18000 38
 
10.6%
7200 33
 
9.2%
14400 29
 
8.1%
3600 25
 
6.9%
36000 21
 
5.8%
21600 18
 
5.0%
5400 14
 
3.9%
15600 13
 
3.6%
12000 11
 
3.1%
Other values (48) 113
31.4%
ValueCountFrequency (%)
480 1
 
0.3%
720 1
 
0.3%
1080 3
 
0.8%
1200 2
 
0.6%
1800 4
 
1.1%
3000 1
 
0.3%
3600 25
6.9%
3960 2
 
0.6%
4800 3
 
0.8%
5400 14
3.9%
ValueCountFrequency (%)
144000 1
 
0.3%
132000 1
 
0.3%
72000 4
 
1.1%
60000 1
 
0.3%
57600 1
 
0.3%
54000 3
 
0.8%
50400 1
 
0.3%
43200 5
 
1.4%
39000 1
 
0.3%
36000 21
5.8%

Interactions

2024-03-23T06:55:04.987769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:02.813478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:03.986811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:05.375481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:03.181504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:04.367955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:05.852066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:03.551699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:55:04.645627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T06:55:18.784095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장구분월배출예상년배출예상
연번1.0000.4070.0110.112
사업장구분0.4071.0000.0270.007
월배출예상0.0110.0271.0000.000
년배출예상0.1120.0070.0001.000
2024-03-23T06:55:19.375821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번월배출예상년배출예상사업장구분
연번1.000-0.062-0.0560.179
월배출예상-0.0621.0000.9810.032
년배출예상-0.0560.9811.0000.000
사업장구분0.1790.0320.0001.000

Missing values

2024-03-23T06:55:06.865855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:55:07.764260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호사업장도로명주소사업장구분월배출예상년배출예상
01동원초등학교대구광역시 수성구 국채보상로 1000 (만촌동)집단급식소227727324
12만촌초등학교대구광역시 수성구 국채보상로214길 33 (만촌동)집단급식소7008400
23파동초등학교대구광역시 수성구 파동로51길 34 (파동)집단급식소160019200
34범물초등학교대구광역시 수성구 지범로41길 33 (범물동)집단급식소6207440
45범일중학교대구광역시 수성구 지범로41길 23 (범물동)집단급식소110013200
56착한낙지 수성점대구광역시 수성구 들안로 105 (상동)일반음식점90010800
67이경채자인암소식육식당대구광역시 수성구 들안로 74 (두산동)일반음식점120014400
78서민갈비대구광역시 수성구 들안로 8-5 (두산동)일반음식점12000144000
89만복국수 들안로점대구광역시 수성구 들안로 35 (상동)일반음식점7509000
910아트리움대구광역시 수성구 국채보상로186길 151 (범어동)일반음식점120014400
연번상호사업장도로명주소사업장구분월배출예상년배출예상
350351헬로72번가대구광역시 수성구 용학로 62_ 지하1_ 지상1층 (두산동)일반음식점120014400
351352수성제니스 요양병원대구광역시 수성구 동대구로 64_ 6~13층 (지산동)집단급식소150018000
352353고려H한방병원대구광역시 수성구 동대구로 31_ 5_6_7_8층 (두산동)집단급식소250030000
353354손고등어자반정식대구광역시 수성구 청호로 255_ 1층 (황금동)일반음식점200024000
354355누리마을감자탕 범어점대구광역시 수성구 달구벌대로 2395_ 범어골드클리닉타워 (범어동)일반음식점90010800
355356수월한방병원대구광역시 수성구 동대구로 64_ 2~5층 (지산동)집단급식소7509000
356357상구네돼지구이수성못점대구광역시 수성구 수성못2길 37-5_ 1~2층 (두산동)일반음식점180021600
357358청운식육식당대구광역시 수성구 범어천로 5 (황금동)일반음식점150018000
358359미츠대구광역시 수성구 수성못2길 23_ 지하1층 (두산동)일반음식점6007200
359360(주)홍구가야대구광역시 수성구 청수로 121 (황금동)일반음식점225027000