Overview

Dataset statistics

Number of variables4
Number of observations282
Missing cells16
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory33.5 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description대구광역시_동구_음식물류폐기물 다량배출사업장 현황_20200511
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15034503&dataSetDetailId=150345031e2af2349aa86&provdMethod=FILE

Alerts

월배출량 has 16 (5.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 19:59:50.627147
Analysis finished2023-12-10 19:59:51.333216
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct279
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T04:59:51.599973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length6.5177305
Min length2

Characters and Unicode

Total characters1838
Distinct characters358
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique276 ?
Unique (%)97.9%

Sample

1st row한라산숯불갈비식당
2nd row공주네속초생선찜
3rd row해이루정
4th row해금강
5th row곤지암할매소머리국밥
ValueCountFrequency (%)
샤브향 3
 
0.9%
팔공참한우마실 2
 
0.6%
어사출또 2
 
0.6%
방촌점 2
 
0.6%
율하점 2
 
0.6%
교동면옥 2
 
0.6%
이시아폴리스점 2
 
0.6%
한올면옥 2
 
0.6%
고향식당 2
 
0.6%
신서혁신점 2
 
0.6%
Other values (305) 307
93.6%
2023-12-11T04:59:52.203234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
3.3%
53
 
2.9%
53
 
2.9%
46
 
2.5%
43
 
2.3%
34
 
1.8%
29
 
1.6%
28
 
1.5%
26
 
1.4%
24
 
1.3%
Other values (348) 1442
78.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1733
94.3%
Space Separator 46
 
2.5%
Close Punctuation 17
 
0.9%
Open Punctuation 17
 
0.9%
Uppercase Letter 8
 
0.4%
Decimal Number 7
 
0.4%
Other Symbol 5
 
0.3%
Lowercase Letter 3
 
0.2%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
3.5%
53
 
3.1%
53
 
3.1%
43
 
2.5%
34
 
2.0%
29
 
1.7%
28
 
1.6%
26
 
1.5%
24
 
1.4%
23
 
1.3%
Other values (329) 1360
78.5%
Uppercase Letter
ValueCountFrequency (%)
T 2
25.0%
I 2
25.0%
C 1
12.5%
D 1
12.5%
G 1
12.5%
F 1
12.5%
Decimal Number
ValueCountFrequency (%)
3 2
28.6%
2 2
28.6%
5 1
14.3%
6 1
14.3%
4 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
t 1
33.3%
o 1
33.3%
z 1
33.3%
Space Separator
ValueCountFrequency (%)
46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1737
94.5%
Common 89
 
4.8%
Latin 11
 
0.6%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
3.5%
53
 
3.1%
53
 
3.1%
43
 
2.5%
34
 
2.0%
29
 
1.7%
28
 
1.6%
26
 
1.5%
24
 
1.4%
23
 
1.3%
Other values (329) 1364
78.5%
Common
ValueCountFrequency (%)
46
51.7%
) 17
 
19.1%
( 17
 
19.1%
3 2
 
2.2%
. 2
 
2.2%
2 2
 
2.2%
5 1
 
1.1%
6 1
 
1.1%
4 1
 
1.1%
Latin
ValueCountFrequency (%)
T 2
18.2%
I 2
18.2%
C 1
9.1%
D 1
9.1%
G 1
9.1%
F 1
9.1%
t 1
9.1%
o 1
9.1%
z 1
9.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1732
94.2%
ASCII 100
 
5.4%
None 5
 
0.3%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
60
 
3.5%
53
 
3.1%
53
 
3.1%
43
 
2.5%
34
 
2.0%
29
 
1.7%
28
 
1.6%
26
 
1.5%
24
 
1.4%
23
 
1.3%
Other values (328) 1359
78.5%
ASCII
ValueCountFrequency (%)
46
46.0%
) 17
 
17.0%
( 17
 
17.0%
T 2
 
2.0%
3 2
 
2.0%
. 2
 
2.0%
I 2
 
2.0%
2 2
 
2.0%
5 1
 
1.0%
6 1
 
1.0%
Other values (8) 8
 
8.0%
None
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct266
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T04:59:52.580556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length11.368794
Min length5

Characters and Unicode

Total characters3206
Distinct characters117
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique250 ?
Unique (%)88.7%

Sample

1st row동대구로 601
2nd row신암남로15길 70
3rd row동대구로 600
4th row신암남로 133, 1~2층
5th row동북로 308
ValueCountFrequency (%)
1층 28
 
4.0%
효동로2길 21
 
3.0%
2층 17
 
2.4%
팔공로 15
 
2.1%
팔공산로185길 14
 
2.0%
동부로30길 13
 
1.8%
동촌로 13
 
1.8%
갓바위로 12
 
1.7%
안심로 12
 
1.7%
화랑로 12
 
1.7%
Other values (305) 548
77.7%
2023-12-11T04:59:53.140201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
424
 
13.2%
280
 
8.7%
1 244
 
7.6%
2 241
 
7.5%
3 146
 
4.6%
0 135
 
4.2%
133
 
4.1%
, 124
 
3.9%
118
 
3.7%
5 117
 
3.6%
Other values (107) 1244
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1283
40.0%
Decimal Number 1266
39.5%
Space Separator 424
 
13.2%
Other Punctuation 124
 
3.9%
Dash Punctuation 42
 
1.3%
Math Symbol 35
 
1.1%
Uppercase Letter 12
 
0.4%
Open Punctuation 10
 
0.3%
Close Punctuation 10
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
280
21.8%
133
 
10.4%
118
 
9.2%
63
 
4.9%
57
 
4.4%
55
 
4.3%
54
 
4.2%
28
 
2.2%
23
 
1.8%
23
 
1.8%
Other values (88) 449
35.0%
Decimal Number
ValueCountFrequency (%)
1 244
19.3%
2 241
19.0%
3 146
11.5%
0 135
10.7%
5 117
9.2%
4 94
 
7.4%
6 91
 
7.2%
8 75
 
5.9%
7 66
 
5.2%
9 57
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
A 7
58.3%
B 4
33.3%
C 1
 
8.3%
Space Separator
ValueCountFrequency (%)
424
100.0%
Other Punctuation
ValueCountFrequency (%)
, 124
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Math Symbol
ValueCountFrequency (%)
~ 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1911
59.6%
Hangul 1283
40.0%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
280
21.8%
133
 
10.4%
118
 
9.2%
63
 
4.9%
57
 
4.4%
55
 
4.3%
54
 
4.2%
28
 
2.2%
23
 
1.8%
23
 
1.8%
Other values (88) 449
35.0%
Common
ValueCountFrequency (%)
424
22.2%
1 244
12.8%
2 241
12.6%
3 146
 
7.6%
0 135
 
7.1%
, 124
 
6.5%
5 117
 
6.1%
4 94
 
4.9%
6 91
 
4.8%
8 75
 
3.9%
Other values (6) 220
11.5%
Latin
ValueCountFrequency (%)
A 7
58.3%
B 4
33.3%
C 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1923
60.0%
Hangul 1283
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
424
22.0%
1 244
12.7%
2 241
12.5%
3 146
 
7.6%
0 135
 
7.0%
, 124
 
6.4%
5 117
 
6.1%
4 94
 
4.9%
6 91
 
4.7%
8 75
 
3.9%
Other values (9) 232
12.1%
Hangul
ValueCountFrequency (%)
280
21.8%
133
 
10.4%
118
 
9.2%
63
 
4.9%
57
 
4.4%
55
 
4.3%
54
 
4.2%
28
 
2.2%
23
 
1.8%
23
 
1.8%
Other values (88) 449
35.0%

월배출량
Real number (ℝ)

MISSING 

Distinct68
Distinct (%)25.6%
Missing16
Missing (%)5.7%
Infinite0
Infinite (%)0.0%
Mean1169.1767
Minimum15
Maximum7500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-11T04:59:53.345453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile150
Q1400
median890
Q31500
95-th percentile3000
Maximum7500
Range7485
Interquartile range (IQR)1100

Descriptive statistics

Standard deviation1167.9331
Coefficient of variation (CV)0.99893632
Kurtosis7.7856756
Mean1169.1767
Median Absolute Deviation (MAD)590
Skewness2.4526665
Sum311001
Variance1364067.6
MonotonicityNot monotonic
2023-12-11T04:59:53.549320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300 31
 
11.0%
600 26
 
9.2%
900 23
 
8.2%
1500 20
 
7.1%
1200 16
 
5.7%
450 13
 
4.6%
2100 11
 
3.9%
1800 11
 
3.9%
750 10
 
3.5%
3000 9
 
3.2%
Other values (58) 96
34.0%
(Missing) 16
 
5.7%
ValueCountFrequency (%)
15 2
0.7%
20 1
 
0.4%
30 3
1.1%
60 1
 
0.4%
80 1
 
0.4%
90 2
0.7%
95 1
 
0.4%
100 1
 
0.4%
120 1
 
0.4%
150 3
1.1%
ValueCountFrequency (%)
7500 1
 
0.4%
7000 1
 
0.4%
6000 2
 
0.7%
5400 1
 
0.4%
5000 1
 
0.4%
4500 4
1.4%
4200 1
 
0.4%
3600 1
 
0.4%
3200 1
 
0.4%
3000 9
3.2%

방법
Categorical

Distinct9
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
사료화
121 
사료
70 
퇴비화
70 
<NA>
14 
퇴비
 
3
Other values (4)
 
4

Length

Max length6
Median length3
Mean length2.8156028
Min length2

Unique

Unique4 ?
Unique (%)1.4%

Sample

1st row사료
2nd row퇴비화
3rd row사료화
4th row사료
5th row퇴비화

Common Values

ValueCountFrequency (%)
사료화 121
42.9%
사료 70
24.8%
퇴비화 70
24.8%
<NA> 14
 
5.0%
퇴비 3
 
1.1%
개 사료 1
 
0.4%
사료 퇴비 1
 
0.4%
농업생산활동 1
 
0.4%
사료퇴비 1
 
0.4%

Length

2023-12-11T04:59:53.754823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:59:53.987890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사료화 121
42.6%
사료 72
25.4%
퇴비화 70
24.6%
na 14
 
4.9%
퇴비 4
 
1.4%
1
 
0.4%
농업생산활동 1
 
0.4%
사료퇴비 1
 
0.4%

Interactions

2023-12-11T04:59:50.980635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T04:59:54.126236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월배출량방법
월배출량1.0000.510
방법0.5101.000
2023-12-11T04:59:54.261187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월배출량방법
월배출량1.0000.279
방법0.2791.000

Missing values

2023-12-11T04:59:51.150758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T04:59:51.280369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호주소월배출량방법
0한라산숯불갈비식당동대구로 601300사료
1공주네속초생선찜신암남로15길 702550퇴비화
2해이루정동대구로 6001200사료화
3해금강신암남로 133, 1~2층172사료
4곤지암할매소머리국밥동북로 3081500퇴비화
5금마루아양로 253300사료
6백담구이동부로22길 50300사료
7만복한쭈꾸미 낙지볶음동대구로 4483200사료화
8기타치는 당나귀동부로22길 69-1, 2층750사료화
9청정한우숯불동대구로 442-1, 1층1000사료화
상호주소월배출량방법
272한끼맛있다 이시아점팔공로51길 31-10, 102호<NA><NA>
273짝라이브효동로2길 47-7, 3층<NA><NA>
274라라코스트(동촌점)효동로2길 29<NA><NA>
275두메산골 한우마을화랑로 395<NA><NA>
276유정갈비아양로 6<NA>사료
277영천한우식객동부로30길 68<NA>사료화
278밥을짓다팔공산로9길 6-1<NA><NA>
279뼈대있는 돼지집동부로30길 57<NA><NA>
280풍미향동촌로 402<NA><NA>
281한티재팔공로51길 3, 201~202호<NA><NA>