Overview

Dataset statistics

Number of variables6
Number of observations1208
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory59.1 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text2
Categorical1
DateTime1

Dataset

Description창원시 관내 음식물쓰레기 다량 배출 사업장명, 사업장구분, 월 배출량, 규모에 관한 자료 입니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15034268

Alerts

데이터기준일 has constant value ""Constant
월배출량(KG) is highly skewed (γ1 = 33.11183439)Skewed
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:18:03.203396
Analysis finished2023-12-11 00:18:04.171049
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean604.5
Minimum1
Maximum1208
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.7 KiB
2023-12-11T09:18:04.242689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile61.35
Q1302.75
median604.5
Q3906.25
95-th percentile1147.65
Maximum1208
Range1207
Interquartile range (IQR)603.5

Descriptive statistics

Standard deviation348.86387
Coefficient of variation (CV)0.57711145
Kurtosis-1.2
Mean604.5
Median Absolute Deviation (MAD)302
Skewness0
Sum730236
Variance121706
MonotonicityStrictly increasing
2023-12-11T09:18:04.389682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
813 1
 
0.1%
811 1
 
0.1%
810 1
 
0.1%
809 1
 
0.1%
808 1
 
0.1%
807 1
 
0.1%
806 1
 
0.1%
805 1
 
0.1%
804 1
 
0.1%
Other values (1198) 1198
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1208 1
0.1%
1207 1
0.1%
1206 1
0.1%
1205 1
0.1%
1204 1
0.1%
1203 1
0.1%
1202 1
0.1%
1201 1
0.1%
1200 1
0.1%
1199 1
0.1%
Distinct1165
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
2023-12-11T09:18:04.597608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length23
Mean length8.6432119
Min length2

Characters and Unicode

Total characters10441
Distinct characters582
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1122 ?
Unique (%)92.9%

Sample

1st row대산초등학교
2nd row봉림중학교
3rd row경남교육연수원
4th row정성푸드
5th row(주) 신세계푸드 동서식품창원
ValueCountFrequency (%)
주)아워홈 18
 
1.1%
주식회사 17
 
1.0%
창원점 16
 
1.0%
의료법인 13
 
0.8%
주)현대그린푸드 12
 
0.7%
주)한솔 11
 
0.7%
주)비앤에스푸드 11
 
0.7%
구내식당 11
 
0.7%
창원상남점 9
 
0.5%
명륜진사갈비 9
 
0.5%
Other values (1358) 1528
92.3%
2023-12-11T09:18:04.948310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
448
 
4.3%
347
 
3.3%
( 281
 
2.7%
) 280
 
2.7%
269
 
2.6%
240
 
2.3%
236
 
2.3%
233
 
2.2%
180
 
1.7%
176
 
1.7%
Other values (572) 7751
74.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9218
88.3%
Space Separator 448
 
4.3%
Open Punctuation 281
 
2.7%
Close Punctuation 280
 
2.7%
Decimal Number 98
 
0.9%
Uppercase Letter 82
 
0.8%
Other Punctuation 18
 
0.2%
Other Symbol 10
 
0.1%
Connector Punctuation 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
347
 
3.8%
269
 
2.9%
240
 
2.6%
236
 
2.6%
233
 
2.5%
180
 
2.0%
176
 
1.9%
158
 
1.7%
133
 
1.4%
125
 
1.4%
Other values (531) 7121
77.3%
Uppercase Letter
ValueCountFrequency (%)
S 13
15.9%
T 11
13.4%
C 8
9.8%
D 8
9.8%
G 6
 
7.3%
B 5
 
6.1%
F 4
 
4.9%
M 4
 
4.9%
K 4
 
4.9%
J 4
 
4.9%
Other values (10) 15
18.3%
Decimal Number
ValueCountFrequency (%)
2 23
23.5%
1 22
22.4%
3 12
12.2%
5 9
 
9.2%
4 8
 
8.2%
9 7
 
7.1%
7 7
 
7.1%
6 5
 
5.1%
0 3
 
3.1%
8 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
& 13
72.2%
. 2
 
11.1%
/ 2
 
11.1%
; 1
 
5.6%
Space Separator
ValueCountFrequency (%)
448
100.0%
Open Punctuation
ValueCountFrequency (%)
( 281
100.0%
Close Punctuation
ValueCountFrequency (%)
) 280
100.0%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9228
88.4%
Common 1130
 
10.8%
Latin 83
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
347
 
3.8%
269
 
2.9%
240
 
2.6%
236
 
2.6%
233
 
2.5%
180
 
2.0%
176
 
1.9%
158
 
1.7%
133
 
1.4%
125
 
1.4%
Other values (532) 7131
77.3%
Latin
ValueCountFrequency (%)
S 13
15.7%
T 11
13.3%
C 8
9.6%
D 8
9.6%
G 6
 
7.2%
B 5
 
6.0%
F 4
 
4.8%
M 4
 
4.8%
K 4
 
4.8%
J 4
 
4.8%
Other values (11) 16
19.3%
Common
ValueCountFrequency (%)
448
39.6%
( 281
24.9%
) 280
24.8%
2 23
 
2.0%
1 22
 
1.9%
& 13
 
1.2%
3 12
 
1.1%
5 9
 
0.8%
4 8
 
0.7%
9 7
 
0.6%
Other values (9) 27
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9218
88.3%
ASCII 1213
 
11.6%
None 10
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
448
36.9%
( 281
23.2%
) 280
23.1%
2 23
 
1.9%
1 22
 
1.8%
S 13
 
1.1%
& 13
 
1.1%
3 12
 
1.0%
T 11
 
0.9%
5 9
 
0.7%
Other values (30) 101
 
8.3%
Hangul
ValueCountFrequency (%)
347
 
3.8%
269
 
2.9%
240
 
2.6%
236
 
2.6%
233
 
2.5%
180
 
2.0%
176
 
1.9%
158
 
1.7%
133
 
1.4%
125
 
1.4%
Other values (531) 7121
77.3%
None
ValueCountFrequency (%)
10
100.0%

사업장구분
Categorical

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
일반음식점
605 
집단급식소
574 
휴게음식점
 
13
관광숙박시설
 
10
대규모점포
 
6

Length

Max length6
Median length5
Mean length5.0082781
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
일반음식점 605
50.1%
집단급식소 574
47.5%
휴게음식점 13
 
1.1%
관광숙박시설 10
 
0.8%
대규모점포 6
 
0.5%

Length

2023-12-11T09:18:05.074276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:18:05.195798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 605
50.1%
집단급식소 574
47.5%
휴게음식점 13
 
1.1%
관광숙박시설 10
 
0.8%
대규모점포 6
 
0.5%

월배출량(KG)
Real number (ℝ)

SKEWED 

Distinct198
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2294.381
Minimum9.5
Maximum630000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.7 KiB
2023-12-11T09:18:05.315115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.5
5-th percentile200
Q1600
median1200
Q32000
95-th percentile4800
Maximum630000
Range629990.5
Interquartile range (IQR)1400

Descriptive statistics

Standard deviation18383.441
Coefficient of variation (CV)8.0123747
Kurtosis1129.0216
Mean2294.381
Median Absolute Deviation (MAD)600
Skewness33.111834
Sum2771612.3
Variance3.3795089 × 108
MonotonicityNot monotonic
2023-12-11T09:18:05.469214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
900.0 129
 
10.7%
1200.0 107
 
8.9%
1500.0 104
 
8.6%
600.0 91
 
7.5%
300.0 62
 
5.1%
3000.0 54
 
4.5%
1000.0 44
 
3.6%
2000.0 35
 
2.9%
400.0 26
 
2.2%
2400.0 24
 
2.0%
Other values (188) 532
44.0%
ValueCountFrequency (%)
9.5 1
 
0.1%
15.0 1
 
0.1%
20.0 1
 
0.1%
24.0 1
 
0.1%
25.0 2
0.2%
30.0 4
0.3%
40.0 2
0.2%
50.0 1
 
0.1%
60.0 3
0.2%
64.0 1
 
0.1%
ValueCountFrequency (%)
630000.0 1
0.1%
69525.0 1
0.1%
54750.0 1
0.1%
40000.0 1
0.1%
38000.0 1
0.1%
20000.0 1
0.1%
15000.0 2
0.2%
12166.0 1
0.1%
11500.0 1
0.1%
10650.0 1
0.1%
Distinct757
Distinct (%)62.7%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
2023-12-11T09:18:05.812454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length5.0976821
Min length3

Characters and Unicode

Total characters6158
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique619 ?
Unique (%)51.2%

Sample

1st row110명
2nd row308명
3rd row250명
4th row100명
5th row400명
ValueCountFrequency (%)
100명 36
 
3.0%
200명 27
 
2.2%
180명 22
 
1.8%
150명 21
 
1.7%
250명 17
 
1.4%
500명 17
 
1.4%
300명 14
 
1.2%
120명 13
 
1.1%
130명 12
 
1.0%
450명 12
 
1.0%
Other values (747) 1017
84.2%
2023-12-11T09:18:06.408870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 910
14.8%
2 764
12.4%
634
10.3%
574
9.3%
. 457
7.4%
1 455
7.4%
3 455
7.4%
5 428
7.0%
4 335
 
5.4%
6 302
 
4.9%
Other values (3) 844
13.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4493
73.0%
Other Symbol 634
 
10.3%
Other Letter 574
 
9.3%
Other Punctuation 457
 
7.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 910
20.3%
2 764
17.0%
1 455
10.1%
3 455
10.1%
5 428
9.5%
4 335
 
7.5%
6 302
 
6.7%
7 300
 
6.7%
8 296
 
6.6%
9 248
 
5.5%
Other Symbol
ValueCountFrequency (%)
634
100.0%
Other Letter
ValueCountFrequency (%)
574
100.0%
Other Punctuation
ValueCountFrequency (%)
. 457
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5584
90.7%
Hangul 574
 
9.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 910
16.3%
2 764
13.7%
634
11.4%
. 457
8.2%
1 455
8.1%
3 455
8.1%
5 428
7.7%
4 335
 
6.0%
6 302
 
5.4%
7 300
 
5.4%
Other values (2) 544
9.7%
Hangul
ValueCountFrequency (%)
574
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4950
80.4%
CJK Compat 634
 
10.3%
Hangul 574
 
9.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 910
18.4%
2 764
15.4%
. 457
9.2%
1 455
9.2%
3 455
9.2%
5 428
8.6%
4 335
 
6.8%
6 302
 
6.1%
7 300
 
6.1%
8 296
 
6.0%
CJK Compat
ValueCountFrequency (%)
634
100.0%
Hangul
ValueCountFrequency (%)
574
100.0%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
Minimum2023-07-28 00:00:00
Maximum2023-07-28 00:00:00
2023-12-11T09:18:06.574397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:06.693735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T09:18:03.753380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:03.558784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:03.857361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:03.654910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:18:06.783704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장구분월배출량(KG)
연번1.0000.2340.000
사업장구분0.2341.0000.276
월배출량(KG)0.0000.2761.000
2023-12-11T09:18:06.894159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번월배출량(KG)사업장구분
연번1.000-0.0300.099
월배출량(KG)-0.0301.0000.216
사업장구분0.0990.2161.000

Missing values

2023-12-11T09:18:03.995724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:18:04.129036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명사업장구분월배출량(KG)규모데이터기준일
01대산초등학교집단급식소700.0110명2023-07-28
12봉림중학교집단급식소1600.0308명2023-07-28
23경남교육연수원집단급식소1000.0250명2023-07-28
34정성푸드집단급식소900.0100명2023-07-28
45(주) 신세계푸드 동서식품창원집단급식소3300.0400명2023-07-28
56북면초등학교집단급식소1500.0976명2023-07-28
67주남오리알일반음식점900.0305.56㎡2023-07-28
78가빈밥상일반음식점900.0255.36㎡2023-07-28
89창원명곡고등학교집단급식소3000.0742명2023-07-28
910고요숲속샘터유치원집단급식소200.0239명2023-07-28
연번사업장명사업장구분월배출량(KG)규모데이터기준일
11981199갈비만찬일반음식점3300.0528㎡2023-07-28
11991200동진여자중학교집단급식소1400.0620명2023-07-28
12001201용원한식뷔페일반음식점1200.0205.4㎡2023-07-28
12011202화로이/바보형제쭈꾸미일반음식점3000.0626.08㎡2023-07-28
12021203육장갈비일반음식점1200.0211㎡2023-07-28
12031204(주)에스피씨지에프에스쿠팡물류창원1센터 구내식당집단급식소2000.05000명2023-07-28
12041205복실이농산창원지사일반음식점4500.0293.6㎡2023-07-28
12051206(주)에스피씨지에프에스쿠팡물류창원2센터 구내식당집단급식소1000.0500명2023-07-28
12061207다나병원집단급식소70.0110명2023-07-28
12071208해원꽃숲유치원집단급식소300.0151명2023-07-28