Overview

Dataset statistics

Number of variables7
Number of observations83
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory60.6 B

Variable types

Numeric3
Text2
Categorical1
DateTime1

Dataset

Description울산광역시 북구에 소재하는 음식물 폐기물 다량배출 사업장 현황에 대한 다양한 데이터입니다. 음식물 폐기물 다량배출 사업장의 상호, 규모, 주소, 업종, 예상배출량 등에 대한 데이터입니다.
Author울산광역시 북구
URLhttps://www.data.go.kr/data/15034306/fileData.do

Alerts

업종 has constant value ""Constant
데이터 기준일 has constant value ""Constant
규모(인) is highly overall correlated with 예상배출량(kg_일)High correlation
예상배출량(kg_일) is highly overall correlated with 규모(인)High correlation
연번 has unique valuesUnique
상호 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:26:40.342813
Analysis finished2024-04-29 22:26:43.517228
Duration3.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42
Minimum1
Maximum83
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size879.0 B
2024-04-30T07:26:43.587665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.1
Q121.5
median42
Q362.5
95-th percentile78.9
Maximum83
Range82
Interquartile range (IQR)41

Descriptive statistics

Standard deviation24.103942
Coefficient of variation (CV)0.57390337
Kurtosis-1.2
Mean42
Median Absolute Deviation (MAD)21
Skewness0
Sum3486
Variance581
MonotonicityStrictly increasing
2024-04-30T07:26:43.704621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
54 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
55 1
 
1.2%
Other values (73) 73
88.0%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
74 1
1.2%

상호
Text

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2024-04-30T07:26:43.878607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length9.0843373
Min length5

Characters and Unicode

Total characters754
Distinct characters179
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row매곡고등학교
2nd row(의)송은의료재단 울산시티병원
3rd row효정중학교
4th row㈜에스티엑스에프앤씨 자동차부품혁신센터
5th row동천고등학교
ValueCountFrequency (%)
㈜현대그린푸드 2
 
2.0%
울산광역시 2
 
2.0%
매곡고등학교 1
 
1.0%
은월초등학교 1
 
1.0%
엄마손푸드[동국실업울산공장 1
 
1.0%
고헌중학교 1
 
1.0%
울산엘리야병원 1
 
1.0%
강북교육지원청 1
 
1.0%
명촌초등학교 1
 
1.0%
호계중학교 1
 
1.0%
Other values (90) 90
88.2%
2024-04-30T07:26:44.181044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
6.5%
48
 
6.4%
34
 
4.5%
23
 
3.1%
22
 
2.9%
20
 
2.7%
19
 
2.5%
19
 
2.5%
18
 
2.4%
16
 
2.1%
Other values (169) 486
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 677
89.8%
Space Separator 20
 
2.7%
Other Symbol 19
 
2.5%
Open Punctuation 14
 
1.9%
Close Punctuation 14
 
1.9%
Uppercase Letter 5
 
0.7%
Decimal Number 3
 
0.4%
Dash Punctuation 1
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
7.2%
48
 
7.1%
34
 
5.0%
23
 
3.4%
22
 
3.2%
19
 
2.8%
18
 
2.7%
16
 
2.4%
15
 
2.2%
13
 
1.9%
Other values (156) 420
62.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
60.0%
S 1
 
20.0%
H 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
[ 11
78.6%
( 3
 
21.4%
Close Punctuation
ValueCountFrequency (%)
] 11
78.6%
) 3
 
21.4%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 696
92.3%
Common 52
 
6.9%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
7.0%
48
 
6.9%
34
 
4.9%
23
 
3.3%
22
 
3.2%
19
 
2.7%
19
 
2.7%
18
 
2.6%
16
 
2.3%
15
 
2.2%
Other values (157) 433
62.2%
Common
ValueCountFrequency (%)
20
38.5%
[ 11
21.2%
] 11
21.2%
) 3
 
5.8%
( 3
 
5.8%
2 2
 
3.8%
- 1
 
1.9%
1 1
 
1.9%
Latin
ValueCountFrequency (%)
B 3
50.0%
S 1
 
16.7%
H 1
 
16.7%
e 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 677
89.8%
ASCII 58
 
7.7%
None 19
 
2.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
7.2%
48
 
7.1%
34
 
5.0%
23
 
3.4%
22
 
3.2%
19
 
2.8%
18
 
2.7%
16
 
2.4%
15
 
2.2%
13
 
1.9%
Other values (156) 420
62.0%
ASCII
ValueCountFrequency (%)
20
34.5%
[ 11
19.0%
] 11
19.0%
B 3
 
5.2%
) 3
 
5.2%
( 3
 
5.2%
2 2
 
3.4%
- 1
 
1.7%
S 1
 
1.7%
H 1
 
1.7%
Other values (2) 2
 
3.4%
None
ValueCountFrequency (%)
19
100.0%

규모(인)
Real number (ℝ)

HIGH CORRELATION 

Distinct62
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean573.83133
Minimum77
Maximum1800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size879.0 B
2024-04-30T07:26:44.322643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum77
5-th percentile102.7
Q1237.5
median520
Q3811
95-th percentile1262.6
Maximum1800
Range1723
Interquartile range (IQR)573.5

Descriptive statistics

Standard deviation397.41846
Coefficient of variation (CV)0.69257018
Kurtosis0.49293702
Mean573.83133
Median Absolute Deviation (MAD)285
Skewness0.926956
Sum47628
Variance157941.43
MonotonicityNot monotonic
2024-04-30T07:26:44.465645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 4
 
4.8%
130 3
 
3.6%
550 3
 
3.6%
1000 2
 
2.4%
450 2
 
2.4%
200 2
 
2.4%
220 2
 
2.4%
520 2
 
2.4%
270 2
 
2.4%
720 2
 
2.4%
Other values (52) 59
71.1%
ValueCountFrequency (%)
77 1
 
1.2%
100 4
4.8%
127 1
 
1.2%
130 3
3.6%
140 2
2.4%
150 1
 
1.2%
190 1
 
1.2%
200 2
2.4%
206 1
 
1.2%
220 2
2.4%
ValueCountFrequency (%)
1800 1
1.2%
1720 1
1.2%
1500 1
1.2%
1290 1
1.2%
1264 1
1.2%
1250 1
1.2%
1200 1
1.2%
1171 1
1.2%
1100 1
1.2%
1065 1
1.2%

주소
Text

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2024-04-30T07:26:44.672921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length21.771084
Min length17

Characters and Unicode

Total characters1807
Distinct characters101
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row울산광역시 북구 매산로 45(매곡동)
2nd row울산광역시 북구 산업로 1007(연암동)
3rd row울산광역시 북구 염포로 363(양정동)
4th row울산광역시 북구 매곡산업로 35(매곡동)
5th row울산광역시 북구 고래로 54(천곡동)
ValueCountFrequency (%)
울산광역시 83
24.5%
북구 83
24.5%
염포로 6
 
1.8%
산업로 5
 
1.5%
호계로 5
 
1.5%
고래로 3
 
0.9%
화동로 3
 
0.9%
매곡1로 2
 
0.6%
무룡로 2
 
0.6%
모듈화산업로 2
 
0.6%
Other values (137) 145
42.8%
2024-04-30T07:26:44.995698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
270
 
14.9%
110
 
6.1%
96
 
5.3%
84
 
4.6%
83
 
4.6%
83
 
4.6%
83
 
4.6%
83
 
4.6%
83
 
4.6%
) 82
 
4.5%
Other values (91) 750
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1120
62.0%
Space Separator 270
 
14.9%
Decimal Number 242
 
13.4%
Close Punctuation 82
 
4.5%
Open Punctuation 82
 
4.5%
Dash Punctuation 7
 
0.4%
Other Punctuation 2
 
0.1%
Uppercase Letter 1
 
0.1%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
 
9.8%
96
 
8.6%
84
 
7.5%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
45
 
4.0%
38
 
3.4%
Other values (74) 332
29.6%
Decimal Number
ValueCountFrequency (%)
1 55
22.7%
3 31
12.8%
2 31
12.8%
4 24
9.9%
5 23
9.5%
0 21
 
8.7%
6 19
 
7.9%
7 14
 
5.8%
9 12
 
5.0%
8 12
 
5.0%
Space Separator
ValueCountFrequency (%)
270
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1121
62.0%
Common 685
37.9%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
 
9.8%
96
 
8.6%
84
 
7.5%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
45
 
4.0%
38
 
3.4%
Other values (75) 333
29.7%
Common
ValueCountFrequency (%)
270
39.4%
) 82
 
12.0%
( 82
 
12.0%
1 55
 
8.0%
3 31
 
4.5%
2 31
 
4.5%
4 24
 
3.5%
5 23
 
3.4%
0 21
 
3.1%
6 19
 
2.8%
Other values (5) 47
 
6.9%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1120
62.0%
ASCII 686
38.0%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
270
39.4%
) 82
 
12.0%
( 82
 
12.0%
1 55
 
8.0%
3 31
 
4.5%
2 31
 
4.5%
4 24
 
3.5%
5 23
 
3.4%
0 21
 
3.1%
6 19
 
2.8%
Other values (6) 48
 
7.0%
Hangul
ValueCountFrequency (%)
110
 
9.8%
96
 
8.6%
84
 
7.5%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
83
 
7.4%
45
 
4.0%
38
 
3.4%
Other values (74) 332
29.6%
None
ValueCountFrequency (%)
1
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size796.0 B
집단급식소
83 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
집단급식소 83
100.0%

Length

2024-04-30T07:26:45.111443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:26:45.210557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 83
100.0%

예상배출량(kg_일)
Real number (ℝ)

HIGH CORRELATION 

Distinct49
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116.92012
Minimum16
Maximum500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size879.0 B
2024-04-30T07:26:45.316245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile33.347
Q150
median99
Q3157.5
95-th percentile277.4
Maximum500
Range484
Interquartile range (IQR)107.5

Descriptive statistics

Standard deviation83.338751
Coefficient of variation (CV)0.71278365
Kurtosis4.6190825
Mean116.92012
Median Absolute Deviation (MAD)49
Skewness1.7235118
Sum9704.37
Variance6945.3474
MonotonicityNot monotonic
2024-04-30T07:26:45.444052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
50.0 7
 
8.4%
200.0 6
 
7.2%
100.0 5
 
6.0%
40.0 5
 
6.0%
150.0 4
 
4.8%
60.0 3
 
3.6%
80.0 3
 
3.6%
70.0 2
 
2.4%
160.0 2
 
2.4%
20.0 2
 
2.4%
Other values (39) 44
53.0%
ValueCountFrequency (%)
16.0 1
 
1.2%
20.0 2
 
2.4%
30.0 1
 
1.2%
33.3 1
 
1.2%
33.77 1
 
1.2%
35.0 1
 
1.2%
38.0 1
 
1.2%
40.0 5
6.0%
41.0 1
 
1.2%
45.0 1
 
1.2%
ValueCountFrequency (%)
500.0 1
 
1.2%
320.0 1
 
1.2%
300.0 2
 
2.4%
278.0 1
 
1.2%
272.0 1
 
1.2%
250.0 1
 
1.2%
200.0 6
7.2%
198.0 1
 
1.2%
180.0 2
 
2.4%
170.0 2
 
2.4%

데이터 기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size796.0 B
Minimum2024-04-25 00:00:00
Maximum2024-04-25 00:00:00
2024-04-30T07:26:45.540866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:45.626889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-30T07:26:43.089114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.369229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.759767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:43.172949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.515538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.857600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:43.264721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.661857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:26:42.981165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:26:45.693045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호규모(인)주소예상배출량(kg_일)
연번1.0001.0000.5481.0000.000
상호1.0001.0001.0001.0001.000
규모(인)0.5481.0001.0001.0000.737
주소1.0001.0001.0001.0001.000
예상배출량(kg_일)0.0001.0000.7371.0001.000
2024-04-30T07:26:45.777511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번규모(인)예상배출량(kg_일)
연번1.000-0.417-0.235
규모(인)-0.4171.0000.755
예상배출량(kg_일)-0.2350.7551.000

Missing values

2024-04-30T07:26:43.367032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:26:43.469123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호규모(인)주소업종예상배출량(kg_일)데이터 기준일
01매곡고등학교850울산광역시 북구 매산로 45(매곡동)집단급식소200.02024-04-25
12(의)송은의료재단 울산시티병원1250울산광역시 북구 산업로 1007(연암동)집단급식소278.02024-04-25
23효정중학교554울산광역시 북구 염포로 363(양정동)집단급식소100.02024-04-25
34㈜에스티엑스에프앤씨 자동차부품혁신센터600울산광역시 북구 매곡산업로 35(매곡동)집단급식소130.02024-04-25
45동천고등학교842울산광역시 북구 고래로 54(천곡동)집단급식소300.02024-04-25
56화봉고등학교680울산광역시 북구 화동로 47(화봉동)집단급식소200.02024-04-25
67매곡초등학교1100울산광역시 북구 매곡1로 49(매곡동)집단급식소145.02024-04-25
78연암중학교230울산광역시 북구 사청1길 12(연암동)집단급식소60.02024-04-25
89양정초등학교645울산광역시 북구 양정2길 11(양정동)집단급식소100.02024-04-25
910화봉초등학교780울산광역시 북구 화산로 42(화봉동)집단급식소80.02024-04-25
연번상호규모(인)주소업종예상배출량(kg_일)데이터 기준일
7374㈜동원홈푸드[금강기계공업]140울산광역시 북구 효암로 164(효문동)집단급식소41.02024-04-25
7475㈜에이치에프앤에스[SHB울산공장]220울산광역시 북구 염포로 272-1(효문동)집단급식소38.02024-04-25
7576웰빙푸드[엔브이에이치코리아㈜]550울산광역시 북구 이화산업1길 9(중산동)집단급식소165.02024-04-25
7677큰사랑유치원206울산광역시 북구 제내2길 45-4(신천동)집단급식소40.02024-04-25
7778㈜풀무원푸드앤컬처-현대차시트 2식당450울산광역시 북구 율동2길 13(효문동)집단급식소180.02024-04-25
7879㈜동원홈푸드 엘림종합복지센터320울산광역시 북구 농서로 71-30(상안동)집단급식소100.02024-04-25
7980북울산병원130울산광역시 북구 신답로 37(상안동)집단급식소79.82024-04-25
8081의료법인 월촌의료재단235울산광역시 북구 당수골5길41-6(호계동)집단급식소33.32024-04-25
8182㈜한솔[㈜베스틱]140울산광역시 북구 모듈화산업로 24(효문동)집단급식소40.02024-04-25
8283울산북구 공공산후조리원100울산광역시 북구 호계매곡5로60집단급식소100.02024-04-25