Overview

Dataset statistics

Number of variables4
Number of observations129
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory35.0 B

Variable types

Text1
Numeric2
DateTime1

Dataset

Description부산광역시 사하구 음식물폐기물다량배출사업장 현황에 대한 데이터로 사업장명, 급식인원, 하루 배출량 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15034247/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
급식인원 is highly overall correlated with 배출량High correlation
배출량 is highly overall correlated with 급식인원High correlation
사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:43:01.859518
Analysis finished2023-12-12 15:43:02.671467
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct129
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T00:43:02.853734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length8.5891473
Min length3

Characters and Unicode

Total characters1108
Distinct characters227
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)100.0%

Sample

1st row감천문화요양병원
2nd row을숙도초등학교
3rd row삼성여자고등학교
4th row감천중학교
5th row부일외국어고등학교
ValueCountFrequency (%)
㈜풀무원푸드앤컬처 4
 
2.4%
의료법인 3
 
1.8%
㈜새손 2
 
1.2%
㈜아워홈 2
 
1.2%
주식회사 2
 
1.2%
구내식당 2
 
1.2%
허브휴양 1
 
0.6%
한방병원 1
 
0.6%
늘사랑요양병원 1
 
0.6%
동주여자중학교 1
 
0.6%
Other values (146) 146
88.5%
2023-12-13T00:43:03.300740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
 
5.1%
54
 
4.9%
50
 
4.5%
39
 
3.5%
37
 
3.3%
35
 
3.2%
26
 
2.3%
22
 
2.0%
20
 
1.8%
18
 
1.6%
Other values (217) 751
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1000
90.3%
Space Separator 37
 
3.3%
Other Symbol 22
 
2.0%
Close Punctuation 15
 
1.4%
Open Punctuation 14
 
1.3%
Decimal Number 7
 
0.6%
Uppercase Letter 7
 
0.6%
Other Punctuation 5
 
0.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
5.6%
54
 
5.4%
50
 
5.0%
39
 
3.9%
35
 
3.5%
26
 
2.6%
20
 
2.0%
18
 
1.8%
16
 
1.6%
16
 
1.6%
Other values (197) 670
67.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
14.3%
F 1
14.3%
C 1
14.3%
N 1
14.3%
I 1
14.3%
W 1
14.3%
S 1
14.3%
Decimal Number
ValueCountFrequency (%)
6 2
28.6%
4 1
14.3%
0 1
14.3%
7 1
14.3%
3 1
14.3%
1 1
14.3%
Other Punctuation
ValueCountFrequency (%)
& 3
60.0%
: 2
40.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Other Symbol
ValueCountFrequency (%)
22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1022
92.2%
Common 79
 
7.1%
Latin 7
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
5.5%
54
 
5.3%
50
 
4.9%
39
 
3.8%
35
 
3.4%
26
 
2.5%
22
 
2.2%
20
 
2.0%
18
 
1.8%
16
 
1.6%
Other values (198) 686
67.1%
Common
ValueCountFrequency (%)
37
46.8%
) 15
19.0%
( 14
 
17.7%
& 3
 
3.8%
6 2
 
2.5%
: 2
 
2.5%
4 1
 
1.3%
0 1
 
1.3%
7 1
 
1.3%
3 1
 
1.3%
Other values (2) 2
 
2.5%
Latin
ValueCountFrequency (%)
B 1
14.3%
F 1
14.3%
C 1
14.3%
N 1
14.3%
I 1
14.3%
W 1
14.3%
S 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1000
90.3%
ASCII 86
 
7.8%
None 22
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
56
 
5.6%
54
 
5.4%
50
 
5.0%
39
 
3.9%
35
 
3.5%
26
 
2.6%
20
 
2.0%
18
 
1.8%
16
 
1.6%
16
 
1.6%
Other values (197) 670
67.0%
ASCII
ValueCountFrequency (%)
37
43.0%
) 15
17.4%
( 14
 
16.3%
& 3
 
3.5%
6 2
 
2.3%
: 2
 
2.3%
4 1
 
1.2%
0 1
 
1.2%
B 1
 
1.2%
F 1
 
1.2%
Other values (9) 9
 
10.5%
None
ValueCountFrequency (%)
22
100.0%

급식인원
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)57.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean520.83419
Minimum100
Maximum3016.58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T00:43:03.457831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile107
Q1200
median440
Q3670
95-th percentile1212
Maximum3016.58
Range2916.58
Interquartile range (IQR)470

Descriptive statistics

Standard deviation425.09584
Coefficient of variation (CV)0.81618268
Kurtosis9.2009343
Mean520.83419
Median Absolute Deviation (MAD)240
Skewness2.3052918
Sum67187.61
Variance180706.47
MonotonicityNot monotonic
2023-12-13T00:43:03.615873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200.0 7
 
5.4%
150.0 6
 
4.7%
100.0 6
 
4.7%
250.0 6
 
4.7%
450.0 5
 
3.9%
120.0 4
 
3.1%
900.0 4
 
3.1%
650.0 4
 
3.1%
400.0 3
 
2.3%
600.0 3
 
2.3%
Other values (64) 81
62.8%
ValueCountFrequency (%)
100.0 6
4.7%
105.0 1
 
0.8%
110.0 2
 
1.6%
120.0 4
3.1%
130.0 1
 
0.8%
140.0 3
2.3%
150.0 6
4.7%
160.0 2
 
1.6%
170.0 2
 
1.6%
190.0 1
 
0.8%
ValueCountFrequency (%)
3016.58 1
0.8%
1900.0 1
0.8%
1768.0 1
0.8%
1500.0 2
1.6%
1253.35 1
0.8%
1240.0 1
0.8%
1170.0 1
0.8%
1150.0 1
0.8%
1090.0 1
0.8%
1055.0 1
0.8%

배출량
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)34.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.192248
Minimum0
Maximum360
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T00:43:03.790605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16
Q140
median73
Q3120
95-th percentile200
Maximum360
Range360
Interquartile range (IQR)80

Descriptive statistics

Standard deviation61.449186
Coefficient of variation (CV)0.73864077
Kurtosis3.7883611
Mean83.192248
Median Absolute Deviation (MAD)38
Skewness1.6020817
Sum10731.8
Variance3776.0024
MonotonicityNot monotonic
2023-12-13T00:43:03.958565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
120.0 15
 
11.6%
50.0 9
 
7.0%
100.0 9
 
7.0%
90.0 9
 
7.0%
30.0 8
 
6.2%
70.0 7
 
5.4%
80.0 6
 
4.7%
20.0 6
 
4.7%
60.0 6
 
4.7%
40.0 6
 
4.7%
Other values (35) 48
37.2%
ValueCountFrequency (%)
0.0 1
 
0.8%
3.0 1
 
0.8%
4.0 1
 
0.8%
6.5 1
 
0.8%
10.0 2
 
1.6%
16.0 2
 
1.6%
18.0 2
 
1.6%
20.0 6
4.7%
21.0 1
 
0.8%
25.0 2
 
1.6%
ValueCountFrequency (%)
360.0 1
 
0.8%
298.0 1
 
0.8%
250.0 3
2.3%
200.0 3
2.3%
190.0 2
1.6%
180.0 1
 
0.8%
170.0 1
 
0.8%
160.0 1
 
0.8%
140.0 2
1.6%
133.0 1
 
0.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2023-04-19 00:00:00
Maximum2023-04-19 00:00:00
2023-12-13T00:43:04.104421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:43:04.559633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T00:43:02.249220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:43:02.038089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:43:02.378200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:43:02.156585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:43:04.644514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급식인원배출량
급식인원1.0000.757
배출량0.7571.000
2023-12-13T00:43:04.747335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급식인원배출량
급식인원1.0000.623
배출량0.6231.000

Missing values

2023-12-13T00:43:02.537685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:43:02.635403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명급식인원배출량데이터기준일자
0감천문화요양병원150.050.02023-04-19
1을숙도초등학교760.0100.02023-04-19
2삼성여자고등학교1768.0298.02023-04-19
3감천중학교340.0100.02023-04-19
4부일외국어고등학교1900.0125.02023-04-19
5조이 효 요양병원540.095.02023-04-19
6중앙유병원400.0120.02023-04-19
7한국남부발전㈜) 부산발전본부 구내식당120.018.02023-04-19
8옥천초등학교680.0120.02023-04-19
9장평중학교450.0120.02023-04-19
사업장명급식인원배출량데이터기준일자
119㈜아워홈 동진섬유부산점100.020.02023-04-19
120서천초등학교240.050.02023-04-19
121하가람739.0630.02023-04-19
122의료법인교통문화의료재단 우리미소요양병원600.090.02023-04-19
123㈜풀무원푸드앤컬처 창신INC900.0180.02023-04-19
124㈜풀무원푸드앤컬처 서흥 직원식당170.060.02023-04-19
125㈜새손 강남조선소점450.0200.02023-04-19
126푸디스트 주식회사 부산자생한방병원점110.030.02023-04-19
127청솔F&B(삼림식품)100.030.02023-04-19
128부산광역시교육청유아교육진흥원280.035.02023-04-19