Overview

Dataset statistics

Number of variables5
Number of observations23
Missing cells4
Missing cells (%)3.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory48.7 B

Variable types

Categorical3
Numeric2

Dataset

Description인천광역시 계양구 종량제봉투 주문 판매에 대한 정보로, 연도, 봉투구분, 봉투재질, 봉투용량, 연간 판매량 등을 제공합니다.
Author인천광역시 계양구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15084345&srcSe=7661IVAWM27C61E190

Alerts

연도 has constant value ""Constant
봉투용량(L) is highly overall correlated with 봉투재질High correlation
봉투구분 is highly overall correlated with 봉투재질High correlation
봉투재질 is highly overall correlated with 봉투용량(L) and 1 other fieldsHigh correlation
봉투용량(L) has 4 (17.4%) missing valuesMissing
연간 판매량 has unique valuesUnique

Reproduction

Analysis started2024-03-13 06:05:23.350758
Analysis finished2024-03-13 06:05:23.991112
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023
23 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 23
100.0%

Length

2024-03-13T15:05:24.043968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T15:05:24.120373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 23
100.0%

봉투구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
음식물(신)
일반
불연성
스티커
사업계

Length

Max length6
Median length3
Mean length3.5652174
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
음식물(신) 6
26.1%
일반 5
21.7%
불연성 4
17.4%
스티커 4
17.4%
사업계 2
 
8.7%
재사용 2
 
8.7%

Length

2024-03-13T15:05:24.222749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T15:05:24.332503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음식물(신 6
26.1%
일반 5
21.7%
불연성 4
17.4%
스티커 4
17.4%
사업계 2
 
8.7%
재사용 2
 
8.7%

봉투재질
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
고밀도
13 
선형저밀도
폐기물스티커

Length

Max length6
Median length3
Mean length4.0434783
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고밀도
2nd row고밀도
3rd row고밀도
4th row선형저밀도
5th row선형저밀도

Common Values

ValueCountFrequency (%)
고밀도 13
56.5%
선형저밀도 6
26.1%
폐기물스티커 4
 
17.4%

Length

2024-03-13T15:05:24.451929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T15:05:24.550535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고밀도 13
56.5%
선형저밀도 6
26.1%
폐기물스티커 4
 
17.4%

봉투용량(L)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct10
Distinct (%)52.6%
Missing4
Missing (%)17.4%
Infinite0
Infinite (%)0.0%
Mean31.894737
Minimum1
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2024-03-13T15:05:24.647576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.9
Q15
median10
Q350
95-th percentile102.5
Maximum125
Range124
Interquartile range (IQR)45

Descriptive statistics

Standard deviation38.431606
Coefficient of variation (CV)1.2049513
Kurtosis0.83703999
Mean31.894737
Median Absolute Deviation (MAD)9
Skewness1.4080624
Sum606
Variance1476.9883
MonotonicityNot monotonic
2024-03-13T15:05:24.769496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
10 4
17.4%
5 3
13.0%
20 3
13.0%
50 2
8.7%
100 2
8.7%
60 1
 
4.3%
125 1
 
4.3%
1 1
 
4.3%
2 1
 
4.3%
3 1
 
4.3%
(Missing) 4
17.4%
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
3 1
 
4.3%
5 3
13.0%
10 4
17.4%
20 3
13.0%
50 2
8.7%
60 1
 
4.3%
100 2
8.7%
125 1
 
4.3%
ValueCountFrequency (%)
125 1
 
4.3%
100 2
8.7%
60 1
 
4.3%
50 2
8.7%
20 3
13.0%
10 4
17.4%
5 3
13.0%
3 1
 
4.3%
2 1
 
4.3%
1 1
 
4.3%

연간 판매량
Real number (ℝ)

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean386060.13
Minimum5
Maximum2085180
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2024-03-13T15:05:24.898998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile837.8
Q134806.5
median102310
Q3561000
95-th percentile1359550.6
Maximum2085180
Range2085175
Interquartile range (IQR)526193.5

Descriptive statistics

Standard deviation561017.17
Coefficient of variation (CV)1.453186
Kurtosis2.8111946
Mean386060.13
Median Absolute Deviation (MAD)100277
Skewness1.8052946
Sum8879383
Variance3.1474026 × 1011
MonotonicityNot monotonic
2024-03-13T15:05:25.009690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
335765 1
 
4.3%
1365017 1
 
4.3%
109749 1
 
4.3%
142320 1
 
4.3%
262860 1
 
4.3%
540360 1
 
4.3%
581640 1
 
4.3%
102310 1
 
4.3%
22935 1
 
4.3%
48114 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
5 1
4.3%
705 1
4.3%
2033 1
4.3%
4415 1
4.3%
15370 1
4.3%
22935 1
4.3%
46678 1
4.3%
48114 1
4.3%
49445 1
4.3%
65164 1
4.3%
ValueCountFrequency (%)
2085180 1
4.3%
1365017 1
4.3%
1310353 1
4.3%
1058851 1
4.3%
656111 1
4.3%
581640 1
4.3%
540360 1
4.3%
335765 1
4.3%
262860 1
4.3%
142320 1
4.3%

Interactions

2024-03-13T15:05:23.658266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:05:23.483614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:05:23.745937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T15:05:23.567734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T15:05:25.086175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
봉투구분봉투재질봉투용량(L)연간 판매량
봉투구분1.0000.9760.4630.608
봉투재질0.9761.0001.0000.000
봉투용량(L)0.4631.0001.0000.000
연간 판매량0.6080.0000.0001.000
2024-03-13T15:05:25.173263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
봉투재질봉투구분
봉투재질1.0000.745
봉투구분0.7451.000
2024-03-13T15:05:25.247989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
봉투용량(L)연간 판매량봉투구분봉투재질
봉투용량(L)1.000-0.3160.2970.874
연간 판매량-0.3161.0000.3900.000
봉투구분0.2970.3901.0000.745
봉투재질0.8740.0000.7451.000

Missing values

2024-03-13T15:05:23.853420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T15:05:23.957271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도봉투구분봉투재질봉투용량(L)연간 판매량
02023일반고밀도5335765
12023일반고밀도101365017
22023일반고밀도201310353
32023일반선형저밀도501058851
42023일반선형저밀도1005
52023불연성고밀도52033
62023불연성고밀도1015370
72023불연성선형저밀도504415
82023불연성선형저밀도100705
92023사업계선형저밀도6074003
연도봉투구분봉투재질봉투용량(L)연간 판매량
132023스티커폐기물스티커<NA>65164
142023스티커폐기물스티커<NA>49445
152023스티커폐기물스티커<NA>48114
162023스티커폐기물스티커<NA>22935
172023음식물(신)고밀도1102310
182023음식물(신)고밀도2581640
192023음식물(신)고밀도3540360
202023음식물(신)고밀도5262860
212023음식물(신)고밀도10142320
222023음식물(신)고밀도20109749