Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical7
Numeric1

Dataset

Description남동구에서 관리하는 쓰레기봉투물류관리시스템 과거봉투단가에 대한 데이터로 봉투종류, 수수료, 수수료구분, 지역코드, 생성일자, 구분, 사용여부, 유효기간 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15116942&srcSe=7661IVAWM27C61E190

Alerts

수수료단가 has constant value ""Constant
지역코드명 has constant value ""Constant
유효기간 is highly overall correlated with 생성일자High correlation
생성일자 is highly overall correlated with 유효기간High correlation
수수료 is highly overall correlated with 수수료구분High correlation
봉투종류 is highly overall correlated with 사용여부High correlation
수수료구분 is highly overall correlated with 수수료High correlation
사용여부 is highly overall correlated with 봉투종류High correlation
사용여부 is highly imbalanced (91.9%)Imbalance

Reproduction

Analysis started2024-01-28 15:49:15.950366
Analysis finished2024-01-28 15:49:16.422321
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

봉투종류
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사업계용125L
 
5
사업계용 60L
 
5
사업계용 30L
 
5
3000 원권
 
4
15000 원권
 
4
Other values (26)
77 

Length

Max length11
Median length10
Mean length8.43
Min length6

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row1000 원권
2nd row3000 원권
3rd row일반용 5L
4th row일반용 10L
5th row일반용 20L

Common Values

ValueCountFrequency (%)
사업계용125L 5
 
5.0%
사업계용 60L 5
 
5.0%
사업계용 30L 5
 
5.0%
3000 원권 4
 
4.0%
15000 원권 4
 
4.0%
10000 원권 4
 
4.0%
5000 원권 4
 
4.0%
1000 원권 4
 
4.0%
일반용(저) 100L 3
 
3.0%
일반용(스) 10L 3
 
3.0%
Other values (21) 59
59.0%

Length

2024-01-29T00:49:16.480205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
원권 20
 
10.3%
10l 15
 
7.7%
20l 15
 
7.7%
일반용(스 15
 
7.7%
음식물(스 14
 
7.2%
음식물 14
 
7.2%
일반용 13
 
6.7%
5l 12
 
6.2%
사업계용 10
 
5.1%
50l 7
 
3.6%
Other values (13) 60
30.8%

수수료
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.412
Minimum1.3
Maximum819
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-01-29T00:49:16.580796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.3
5-th percentile1.5
Q15
median11
Q391
95-th percentile819
Maximum819
Range817.7
Interquartile range (IQR)86

Descriptive statistics

Standard deviation217.08541
Coefficient of variation (CV)1.9311587
Kurtosis5.0851202
Mean112.412
Median Absolute Deviation (MAD)9.6
Skewness2.4673667
Sum11241.2
Variance47126.076
MonotonicityNot monotonic
2024-01-29T00:49:16.688802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
5.0 15
15.0%
11.0 15
15.0%
3.0 12
12.0%
27.0 7
 
7.0%
819.0 6
 
6.0%
54.0 6
 
6.0%
1.5 6
 
6.0%
1.3 4
 
4.0%
47.0 3
 
3.0%
94.0 3
 
3.0%
Other values (11) 23
23.0%
ValueCountFrequency (%)
1.3 4
 
4.0%
1.5 6
 
6.0%
3.0 12
12.0%
5.0 15
15.0%
11.0 15
15.0%
19.0 2
 
2.0%
27.0 7
7.0%
47.0 3
 
3.0%
54.0 6
 
6.0%
55.0 2
 
2.0%
ValueCountFrequency (%)
819.0 6
6.0%
645.0 2
 
2.0%
410.0 2
 
2.0%
390.0 2
 
2.0%
246.0 2
 
2.0%
195.0 2
 
2.0%
182.0 2
 
2.0%
157.0 3
3.0%
94.0 3
3.0%
91.0 2
 
2.0%

수수료구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
84 
2
16 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 84
84.0%
2 16
 
16.0%

Length

2024-01-29T00:49:16.788225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:16.860684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 84
84.0%
2 16
 
16.0%

수수료단가
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2024-01-29T00:49:16.944935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:17.013171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

지역코드명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남동구판매소 코드
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남동구판매소 코드
2nd row남동구판매소 코드
3rd row남동구판매소 코드
4th row남동구판매소 코드
5th row남동구판매소 코드

Common Values

ValueCountFrequency (%)
남동구판매소 코드 100
100.0%

Length

2024-01-29T00:49:17.079306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:17.142289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동구판매소 100
50.0%
코드 100
50.0%

생성일자
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2009-06-09
39 
2015-06-30
38 
2005-03-14
23 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2009-06-09
2nd row2009-06-09
3rd row2009-06-09
4th row2009-06-09
5th row2009-06-09

Common Values

ValueCountFrequency (%)
2009-06-09 39
39.0%
2015-06-30 38
38.0%
2005-03-14 23
23.0%

Length

2024-01-29T00:49:17.210778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:17.283165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2009-06-09 39
39.0%
2015-06-30 38
38.0%
2005-03-14 23
23.0%

사용여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사용
99 
사용안함
 
1

Length

Max length4
Median length2
Mean length2.02
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row사용
2nd row사용
3rd row사용
4th row사용
5th row사용

Common Values

ValueCountFrequency (%)
사용 99
99.0%
사용안함 1
 
1.0%

Length

2024-01-29T00:49:17.368290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:17.442935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용 99
99.0%
사용안함 1
 
1.0%

유효기간
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2009-06-08
39 
2015-06-29
38 
2005-03-13
14 
2009-06-01

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2009-06-08
2nd row2009-06-08
3rd row2009-06-08
4th row2009-06-08
5th row2009-06-08

Common Values

ValueCountFrequency (%)
2009-06-08 39
39.0%
2015-06-29 38
38.0%
2005-03-13 14
 
14.0%
2009-06-01 9
 
9.0%

Length

2024-01-29T00:49:17.514964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:49:17.585190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2009-06-08 39
39.0%
2015-06-29 38
38.0%
2005-03-13 14
 
14.0%
2009-06-01 9
 
9.0%

Interactions

2024-01-29T00:49:16.208176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T00:49:17.643480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
봉투종류수수료수수료구분생성일자사용여부유효기간
봉투종류1.0000.8530.3440.0001.0000.000
수수료0.8531.0000.9490.0000.0000.000
수수료구분0.3440.9491.0000.1170.0000.251
생성일자0.0000.0000.1171.0000.0001.000
사용여부1.0000.0000.0000.0001.0000.000
유효기간0.0000.0000.2511.0000.0001.000
2024-01-29T00:49:17.725839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료구분유효기간생성일자사용여부봉투종류
수수료구분1.0000.1640.1930.0000.240
유효기간0.1641.0000.9950.0000.000
생성일자0.1930.9951.0000.0000.000
사용여부0.0000.0000.0001.0000.839
봉투종류0.2400.0000.0000.8391.000
2024-01-29T00:49:17.800690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료봉투종류수수료구분생성일자사용여부유효기간
수수료1.0000.4990.7830.0000.0000.000
봉투종류0.4991.0000.2400.0000.8390.000
수수료구분0.7830.2401.0000.1930.0000.164
생성일자0.0000.0000.1931.0000.0000.995
사용여부0.0000.8390.0000.0001.0000.000
유효기간0.0000.0000.1640.9950.0001.000

Missing values

2024-01-29T00:49:16.298521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T00:49:16.386976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

봉투종류수수료수수료구분수수료단가지역코드명생성일자사용여부유효기간
01000 원권19.000남동구판매소 코드2009-06-09사용2009-06-08
13000 원권55.000남동구판매소 코드2009-06-09사용2009-06-08
2일반용 5L3.000남동구판매소 코드2009-06-09사용2009-06-08
3일반용 10L5.000남동구판매소 코드2009-06-09사용2009-06-08
4일반용 20L11.000남동구판매소 코드2009-06-09사용2009-06-08
5일반용 50L27.000남동구판매소 코드2009-06-09사용2009-06-08
6일반용(저) 100L54.000남동구판매소 코드2009-06-09사용2009-06-08
7사업계용 30L47.000남동구판매소 코드2009-06-09사용2009-06-08
8사업계용 60L94.000남동구판매소 코드2009-06-09사용2009-06-08
9사업계용125L157.000남동구판매소 코드2009-06-09사용2009-06-08
봉투종류수수료수수료구분수수료단가지역코드명생성일자사용여부유효기간
90재사용 20L11.000남동구판매소 코드2015-06-30사용2015-06-29
91사업계용 30L195.020남동구판매소 코드2015-06-30사용2015-06-29
92사업계용 60L390.020남동구판매소 코드2015-06-30사용2015-06-29
93사업계용125L645.020남동구판매소 코드2015-06-30사용2015-06-29
941000 원권82.020남동구판매소 코드2015-06-30사용2015-06-29
953000 원권246.020남동구판매소 코드2015-06-30사용2015-06-29
965000 원권410.020남동구판매소 코드2015-06-30사용2015-06-29
9710000 원권819.020남동구판매소 코드2015-06-30사용2015-06-29
9815000 원권819.020남동구판매소 코드2015-06-30사용2015-06-29
99일반용 50L27.000남동구판매소 코드2009-06-09사용안함2009-06-08