Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 7221 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 20 |
Duplicate rows (%) | 0.3% |
Total size in memory | 296.3 KiB |
Average record size in memory | 42.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 인천광역시 서구 쓰레기종량제봉투 포장단위에 대한 데이터로 봉투명, 구분, 만료기간, 수량 등의 정보가 포함되어 있습니다. |
---|---|
Author | 인천광역시 서구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15090818&srcSe=7661IVAWM27C61E190 |
데이터기준일자 has constant value "" | Constant |
Dataset has 20 (0.3%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2024-03-18 02:00:51.302675 |
---|---|
Analysis finished | 2024-03-18 02:00:51.674800 |
Duration | 0.37 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
봉투명
Text
Distinct | 64 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 56.5 KiB |
Value | Count | Frequency (%) |
일반용 | 1398 | 9.8% |
스티커 | 1368 | 9.5% |
재활용 | 1059 | 7.4% |
필증 | 969 | 6.8% |
음식물 | 885 | 6.2% |
불연성 | 516 | 3.6% |
10l | 498 | 3.5% |
사업계용 | 498 | 3.5% |
20l | 492 | 3.4% |
재사용 | 414 | 2.9% |
Other values (48) | 6231 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8598 | 12.8% |
7107 | 10.6% | |
L | 5991 | 8.9% |
용 | 3807 | 5.7% |
) | 3480 | 5.2% |
( | 3480 | 5.2% |
1 | 2415 | 3.6% |
5 | 2082 | 3.1% |
라 | 1878 | 2.8% |
2 | 1692 | 2.5% |
Other values (41) | 26601 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30621 | |
Decimal Number | 16452 | |
Space Separator | 7107 | 10.6% |
Uppercase Letter | 5991 | 8.9% |
Close Punctuation | 3480 | 5.2% |
Open Punctuation | 3480 | 5.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
용 | 3807 | 12.4% |
라 | 1878 | 6.1% |
청 | 1605 | 5.2% |
재 | 1473 | 4.8% |
일 | 1398 | 4.6% |
반 | 1398 | 4.6% |
커 | 1368 | 4.5% |
티 | 1368 | 4.5% |
스 | 1368 | 4.5% |
원 | 1200 | 3.9% |
Other values (29) | 13758 |
Decimal Number
Value | Count | Frequency (%) |
0 | 8598 | |
1 | 2415 | 14.7% |
5 | 2082 | 12.7% |
2 | 1692 | 10.3% |
3 | 1293 | 7.9% |
6 | 312 | 1.9% |
8 | 30 | 0.2% |
9 | 30 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
7107 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 5991 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3480 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3480 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30621 | |
Common | 30519 | |
Latin | 5991 | 8.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
용 | 3807 | 12.4% |
라 | 1878 | 6.1% |
청 | 1605 | 5.2% |
재 | 1473 | 4.8% |
일 | 1398 | 4.6% |
반 | 1398 | 4.6% |
커 | 1368 | 4.5% |
티 | 1368 | 4.5% |
스 | 1368 | 4.5% |
원 | 1200 | 3.9% |
Other values (29) | 13758 |
Common
Value | Count | Frequency (%) |
0 | 8598 | |
7107 | ||
) | 3480 | |
( | 3480 | |
1 | 2415 | 7.9% |
5 | 2082 | 6.8% |
2 | 1692 | 5.5% |
3 | 1293 | 4.2% |
6 | 312 | 1.0% |
8 | 30 | 0.1% |
Latin
Value | Count | Frequency (%) |
L | 5991 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 36510 | |
Hangul | 30621 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8598 | |
7107 | ||
L | 5991 | |
) | 3480 | |
( | 3480 | |
1 | 2415 | 6.6% |
5 | 2082 | 5.7% |
2 | 1692 | 4.6% |
3 | 1293 | 3.5% |
6 | 312 | 0.9% |
Other values (2) | 60 | 0.2% |
Hangul
Value | Count | Frequency (%) |
용 | 3807 | 12.4% |
라 | 1878 | 6.1% |
청 | 1605 | 5.2% |
재 | 1473 | 4.8% |
일 | 1398 | 4.6% |
반 | 1398 | 4.6% |
커 | 1368 | 4.5% |
티 | 1368 | 4.5% |
스 | 1368 | 4.5% |
원 | 1200 | 3.9% |
Other values (29) | 13758 |
구분
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 56.5 KiB |
1 | |
---|---|
2 | |
3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 3 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 2407 | |
2 | 2407 | |
3 | 2407 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 2407 | |
2 | 2407 | |
3 | 2407 |
만료기간
Text
Distinct | 66 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 56.5 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 72210 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2002-01-02 |
---|---|
2nd row | 2002-01-02 |
3rd row | 2002-01-02 |
4th row | 2003-03-02 |
5th row | 2003-03-02 |
Value | Count | Frequency (%) |
9999-99-99 | 195 | 2.7% |
2022-10-24 | 189 | 2.6% |
2022-10-18 | 189 | 2.6% |
2023-09-14 | 189 | 2.6% |
2022-12-11 | 189 | 2.6% |
2022-10-25 | 189 | 2.6% |
2022-10-11 | 186 | 2.6% |
2022-04-13 | 186 | 2.6% |
2022-04-12 | 186 | 2.6% |
2021-12-23 | 186 | 2.6% |
Other values (56) | 5337 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 16410 | |
2 | 15726 | |
- | 14442 | |
1 | 11883 | |
9 | 4524 | 6.3% |
3 | 2091 | 2.9% |
4 | 1821 | 2.5% |
7 | 1485 | 2.1% |
8 | 1299 | 1.8% |
5 | 1290 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 57768 | |
Dash Punctuation | 14442 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 16410 | |
2 | 15726 | |
1 | 11883 | |
9 | 4524 | 7.8% |
3 | 2091 | 3.6% |
4 | 1821 | 3.2% |
7 | 1485 | 2.6% |
8 | 1299 | 2.2% |
5 | 1290 | 2.2% |
6 | 1239 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14442 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 72210 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 16410 | |
2 | 15726 | |
- | 14442 | |
1 | 11883 | |
9 | 4524 | 6.3% |
3 | 2091 | 2.9% |
4 | 1821 | 2.5% |
7 | 1485 | 2.1% |
8 | 1299 | 1.8% |
5 | 1290 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 72210 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 16410 | |
2 | 15726 | |
- | 14442 | |
1 | 11883 | |
9 | 4524 | 6.3% |
3 | 2091 | 2.9% |
4 | 1821 | 2.5% |
7 | 1485 | 2.1% |
8 | 1299 | 1.8% |
5 | 1290 | 1.8% |
수량
Real number (ℝ)
Distinct | 13 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 35.558371 |
Minimum | 1 |
---|---|
Maximum | 1000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 63.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 10 |
Q3 | 50 |
95-th percentile | 200 |
Maximum | 1000 |
Range | 999 |
Interquartile range (IQR) | 49 |
Descriptive statistics
Standard deviation | 68.650515 |
---|---|
Coefficient of variation (CV) | 1.9306428 |
Kurtosis | 85.290681 |
Mean | 35.558371 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 6.9367767 |
Sum | 256767 |
Variance | 4712.8932 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2421 | |
10 | 1867 | |
100 | 983 | |
20 | 732 | 10.1% |
50 | 511 | 7.1% |
200 | 387 | 5.4% |
5 | 160 | 2.2% |
15 | 40 | 0.6% |
25 | 40 | 0.6% |
30 | 39 | 0.5% |
Other values (3) | 41 | 0.6% |
Value | Count | Frequency (%) |
1 | 2421 | |
4 | 24 | 0.3% |
5 | 160 | 2.2% |
10 | 1867 | |
15 | 40 | 0.6% |
20 | 732 | 10.1% |
25 | 40 | 0.6% |
30 | 39 | 0.5% |
50 | 511 | 7.1% |
100 | 983 |
Value | Count | Frequency (%) |
1000 | 16 | 0.2% |
200 | 387 | 5.4% |
120 | 1 | < 0.1% |
100 | 983 | |
50 | 511 | 7.1% |
30 | 39 | 0.5% |
25 | 40 | 0.6% |
20 | 732 | 10.1% |
15 | 40 | 0.6% |
10 | 1867 |
데이터기준일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 56.5 KiB |
2023-12-06 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-12-06 |
---|---|
2nd row | 2023-12-06 |
3rd row | 2023-12-06 |
4th row | 2023-12-06 |
5th row | 2023-12-06 |
Common Values
Value | Count | Frequency (%) |
2023-12-06 | 7221 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-12-06 | 7221 |
봉투명 | 구분 | 만료기간 | 수량 | |
---|---|---|---|---|
봉투명 | 1.000 | 0.000 | 0.000 | 0.644 |
구분 | 0.000 | 1.000 | 0.000 | 0.483 |
만료기간 | 0.000 | 0.000 | 1.000 | 0.142 |
수량 | 0.644 | 0.483 | 0.142 | 1.000 |
수량 | 구분 | |
---|---|---|
수량 | 1.000 | 0.192 |
구분 | 0.192 | 1.000 |
봉투명 | 구분 | 만료기간 | 수량 | 데이터기준일자 | |
---|---|---|---|---|---|
0 | 불연성 10L | 1 | 2002-01-02 | 20 | 2023-12-06 |
1 | 불연성 10L | 2 | 2002-01-02 | 100 | 2023-12-06 |
2 | 불연성 10L | 3 | 2002-01-02 | 1 | 2023-12-06 |
3 | 불연성 10L | 1 | 2003-03-02 | 20 | 2023-12-06 |
4 | 불연성 10L | 2 | 2003-03-02 | 100 | 2023-12-06 |
5 | 불연성 10L | 3 | 2003-03-02 | 1 | 2023-12-06 |
6 | 불연성 10L | 1 | 2003-04-07 | 20 | 2023-12-06 |
7 | 불연성 10L | 2 | 2003-04-07 | 100 | 2023-12-06 |
8 | 불연성 10L | 3 | 2003-04-07 | 1 | 2023-12-06 |
9 | 불연성 20L | 1 | 2003-03-02 | 10 | 2023-12-06 |
봉투명 | 구분 | 만료기간 | 수량 | 데이터기준일자 | |
---|---|---|---|---|---|
7211 | 재사용 10L(청라) | 3 | 9999-99-99 | 1 | 2023-12-06 |
7212 | 재사용 20L(청라) | 1 | 9999-99-99 | 10 | 2023-12-06 |
7213 | 재사용 20L(청라) | 2 | 9999-99-99 | 100 | 2023-12-06 |
7214 | 재사용 20L(청라) | 3 | 9999-99-99 | 1 | 2023-12-06 |
7215 | 재사용 5L | 1 | 9999-99-99 | 20 | 2023-12-06 |
7216 | 재사용 5L | 2 | 9999-99-99 | 100 | 2023-12-06 |
7217 | 재사용 5L | 3 | 9999-99-99 | 1 | 2023-12-06 |
7218 | 재사용 5L(청라) | 1 | 9999-99-99 | 20 | 2023-12-06 |
7219 | 재사용 5L(청라) | 2 | 9999-99-99 | 100 | 2023-12-06 |
7220 | 재사용 5L(청라) | 3 | 9999-99-99 | 1 | 2023-12-06 |
Most frequently occurring
봉투명 | 구분 | 만료기간 | 수량 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|---|
0 | 재활용 30L(주황) | 2 | 2021-12-23 | 20 | 2023-12-06 | 2 |
1 | 재활용 30L(주황) | 2 | 2022-04-12 | 20 | 2023-12-06 | 2 |
2 | 재활용 30L(주황) | 2 | 2022-04-13 | 20 | 2023-12-06 | 2 |
3 | 재활용 30L(주황) | 2 | 2022-10-11 | 20 | 2023-12-06 | 2 |
4 | 재활용 30L(주황) | 2 | 2022-10-18 | 20 | 2023-12-06 | 2 |
5 | 재활용 30L(주황) | 2 | 2022-10-24 | 20 | 2023-12-06 | 2 |
6 | 재활용 30L(주황) | 2 | 2022-10-25 | 20 | 2023-12-06 | 2 |
7 | 재활용 30L(주황) | 2 | 2022-12-11 | 20 | 2023-12-06 | 2 |
8 | 재활용 30L(주황) | 2 | 2023-09-14 | 20 | 2023-12-06 | 2 |
9 | 재활용 30L(주황) | 2 | 9999-99-99 | 20 | 2023-12-06 | 2 |