Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 24 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.0 KiB |
Average record size in memory | 85.5 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Categorical | 3 |
Dataset
Description | 서울특별시 강서구 행정동별 연간 쓰레기 배출량 데이터입니다. 2019년도와 2020년도 데이터이고, 일반주택/공동주택/소형음식점/사업장으로 구분되어 있습니다. |
---|---|
Author | 서울특별시 강서구 |
URL | https://www.data.go.kr/data/15091531/fileData.do |
소형음식점 2019 has constant value "" | Constant |
사업장 2019 is highly overall correlated with 공동주택 2020 and 1 other fields | High correlation |
사업장 2020 is highly overall correlated with 공동주택 2020 and 1 other fields | High correlation |
일반주택 2019 is highly overall correlated with 일반주택 2020 | High correlation |
일반주택 2020 is highly overall correlated with 일반주택 2019 | High correlation |
공동주택 2019 is highly overall correlated with 공동주택 2020 | High correlation |
공동주택 2020 is highly overall correlated with 공동주택 2019 and 2 other fields | High correlation |
사업장 2019 is highly imbalanced (68.6%) | Imbalance |
사업장 2020 is highly imbalanced (68.6%) | Imbalance |
행정동 has unique values | Unique |
일반주택 2019 has unique values | Unique |
일반주택 2020 has unique values | Unique |
일반주택 2020 has 1 (4.2%) zeros | Zeros |
공동주택 2019 has 14 (58.3%) zeros | Zeros |
공동주택 2020 has 13 (54.2%) zeros | Zeros |
소형음식점 2020 has 19 (79.2%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 12:21:32.816831 |
---|---|
Analysis finished | 2023-12-12 12:21:36.427520 |
Duration | 3.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
행정동
Text
UNIQUE
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 324.0 B |
Value | Count | Frequency (%) |
가양1동 | 1 | 4.2% |
가양2동 | 1 | 4.2% |
화곡본동 | 1 | 4.2% |
화곡8동 | 1 | 4.2% |
화곡6동 | 1 | 4.2% |
화곡4동 | 1 | 4.2% |
화곡3동 | 1 | 4.2% |
화곡2동 | 1 | 4.2% |
화곡1동 | 1 | 4.2% |
우장산동 | 1 | 4.2% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 21 | |
화 | 10 | 10.6% |
곡 | 9 | 9.6% |
1 | 5 | 5.3% |
2 | 4 | 4.3% |
3 | 4 | 4.3% |
가 | 3 | 3.2% |
촌 | 3 | 3.2% |
방 | 3 | 3.2% |
양 | 3 | 3.2% |
Other values (23) | 29 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 78 | |
Decimal Number | 16 | 17.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 21 | |
화 | 10 | |
곡 | 9 | |
가 | 3 | 3.8% |
촌 | 3 | 3.8% |
방 | 3 | 3.8% |
양 | 3 | 3.8% |
등 | 3 | 3.8% |
마 | 2 | 2.6% |
산 | 2 | 2.6% |
Other values (17) | 19 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5 | |
2 | 4 | |
3 | 4 | |
8 | 1 | 6.2% |
6 | 1 | 6.2% |
4 | 1 | 6.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 78 | |
Common | 16 | 17.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 21 | |
화 | 10 | |
곡 | 9 | |
가 | 3 | 3.8% |
촌 | 3 | 3.8% |
방 | 3 | 3.8% |
양 | 3 | 3.8% |
등 | 3 | 3.8% |
마 | 2 | 2.6% |
산 | 2 | 2.6% |
Other values (17) | 19 |
Common
Value | Count | Frequency (%) |
1 | 5 | |
2 | 4 | |
3 | 4 | |
8 | 1 | 6.2% |
6 | 1 | 6.2% |
4 | 1 | 6.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 78 | |
ASCII | 16 | 17.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 21 | |
화 | 10 | |
곡 | 9 | |
가 | 3 | 3.8% |
촌 | 3 | 3.8% |
방 | 3 | 3.8% |
양 | 3 | 3.8% |
등 | 3 | 3.8% |
마 | 2 | 2.6% |
산 | 2 | 2.6% |
Other values (17) | 19 |
ASCII
Value | Count | Frequency (%) |
1 | 5 | |
2 | 4 | |
3 | 4 | |
8 | 1 | 6.2% |
6 | 1 | 6.2% |
4 | 1 | 6.2% |
일반주택 2019
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 319805.42 |
Minimum | 1570 |
---|---|
Maximum | 1088770 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 348.0 B |
Quantile statistics
Minimum | 1570 |
---|---|
5-th percentile | 5817.5 |
Q1 | 26937.5 |
median | 278200 |
Q3 | 511540 |
95-th percentile | 849556 |
Maximum | 1088770 |
Range | 1087200 |
Interquartile range (IQR) | 484602.5 |
Descriptive statistics
Standard deviation | 317222.32 |
---|---|
Coefficient of variation (CV) | 0.9919229 |
Kurtosis | -0.18023215 |
Mean | 319805.42 |
Median Absolute Deviation (MAD) | 253505 |
Skewness | 0.78965223 |
Sum | 7675330 |
Variance | 1.0063 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
399610 | 1 | 4.2% |
218430 | 1 | 4.2% |
337970 | 1 | 4.2% |
861220 | 1 | 4.2% |
195140 | 1 | 4.2% |
459490 | 1 | 4.2% |
378630 | 1 | 4.2% |
404790 | 1 | 4.2% |
579110 | 1 | 4.2% |
1088770 | 1 | 4.2% |
Other values (14) | 14 |
Value | Count | Frequency (%) |
1570 | 1 | |
4820 | 1 | |
11470 | 1 | |
13110 | 1 | |
13980 | 1 | |
20210 | 1 | |
29180 | 1 | |
35760 | 1 | |
44250 | 1 | |
70840 | 1 |
Value | Count | Frequency (%) |
1088770 | 1 | |
861220 | 1 | |
783460 | 1 | |
662920 | 1 | |
579110 | 1 | |
567820 | 1 | |
492780 | 1 | |
459490 | 1 | |
404790 | 1 | |
399610 | 1 |
일반주택 2020
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1027805.4 |
Minimum | 0 |
---|---|
Maximum | 5893150 |
Zeros | 1 |
Zeros (%) | 4.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 348.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 9111 |
Q1 | 40135 |
median | 828195 |
Q3 | 1430865 |
95-th percentile | 2742046.5 |
Maximum | 5893150 |
Range | 5893150 |
Interquartile range (IQR) | 1390730 |
Descriptive statistics
Standard deviation | 1334043.4 |
---|---|
Coefficient of variation (CV) | 1.2979532 |
Kurtosis | 7.0138318 |
Mean | 1027805.4 |
Median Absolute Deviation (MAD) | 785800 |
Skewness | 2.3130062 |
Sum | 24667330 |
Variance | 1.7796717 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
925380 | 1 | 4.2% |
444500 | 1 | 4.2% |
2618890 | 1 | 4.2% |
2763780 | 1 | 4.2% |
739440 | 1 | 4.2% |
1676460 | 1 | 4.2% |
1370680 | 1 | 4.2% |
994380 | 1 | 4.2% |
1862250 | 1 | 4.2% |
5893150 | 1 | 4.2% |
Other values (14) | 14 |
Value | Count | Frequency (%) |
0 | 1 | |
6060 | 1 | |
26400 | 1 | |
28630 | 1 | |
31240 | 1 | |
35890 | 1 | |
41550 | 1 | |
43240 | 1 | |
76840 | 1 | |
181270 | 1 |
Value | Count | Frequency (%) |
5893150 | 1 | |
2763780 | 1 | |
2618890 | 1 | |
1862250 | 1 | |
1676460 | 1 | |
1532670 | 1 | |
1396930 | 1 | |
1370680 | 1 | |
1060750 | 1 | |
994380 | 1 |
공동주택 2019
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 45.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 62965 |
Minimum | 0 |
---|---|
Maximum | 297520 |
Zeros | 14 |
Zeros (%) | 58.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 348.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 93200 |
95-th percentile | 247871.5 |
Maximum | 297520 |
Range | 297520 |
Interquartile range (IQR) | 93200 |
Descriptive statistics
Standard deviation | 97640.845 |
---|---|
Coefficient of variation (CV) | 1.5507162 |
Kurtosis | 0.2977745 |
Mean | 62965 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.3314726 |
Sum | 1511160 |
Variance | 9.5337347 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 14 | |
220170 | 1 | 4.2% |
297520 | 1 | 4.2% |
252760 | 1 | 4.2% |
50280 | 1 | 4.2% |
207310 | 1 | 4.2% |
129950 | 1 | 4.2% |
80950 | 1 | 4.2% |
203860 | 1 | 4.2% |
52950 | 1 | 4.2% |
Value | Count | Frequency (%) |
0 | 14 | |
15410 | 1 | 4.2% |
50280 | 1 | 4.2% |
52950 | 1 | 4.2% |
80950 | 1 | 4.2% |
129950 | 1 | 4.2% |
203860 | 1 | 4.2% |
207310 | 1 | 4.2% |
220170 | 1 | 4.2% |
252760 | 1 | 4.2% |
Value | Count | Frequency (%) |
297520 | 1 | |
252760 | 1 | |
220170 | 1 | |
207310 | 1 | |
203860 | 1 | |
129950 | 1 | |
80950 | 1 | |
52950 | 1 | |
50280 | 1 | |
15410 | 1 |
공동주택 2020
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 12 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 101796.67 |
Minimum | 0 |
---|---|
Maximum | 790170 |
Zeros | 13 |
Zeros (%) | 54.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 348.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 77570 |
95-th percentile | 548913 |
Maximum | 790170 |
Range | 790170 |
Interquartile range (IQR) | 77570 |
Descriptive statistics
Standard deviation | 207029.42 |
---|---|
Coefficient of variation (CV) | 2.0337544 |
Kurtosis | 5.3517857 |
Mean | 101796.67 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4055701 |
Sum | 2443120 |
Variance | 4.2861181 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 13 | |
422790 | 1 | 4.2% |
571170 | 1 | 4.2% |
5020 | 1 | 4.2% |
63060 | 1 | 4.2% |
5290 | 1 | 4.2% |
790170 | 1 | 4.2% |
254880 | 1 | 4.2% |
140230 | 1 | 4.2% |
108380 | 1 | 4.2% |
Other values (2) | 2 | 8.3% |
Value | Count | Frequency (%) |
0 | 13 | |
5020 | 1 | 4.2% |
5290 | 1 | 4.2% |
14830 | 1 | 4.2% |
63060 | 1 | 4.2% |
67300 | 1 | 4.2% |
108380 | 1 | 4.2% |
140230 | 1 | 4.2% |
254880 | 1 | 4.2% |
422790 | 1 | 4.2% |
Value | Count | Frequency (%) |
790170 | 1 | |
571170 | 1 | |
422790 | 1 | |
254880 | 1 | |
140230 | 1 | |
108380 | 1 | |
67300 | 1 | |
63060 | 1 | |
14830 | 1 | |
5290 | 1 |
소형음식점 2019
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 324.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 24 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 24 |
소형음식점 2020
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 25.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1656.6667 |
Minimum | 0 |
---|---|
Maximum | 14450 |
Zeros | 19 |
Zeros (%) | 79.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 348.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 9644 |
Maximum | 14450 |
Range | 14450 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 3866.8285 |
---|---|
Coefficient of variation (CV) | 2.3341017 |
Kurtosis | 5.152481 |
Mean | 1656.6667 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4150645 |
Sum | 39760 |
Variance | 14952362 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 19 | |
9800 | 1 | 4.2% |
8760 | 1 | 4.2% |
5020 | 1 | 4.2% |
14450 | 1 | 4.2% |
1730 | 1 | 4.2% |
Value | Count | Frequency (%) |
0 | 19 | |
1730 | 1 | 4.2% |
5020 | 1 | 4.2% |
8760 | 1 | 4.2% |
9800 | 1 | 4.2% |
14450 | 1 | 4.2% |
Value | Count | Frequency (%) |
14450 | 1 | 4.2% |
9800 | 1 | 4.2% |
8760 | 1 | 4.2% |
5020 | 1 | 4.2% |
1730 | 1 | 4.2% |
0 | 19 |
사업장 2019
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 12.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 324.0 B |
0 | |
---|---|
28990 | 1 |
227690 | 1 |
Length
Max length | 6 |
---|---|
Median length | 1 |
Mean length | 1.375 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 8.3% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 28990 |
Common Values
Value | Count | Frequency (%) |
0 | 22 | |
28990 | 1 | 4.2% |
227690 | 1 | 4.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 22 | |
28990 | 1 | 4.2% |
227690 | 1 | 4.2% |
사업장 2020
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 12.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 324.0 B |
0 | |
---|---|
1170 | 1 |
497200 | 1 |
Length
Max length | 6 |
---|---|
Median length | 1 |
Mean length | 1.3333333 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 8.3% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 1170 |
Common Values
Value | Count | Frequency (%) |
0 | 22 | |
1170 | 1 | 4.2% |
497200 | 1 | 4.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 22 | |
1170 | 1 | 4.2% |
497200 | 1 | 4.2% |
행정동 | 일반주택 2019 | 일반주택 2020 | 공동주택 2019 | 공동주택 2020 | 소형음식점 2020 | 사업장 2019 | 사업장 2020 | |
---|---|---|---|---|---|---|---|---|
행정동 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
일반주택 2019 | 1.000 | 1.000 | 0.844 | 0.587 | 0.736 | 0.306 | 0.000 | 0.000 |
일반주택 2020 | 1.000 | 0.844 | 1.000 | 0.000 | 0.000 | 0.484 | 0.000 | 0.000 |
공동주택 2019 | 1.000 | 0.587 | 0.000 | 1.000 | 0.955 | 0.000 | 0.000 | 0.000 |
공동주택 2020 | 1.000 | 0.736 | 0.000 | 0.955 | 1.000 | 0.000 | 0.895 | 0.895 |
소형음식점 2020 | 1.000 | 0.306 | 0.484 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
사업장 2019 | 1.000 | 0.000 | 0.000 | 0.000 | 0.895 | 0.000 | 1.000 | 1.000 |
사업장 2020 | 1.000 | 0.000 | 0.000 | 0.000 | 0.895 | 0.000 | 1.000 | 1.000 |
사업장 2019 | 사업장 2020 | |
---|---|---|
사업장 2019 | 1.000 | 1.000 |
사업장 2020 | 1.000 | 1.000 |
일반주택 2019 | 일반주택 2020 | 공동주택 2019 | 공동주택 2020 | 소형음식점 2020 | 사업장 2019 | 사업장 2020 | |
---|---|---|---|---|---|---|---|
일반주택 2019 | 1.000 | 0.803 | 0.013 | 0.056 | -0.326 | 0.000 | 0.000 |
일반주택 2020 | 0.803 | 1.000 | -0.050 | -0.017 | -0.163 | 0.000 | 0.000 |
공동주택 2019 | 0.013 | -0.050 | 1.000 | 0.835 | -0.177 | 0.000 | 0.000 |
공동주택 2020 | 0.056 | -0.017 | 0.835 | 1.000 | -0.294 | 0.563 | 0.563 |
소형음식점 2020 | -0.326 | -0.163 | -0.177 | -0.294 | 1.000 | 0.000 | 0.000 |
사업장 2019 | 0.000 | 0.000 | 0.000 | 0.563 | 0.000 | 1.000 | 1.000 |
사업장 2020 | 0.000 | 0.000 | 0.000 | 0.563 | 0.000 | 1.000 | 1.000 |
행정동 | 일반주택 2019 | 일반주택 2020 | 공동주택 2019 | 공동주택 2020 | 소형음식점 2019 | 소형음식점 2020 | 사업장 2019 | 사업장 2020 | |
---|---|---|---|---|---|---|---|---|---|
0 | 가양1동 | 399610 | 925380 | 220170 | 422790 | 0 | 0 | 0 | 0 |
1 | 가양2동 | 1570 | 31240 | 297520 | 571170 | 0 | 0 | 0 | 0 |
2 | 가양3동 | 4820 | 181270 | 0 | 5020 | 0 | 0 | 0 | 0 |
3 | 공항동 | 13980 | 35890 | 0 | 0 | 0 | 9800 | 0 | 0 |
4 | 김포공항내 | 35760 | 6060 | 0 | 0 | 0 | 0 | 28990 | 1170 |
5 | 등촌1동 | 20210 | 28630 | 0 | 0 | 0 | 0 | 0 | 0 |
6 | 등촌2동 | 13110 | 43240 | 0 | 0 | 0 | 8760 | 0 | 0 |
7 | 등촌3동 | 11470 | 41550 | 252760 | 63060 | 0 | 5020 | 0 | 0 |
8 | 마곡동 | 70840 | 26400 | 0 | 0 | 0 | 0 | 0 | 0 |
9 | 마곡지구 | 44250 | 0 | 50280 | 5290 | 0 | 0 | 0 | 0 |
행정동 | 일반주택 2019 | 일반주택 2020 | 공동주택 2019 | 공동주택 2020 | 소형음식점 2019 | 소형음식점 2020 | 사업장 2019 | 사업장 2020 | |
---|---|---|---|---|---|---|---|---|---|
14 | 염창동 | 29180 | 76840 | 0 | 0 | 0 | 14450 | 0 | 0 |
15 | 우장산동 | 492780 | 1532670 | 52950 | 0 | 0 | 0 | 0 | 0 |
16 | 화곡1동 | 1088770 | 5893150 | 0 | 0 | 0 | 1730 | 0 | 0 |
17 | 화곡2동 | 579110 | 1862250 | 0 | 0 | 0 | 0 | 0 | 0 |
18 | 화곡3동 | 404790 | 994380 | 0 | 67300 | 0 | 0 | 0 | 0 |
19 | 화곡4동 | 378630 | 1370680 | 0 | 0 | 0 | 0 | 0 | 0 |
20 | 화곡6동 | 459490 | 1676460 | 0 | 0 | 0 | 0 | 0 | 0 |
21 | 화곡8동 | 195140 | 739440 | 0 | 0 | 0 | 0 | 0 | 0 |
22 | 화곡본동 | 861220 | 2763780 | 0 | 0 | 0 | 0 | 0 | 0 |
23 | 업체혼합 | 337970 | 2618890 | 15410 | 14830 | 0 | 0 | 0 | 0 |