Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 has constant value "" | Constant |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
국가 기준초과 구분 is highly imbalanced (99.3%) | Imbalance |
측정기 상태 has 9758 (97.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-27 12:04:29.757506 |
---|---|
Analysis finished | 2024-04-27 12:04:38.981524 |
Duration | 9.22 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 445 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.012011 × 109 |
Minimum | 2.0120101 × 109 |
---|---|
Maximum | 2.0120119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0120101 × 109 |
---|---|
5-th percentile | 2.0120101 × 109 |
Q1 | 2.0120105 × 109 |
median | 2.012011 × 109 |
Q3 | 2.0120114 × 109 |
95-th percentile | 2.0120118 × 109 |
Maximum | 2.0120119 × 109 |
Range | 1812 |
Interquartile range (IQR) | 906 |
Descriptive statistics
Standard deviation | 534.50972 |
---|---|
Coefficient of variation (CV) | 2.6565944 × 10-7 |
Kurtosis | -1.1923553 |
Mean | 2.012011 × 109 |
Median Absolute Deviation (MAD) | 489.5 |
Skewness | 0.0037260393 |
Sum | 2.012011 × 1013 |
Variance | 285700.64 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2012011200 | 39 | 0.4% |
2012010117 | 34 | 0.3% |
2012011018 | 33 | 0.3% |
2012010712 | 33 | 0.3% |
2012010901 | 32 | 0.3% |
2012010209 | 32 | 0.3% |
2012011019 | 32 | 0.3% |
2012010500 | 32 | 0.3% |
2012011405 | 32 | 0.3% |
2012011100 | 31 | 0.3% |
Other values (435) | 9670 |
Value | Count | Frequency (%) |
2012010100 | 16 | |
2012010101 | 21 | |
2012010102 | 24 | |
2012010103 | 27 | |
2012010104 | 25 | |
2012010105 | 18 | |
2012010106 | 24 | |
2012010107 | 25 | |
2012010108 | 23 | |
2012010109 | 14 |
Value | Count | Frequency (%) |
2012011912 | 16 | |
2012011911 | 23 | |
2012011910 | 25 | |
2012011909 | 17 | |
2012011908 | 18 | |
2012011907 | 31 | |
2012011906 | 28 | |
2012011905 | 17 | |
2012011904 | 28 | |
2012011903 | 30 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9952 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.191102 |
---|---|
Coefficient of variation (CV) | 0.063640774 |
Kurtosis | -1.1847427 |
Mean | 112.9952 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.00086664139 |
Sum | 1129952 |
Variance | 51.711948 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
110 | 435 | 4.3% |
111 | 426 | 4.3% |
125 | 423 | 4.2% |
120 | 421 | 4.2% |
114 | 419 | 4.2% |
102 | 418 | 4.2% |
122 | 416 | 4.2% |
101 | 414 | 4.1% |
106 | 409 | 4.1% |
116 | 409 | 4.1% |
Other values (15) | 5810 |
Value | Count | Frequency (%) |
101 | 414 | |
102 | 418 | |
103 | 376 | |
104 | 361 | |
105 | 390 | |
106 | 409 | |
107 | 390 | |
108 | 401 | |
109 | 404 | |
110 | 435 |
Value | Count | Frequency (%) |
125 | 423 | |
124 | 371 | |
123 | 385 | |
122 | 416 | |
121 | 372 | |
120 | 421 | |
119 | 398 | |
118 | 395 | |
117 | 383 | |
116 | 409 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3208 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7336852 |
---|---|
Coefficient of variation (CV) | 0.51377334 |
Kurtosis | -1.1992067 |
Mean | 5.3208 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.18749361 |
Sum | 53208 |
Variance | 7.4730347 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 1732 | |
5 | 1686 | |
6 | 1684 | |
9 | 1669 | |
1 | 1625 | |
8 | 1604 |
Value | Count | Frequency (%) |
1 | 1625 | |
3 | 1732 | |
5 | 1686 | |
6 | 1684 | |
8 | 1604 | |
9 | 1669 |
Value | Count | Frequency (%) |
9 | 1669 | |
8 | 1604 | |
6 | 1684 | |
5 | 1686 | |
3 | 1732 | |
1 | 1625 |
평균값
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 283 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -22.577191 |
Minimum | -9999 |
---|---|
Maximum | 173 |
Zeros | 47 |
Zeros (%) | 0.5% |
Negative | 128 |
Negative (%) | 1.3% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.002 |
Q1 | 0.011 |
median | 0.071 |
Q3 | 35 |
95-th percentile | 88 |
Maximum | 173 |
Range | 10172 |
Interquartile range (IQR) | 34.989 |
Descriptive statistics
Standard deviation | 627.15421 |
---|---|
Coefficient of variation (CV) | -27.778222 |
Kurtosis | 246.91656 |
Mean | -22.577191 |
Median Absolute Deviation (MAD) | 0.229 |
Skewness | -15.711937 |
Sum | -225771.91 |
Variance | 393322.41 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.007 | 303 | 3.0% |
0.008 | 291 | 2.9% |
0.006 | 267 | 2.7% |
0.009 | 245 | 2.5% |
0.002 | 244 | 2.4% |
0.6 | 210 | 2.1% |
0.01 | 209 | 2.1% |
0.8 | 199 | 2.0% |
0.7 | 195 | 1.9% |
0.011 | 182 | 1.8% |
Other values (273) | 7655 |
Value | Count | Frequency (%) |
-9999.0 | 39 | 0.4% |
-999.9 | 25 | 0.2% |
-9.999 | 64 | 0.6% |
0.0 | 47 | 0.5% |
0.001 | 159 | |
0.002 | 244 | |
0.003 | 174 | |
0.004 | 136 | |
0.005 | 174 | |
0.006 | 267 |
Value | Count | Frequency (%) |
173.0 | 1 | < 0.1% |
172.0 | 1 | < 0.1% |
168.0 | 1 | < 0.1% |
165.0 | 1 | < 0.1% |
163.0 | 1 | < 0.1% |
162.0 | 1 | < 0.1% |
160.0 | 1 | < 0.1% |
159.0 | 1 | < 0.1% |
158.0 | 3 | |
157.0 | 2 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.1067 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9758 |
Zeros (%) | 97.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.82157325 |
---|---|
Coefficient of variation (CV) | 7.699843 |
Kurtosis | 83.727357 |
Mean | 0.1067 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.9285928 |
Sum | 1067 |
Variance | 0.67498261 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9758 | |
4 | 75 | 0.8% |
1 | 62 | 0.6% |
8 | 44 | 0.4% |
9 | 33 | 0.3% |
2 | 28 | 0.3% |
Value | Count | Frequency (%) |
0 | 9758 | |
1 | 62 | 0.6% |
2 | 28 | 0.3% |
4 | 75 | 0.8% |
8 | 44 | 0.4% |
9 | 33 | 0.3% |
Value | Count | Frequency (%) |
9 | 33 | 0.3% |
8 | 44 | 0.4% |
4 | 75 | 0.8% |
2 | 28 | 0.3% |
1 | 62 | 0.6% |
0 | 9758 |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 6 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9994 | |
1 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9994 | |
1 | 6 | 0.1% |
지자체 기준초과 구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.128 | 0.202 | 0.031 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.115 | 0.261 | 0.030 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.151 | 0.075 | 0.068 |
평균값 | 0.128 | 0.115 | 0.151 | 1.000 | 0.720 | 0.000 |
측정기 상태 | 0.202 | 0.261 | 0.075 | 0.720 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.031 | 0.030 | 0.068 | 0.000 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.001 | 0.013 | -0.036 | 0.121 | 0.024 |
측정소 코드 | 0.001 | 1.000 | -0.008 | -0.022 | 0.086 | 0.023 |
측정항목 | 0.013 | -0.008 | 1.000 | 0.643 | -0.004 | 0.049 |
평균값 | -0.036 | -0.022 | 0.643 | 1.000 | -0.162 | 0.000 |
측정기 상태 | 0.121 | 0.086 | -0.004 | -0.162 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.024 | 0.023 | 0.049 | 0.000 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
55605 | 2012011610 | 118 | 6 | 0.006 | 0 | 0 | 0 |
20790 | 2012010618 | 116 | 1 | 0.021 | 9 | 0 | 0 |
41773 | 2012011214 | 113 | 3 | 0.038 | 0 | 0 | 0 |
26676 | 2012010809 | 122 | 1 | 0.007 | 0 | 0 | 0 |
54144 | 2012011600 | 125 | 1 | 0.006 | 0 | 0 | 0 |
18926 | 2012010606 | 105 | 5 | 1.1 | 0 | 0 | 0 |
39707 | 2012011200 | 118 | 9 | 35.0 | 0 | 0 | 0 |
55324 | 2012011608 | 121 | 8 | 55.0 | 0 | 0 | 0 |
12823 | 2012010413 | 113 | 3 | 0.013 | 0 | 0 | 0 |
43654 | 2012011303 | 101 | 8 | 41.0 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
38157 | 2012011114 | 110 | 6 | 0.022 | 0 | 0 | 0 |
17238 | 2012010518 | 124 | 1 | 0.006 | 0 | 0 | 0 |
4150 | 2012010203 | 117 | 8 | 82.0 | 0 | 0 | 0 |
18456 | 2012010603 | 102 | 1 | 0.008 | 0 | 0 | 0 |
24417 | 2012010718 | 120 | 6 | 0.003 | 0 | 0 | 0 |
28489 | 2012010821 | 124 | 3 | 0.039 | 0 | 0 | 0 |
21735 | 2012010700 | 123 | 6 | 0.002 | 0 | 0 | 0 |
43584 | 2012011302 | 115 | 1 | 0.011 | 0 | 0 | 0 |
60674 | 2012011720 | 113 | 5 | 0.7 | 0 | 0 | 0 |
46886 | 2012011400 | 115 | 5 | 0.5 | 0 | 0 | 0 |