Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (98.6%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (96.7%) | Imbalance |
평균값 has 353 (3.5%) zeros | Zeros |
측정기 상태 has 6781 (67.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:01:55.121855 |
---|---|
Analysis finished | 2024-05-04 04:02:05.607904 |
Duration | 10.49 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 583 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0000113 × 109 |
Minimum | 2.0000101 × 109 |
---|---|
Maximum | 2.0000125 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0000101 × 109 |
---|---|
5-th percentile | 2.0000102 × 109 |
Q1 | 2.0000107 × 109 |
median | 2.0000112 × 109 |
Q3 | 2.0000119 × 109 |
95-th percentile | 2.0000124 × 109 |
Maximum | 2.0000125 × 109 |
Range | 2406 |
Interquartile range (IQR) | 1203 |
Descriptive statistics
Standard deviation | 700.52126 |
---|---|
Coefficient of variation (CV) | 3.5025866 × 10-7 |
Kurtosis | -1.199604 |
Mean | 2.0000113 × 109 |
Median Absolute Deviation (MAD) | 602 |
Skewness | 0.021984439 |
Sum | 2.0000113 × 1013 |
Variance | 490730.03 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000010705 | 30 | 0.3% |
2000010414 | 28 | 0.3% |
2000010706 | 28 | 0.3% |
2000012315 | 27 | 0.3% |
2000011216 | 27 | 0.3% |
2000011313 | 27 | 0.3% |
2000010914 | 26 | 0.3% |
2000010313 | 26 | 0.3% |
2000011618 | 26 | 0.3% |
2000011205 | 25 | 0.2% |
Other values (573) | 9730 |
Value | Count | Frequency (%) |
2000010100 | 18 | |
2000010101 | 14 | |
2000010102 | 21 | |
2000010103 | 20 | |
2000010104 | 15 | |
2000010105 | 17 | |
2000010106 | 18 | |
2000010107 | 14 | |
2000010108 | 19 | |
2000010109 | 17 |
Value | Count | Frequency (%) |
2000012506 | 12 | |
2000012505 | 17 | |
2000012504 | 14 | |
2000012503 | 13 | |
2000012502 | 14 | |
2000012501 | 18 | |
2000012500 | 18 | |
2000012423 | 18 | |
2000012422 | 13 | |
2000012421 | 22 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.7874 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.1973454 |
---|---|
Coefficient of variation (CV) | 0.063813382 |
Kurtosis | -1.2066641 |
Mean | 112.7874 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.041144931 |
Sum | 1127874 |
Variance | 51.801781 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
103 | 463 | 4.6% |
117 | 440 | 4.4% |
107 | 437 | 4.4% |
122 | 420 | 4.2% |
108 | 420 | 4.2% |
102 | 415 | 4.2% |
106 | 414 | 4.1% |
110 | 413 | 4.1% |
112 | 412 | 4.1% |
118 | 411 | 4.1% |
Other values (15) | 5755 |
Value | Count | Frequency (%) |
101 | 385 | |
102 | 415 | |
103 | 463 | |
104 | 402 | |
105 | 401 | |
106 | 414 | |
107 | 437 | |
108 | 420 | |
109 | 384 | |
110 | 413 |
Value | Count | Frequency (%) |
125 | 377 | |
124 | 388 | |
123 | 357 | |
122 | 420 | |
121 | 381 | |
120 | 396 | |
119 | 348 | |
118 | 411 | |
117 | 440 | |
116 | 378 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.319 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7420049 |
---|---|
Coefficient of variation (CV) | 0.51551136 |
Kurtosis | -1.2026796 |
Mean | 5.319 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.19884851 |
Sum | 53190 |
Variance | 7.5185909 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 1745 | |
8 | 1680 | |
1 | 1670 | |
3 | 1657 | |
9 | 1632 | |
6 | 1616 |
Value | Count | Frequency (%) |
1 | 1670 | |
3 | 1657 | |
5 | 1745 | |
6 | 1616 | |
8 | 1680 | |
9 | 1632 |
Value | Count | Frequency (%) |
9 | 1632 | |
8 | 1680 | |
6 | 1616 | |
5 | 1745 | |
3 | 1657 | |
1 | 1670 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 259 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -2036.5449 |
Minimum | -9999 |
---|---|
Maximum | 194 |
Zeros | 353 |
Zeros (%) | 3.5% |
Negative | 2990 |
Negative (%) | 29.9% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9.999 |
median | 0.008 |
Q3 | 0.1 |
95-th percentile | 42 |
Maximum | 194 |
Range | 10193 |
Interquartile range (IQR) | 10.099 |
Descriptive statistics
Standard deviation | 4004.396 |
---|---|
Coefficient of variation (CV) | -1.9662695 |
Kurtosis | 0.20536988 |
Mean | -2036.5449 |
Median Absolute Deviation (MAD) | 0.292 |
Skewness | -1.4824024 |
Sum | -20365449 |
Variance | 16035188 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 2016 | 20.2% |
-9.999 | 725 | 7.2% |
0.0 | 353 | 3.5% |
0.1 | 291 | 2.9% |
0.004 | 263 | 2.6% |
-999.9 | 248 | 2.5% |
0.003 | 231 | 2.3% |
0.002 | 228 | 2.3% |
0.005 | 223 | 2.2% |
0.006 | 216 | 2.2% |
Other values (249) | 5206 |
Value | Count | Frequency (%) |
-9999.0 | 2016 | |
-999.9 | 248 | 2.5% |
-9.999 | 725 | 7.2% |
-0.286 | 1 | < 0.1% |
0.0 | 353 | 3.5% |
0.001 | 144 | 1.4% |
0.002 | 228 | 2.3% |
0.003 | 231 | 2.3% |
0.004 | 263 | 2.6% |
0.005 | 223 | 2.2% |
Value | Count | Frequency (%) |
194.0 | 1 | |
187.0 | 1 | |
176.0 | 1 | |
172.0 | 1 | |
171.0 | 1 | |
169.0 | 1 | |
168.0 | 1 | |
162.0 | 1 | |
155.0 | 1 | |
152.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2708 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 6781 |
Zeros (%) | 67.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 1.8784622 |
---|---|
Coefficient of variation (CV) | 1.478173 |
Kurtosis | -0.88324923 |
Mean | 1.2708 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.89106979 |
Sum | 12708 |
Variance | 3.5286202 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6781 | |
4 | 3046 | |
2 | 114 | 1.1% |
8 | 27 | 0.3% |
1 | 26 | 0.3% |
9 | 6 | 0.1% |
Value | Count | Frequency (%) |
0 | 6781 | |
1 | 26 | 0.3% |
2 | 114 | 1.1% |
4 | 3046 | |
8 | 27 | 0.3% |
9 | 6 | 0.1% |
Value | Count | Frequency (%) |
9 | 6 | 0.1% |
8 | 27 | 0.3% |
4 | 3046 | |
2 | 114 | 1.1% |
1 | 26 | 0.3% |
0 | 6781 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 13 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9987 | |
1 | 13 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9987 | |
1 | 13 | 0.1% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 34 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9966 | |
1 | 34 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9966 | |
1 | 34 | 0.3% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.050 | 0.136 | 0.051 | 0.084 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.424 | 0.339 | 0.051 | 0.080 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.277 | 0.679 | 0.097 | 0.172 |
평균값 | 0.050 | 0.424 | 0.277 | 1.000 | 0.622 | 0.000 | 0.000 |
측정기 상태 | 0.136 | 0.339 | 0.679 | 0.622 | 1.000 | 0.072 | 0.061 |
국가 기준초과 구분 | 0.051 | 0.051 | 0.097 | 0.000 | 0.072 | 1.000 | 0.803 |
지자체 기준초과 구분 | 0.084 | 0.080 | 0.172 | 0.000 | 0.061 | 0.803 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.594 |
국가 기준초과 구분 | 0.594 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.003 | 0.020 | 0.028 | -0.014 | 0.039 | 0.062 |
측정소 코드 | 0.003 | 1.000 | 0.002 | 0.156 | -0.222 | 0.039 | 0.061 |
측정항목 | 0.020 | 0.002 | 1.000 | -0.256 | 0.474 | 0.070 | 0.123 |
평균값 | 0.028 | 0.156 | -0.256 | 1.000 | -0.796 | 0.013 | 0.028 |
측정기 상태 | -0.014 | -0.222 | 0.474 | -0.796 | 1.000 | 0.052 | 0.044 |
국가 기준초과 구분 | 0.039 | 0.039 | 0.070 | 0.013 | 0.052 | 1.000 | 0.594 |
지자체 기준초과 구분 | 0.062 | 0.061 | 0.123 | 0.028 | 0.044 | 0.594 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
19457 | 2000010609 | 118 | 9 | -9999.0 | 4 | 0 | 0 |
23184 | 2000010710 | 115 | 1 | 0.001 | 0 | 0 | 0 |
81744 | 2000012316 | 125 | 1 | 0.001 | 0 | 0 | 0 |
86639 | 2000012501 | 115 | 9 | -9999.0 | 4 | 0 | 0 |
28249 | 2000010820 | 109 | 3 | 0.042 | 0 | 0 | 0 |
33959 | 2000011010 | 110 | 9 | -9999.0 | 4 | 0 | 0 |
23354 | 2000010711 | 118 | 5 | 0.8 | 0 | 0 | 0 |
69260 | 2000012005 | 119 | 5 | 0.0 | 0 | 0 | 0 |
2173 | 2000010114 | 113 | 3 | 0.023 | 0 | 0 | 0 |
52830 | 2000011516 | 106 | 1 | 0.005 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
56526 | 2000011616 | 122 | 1 | 0.006 | 0 | 0 | 0 |
70102 | 2000012011 | 109 | 8 | 3.0 | 0 | 0 | 0 |
55584 | 2000011610 | 115 | 1 | 0.012 | 0 | 0 | 0 |
27023 | 2000010812 | 104 | 9 | -9999.0 | 4 | 0 | 0 |
8189 | 2000010306 | 115 | 9 | -9999.0 | 4 | 0 | 0 |
67865 | 2000011920 | 111 | 9 | -9999.0 | 4 | 0 | 0 |
37618 | 2000011110 | 120 | 8 | 76.0 | 0 | 0 | 0 |
35414 | 2000011020 | 103 | 5 | 1.1 | 0 | 0 | 0 |
85535 | 2000012418 | 106 | 9 | -9999.0 | 4 | 0 | 0 |
43662 | 2000011303 | 103 | 1 | 0.004 | 0 | 0 | 0 |