Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (95.9%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (90.7%) | Imbalance |
평균값 has 310 (3.1%) zeros | Zeros |
측정기 상태 has 6022 (60.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-27 12:08:14.056485 |
---|---|
Analysis finished | 2024-04-27 12:08:22.106176 |
Duration | 8.05 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 570 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9990112 × 109 |
Minimum | 1.9990101 × 109 |
---|---|
Maximum | 1.9990124 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9990101 × 109 |
---|---|
5-th percentile | 1.9990102 × 109 |
Q1 | 1.9990107 × 109 |
median | 1.9990112 × 109 |
Q3 | 1.9990118 × 109 |
95-th percentile | 1.9990123 × 109 |
Maximum | 1.9990124 × 109 |
Range | 2317 |
Interquartile range (IQR) | 1115 |
Descriptive statistics
Standard deviation | 679.9902 |
---|---|
Coefficient of variation (CV) | 3.4016327 × 10-7 |
Kurtosis | -1.1813133 |
Mean | 1.9990112 × 109 |
Median Absolute Deviation (MAD) | 595 |
Skewness | -0.00086168503 |
Sum | 1.9990112 × 1013 |
Variance | 462386.68 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1999011621 | 29 | 0.3% |
1999011401 | 28 | 0.3% |
1999010603 | 28 | 0.3% |
1999010214 | 27 | 0.3% |
1999011315 | 27 | 0.3% |
1999011205 | 27 | 0.3% |
1999011714 | 27 | 0.3% |
1999010202 | 26 | 0.3% |
1999011921 | 26 | 0.3% |
1999011312 | 26 | 0.3% |
Other values (560) | 9729 |
Value | Count | Frequency (%) |
1999010100 | 10 | |
1999010101 | 18 | |
1999010102 | 15 | |
1999010103 | 24 | |
1999010104 | 21 | |
1999010105 | 13 | |
1999010106 | 21 | |
1999010107 | 17 | |
1999010108 | 20 | |
1999010109 | 24 |
Value | Count | Frequency (%) |
1999012417 | 9 | |
1999012416 | 16 | |
1999012415 | 15 | |
1999012414 | 15 | |
1999012413 | 16 | |
1999012412 | 19 | |
1999012411 | 21 | |
1999012410 | 20 | |
1999012409 | 15 | |
1999012408 | 19 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.0235 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2236948 |
---|---|
Coefficient of variation (CV) | 0.063913211 |
Kurtosis | -1.2107436 |
Mean | 113.0235 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0097533089 |
Sum | 1130235 |
Variance | 52.181766 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
122 | 436 | 4.4% |
115 | 428 | 4.3% |
120 | 420 | 4.2% |
117 | 415 | 4.2% |
103 | 415 | 4.2% |
106 | 414 | 4.1% |
102 | 414 | 4.1% |
111 | 411 | 4.1% |
116 | 410 | 4.1% |
107 | 408 | 4.1% |
Other values (15) | 5829 |
Value | Count | Frequency (%) |
101 | 395 | |
102 | 414 | |
103 | 415 | |
104 | 386 | |
105 | 377 | |
106 | 414 | |
107 | 408 | |
108 | 383 | |
109 | 401 | |
110 | 401 |
Value | Count | Frequency (%) |
125 | 398 | |
124 | 403 | |
123 | 384 | |
122 | 436 | |
121 | 378 | |
120 | 420 | |
119 | 406 | |
118 | 388 | |
117 | 415 | |
116 | 410 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3189 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7650619 |
---|---|
Coefficient of variation (CV) | 0.51985597 |
Kurtosis | -1.2372694 |
Mean | 5.3189 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.18916866 |
Sum | 53189 |
Variance | 7.6455673 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 1735 | |
8 | 1691 | |
9 | 1680 | |
1 | 1678 | |
5 | 1638 | |
6 | 1578 |
Value | Count | Frequency (%) |
1 | 1678 | |
3 | 1735 | |
5 | 1638 | |
6 | 1578 | |
8 | 1691 | |
9 | 1680 |
Value | Count | Frequency (%) |
9 | 1680 | |
8 | 1691 | |
6 | 1578 | |
5 | 1638 | |
3 | 1735 | |
1 | 1678 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 342 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -2248.914 |
Minimum | -9999 |
---|---|
Maximum | 1746 |
Zeros | 310 |
Zeros (%) | 3.1% |
Negative | 3695 |
Negative (%) | 37.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -999.9 |
median | 0.006 |
Q3 | 0.054 |
95-th percentile | 61 |
Maximum | 1746 |
Range | 11745 |
Interquartile range (IQR) | 999.954 |
Descriptive statistics
Standard deviation | 4143.5648 |
---|---|
Coefficient of variation (CV) | -1.8424737 |
Kurtosis | -0.21721077 |
Mean | -2248.914 |
Median Absolute Deviation (MAD) | 2.094 |
Skewness | -1.3311283 |
Sum | -22489140 |
Variance | 17169129 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 2219 | |
-9.999 | 1101 | 11.0% |
-999.9 | 372 | 3.7% |
0.0 | 310 | 3.1% |
0.001 | 225 | 2.2% |
0.004 | 172 | 1.7% |
0.002 | 166 | 1.7% |
0.005 | 160 | 1.6% |
0.009 | 158 | 1.6% |
0.008 | 156 | 1.6% |
Other values (332) | 4961 |
Value | Count | Frequency (%) |
-9999.0 | 2219 | |
-999.9 | 372 | 3.7% |
-9.999 | 1101 | |
-0.438 | 1 | < 0.1% |
-0.432 | 1 | < 0.1% |
-0.236 | 1 | < 0.1% |
0.0 | 310 | 3.1% |
0.001 | 225 | 2.2% |
0.002 | 166 | 1.7% |
0.003 | 147 | 1.5% |
Value | Count | Frequency (%) |
1746.0 | 1 | |
1711.0 | 1 | |
1503.0 | 1 | |
1431.0 | 1 | |
1406.0 | 1 | |
1272.0 | 1 | |
1140.0 | 1 | |
872.0 | 1 | |
604.0 | 1 | |
457.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.5774 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 6022 |
Zeros (%) | 60.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 1.9729162 |
---|---|
Coefficient of variation (CV) | 1.2507393 |
Kurtosis | -1.4241996 |
Mean | 1.5774 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.53342752 |
Sum | 15774 |
Variance | 3.8923985 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6022 | |
4 | 3809 | |
2 | 116 | 1.2% |
8 | 27 | 0.3% |
1 | 18 | 0.2% |
9 | 8 | 0.1% |
Value | Count | Frequency (%) |
0 | 6022 | |
1 | 18 | 0.2% |
2 | 116 | 1.2% |
4 | 3809 | |
8 | 27 | 0.3% |
9 | 8 | 0.1% |
Value | Count | Frequency (%) |
9 | 8 | 0.1% |
8 | 27 | 0.3% |
4 | 3809 | |
2 | 116 | 1.2% |
1 | 18 | 0.2% |
0 | 6022 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 44 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9956 | |
1 | 44 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9956 | |
1 | 44 | 0.4% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 118 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9882 | |
1 | 118 | 1.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9882 | |
1 | 118 | 1.2% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.088 | 0.100 | 0.087 | 0.114 |
측정소 코드 | 0.000 | 1.000 | 0.007 | 0.273 | 0.325 | 0.103 | 0.113 |
측정항목 | 0.000 | 0.007 | 1.000 | 0.380 | 0.619 | 0.203 | 0.335 |
평균값 | 0.088 | 0.273 | 0.380 | 1.000 | 0.672 | 0.278 | 0.167 |
측정기 상태 | 0.100 | 0.325 | 0.619 | 0.672 | 1.000 | 0.192 | 0.154 |
국가 기준초과 구분 | 0.087 | 0.103 | 0.203 | 0.278 | 0.192 | 1.000 | 0.810 |
지자체 기준초과 구분 | 0.114 | 0.113 | 0.335 | 0.167 | 0.154 | 0.810 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.601 |
국가 기준초과 구분 | 0.601 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.003 | -0.009 | 0.052 | -0.066 | 0.067 | 0.087 |
측정소 코드 | 0.003 | 1.000 | 0.014 | 0.041 | -0.033 | 0.079 | 0.087 |
측정항목 | -0.009 | 0.014 | 1.000 | -0.351 | 0.447 | 0.146 | 0.241 |
평균값 | 0.052 | 0.041 | -0.351 | 1.000 | -0.836 | 0.452 | 0.281 |
측정기 상태 | -0.066 | -0.033 | 0.447 | -0.836 | 1.000 | 0.138 | 0.111 |
국가 기준초과 구분 | 0.067 | 0.079 | 0.146 | 0.452 | 0.138 | 1.000 | 0.601 |
지자체 기준초과 구분 | 0.087 | 0.087 | 0.241 | 0.281 | 0.111 | 0.601 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
50641 | 1999011501 | 116 | 3 | 0.053 | 0 | 0 | 0 |
40719 | 1999011207 | 112 | 6 | 0.0 | 0 | 0 | 0 |
32675 | 1999011001 | 121 | 9 | -9999.0 | 4 | 0 | 0 |
16908 | 1999010516 | 119 | 1 | -9.999 | 4 | 0 | 0 |
26096 | 1999010805 | 125 | 5 | 0.4 | 0 | 0 | 0 |
32131 | 1999010922 | 106 | 3 | -9.999 | 4 | 0 | 0 |
3106 | 1999010120 | 118 | 8 | 44.0 | 0 | 0 | 0 |
18114 | 1999010600 | 120 | 1 | 0.027 | 0 | 0 | 0 |
35037 | 1999011017 | 115 | 6 | 0.012 | 0 | 0 | 0 |
23517 | 1999010712 | 120 | 6 | 0.018 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
58968 | 1999011709 | 104 | 1 | -9.999 | 4 | 0 | 0 |
24079 | 1999010716 | 114 | 3 | 0.013 | 0 | 0 | 0 |
61564 | 1999011802 | 111 | 8 | -9999.0 | 4 | 0 | 0 |
36023 | 1999011100 | 104 | 9 | -9999.0 | 4 | 0 | 0 |
83545 | 1999012404 | 125 | 3 | 0.021 | 0 | 0 | 0 |
72406 | 1999012102 | 118 | 8 | 46.0 | 0 | 0 | 0 |
19546 | 1999010610 | 108 | 8 | -9999.0 | 4 | 0 | 0 |
75035 | 1999012120 | 106 | 9 | -9999.0 | 4 | 0 | 0 |
38144 | 1999011114 | 108 | 5 | -999.9 | 4 | 0 | 0 |
13103 | 1999010415 | 109 | 9 | -9999.0 | 4 | 0 | 0 |