Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 and 1 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly imbalanced (55.2%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (99.5%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (99.5%) | Imbalance |
평균값 has 1429 (14.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 07:00:55.001612 |
---|---|
Analysis finished | 2024-05-11 07:00:59.675465 |
Duration | 4.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 1847 |
---|---|
Distinct (%) | 18.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9940196 × 109 |
Minimum | 1.9940101 × 109 |
---|---|
Maximum | 1.9940319 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9940101 × 109 |
---|---|
5-th percentile | 1.9940104 × 109 |
Q1 | 1.9940119 × 109 |
median | 1.9940207 × 109 |
Q3 | 1.9940227 × 109 |
95-th percentile | 1.9940315 × 109 |
Maximum | 1.9940319 × 109 |
Range | 21803 |
Interquartile range (IQR) | 10798 |
Descriptive statistics
Standard deviation | 7608.6564 |
---|---|
Coefficient of variation (CV) | 3.815738 × 10-6 |
Kurtosis | -1.2980413 |
Mean | 1.9940196 × 109 |
Median Absolute Deviation (MAD) | 8618 |
Skewness | 0.28148177 |
Sum | 1.9940196 × 1013 |
Variance | 57891653 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1994020120 | 13 | 0.1% |
1994020302 | 13 | 0.1% |
1994021020 | 12 | 0.1% |
1994011202 | 12 | 0.1% |
1994021208 | 12 | 0.1% |
1994012920 | 12 | 0.1% |
1994011204 | 12 | 0.1% |
1994011321 | 12 | 0.1% |
1994010812 | 12 | 0.1% |
1994012202 | 12 | 0.1% |
Other values (1837) | 9878 |
Value | Count | Frequency (%) |
1994010100 | 7 | |
1994010101 | 3 | < 0.1% |
1994010102 | 6 | |
1994010103 | 8 | |
1994010104 | 7 | |
1994010105 | 4 | |
1994010106 | 3 | < 0.1% |
1994010107 | 7 | |
1994010108 | 6 | |
1994010109 | 7 |
Value | Count | Frequency (%) |
1994031903 | 5 | |
1994031902 | 4 | < 0.1% |
1994031901 | 5 | |
1994031900 | 4 | < 0.1% |
1994031823 | 7 | |
1994031822 | 7 | |
1994031821 | 5 | |
1994031820 | 5 | |
1994031819 | 10 | |
1994031817 | 5 |
측정소 코드
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.8259 |
Minimum | 103 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 103 |
---|---|
5-th percentile | 103 |
Q1 | 107 |
median | 113 |
Q3 | 117 |
95-th percentile | 124 |
Maximum | 124 |
Range | 21 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 7.1078859 |
---|---|
Coefficient of variation (CV) | 0.062998707 |
Kurtosis | -1.3319684 |
Mean | 112.8259 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.18077134 |
Sum | 1128259 |
Variance | 50.522041 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
116 | 1153 | |
124 | 1138 | |
122 | 1133 | |
105 | 1130 | |
113 | 1124 | |
108 | 1117 | |
103 | 1105 | |
107 | 1064 | |
117 | 1036 |
Value | Count | Frequency (%) |
103 | 1105 | |
105 | 1130 | |
107 | 1064 | |
108 | 1117 | |
113 | 1124 | |
116 | 1153 | |
117 | 1036 | |
122 | 1133 | |
124 | 1138 |
Value | Count | Frequency (%) |
124 | 1138 | |
122 | 1133 | |
117 | 1036 | |
116 | 1153 | |
113 | 1124 | |
108 | 1117 | |
107 | 1064 | |
105 | 1130 | |
103 | 1105 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.2781 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.760312 |
---|---|
Coefficient of variation (CV) | 0.52297455 |
Kurtosis | -1.2280978 |
Mean | 5.2781 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.1793247 |
Sum | 52781 |
Variance | 7.6193223 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1731 | |
5 | 1695 | |
3 | 1686 | |
8 | 1657 | |
9 | 1625 | |
6 | 1606 |
Value | Count | Frequency (%) |
1 | 1731 | |
3 | 1686 | |
5 | 1695 | |
6 | 1606 | |
8 | 1657 | |
9 | 1625 |
Value | Count | Frequency (%) |
9 | 1625 | |
8 | 1657 | |
6 | 1606 | |
5 | 1695 | |
3 | 1686 | |
1 | 1731 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 198 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -1914.8286 |
Minimum | -9999 |
---|---|
Maximum | 11.8 |
Zeros | 1429 |
Zeros (%) | 14.3% |
Negative | 2014 |
Negative (%) | 20.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | 0 |
median | 0.013 |
Q3 | 0.043 |
95-th percentile | 1.7 |
Maximum | 11.8 |
Range | 10010.8 |
Interquartile range (IQR) | 0.043 |
Descriptive statistics
Standard deviation | 3921.8488 |
---|---|
Coefficient of variation (CV) | -2.0481462 |
Kurtosis | 0.48277167 |
Mean | -1914.8286 |
Median Absolute Deviation (MAD) | 0.022 |
Skewness | -1.5740346 |
Sum | -19148286 |
Variance | 15380898 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 1903 | 19.0% |
0.0 | 1429 | 14.3% |
0.002 | 205 | 2.1% |
0.001 | 203 | 2.0% |
0.003 | 136 | 1.4% |
0.011 | 126 | 1.3% |
0.019 | 124 | 1.2% |
0.01 | 123 | 1.2% |
0.012 | 121 | 1.2% |
0.7 | 117 | 1.2% |
Other values (188) | 5513 |
Value | Count | Frequency (%) |
-9999.0 | 1903 | |
-3276.8 | 10 | 0.1% |
-999.9 | 4 | < 0.1% |
-999.8 | 66 | 0.7% |
-999.7 | 13 | 0.1% |
-999.6 | 7 | 0.1% |
-9.999 | 11 | 0.1% |
0.0 | 1429 | |
0.001 | 203 | 2.0% |
0.002 | 205 | 2.1% |
Value | Count | Frequency (%) |
11.8 | 1 | |
9.6 | 1 | |
8.1 | 1 | |
7.9 | 1 | |
7.7 | 1 | |
7.4 | 1 | |
7.0 | 1 | |
6.9 | 1 | |
6.8 | 1 | |
6.6 | 1 |
측정기 상태
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
4 | |
2 | 110 |
1 | 41 |
9 | 6 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 4 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 6531 | |
4 | 3312 | |
2 | 110 | 1.1% |
1 | 41 | 0.4% |
9 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 6531 | |
4 | 3312 | |
2 | 110 | 1.1% |
1 | 41 | 0.4% |
9 | 6 | 0.1% |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
지자체 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.021 | 0.073 | 0.082 | 0.000 | 0.000 |
측정소 코드 | 0.000 | 1.000 | 0.018 | 0.285 | 0.101 | 0.000 | 0.021 |
측정항목 | 0.021 | 0.018 | 1.000 | 0.199 | 0.644 | 0.000 | 0.034 |
평균값 | 0.073 | 0.285 | 0.199 | 1.000 | 0.616 | 0.000 | 0.000 |
측정기 상태 | 0.082 | 0.101 | 0.644 | 0.616 | 1.000 | 0.190 | 0.126 |
국가 기준초과 구분 | 0.000 | 0.000 | 0.000 | 0.000 | 0.190 | 1.000 | 0.555 |
지자체 기준초과 구분 | 0.000 | 0.021 | 0.034 | 0.000 | 0.126 | 0.555 | 1.000 |
측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|
측정기 상태 | 1.000 | 0.233 | 0.154 |
국가 기준초과 구분 | 0.233 | 1.000 | 0.375 |
지자체 기준초과 구분 | 0.154 | 0.375 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.010 | 0.024 | -0.015 | 0.031 | 0.000 | 0.000 |
측정소 코드 | 0.010 | 1.000 | -0.009 | 0.031 | 0.068 | 0.000 | 0.015 |
측정항목 | 0.024 | -0.009 | 1.000 | -0.724 | 0.504 | 0.000 | 0.024 |
평균값 | -0.015 | 0.031 | -0.724 | 1.000 | 0.614 | 0.000 | 0.000 |
측정기 상태 | 0.031 | 0.068 | 0.504 | 0.614 | 1.000 | 0.233 | 0.154 |
국가 기준초과 구분 | 0.000 | 0.000 | 0.000 | 0.000 | 0.233 | 1.000 | 0.375 |
지자체 기준초과 구분 | 0.000 | 0.015 | 0.024 | 0.000 | 0.154 | 0.375 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
15218 | 1994011217 | 122 | 5 | 0.9 | 0 | 0 | 0 |
43215 | 1994020308 | 107 | 6 | 0.002 | 0 | 0 | 0 |
91660 | 1994031217 | 108 | 8 | 0.0 | 4 | 0 | 0 |
89385 | 1994031023 | 107 | 6 | 0.021 | 0 | 0 | 0 |
11971 | 1994011005 | 117 | 3 | 0.039 | 0 | 0 | 0 |
17215 | 1994011406 | 122 | 3 | 0.046 | 0 | 0 | 0 |
86863 | 1994030900 | 116 | 3 | 0.017 | 0 | 0 | 0 |
63818 | 1994021905 | 122 | 5 | 2.5 | 0 | 0 | 0 |
23060 | 1994011819 | 103 | 5 | 1.0 | 0 | 0 | 0 |
29075 | 1994012310 | 108 | 9 | -9999.0 | 4 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
91548 | 1994031215 | 108 | 1 | 0.007 | 0 | 0 | 0 |
21097 | 1994011706 | 117 | 3 | 0.032 | 0 | 0 | 0 |
11235 | 1994010916 | 103 | 6 | 0.011 | 0 | 0 | 0 |
10958 | 1994010910 | 124 | 5 | 2.0 | 0 | 0 | 0 |
84987 | 1994030713 | 122 | 6 | 0.035 | 0 | 0 | 0 |
3154 | 1994010310 | 108 | 8 | 0.0 | 4 | 0 | 0 |
57248 | 1994021404 | 105 | 5 | 1.8 | 0 | 0 | 0 |
86065 | 1994030809 | 122 | 3 | 0.039 | 0 | 0 | 0 |
14739 | 1994011208 | 124 | 6 | 0.001 | 0 | 0 | 0 |
67300 | 1994022122 | 107 | 8 | 0.0 | 4 | 0 | 0 |