Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly imbalanced (88.6%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (97.3%) | Imbalance |
평균값 has 303 (3.0%) zeros | Zeros |
측정기 상태 has 4023 (40.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:04:12.400280 |
---|---|
Analysis finished | 2024-05-04 04:04:22.747800 |
Duration | 10.35 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 2064 |
---|---|
Distinct (%) | 20.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9910215 × 109 |
Minimum | 1.9910101 × 109 |
---|---|
Maximum | 1.9910328 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9910101 × 109 |
---|---|
5-th percentile | 1.9910105 × 109 |
Q1 | 1.9910122 × 109 |
median | 1.9910215 × 109 |
Q3 | 1.9910309 × 109 |
95-th percentile | 1.9910324 × 109 |
Maximum | 1.9910328 × 109 |
Range | 22719 |
Interquartile range (IQR) | 18688 |
Descriptive statistics
Standard deviation | 8442.2561 |
---|---|
Coefficient of variation (CV) | 4.2401632 × 10-6 |
Kurtosis | -1.5757346 |
Mean | 1.9910215 × 109 |
Median Absolute Deviation (MAD) | 9311 |
Skewness | 0.0060656819 |
Sum | 1.9910215 × 1013 |
Variance | 71271687 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1991020516 | 12 | 0.1% |
1991012706 | 12 | 0.1% |
1991030300 | 12 | 0.1% |
1991020200 | 12 | 0.1% |
1991032312 | 12 | 0.1% |
1991011310 | 11 | 0.1% |
1991030105 | 11 | 0.1% |
1991011401 | 11 | 0.1% |
1991030822 | 11 | 0.1% |
1991012014 | 11 | 0.1% |
Other values (2054) | 9885 |
Value | Count | Frequency (%) |
1991010100 | 2 | < 0.1% |
1991010101 | 7 | |
1991010102 | 4 | |
1991010104 | 2 | < 0.1% |
1991010105 | 2 | < 0.1% |
1991010106 | 6 | |
1991010107 | 2 | < 0.1% |
1991010108 | 1 | < 0.1% |
1991010109 | 5 | |
1991010111 | 5 |
Value | Count | Frequency (%) |
1991032819 | 3 | < 0.1% |
1991032818 | 7 | |
1991032817 | 5 | |
1991032816 | 9 | |
1991032815 | 4 | |
1991032814 | 8 | |
1991032813 | 3 | < 0.1% |
1991032812 | 4 | |
1991032811 | 5 | |
1991032810 | 6 |
측정소 코드
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.8054 |
Minimum | 103 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 103 |
---|---|
5-th percentile | 103 |
Q1 | 107 |
median | 113 |
Q3 | 117 |
95-th percentile | 124 |
Maximum | 124 |
Range | 21 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 7.3041814 |
---|---|
Coefficient of variation (CV) | 0.064750281 |
Kurtosis | -1.379135 |
Mean | 112.8054 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.21494886 |
Sum | 1128054 |
Variance | 53.351066 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
103 | 1290 | |
113 | 1266 | |
124 | 1257 | |
117 | 1251 | |
122 | 1238 | |
108 | 1229 | |
107 | 1217 | |
105 | 848 | |
116 | 404 | 4.0% |
Value | Count | Frequency (%) |
103 | 1290 | |
105 | 848 | |
107 | 1217 | |
108 | 1229 | |
113 | 1266 | |
116 | 404 | 4.0% |
117 | 1251 | |
122 | 1238 | |
124 | 1257 |
Value | Count | Frequency (%) |
124 | 1257 | |
122 | 1238 | |
117 | 1251 | |
116 | 404 | 4.0% |
113 | 1266 | |
108 | 1229 | |
107 | 1217 | |
105 | 848 | |
103 | 1290 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3351 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.77171 |
---|---|
Coefficient of variation (CV) | 0.51952353 |
Kurtosis | -1.2233326 |
Mean | 5.3351 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.20668313 |
Sum | 53351 |
Variance | 7.6823762 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1730 | |
1 | 1714 | |
6 | 1663 | |
5 | 1635 | |
3 | 1630 | |
8 | 1628 |
Value | Count | Frequency (%) |
1 | 1714 | |
3 | 1630 | |
5 | 1635 | |
6 | 1663 | |
8 | 1628 | |
9 | 1730 |
Value | Count | Frequency (%) |
9 | 1730 | |
8 | 1628 | |
6 | 1663 | |
5 | 1635 | |
3 | 1630 | |
1 | 1714 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 443 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -3262.5533 |
Minimum | -9999 |
---|---|
Maximum | 276 |
Zeros | 303 |
Zeros (%) | 3.0% |
Negative | 5940 |
Negative (%) | 59.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9999 |
median | -9.999 |
Q3 | 0.03 |
95-th percentile | 2.9 |
Maximum | 276 |
Range | 10275 |
Interquartile range (IQR) | 9999.03 |
Descriptive statistics
Standard deviation | 4622.6302 |
---|---|
Coefficient of variation (CV) | -1.416875 |
Kurtosis | -1.4030349 |
Mean | -3262.5533 |
Median Absolute Deviation (MAD) | 10.285 |
Skewness | -0.76572176 |
Sum | -32625533 |
Variance | 21368710 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 3195 | |
-9.999 | 2074 | |
-999.9 | 671 | 6.7% |
0.0 | 303 | 3.0% |
0.001 | 79 | 0.8% |
0.019 | 58 | 0.6% |
0.012 | 54 | 0.5% |
0.003 | 52 | 0.5% |
0.015 | 52 | 0.5% |
0.02 | 51 | 0.5% |
Other values (433) | 3411 |
Value | Count | Frequency (%) |
-9999.0 | 3195 | |
-999.9 | 671 | 6.7% |
-9.999 | 2074 | |
0.0 | 303 | 3.0% |
0.001 | 79 | 0.8% |
0.002 | 41 | 0.4% |
0.003 | 52 | 0.5% |
0.004 | 39 | 0.4% |
0.005 | 34 | 0.3% |
0.006 | 38 | 0.4% |
Value | Count | Frequency (%) |
276.0 | 1 | |
252.0 | 1 | |
196.0 | 1 | |
180.0 | 1 | |
179.0 | 1 | |
178.0 | 1 | |
160.0 | 1 | |
156.0 | 2 | |
147.0 | 2 | |
146.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.3334 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 4023 |
Zeros (%) | 40.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.1824575 |
---|---|
Coefficient of variation (CV) | 0.93531221 |
Kurtosis | 0.1261697 |
Mean | 2.3334 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.51699821 |
Sum | 23334 |
Variance | 4.7631208 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 4715 | |
0 | 4023 | |
2 | 960 | 9.6% |
9 | 278 | 2.8% |
1 | 20 | 0.2% |
8 | 4 | < 0.1% |
Value | Count | Frequency (%) |
0 | 4023 | |
1 | 20 | 0.2% |
2 | 960 | 9.6% |
4 | 4715 | |
8 | 4 | < 0.1% |
9 | 278 | 2.8% |
Value | Count | Frequency (%) |
9 | 278 | 2.8% |
8 | 4 | < 0.1% |
4 | 4715 | |
2 | 960 | 9.6% |
1 | 20 | 0.2% |
0 | 4023 |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 152 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9848 | |
1 | 152 | 1.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9848 | |
1 | 152 | 1.5% |
지자체 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 27 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9973 | |
1 | 27 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9973 | |
1 | 27 | 0.3% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.210 | 0.000 | 0.092 | 0.296 | 0.108 | 0.077 |
측정소 코드 | 0.210 | 1.000 | 0.013 | 0.321 | 0.542 | 0.144 | 0.122 |
측정항목 | 0.000 | 0.013 | 1.000 | 0.484 | 0.698 | 0.352 | 0.161 |
평균값 | 0.092 | 0.321 | 0.484 | 1.000 | 0.574 | 0.073 | 0.019 |
측정기 상태 | 0.296 | 0.542 | 0.698 | 0.574 | 1.000 | 0.208 | 0.083 |
국가 기준초과 구분 | 0.108 | 0.144 | 0.352 | 0.073 | 0.208 | 1.000 | 0.198 |
지자체 기준초과 구분 | 0.077 | 0.122 | 0.161 | 0.019 | 0.083 | 0.198 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.127 |
국가 기준초과 구분 | 0.127 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.012 | 0.001 | -0.023 | 0.016 | 0.077 | 0.056 |
측정소 코드 | 0.012 | 1.000 | 0.004 | 0.146 | -0.150 | 0.104 | 0.087 |
측정항목 | 0.001 | 0.004 | 1.000 | -0.684 | 0.490 | 0.253 | 0.116 |
평균값 | -0.023 | 0.146 | -0.684 | 1.000 | -0.856 | 0.098 | 0.039 |
측정기 상태 | 0.016 | -0.150 | 0.490 | -0.856 | 1.000 | 0.150 | 0.059 |
국가 기준초과 구분 | 0.077 | 0.104 | 0.253 | 0.098 | 0.150 | 1.000 | 0.127 |
지자체 기준초과 구분 | 0.056 | 0.087 | 0.116 | 0.039 | 0.059 | 0.127 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
23450 | 1991012108 | 113 | 5 | 2.7 | 0 | 0 | 0 |
47472 | 1991021216 | 103 | 1 | 0.029 | 0 | 0 | 0 |
48773 | 1991021322 | 124 | 9 | -9999.0 | 4 | 0 | 0 |
59334 | 1991022410 | 113 | 1 | 0.018 | 0 | 0 | 0 |
86284 | 1991031805 | 122 | 8 | -9999.0 | 4 | 0 | 0 |
36731 | 1991020200 | 107 | 9 | -9999.0 | 4 | 0 | 0 |
32808 | 1991012911 | 113 | 1 | 0.082 | 0 | 0 | 0 |
3338 | 1991010321 | 113 | 5 | 1.6 | 0 | 0 | 0 |
18448 | 1991011700 | 107 | 8 | -9999.0 | 4 | 0 | 0 |
95501 | 1991032508 | 113 | 9 | -9999.0 | 4 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
37102 | 1991020209 | 103 | 8 | -9999.0 | 4 | 0 | 0 |
79920 | 1991031308 | 103 | 1 | 0.09 | 0 | 0 | 0 |
67645 | 1991030320 | 117 | 3 | 0.015 | 0 | 0 | 0 |
21637 | 1991011918 | 122 | 3 | -9.999 | 2 | 0 | 0 |
79642 | 1991031302 | 122 | 8 | -9999.0 | 4 | 0 | 0 |
47449 | 1991021215 | 113 | 3 | 0.012 | 0 | 0 | 0 |
30277 | 1991012706 | 122 | 3 | -9.999 | 2 | 0 | 0 |
36218 | 1991020112 | 103 | 5 | 1.6 | 0 | 0 | 0 |
28224 | 1991012512 | 103 | 1 | -9.999 | 4 | 0 | 0 |
98900 | 1991032723 | 113 | 5 | 2.4 | 0 | 0 | 0 |