Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (92.6%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (86.6%) | Imbalance |
측정기 상태 has 8350 (83.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:00:23.513485 |
---|---|
Analysis finished | 2024-05-04 04:00:31.714154 |
Duration | 8.2 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 573 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0040113 × 109 |
Minimum | 2.0040101 × 109 |
---|---|
Maximum | 2.0040124 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0040101 × 109 |
---|---|
5-th percentile | 2.0040102 × 109 |
Q1 | 2.0040107 × 109 |
median | 2.0040113 × 109 |
Q3 | 2.0040118 × 109 |
95-th percentile | 2.0040123 × 109 |
Maximum | 2.0040124 × 109 |
Range | 2320 |
Interquartile range (IQR) | 1121 |
Descriptive statistics
Standard deviation | 690.59834 |
---|---|
Coefficient of variation (CV) | 3.4460802 × 10-7 |
Kurtosis | -1.2009555 |
Mean | 2.0040113 × 109 |
Median Absolute Deviation (MAD) | 596 |
Skewness | -0.0064701802 |
Sum | 2.0040113 × 1013 |
Variance | 476926.07 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2004010509 | 29 | 0.3% |
2004011500 | 29 | 0.3% |
2004010709 | 28 | 0.3% |
2004010219 | 27 | 0.3% |
2004012021 | 27 | 0.3% |
2004011716 | 26 | 0.3% |
2004011411 | 26 | 0.3% |
2004010912 | 26 | 0.3% |
2004011606 | 25 | 0.2% |
2004011722 | 25 | 0.2% |
Other values (563) | 9732 |
Value | Count | Frequency (%) |
2004010100 | 15 | |
2004010101 | 18 | |
2004010102 | 19 | |
2004010103 | 23 | |
2004010104 | 21 | |
2004010105 | 21 | |
2004010106 | 12 | |
2004010107 | 24 | |
2004010108 | 23 | |
2004010109 | 18 |
Value | Count | Frequency (%) |
2004012420 | 2 | < 0.1% |
2004012419 | 19 | |
2004012418 | 17 | |
2004012417 | 14 | |
2004012416 | 17 | |
2004012415 | 12 | |
2004012414 | 23 | |
2004012413 | 22 | |
2004012412 | 15 | |
2004012411 | 24 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9588 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2330446 |
---|---|
Coefficient of variation (CV) | 0.064032591 |
Kurtosis | -1.2073095 |
Mean | 112.9588 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0056738554 |
Sum | 1129588 |
Variance | 52.316934 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
102 | 442 | 4.4% |
115 | 423 | 4.2% |
103 | 420 | 4.2% |
117 | 420 | 4.2% |
105 | 420 | 4.2% |
111 | 416 | 4.2% |
122 | 413 | 4.1% |
119 | 412 | 4.1% |
113 | 411 | 4.1% |
112 | 410 | 4.1% |
Other values (15) | 5813 |
Value | Count | Frequency (%) |
101 | 403 | |
102 | 442 | |
103 | 420 | |
104 | 389 | |
105 | 420 | |
106 | 383 | |
107 | 348 | |
108 | 386 | |
109 | 389 | |
110 | 408 |
Value | Count | Frequency (%) |
125 | 402 | |
124 | 372 | |
123 | 400 | |
122 | 413 | |
121 | 398 | |
120 | 402 | |
119 | 412 | |
118 | 399 | |
117 | 420 | |
116 | 355 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.338 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7490565 |
---|---|
Coefficient of variation (CV) | 0.51499747 |
Kurtosis | -1.2117151 |
Mean | 5.338 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.20342397 |
Sum | 53380 |
Variance | 7.5573117 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 1698 | |
9 | 1679 | |
6 | 1678 | |
8 | 1662 | |
1 | 1651 | |
5 | 1632 |
Value | Count | Frequency (%) |
1 | 1651 | |
3 | 1698 | |
5 | 1632 | |
6 | 1678 | |
8 | 1662 | |
9 | 1679 |
Value | Count | Frequency (%) |
9 | 1679 | |
8 | 1662 | |
6 | 1678 | |
5 | 1632 | |
3 | 1698 | |
1 | 1651 |
평균값
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 340 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -1046.6485 |
Minimum | -9999 |
---|---|
Maximum | 10505 |
Zeros | 73 |
Zeros (%) | 0.7% |
Negative | 1417 |
Negative (%) | 14.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | 0.004 |
median | 0.03 |
Q3 | 1.4 |
95-th percentile | 84 |
Maximum | 10505 |
Range | 20504 |
Interquartile range (IQR) | 1.396 |
Descriptive statistics
Standard deviation | 3073.389 |
---|---|
Coefficient of variation (CV) | -2.9364098 |
Kurtosis | 4.6106383 |
Mean | -1046.6485 |
Median Absolute Deviation (MAD) | 0.27 |
Skewness | -2.5587277 |
Sum | -10466485 |
Variance | 9445719.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 1052 | 10.5% |
0.004 | 305 | 3.0% |
0.002 | 299 | 3.0% |
0.001 | 292 | 2.9% |
0.005 | 282 | 2.8% |
0.003 | 271 | 2.7% |
-9.999 | 269 | 2.7% |
0.006 | 195 | 1.9% |
0.007 | 190 | 1.9% |
0.008 | 185 | 1.8% |
Other values (330) | 6660 |
Value | Count | Frequency (%) |
-9999.0 | 1052 | |
-999.9 | 96 | 1.0% |
-9.999 | 269 | 2.7% |
0.0 | 73 | 0.7% |
0.001 | 292 | 2.9% |
0.002 | 299 | 3.0% |
0.003 | 271 | 2.7% |
0.004 | 305 | 3.0% |
0.005 | 282 | 2.8% |
0.006 | 195 | 1.9% |
Value | Count | Frequency (%) |
10505.0 | 1 | |
1819.0 | 1 | |
916.0 | 2 | |
605.0 | 1 | |
527.0 | 1 | |
407.0 | 1 | |
288.0 | 1 | |
276.0 | 1 | |
242.0 | 1 | |
240.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.6737 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 8350 |
Zeros (%) | 83.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.6224955 |
---|---|
Coefficient of variation (CV) | 2.4083353 |
Kurtosis | 5.6179906 |
Mean | 0.6737 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4360923 |
Sum | 6737 |
Variance | 2.6324916 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8350 | |
4 | 1303 | 13.0% |
2 | 176 | 1.8% |
8 | 126 | 1.3% |
1 | 30 | 0.3% |
9 | 15 | 0.1% |
Value | Count | Frequency (%) |
0 | 8350 | |
1 | 30 | 0.3% |
2 | 176 | 1.8% |
4 | 1303 | 13.0% |
8 | 126 | 1.3% |
9 | 15 | 0.1% |
Value | Count | Frequency (%) |
9 | 15 | 0.1% |
8 | 126 | 1.3% |
4 | 1303 | 13.0% |
2 | 176 | 1.8% |
1 | 30 | 0.3% |
0 | 8350 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 90 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9910 | |
1 | 90 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9910 | |
1 | 90 | 0.9% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 187 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9813 | |
1 | 187 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9813 | |
1 | 187 | 1.9% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.057 | 0.158 | 0.170 | 0.266 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.064 | 0.263 | 0.031 | 0.047 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.152 | 0.616 | 0.260 | 0.417 |
평균값 | 0.057 | 0.064 | 0.152 | 1.000 | 0.349 | 0.019 | 0.010 |
측정기 상태 | 0.158 | 0.263 | 0.616 | 0.349 | 1.000 | 0.115 | 0.075 |
국가 기준초과 구분 | 0.170 | 0.031 | 0.260 | 0.019 | 0.115 | 1.000 | 0.850 |
지자체 기준초과 구분 | 0.266 | 0.047 | 0.417 | 0.010 | 0.075 | 0.850 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.647 |
국가 기준초과 구분 | 0.647 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.002 | -0.004 | -0.055 | 0.035 | 0.129 | 0.203 |
측정소 코드 | -0.002 | 1.000 | 0.005 | 0.088 | -0.088 | 0.024 | 0.036 |
측정항목 | -0.004 | 0.005 | 1.000 | 0.198 | 0.334 | 0.187 | 0.301 |
평균값 | -0.055 | 0.088 | 0.198 | 1.000 | -0.560 | 0.044 | 0.049 |
측정기 상태 | 0.035 | -0.088 | 0.334 | -0.560 | 1.000 | 0.083 | 0.054 |
국가 기준초과 구분 | 0.129 | 0.024 | 0.187 | 0.044 | 0.083 | 1.000 | 0.647 |
지자체 기준초과 구분 | 0.203 | 0.036 | 0.301 | 0.049 | 0.054 | 0.647 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
21596 | 2004010623 | 125 | 5 | 1.9 | 0 | 0 | 0 |
78318 | 2004012218 | 104 | 1 | 0.003 | 0 | 0 | 0 |
38935 | 2004011119 | 115 | 3 | 0.05 | 0 | 0 | 0 |
73963 | 2004012113 | 103 | 3 | 0.01 | 0 | 0 | 0 |
50500 | 2004011500 | 117 | 8 | 65.0 | 0 | 0 | 0 |
71923 | 2004012023 | 113 | 3 | 0.014 | 0 | 0 | 0 |
15395 | 2004010506 | 116 | 9 | -9999.0 | 4 | 0 | 0 |
83657 | 2004012405 | 118 | 9 | -9999.0 | 4 | 0 | 0 |
7190 | 2004010223 | 124 | 5 | 0.4 | 0 | 0 | 0 |
34304 | 2004011012 | 118 | 5 | 0.3 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
71579 | 2004012021 | 105 | 9 | -9999.0 | 4 | 0 | 0 |
80414 | 2004012308 | 103 | 5 | 0.7 | 0 | 0 | 0 |
14093 | 2004010421 | 124 | 9 | 5.0 | 2 | 0 | 0 |
38767 | 2004011118 | 112 | 3 | 0.044 | 0 | 0 | 0 |
44951 | 2004011311 | 117 | 9 | -9999.0 | 4 | 0 | 0 |
32417 | 2004011000 | 103 | 9 | -9999.0 | 4 | 0 | 0 |
61415 | 2004011801 | 111 | 9 | -9999.0 | 4 | 0 | 0 |
74862 | 2004012119 | 103 | 1 | 0.004 | 0 | 0 | 0 |
41281 | 2004011211 | 106 | 3 | -9.999 | 4 | 0 | 0 |
82938 | 2004012400 | 124 | 1 | 0.021 | 2 | 0 | 0 |