Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (89.4%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (84.0%) | Imbalance |
평균값 has 232 (2.3%) zeros | Zeros |
측정기 상태 has 7272 (72.7%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:00:54.450908 |
---|---|
Analysis finished | 2024-05-04 04:01:03.427774 |
Duration | 8.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 667 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0010114 × 109 |
Minimum | 2.0010101 × 109 |
---|---|
Maximum | 2.0010128 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0010101 × 109 |
---|---|
5-th percentile | 2.0010102 × 109 |
Q1 | 2.0010107 × 109 |
median | 2.0010114 × 109 |
Q3 | 2.0010121 × 109 |
95-th percentile | 2.0010127 × 109 |
Maximum | 2.0010128 × 109 |
Range | 2718 |
Interquartile range (IQR) | 1397 |
Descriptive statistics
Standard deviation | 800.93825 |
---|---|
Coefficient of variation (CV) | 4.002667 × 10-7 |
Kurtosis | -1.2004567 |
Mean | 2.0010114 × 109 |
Median Absolute Deviation (MAD) | 699 |
Skewness | -0.00114414 |
Sum | 2.0010114 × 1013 |
Variance | 641502.08 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2001012416 | 28 | 0.3% |
2001012518 | 28 | 0.3% |
2001011706 | 25 | 0.2% |
2001010514 | 25 | 0.2% |
2001012308 | 25 | 0.2% |
2001010806 | 25 | 0.2% |
2001011501 | 24 | 0.2% |
2001012506 | 24 | 0.2% |
2001010623 | 24 | 0.2% |
2001010109 | 24 | 0.2% |
Other values (657) | 9748 |
Value | Count | Frequency (%) |
2001010100 | 16 | |
2001010101 | 16 | |
2001010102 | 15 | |
2001010103 | 15 | |
2001010104 | 17 | |
2001010105 | 13 | |
2001010106 | 19 | |
2001010107 | 8 | 0.1% |
2001010108 | 16 | |
2001010109 | 24 |
Value | Count | Frequency (%) |
2001012818 | 3 | < 0.1% |
2001012817 | 17 | |
2001012816 | 17 | |
2001012815 | 10 | |
2001012814 | 12 | |
2001012813 | 13 | |
2001012812 | 11 | |
2001012811 | 12 | |
2001012810 | 10 | |
2001012809 | 14 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.936 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.1690616 |
---|---|
Coefficient of variation (CV) | 0.063478975 |
Kurtosis | -1.1934948 |
Mean | 112.936 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.013939591 |
Sum | 1129360 |
Variance | 51.395444 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
106 | 443 | 4.4% |
118 | 436 | 4.4% |
111 | 434 | 4.3% |
102 | 432 | 4.3% |
110 | 412 | 4.1% |
122 | 410 | 4.1% |
124 | 408 | 4.1% |
108 | 407 | 4.1% |
107 | 407 | 4.1% |
115 | 405 | 4.0% |
Other values (15) | 5806 |
Value | Count | Frequency (%) |
101 | 387 | |
102 | 432 | |
103 | 370 | |
104 | 368 | |
105 | 404 | |
106 | 443 | |
107 | 407 | |
108 | 407 | |
109 | 400 | |
110 | 412 |
Value | Count | Frequency (%) |
125 | 371 | |
124 | 408 | |
123 | 384 | |
122 | 410 | |
121 | 376 | |
120 | 394 | |
119 | 391 | |
118 | 436 | |
117 | 397 | |
116 | 377 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.432 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7467309 |
---|---|
Coefficient of variation (CV) | 0.50565738 |
Kurtosis | -1.1958416 |
Mean | 5.432 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.23951616 |
Sum | 54320 |
Variance | 7.5445305 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1789 | |
8 | 1715 | |
5 | 1694 | |
3 | 1651 | |
6 | 1585 | |
1 | 1566 |
Value | Count | Frequency (%) |
1 | 1566 | |
3 | 1651 | |
5 | 1694 | |
6 | 1585 | |
8 | 1715 | |
9 | 1789 |
Value | Count | Frequency (%) |
9 | 1789 | |
8 | 1715 | |
6 | 1585 | |
5 | 1694 | |
3 | 1651 | |
1 | 1566 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 387 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -1828.1401 |
Minimum | -9999 |
---|---|
Maximum | 941 |
Zeros | 232 |
Zeros (%) | 2.3% |
Negative | 2419 |
Negative (%) | 24.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | 0 |
median | 0.015 |
Q3 | 1 |
95-th percentile | 77 |
Maximum | 941 |
Range | 10940 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3861.0667 |
---|---|
Coefficient of variation (CV) | -2.1120191 |
Kurtosis | 0.69966428 |
Mean | -1828.1401 |
Median Absolute Deviation (MAD) | 0.885 |
Skewness | -1.6407694 |
Sum | -18281401 |
Variance | 14907836 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 1823 | 18.2% |
-9.999 | 411 | 4.1% |
0.005 | 236 | 2.4% |
0.004 | 235 | 2.4% |
0.0 | 232 | 2.3% |
0.002 | 230 | 2.3% |
0.006 | 208 | 2.1% |
0.001 | 204 | 2.0% |
0.003 | 192 | 1.9% |
0.007 | 162 | 1.6% |
Other values (377) | 6067 |
Value | Count | Frequency (%) |
-9999.0 | 1823 | |
-3276.8 | 5 | 0.1% |
-999.9 | 149 | 1.5% |
-32.768 | 25 | 0.2% |
-9.999 | 411 | 4.1% |
-8.9 | 1 | < 0.1% |
-4.4 | 1 | < 0.1% |
-1.2 | 4 | < 0.1% |
0.0 | 232 | 2.3% |
0.001 | 204 | 2.0% |
Value | Count | Frequency (%) |
941.0 | 1 | |
903.0 | 1 | |
860.0 | 1 | |
829.0 | 1 | |
819.0 | 1 | |
780.0 | 1 | |
757.0 | 1 | |
701.0 | 1 | |
651.0 | 1 | |
598.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0447 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 7272 |
Zeros (%) | 72.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.7731374 |
---|---|
Coefficient of variation (CV) | 1.6972695 |
Kurtosis | 0.42588548 |
Mean | 1.0447 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.3030879 |
Sum | 10447 |
Variance | 3.1440163 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7272 | |
4 | 2364 | 23.6% |
2 | 280 | 2.8% |
1 | 38 | 0.4% |
9 | 25 | 0.2% |
8 | 21 | 0.2% |
Value | Count | Frequency (%) |
0 | 7272 | |
1 | 38 | 0.4% |
2 | 280 | 2.8% |
4 | 2364 | 23.6% |
8 | 21 | 0.2% |
9 | 25 | 0.2% |
Value | Count | Frequency (%) |
9 | 25 | 0.2% |
8 | 21 | 0.2% |
4 | 2364 | 23.6% |
2 | 280 | 2.8% |
1 | 38 | 0.4% |
0 | 7272 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 140 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9860 | |
1 | 140 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9860 | |
1 | 140 | 1.4% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 233 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9767 | |
1 | 233 | 2.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9767 | |
1 | 233 | 2.3% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.015 | 0.189 | 0.210 | 0.257 | 0.300 |
측정소 코드 | 0.000 | 1.000 | 0.018 | 0.173 | 0.320 | 0.070 | 0.077 |
측정항목 | 0.015 | 0.018 | 1.000 | 0.417 | 0.725 | 0.312 | 0.432 |
평균값 | 0.189 | 0.173 | 0.417 | 1.000 | 0.654 | 0.006 | 0.011 |
측정기 상태 | 0.210 | 0.320 | 0.725 | 0.654 | 1.000 | 0.147 | 0.139 |
국가 기준초과 구분 | 0.257 | 0.070 | 0.312 | 0.006 | 0.147 | 1.000 | 0.931 |
지자체 기준초과 구분 | 0.300 | 0.077 | 0.432 | 0.011 | 0.139 | 0.931 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.763 |
국가 기준초과 구분 | 0.763 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.012 | -0.022 | 0.114 | -0.126 | 0.197 | 0.230 |
측정소 코드 | -0.012 | 1.000 | -0.020 | -0.016 | -0.009 | 0.054 | 0.059 |
측정항목 | -0.022 | -0.020 | 1.000 | -0.131 | 0.497 | 0.224 | 0.311 |
평균값 | 0.114 | -0.016 | -0.131 | 1.000 | -0.723 | 0.057 | 0.075 |
측정기 상태 | -0.126 | -0.009 | 0.497 | -0.723 | 1.000 | 0.106 | 0.100 |
국가 기준초과 구분 | 0.197 | 0.054 | 0.224 | 0.057 | 0.106 | 1.000 | 0.763 |
지자체 기준초과 구분 | 0.230 | 0.059 | 0.311 | 0.075 | 0.100 | 0.763 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
19998 | 2001010613 | 109 | 1 | 0.003 | 0 | 0 | 0 |
84984 | 2001012414 | 115 | 1 | 0.006 | 0 | 0 | 0 |
22444 | 2001010705 | 116 | 8 | 21.0 | 0 | 0 | 0 |
21614 | 2001010700 | 103 | 5 | 0.6 | 0 | 0 | 0 |
14675 | 2001010501 | 121 | 9 | -9999.0 | 4 | 0 | 0 |
42307 | 2001011218 | 102 | 3 | 0.028 | 0 | 0 | 0 |
39479 | 2001011123 | 105 | 9 | -9999.0 | 4 | 0 | 0 |
28603 | 2001010822 | 118 | 3 | 0.035 | 0 | 0 | 0 |
26183 | 2001010806 | 114 | 9 | -9999.0 | 4 | 0 | 0 |
79435 | 2001012301 | 115 | 3 | 0.038 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
67278 | 2001011916 | 114 | 1 | 0.022 | 0 | 0 | 0 |
22884 | 2001010708 | 115 | 1 | 0.001 | 0 | 0 | 0 |
4567 | 2001010206 | 112 | 3 | 0.052 | 0 | 0 | 0 |
6917 | 2001010222 | 103 | 9 | -9999.0 | 4 | 0 | 0 |
91334 | 2001012608 | 123 | 5 | 1.9 | 0 | 0 | 0 |
67726 | 2001011919 | 113 | 8 | 103.0 | 0 | 0 | 0 |
63700 | 2001011816 | 117 | 8 | 94.0 | 0 | 0 | 0 |
50466 | 2001011500 | 112 | 1 | 0.003 | 0 | 0 | 0 |
42207 | 2001011217 | 110 | 6 | 0.011 | 0 | 0 | 0 |
14911 | 2001010503 | 111 | 3 | -9.999 | 4 | 0 | 0 |