Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 and 1 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
국가 기준초과 구분 is highly imbalanced (83.9%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (96.6%) | Imbalance |
평균값 has 642 (6.4%) zeros | Zeros |
측정기 상태 has 5964 (59.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:04:34.259420 |
---|---|
Analysis finished | 2024-05-04 04:04:45.873535 |
Duration | 11.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 2075 |
---|---|
Distinct (%) | 20.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9900211 × 109 |
Minimum | 1.9900101 × 109 |
---|---|
Maximum | 1.9900328 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9900101 × 109 |
---|---|
5-th percentile | 1.9900104 × 109 |
Q1 | 1.9900122 × 109 |
median | 1.9900213 × 109 |
Q3 | 1.9900307 × 109 |
95-th percentile | 1.9900324 × 109 |
Maximum | 1.9900328 × 109 |
Range | 22719 |
Interquartile range (IQR) | 18498.25 |
Descriptive statistics
Standard deviation | 8211.8413 |
---|---|
Coefficient of variation (CV) | 4.1265097 × 10-6 |
Kurtosis | -1.4881098 |
Mean | 1.9900211 × 109 |
Median Absolute Deviation (MAD) | 9215.5 |
Skewness | 0.07781517 |
Sum | 1.9900211 × 1013 |
Variance | 67434337 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1990012707 | 13 | 0.1% |
1990032611 | 13 | 0.1% |
1990011816 | 13 | 0.1% |
1990011412 | 12 | 0.1% |
1990022621 | 12 | 0.1% |
1990022812 | 12 | 0.1% |
1990020917 | 12 | 0.1% |
1990010711 | 11 | 0.1% |
1990010701 | 11 | 0.1% |
1990010510 | 11 | 0.1% |
Other values (2065) | 9880 |
Value | Count | Frequency (%) |
1990010100 | 5 | |
1990010101 | 7 | |
1990010102 | 7 | |
1990010103 | 9 | |
1990010104 | 5 | |
1990010105 | 4 | |
1990010106 | 4 | |
1990010107 | 3 | < 0.1% |
1990010108 | 5 | |
1990010109 | 3 | < 0.1% |
Value | Count | Frequency (%) |
1990032819 | 3 | < 0.1% |
1990032818 | 9 | |
1990032817 | 5 | |
1990032816 | 2 | < 0.1% |
1990032814 | 2 | < 0.1% |
1990032813 | 2 | < 0.1% |
1990032812 | 4 | |
1990032811 | 4 | |
1990032810 | 7 | |
1990032809 | 5 |
측정소 코드
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.4052 |
Minimum | 103 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 103 |
---|---|
5-th percentile | 103 |
Q1 | 105 |
median | 113 |
Q3 | 122 |
95-th percentile | 124 |
Maximum | 124 |
Range | 21 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 7.4239426 |
---|---|
Coefficient of variation (CV) | 0.066046256 |
Kurtosis | -1.3896716 |
Mean | 112.4052 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.32927212 |
Sum | 1124052 |
Variance | 55.114924 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
105 | 1280 | |
122 | 1265 | |
117 | 1262 | |
107 | 1258 | |
124 | 1252 | |
113 | 1239 | |
103 | 1229 | |
108 | 1215 |
Value | Count | Frequency (%) |
103 | 1229 | |
105 | 1280 | |
107 | 1258 | |
108 | 1215 | |
113 | 1239 | |
117 | 1262 | |
122 | 1265 | |
124 | 1252 |
Value | Count | Frequency (%) |
124 | 1252 | |
122 | 1265 | |
117 | 1262 | |
113 | 1239 | |
108 | 1215 | |
107 | 1258 | |
105 | 1280 | |
103 | 1229 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3349 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7621559 |
---|---|
Coefficient of variation (CV) | 0.51775213 |
Kurtosis | -1.2251333 |
Mean | 5.3349 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.19929497 |
Sum | 53349 |
Variance | 7.6295049 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1708 | |
3 | 1699 | |
1 | 1672 | |
8 | 1655 | |
6 | 1638 | |
5 | 1628 |
Value | Count | Frequency (%) |
1 | 1672 | |
3 | 1699 | |
5 | 1628 | |
6 | 1638 | |
8 | 1655 | |
9 | 1708 |
Value | Count | Frequency (%) |
9 | 1708 | |
8 | 1655 | |
6 | 1638 | |
5 | 1628 | |
3 | 1699 | |
1 | 1672 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 530 |
---|---|
Distinct (%) | 5.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -3185.5209 |
Minimum | -9999 |
---|---|
Maximum | 272 |
Zeros | 642 |
Zeros (%) | 6.4% |
Negative | 3903 |
Negative (%) | 39.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9999 |
median | 0.007 |
Q3 | 0.058 |
95-th percentile | 5.2 |
Maximum | 272 |
Range | 10271 |
Interquartile range (IQR) | 9999.058 |
Descriptive statistics
Standard deviation | 4637.104 |
---|---|
Coefficient of variation (CV) | -1.4556816 |
Kurtosis | -1.376976 |
Mean | -3185.5209 |
Median Absolute Deviation (MAD) | 1.893 |
Skewness | -0.78683898 |
Sum | -31855209 |
Variance | 21502734 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 3163 | |
0.0 | 642 | 6.4% |
-9.999 | 494 | 4.9% |
-999.9 | 243 | 2.4% |
0.001 | 123 | 1.2% |
0.002 | 89 | 0.9% |
0.018 | 77 | 0.8% |
0.031 | 73 | 0.7% |
0.006 | 72 | 0.7% |
0.022 | 70 | 0.7% |
Other values (520) | 4954 |
Value | Count | Frequency (%) |
-9999.0 | 3163 | |
-999.9 | 243 | 2.4% |
-10.03 | 1 | < 0.1% |
-9.999 | 494 | 4.9% |
-0.048 | 1 | < 0.1% |
-0.039 | 1 | < 0.1% |
0.0 | 642 | 6.4% |
0.001 | 123 | 1.2% |
0.002 | 89 | 0.9% |
0.003 | 53 | 0.5% |
Value | Count | Frequency (%) |
272.0 | 1 | |
231.0 | 1 | |
206.0 | 2 | |
192.0 | 1 | |
189.0 | 1 | |
187.0 | 1 | |
184.0 | 1 | |
175.0 | 1 | |
173.0 | 1 | |
169.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6812 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 5964 |
Zeros (%) | 59.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.2192023 |
---|---|
Coefficient of variation (CV) | 1.3200109 |
Kurtosis | 0.26354074 |
Mean | 1.6812 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.99967827 |
Sum | 16812 |
Variance | 4.924859 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5964 | |
4 | 3377 | |
2 | 310 | 3.1% |
8 | 209 | 2.1% |
9 | 109 | 1.1% |
1 | 31 | 0.3% |
Value | Count | Frequency (%) |
0 | 5964 | |
1 | 31 | 0.3% |
2 | 310 | 3.1% |
4 | 3377 | |
8 | 209 | 2.1% |
9 | 109 | 1.1% |
Value | Count | Frequency (%) |
9 | 109 | 1.1% |
8 | 209 | 2.1% |
4 | 3377 | |
2 | 310 | 3.1% |
1 | 31 | 0.3% |
0 | 5964 |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 236 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9764 | |
1 | 236 | 2.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9764 | |
1 | 236 | 2.4% |
지자체 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 35 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9965 | |
1 | 35 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9965 | |
1 | 35 | 0.4% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.123 | 0.117 | 0.078 | 0.058 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.286 | 0.371 | 0.109 | 0.122 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.281 | 0.821 | 0.440 | 0.163 |
평균값 | 0.123 | 0.286 | 0.281 | 1.000 | 0.818 | 0.050 | 0.000 |
측정기 상태 | 0.117 | 0.371 | 0.821 | 0.818 | 1.000 | 0.175 | 0.060 |
국가 기준초과 구분 | 0.078 | 0.109 | 0.440 | 0.050 | 0.175 | 1.000 | 0.271 |
지자체 기준초과 구분 | 0.058 | 0.122 | 0.163 | 0.000 | 0.060 | 0.271 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.175 |
국가 기준초과 구분 | 0.175 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.001 | -0.012 | 0.016 | -0.020 | 0.056 | 0.041 |
측정소 코드 | 0.001 | 1.000 | -0.004 | 0.131 | -0.147 | 0.078 | 0.088 |
측정항목 | -0.012 | -0.004 | 1.000 | -0.687 | 0.610 | 0.317 | 0.118 |
평균값 | 0.016 | 0.131 | -0.687 | 1.000 | -0.845 | 0.111 | 0.040 |
측정기 상태 | -0.020 | -0.147 | 0.610 | -0.845 | 1.000 | 0.126 | 0.043 |
국가 기준초과 구분 | 0.056 | 0.078 | 0.317 | 0.111 | 0.126 | 1.000 | 0.175 |
지자체 기준초과 구분 | 0.041 | 0.088 | 0.118 | 0.040 | 0.043 | 0.175 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
34013 | 1990013012 | 113 | 9 | -9999.0 | 4 | 0 | 0 |
40915 | 1990020512 | 108 | 3 | 0.044 | 0 | 0 | 0 |
24797 | 1990012212 | 113 | 9 | -9999.0 | 4 | 0 | 0 |
32711 | 1990012909 | 108 | 9 | -9999.0 | 4 | 0 | 0 |
61426 | 1990022307 | 117 | 8 | -9999.0 | 4 | 0 | 0 |
3697 | 1990010405 | 103 | 3 | 0.026 | 0 | 0 | 0 |
3895 | 1990010409 | 105 | 3 | 0.027 | 0 | 0 | 0 |
4837 | 1990010504 | 122 | 3 | 0.034 | 0 | 0 | 0 |
11302 | 1990011019 | 108 | 8 | -9999.0 | 4 | 0 | 0 |
64817 | 1990022606 | 107 | 9 | -9999.0 | 4 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
63915 | 1990022511 | 113 | 6 | 0.009 | 0 | 0 | 0 |
185 | 1990010103 | 122 | 9 | -9999.0 | 4 | 0 | 0 |
73364 | 1990030516 | 108 | 5 | 1.4 | 0 | 0 | 0 |
46195 | 1990021002 | 108 | 3 | 0.015 | 0 | 0 | 0 |
9433 | 1990010904 | 113 | 3 | 0.025 | 0 | 0 | 0 |
19461 | 1990011721 | 108 | 6 | 0.0 | 0 | 0 | 0 |
6663 | 1990010618 | 122 | 6 | 0.004 | 0 | 0 | 0 |
50546 | 1990021321 | 103 | 5 | 5.2 | 0 | 0 | 0 |
50787 | 1990021402 | 103 | 6 | 0.0 | 0 | 0 | 0 |
41671 | 1990020604 | 105 | 3 | 0.031 | 0 | 0 | 0 |