Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 and 1 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
국가 기준초과 구분 is highly imbalanced (93.7%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (99.2%) | Imbalance |
평균값 has 414 (4.1%) zeros | Zeros |
측정기 상태 has 4954 (49.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:03:52.203109 |
---|---|
Analysis finished | 2024-05-04 04:04:03.832342 |
Duration | 11.63 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 1685 |
---|---|
Distinct (%) | 16.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9920179 × 109 |
Minimum | 1.9920101 × 109 |
---|---|
Maximum | 1.9920311 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9920101 × 109 |
---|---|
5-th percentile | 1.9920104 × 109 |
Q1 | 1.9920116 × 109 |
median | 1.9920201 × 109 |
Q3 | 1.9920221 × 109 |
95-th percentile | 1.9920307 × 109 |
Maximum | 1.9920311 × 109 |
Range | 21006 |
Interquartile range (IQR) | 10487.25 |
Descriptive statistics
Standard deviation | 6881.9403 |
---|---|
Coefficient of variation (CV) | 3.4547583 × 10-6 |
Kurtosis | -0.96675957 |
Mean | 1.9920179 × 109 |
Median Absolute Deviation (MAD) | 7798 |
Skewness | 0.53552467 |
Sum | 1.9920179 × 1013 |
Variance | 47361102 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1992011017 | 15 | 0.1% |
1992010704 | 13 | 0.1% |
1992011620 | 13 | 0.1% |
1992010117 | 13 | 0.1% |
1992012302 | 13 | 0.1% |
1992021512 | 13 | 0.1% |
1992012007 | 13 | 0.1% |
1992011923 | 12 | 0.1% |
1992013009 | 12 | 0.1% |
1992020218 | 12 | 0.1% |
Other values (1675) | 9871 |
Value | Count | Frequency (%) |
1992010100 | 5 | |
1992010101 | 7 | |
1992010102 | 6 | |
1992010103 | 5 | |
1992010104 | 10 | |
1992010105 | 4 | < 0.1% |
1992010106 | 6 | |
1992010107 | 5 | |
1992010108 | 4 | < 0.1% |
1992010109 | 12 |
Value | Count | Frequency (%) |
1992031106 | 4 | < 0.1% |
1992031105 | 7 | |
1992031104 | 5 | |
1992031103 | 11 | |
1992031102 | 4 | < 0.1% |
1992031101 | 2 | < 0.1% |
1992031100 | 2 | < 0.1% |
1992031023 | 9 | |
1992031022 | 5 | |
1992031021 | 5 |
측정소 코드
Real number (ℝ)
Distinct | 11 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.4342 |
Minimum | 103 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 103 |
---|---|
5-th percentile | 103 |
Q1 | 107 |
median | 113 |
Q3 | 117 |
95-th percentile | 124 |
Maximum | 124 |
Range | 21 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 6.8692932 |
---|---|
Coefficient of variation (CV) | 0.061096118 |
Kurtosis | -1.2173878 |
Mean | 112.4342 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.29942858 |
Sum | 1124342 |
Variance | 47.187189 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
113 | 1036 | |
108 | 1034 | |
116 | 1030 | |
117 | 1024 | |
122 | 1021 | |
107 | 1000 | |
124 | 990 | |
105 | 990 | |
103 | 976 | |
106 | 455 |
Value | Count | Frequency (%) |
103 | 976 | |
105 | 990 | |
106 | 455 | |
107 | 1000 | |
108 | 1034 | |
111 | 444 | |
113 | 1036 | |
116 | 1030 | |
117 | 1024 | |
122 | 1021 |
Value | Count | Frequency (%) |
124 | 990 | |
122 | 1021 | |
117 | 1024 | |
116 | 1030 | |
113 | 1036 | |
111 | 444 | |
108 | 1034 | |
107 | 1000 | |
106 | 455 | |
105 | 990 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3326 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7724981 |
---|---|
Coefficient of variation (CV) | 0.51991489 |
Kurtosis | -1.2341175 |
Mean | 5.3326 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.2037194 |
Sum | 53326 |
Variance | 7.6867459 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1710 | |
1 | 1701 | |
3 | 1687 | |
6 | 1675 | |
8 | 1663 | |
5 | 1564 |
Value | Count | Frequency (%) |
1 | 1701 | |
3 | 1687 | |
5 | 1564 | |
6 | 1675 | |
8 | 1663 | |
9 | 1710 |
Value | Count | Frequency (%) |
9 | 1710 | |
8 | 1663 | |
6 | 1675 | |
5 | 1564 | |
3 | 1687 | |
1 | 1701 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 318 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -3206.8111 |
Minimum | -9999 |
---|---|
Maximum | 208 |
Zeros | 414 |
Zeros (%) | 4.1% |
Negative | 4615 |
Negative (%) | 46.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9999 |
median | 0 |
Q3 | 0.039 |
95-th percentile | 2.4 |
Maximum | 208 |
Range | 10207 |
Interquartile range (IQR) | 9999.039 |
Descriptive statistics
Standard deviation | 4630.3133 |
---|---|
Coefficient of variation (CV) | -1.4438996 |
Kurtosis | -1.382164 |
Mean | -3206.8111 |
Median Absolute Deviation (MAD) | 2.9 |
Skewness | -0.78217227 |
Sum | -32068111 |
Variance | 21439801 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 3169 | |
-9.999 | 1068 | 10.7% |
0.0 | 414 | 4.1% |
-999.9 | 376 | 3.8% |
0.001 | 142 | 1.4% |
0.002 | 103 | 1.0% |
0.009 | 81 | 0.8% |
0.003 | 80 | 0.8% |
0.011 | 77 | 0.8% |
0.019 | 76 | 0.8% |
Other values (308) | 4414 |
Value | Count | Frequency (%) |
-9999.0 | 3169 | |
-999.9 | 376 | 3.8% |
-10.122 | 1 | < 0.1% |
-10.011 | 1 | < 0.1% |
-9.999 | 1068 | 10.7% |
0.0 | 414 | 4.1% |
0.001 | 142 | 1.4% |
0.002 | 103 | 1.0% |
0.003 | 80 | 0.8% |
0.004 | 59 | 0.6% |
Value | Count | Frequency (%) |
208.0 | 1 | |
184.0 | 1 | |
170.0 | 1 | |
168.0 | 1 | |
159.0 | 1 | |
135.0 | 2 | |
113.0 | 1 | |
109.0 | 1 | |
107.0 | 1 | |
84.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9466 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 4954 |
Zeros (%) | 49.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 1.9914178 |
---|---|
Coefficient of variation (CV) | 1.0230237 |
Kurtosis | -1.562241 |
Mean | 1.9466 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.17327247 |
Sum | 19466 |
Variance | 3.965745 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4954 | |
4 | 4591 | |
2 | 402 | 4.0% |
9 | 28 | 0.3% |
1 | 22 | 0.2% |
8 | 3 | < 0.1% |
Value | Count | Frequency (%) |
0 | 4954 | |
1 | 22 | 0.2% |
2 | 402 | 4.0% |
4 | 4591 | |
8 | 3 | < 0.1% |
9 | 28 | 0.3% |
Value | Count | Frequency (%) |
9 | 28 | 0.3% |
8 | 3 | < 0.1% |
4 | 4591 | |
2 | 402 | 4.0% |
1 | 22 | 0.2% |
0 | 4954 |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 74 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9926 | |
1 | 74 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9926 | |
1 | 74 | 0.7% |
지자체 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 7 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9993 | |
1 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9993 | |
1 | 7 | 0.1% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.186 | 0.000 | 0.173 | 0.278 | 0.058 | 0.086 |
측정소 코드 | 0.186 | 1.000 | 0.000 | 0.228 | 0.273 | 0.035 | 0.044 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.363 | 0.730 | 0.242 | 0.076 |
평균값 | 0.173 | 0.228 | 0.363 | 1.000 | 0.561 | 0.029 | 0.000 |
측정기 상태 | 0.278 | 0.273 | 0.730 | 0.561 | 1.000 | 0.117 | 0.020 |
국가 기준초과 구분 | 0.058 | 0.035 | 0.242 | 0.029 | 0.117 | 1.000 | 0.303 |
지자체 기준초과 구분 | 0.086 | 0.044 | 0.076 | 0.000 | 0.020 | 0.303 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.196 |
국가 기준초과 구분 | 0.196 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.046 | 0.004 | 0.025 | -0.028 | 0.041 | 0.062 |
측정소 코드 | 0.046 | 1.000 | 0.001 | 0.080 | -0.153 | 0.038 | 0.047 |
측정항목 | 0.004 | 0.001 | 1.000 | -0.727 | 0.614 | 0.174 | 0.055 |
평균값 | 0.025 | 0.080 | -0.727 | 1.000 | -0.873 | 0.062 | 0.014 |
측정기 상태 | -0.028 | -0.153 | 0.614 | -0.873 | 1.000 | 0.084 | 0.015 |
국가 기준초과 구분 | 0.041 | 0.038 | 0.174 | 0.062 | 0.084 | 1.000 | 0.196 |
지자체 기준초과 구분 | 0.062 | 0.047 | 0.055 | 0.014 | 0.015 | 0.196 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
56137 | 1992020610 | 107 | 3 | 0.051 | 0 | 0 | 0 |
50385 | 1992020123 | 117 | 6 | 0.009 | 0 | 0 | 0 |
46901 | 1992013014 | 113 | 9 | -9999.0 | 4 | 0 | 0 |
47452 | 1992013022 | 124 | 8 | -9999.0 | 4 | 0 | 0 |
40368 | 1992012611 | 116 | 1 | 0.121 | 0 | 0 | 0 |
11438 | 1992010805 | 107 | 5 | 2.5 | 0 | 0 | 0 |
92547 | 1992030512 | 113 | 6 | 0.008 | 0 | 0 | 0 |
72781 | 1992021906 | 113 | 3 | 0.036 | 0 | 0 | 0 |
52777 | 1992020320 | 103 | 3 | 0.024 | 0 | 0 | 0 |
25311 | 1992011623 | 111 | 6 | -9.999 | 4 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
35557 | 1992012310 | 117 | 3 | 0.024 | 0 | 0 | 0 |
4710 | 1992010323 | 108 | 1 | 0.043 | 0 | 0 | 0 |
39345 | 1992012520 | 105 | 6 | -9.999 | 4 | 0 | 0 |
42198 | 1992012715 | 108 | 1 | 0.035 | 0 | 0 | 0 |
29268 | 1992011911 | 111 | 1 | -9.999 | 4 | 0 | 0 |
25867 | 1992011707 | 124 | 3 | 0.019 | 0 | 0 | 0 |
95064 | 1992030711 | 105 | 1 | 0.052 | 0 | 0 | 0 |
7515 | 1992010517 | 122 | 6 | 0.005 | 0 | 0 | 0 |
48262 | 1992013111 | 106 | 8 | -9999.0 | 4 | 0 | 0 |
92445 | 1992030510 | 116 | 6 | 0.012 | 2 | 0 | 0 |