Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (97.3%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (94.4%) | Imbalance |
평균값 has 493 (4.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:05:14.805090 |
---|---|
Analysis finished | 2024-05-04 04:05:21.073809 |
Duration | 6.27 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 926 |
---|---|
Distinct (%) | 9.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9950134 × 109 |
Minimum | 1.9950101 × 109 |
---|---|
Maximum | 1.9950208 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9950101 × 109 |
---|---|
5-th percentile | 1.9950102 × 109 |
Q1 | 1.995011 × 109 |
median | 1.995012 × 109 |
Q3 | 1.995013 × 109 |
95-th percentile | 1.9950206 × 109 |
Maximum | 1.9950208 × 109 |
Range | 10713 |
Interquartile range (IQR) | 1991 |
Descriptive statistics
Standard deviation | 3662.4289 |
---|---|
Coefficient of variation (CV) | 1.8357916 × 10-6 |
Kurtosis | -0.089794021 |
Mean | 1.9950134 × 109 |
Median Absolute Deviation (MAD) | 995 |
Skewness | 1.2822722 |
Sum | 1.9950134 × 1013 |
Variance | 13413386 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1995011013 | 24 | 0.2% |
1995020807 | 22 | 0.2% |
1995010604 | 21 | 0.2% |
1995010911 | 21 | 0.2% |
1995020208 | 21 | 0.2% |
1995020705 | 21 | 0.2% |
1995011720 | 20 | 0.2% |
1995010306 | 20 | 0.2% |
1995020212 | 19 | 0.2% |
1995010612 | 19 | 0.2% |
Other values (916) | 9792 |
Value | Count | Frequency (%) |
1995010100 | 8 | |
1995010101 | 11 | |
1995010102 | 10 | |
1995010103 | 9 | |
1995010104 | 8 | |
1995010105 | 11 | |
1995010106 | 15 | |
1995010107 | 12 | |
1995010108 | 18 | |
1995010109 | 10 |
Value | Count | Frequency (%) |
1995020813 | 5 | 0.1% |
1995020812 | 11 | |
1995020811 | 7 | 0.1% |
1995020810 | 6 | 0.1% |
1995020809 | 13 | |
1995020808 | 9 | |
1995020807 | 22 | |
1995020806 | 8 | 0.1% |
1995020805 | 13 | |
1995020804 | 9 |
측정소 코드
Real number (ℝ)
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.2664 |
Minimum | 102 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 102 |
---|---|
5-th percentile | 102 |
Q1 | 106 |
median | 111 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 124 |
Range | 22 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 7.1365064 |
---|---|
Coefficient of variation (CV) | 0.063567607 |
Kurtosis | -1.3328116 |
Mean | 112.2664 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.24877064 |
Sum | 1122664 |
Variance | 50.929724 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
109 | 608 | 6.1% |
122 | 603 | 6.0% |
117 | 589 | 5.9% |
106 | 583 | 5.8% |
123 | 562 | 5.6% |
111 | 559 | 5.6% |
104 | 559 | 5.6% |
105 | 554 | 5.5% |
108 | 554 | 5.5% |
121 | 553 | 5.5% |
Other values (8) | 4276 |
Value | Count | Frequency (%) |
102 | 535 | |
103 | 537 | |
104 | 559 | |
105 | 554 | |
106 | 583 | |
107 | 508 | |
108 | 554 | |
109 | 608 | |
110 | 544 | |
111 | 559 |
Value | Count | Frequency (%) |
124 | 530 | |
123 | 562 | |
122 | 603 | |
121 | 553 | |
119 | 527 | |
117 | 589 | |
116 | 548 | |
113 | 547 | |
111 | 559 | |
110 | 544 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3689 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.750158 |
---|---|
Coefficient of variation (CV) | 0.51223864 |
Kurtosis | -1.2067447 |
Mean | 5.3689 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.219871 |
Sum | 53689 |
Variance | 7.5633691 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 1709 | |
9 | 1698 | |
5 | 1669 | |
3 | 1658 | |
1 | 1636 | |
6 | 1630 |
Value | Count | Frequency (%) |
1 | 1636 | |
3 | 1658 | |
5 | 1669 | |
6 | 1630 | |
8 | 1709 | |
9 | 1698 |
Value | Count | Frequency (%) |
9 | 1698 | |
8 | 1709 | |
6 | 1630 | |
5 | 1669 | |
3 | 1658 | |
1 | 1636 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 328 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -2636.4637 |
Minimum | -9999 |
---|---|
Maximum | 289 |
Zeros | 493 |
Zeros (%) | 4.9% |
Negative | 3110 |
Negative (%) | 31.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9999 |
median | 0.012 |
Q3 | 0.046 |
95-th percentile | 3.8 |
Maximum | 289 |
Range | 10288 |
Interquartile range (IQR) | 9999.046 |
Descriptive statistics
Standard deviation | 4394.4123 |
---|---|
Coefficient of variation (CV) | -1.6667828 |
Kurtosis | -0.83641736 |
Mean | -2636.4637 |
Median Absolute Deviation (MAD) | 0.188 |
Skewness | -1.0772543 |
Sum | -26364637 |
Variance | 19310859 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 2625 | |
0.0 | 493 | 4.9% |
-9.999 | 335 | 3.4% |
0.011 | 146 | 1.5% |
-999.9 | 145 | 1.5% |
0.008 | 143 | 1.4% |
0.012 | 137 | 1.4% |
0.009 | 136 | 1.4% |
0.015 | 135 | 1.4% |
0.01 | 132 | 1.3% |
Other values (318) | 5573 |
Value | Count | Frequency (%) |
-9999.0 | 2625 | |
-999.9 | 145 | 1.5% |
-999.8 | 1 | < 0.1% |
-999.7 | 1 | < 0.1% |
-999.5 | 1 | < 0.1% |
-9.999 | 335 | 3.4% |
-9.998 | 1 | < 0.1% |
-3.5 | 1 | < 0.1% |
0.0 | 493 | 4.9% |
0.001 | 112 | 1.1% |
Value | Count | Frequency (%) |
289.0 | 1 | |
243.0 | 1 | |
220.0 | 1 | |
216.0 | 1 | |
202.0 | 1 | |
200.0 | 1 | |
199.0 | 1 | |
198.0 | 1 | |
197.0 | 1 | |
192.0 | 1 |
측정기 상태
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
4 | |
2 | 127 |
1 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 4 |
3rd row | 0 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
0 | 6384 | |
4 | 3486 | |
2 | 127 | 1.3% |
1 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 6384 | |
4 | 3486 | |
2 | 127 | 1.3% |
1 | 3 | < 0.1% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 27 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9973 | |
1 | 27 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9973 | |
1 | 27 | 0.3% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 64 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9936 | |
1 | 64 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9936 | |
1 | 64 | 0.6% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.019 | 0.010 | 0.079 | 0.069 | 0.008 | 0.000 |
측정소 코드 | 0.019 | 1.000 | 0.013 | 0.171 | 0.162 | 0.101 | 0.145 |
측정항목 | 0.010 | 0.013 | 1.000 | 0.216 | 0.656 | 0.156 | 0.244 |
평균값 | 0.079 | 0.171 | 0.216 | 1.000 | 0.573 | 0.000 | 0.000 |
측정기 상태 | 0.069 | 0.162 | 0.656 | 0.573 | 1.000 | 0.053 | 0.087 |
국가 기준초과 구분 | 0.008 | 0.101 | 0.156 | 0.000 | 0.053 | 1.000 | 0.841 |
지자체 기준초과 구분 | 0.000 | 0.145 | 0.244 | 0.000 | 0.087 | 0.841 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | 측정기 상태 | |
---|---|---|---|
지자체 기준초과 구분 | 1.000 | 0.636 | 0.058 |
국가 기준초과 구분 | 0.636 | 1.000 | 0.035 |
측정기 상태 | 0.058 | 0.035 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.016 | -0.004 | 0.032 | 0.028 | 0.005 | 0.000 |
측정소 코드 | -0.016 | 1.000 | -0.004 | -0.085 | 0.104 | 0.101 | 0.145 |
측정항목 | -0.004 | -0.004 | 1.000 | -0.587 | 0.485 | 0.112 | 0.175 |
평균값 | 0.032 | -0.085 | -0.587 | 1.000 | 0.599 | 0.029 | 0.048 |
측정기 상태 | 0.028 | 0.104 | 0.485 | 0.599 | 1.000 | 0.035 | 0.058 |
국가 기준초과 구분 | 0.005 | 0.101 | 0.112 | 0.029 | 0.035 | 1.000 | 0.636 |
지자체 기준초과 구분 | 0.000 | 0.145 | 0.175 | 0.048 | 0.058 | 0.636 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
85704 | 1995020301 | 111 | 9 | -9999.0 | 4 | 0 | 0 |
17705 | 1995010719 | 123 | 8 | -9999.0 | 4 | 0 | 0 |
88106 | 1995020323 | 121 | 3 | 0.022 | 0 | 0 | 0 |
74199 | 1995012915 | 102 | 5 | -999.9 | 4 | 0 | 0 |
4956 | 1995010221 | 122 | 9 | -9999.0 | 4 | 0 | 0 |
23055 | 1995010921 | 110 | 5 | 2.2 | 0 | 0 | 0 |
29060 | 1995011205 | 103 | 3 | 0.038 | 0 | 0 | 0 |
13337 | 1995010603 | 110 | 8 | 39.0 | 0 | 0 | 0 |
75638 | 1995013004 | 108 | 3 | 0.005 | 0 | 0 | 0 |
31365 | 1995011302 | 109 | 5 | 1.0 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
97338 | 1995020713 | 106 | 9 | -9999.0 | 4 | 0 | 0 |
90121 | 1995020418 | 110 | 1 | 0.015 | 0 | 0 | 0 |
48005 | 1995011912 | 110 | 8 | 192.0 | 0 | 1 | 1 |
98049 | 1995020719 | 122 | 5 | 0.8 | 0 | 0 | 0 |
33025 | 1995011317 | 121 | 1 | 0.006 | 0 | 0 | 0 |
58471 | 1995012313 | 109 | 1 | 0.019 | 0 | 0 | 0 |
84757 | 1995020216 | 121 | 1 | 0.029 | 0 | 0 | 0 |
36184 | 1995011423 | 102 | 6 | 0.011 | 0 | 0 | 0 |
91568 | 1995020507 | 122 | 3 | 0.045 | 0 | 0 | 0 |
44455 | 1995011803 | 116 | 1 | 0.028 | 0 | 0 | 0 |