Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (97.6%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (95.2%) | Imbalance |
평균값 has 142 (1.4%) zeros | Zeros |
측정기 상태 has 8282 (82.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-27 12:06:21.336806 |
---|---|
Analysis finished | 2024-04-27 12:06:30.351659 |
Duration | 9.01 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 581 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0050113 × 109 |
Minimum | 2.0050101 × 109 |
---|---|
Maximum | 2.0050125 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0050101 × 109 |
---|---|
5-th percentile | 2.0050102 × 109 |
Q1 | 2.0050107 × 109 |
median | 2.0050112 × 109 |
Q3 | 2.0050119 × 109 |
95-th percentile | 2.0050123 × 109 |
Maximum | 2.0050125 × 109 |
Range | 2404 |
Interquartile range (IQR) | 1201 |
Descriptive statistics
Standard deviation | 694.46557 |
---|---|
Coefficient of variation (CV) | 3.4636492 × 10-7 |
Kurtosis | -1.1922604 |
Mean | 2.0050113 × 109 |
Median Absolute Deviation (MAD) | 600 |
Skewness | 0.010009571 |
Sum | 2.0050113 × 1013 |
Variance | 482282.43 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2005012310 | 29 | 0.3% |
2005011023 | 29 | 0.3% |
2005012116 | 28 | 0.3% |
2005010403 | 28 | 0.3% |
2005011915 | 28 | 0.3% |
2005011914 | 27 | 0.3% |
2005012121 | 26 | 0.3% |
2005011812 | 26 | 0.3% |
2005010815 | 26 | 0.3% |
2005012321 | 26 | 0.3% |
Other values (571) | 9727 |
Value | Count | Frequency (%) |
2005010100 | 20 | |
2005010101 | 17 | |
2005010102 | 20 | |
2005010103 | 11 | |
2005010104 | 16 | |
2005010105 | 15 | |
2005010106 | 12 | |
2005010107 | 14 | |
2005010108 | 23 | |
2005010109 | 17 |
Value | Count | Frequency (%) |
2005012504 | 4 | < 0.1% |
2005012503 | 15 | |
2005012502 | 18 | |
2005012501 | 19 | |
2005012500 | 13 | |
2005012423 | 19 | |
2005012422 | 15 | |
2005012421 | 16 | |
2005012420 | 15 | |
2005012419 | 19 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.0493 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.226527 |
---|---|
Coefficient of variation (CV) | 0.063923677 |
Kurtosis | -1.2120867 |
Mean | 113.0493 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.01901842 |
Sum | 1130493 |
Variance | 52.222692 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
122 | 442 | 4.4% |
116 | 436 | 4.4% |
118 | 428 | 4.3% |
117 | 418 | 4.2% |
102 | 414 | 4.1% |
111 | 413 | 4.1% |
103 | 412 | 4.1% |
123 | 408 | 4.1% |
110 | 408 | 4.1% |
105 | 407 | 4.1% |
Other values (15) | 5814 |
Value | Count | Frequency (%) |
101 | 399 | |
102 | 414 | |
103 | 412 | |
104 | 381 | |
105 | 407 | |
106 | 366 | |
107 | 403 | |
108 | 400 | |
109 | 384 | |
110 | 408 |
Value | Count | Frequency (%) |
125 | 389 | |
124 | 398 | |
123 | 408 | |
122 | 442 | |
121 | 388 | |
120 | 396 | |
119 | 388 | |
118 | 428 | |
117 | 418 | |
116 | 436 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.2745 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7655586 |
---|---|
Coefficient of variation (CV) | 0.52432622 |
Kurtosis | -1.2413594 |
Mean | 5.2745 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.18146356 |
Sum | 52745 |
Variance | 7.6483146 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1740 | |
8 | 1720 | |
3 | 1712 | |
5 | 1632 | |
6 | 1605 | |
9 | 1591 |
Value | Count | Frequency (%) |
1 | 1740 | |
3 | 1712 | |
5 | 1632 | |
6 | 1605 | |
8 | 1720 | |
9 | 1591 |
Value | Count | Frequency (%) |
9 | 1591 | |
8 | 1720 | |
6 | 1605 | |
5 | 1632 | |
3 | 1712 | |
1 | 1740 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 291 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -938.25209 |
Minimum | -9999 |
---|---|
Maximum | 182 |
Zeros | 142 |
Zeros (%) | 1.4% |
Negative | 1544 |
Negative (%) | 15.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | 0.004 |
median | 0.025 |
Q3 | 1.1 |
95-th percentile | 63 |
Maximum | 182 |
Range | 10181 |
Interquartile range (IQR) | 1.096 |
Descriptive statistics
Standard deviation | 2909.4193 |
---|---|
Coefficient of variation (CV) | -3.1008929 |
Kurtosis | 5.7917308 |
Mean | -938.25209 |
Median Absolute Deviation (MAD) | 0.375 |
Skewness | -2.7878782 |
Sum | -9382520.9 |
Variance | 8464720.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 933 | 9.3% |
-9.999 | 457 | 4.6% |
0.003 | 308 | 3.1% |
0.004 | 284 | 2.8% |
0.002 | 264 | 2.6% |
0.005 | 249 | 2.5% |
0.001 | 236 | 2.4% |
0.006 | 231 | 2.3% |
0.5 | 188 | 1.9% |
0.007 | 185 | 1.8% |
Other values (281) | 6665 |
Value | Count | Frequency (%) |
-9999.0 | 933 | |
-999.9 | 154 | 1.5% |
-9.999 | 457 | |
0.0 | 142 | 1.4% |
0.001 | 236 | 2.4% |
0.002 | 264 | 2.6% |
0.003 | 308 | 3.1% |
0.004 | 284 | 2.8% |
0.005 | 249 | 2.5% |
0.006 | 231 | 2.3% |
Value | Count | Frequency (%) |
182.0 | 1 | < 0.1% |
180.0 | 1 | < 0.1% |
175.0 | 1 | < 0.1% |
170.0 | 2 | |
168.0 | 1 | < 0.1% |
164.0 | 1 | < 0.1% |
163.0 | 1 | < 0.1% |
162.0 | 1 | < 0.1% |
161.0 | 1 | < 0.1% |
160.0 | 3 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.6874 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 8282 |
Zeros (%) | 82.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.5899478 |
---|---|
Coefficient of variation (CV) | 2.3129878 |
Kurtosis | 4.5775396 |
Mean | 0.6874 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.2429648 |
Sum | 6874 |
Variance | 2.527934 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8282 | |
4 | 1457 | 14.6% |
2 | 146 | 1.5% |
8 | 57 | 0.6% |
9 | 30 | 0.3% |
1 | 28 | 0.3% |
Value | Count | Frequency (%) |
0 | 8282 | |
1 | 28 | 0.3% |
2 | 146 | 1.5% |
4 | 1457 | 14.6% |
8 | 57 | 0.6% |
9 | 30 | 0.3% |
Value | Count | Frequency (%) |
9 | 30 | 0.3% |
8 | 57 | 0.6% |
4 | 1457 | 14.6% |
2 | 146 | 1.5% |
1 | 28 | 0.3% |
0 | 8282 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 24 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9976 | |
1 | 24 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9976 | |
1 | 24 | 0.2% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 53 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 53 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 53 | 0.5% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.209 | 0.248 | 0.077 | 0.126 |
측정소 코드 | 0.000 | 1.000 | 0.023 | 0.151 | 0.303 | 0.028 | 0.044 |
측정항목 | 0.000 | 0.023 | 1.000 | 0.388 | 0.526 | 0.118 | 0.187 |
평균값 | 0.209 | 0.151 | 0.388 | 1.000 | 0.630 | 0.000 | 0.000 |
측정기 상태 | 0.248 | 0.303 | 0.526 | 0.630 | 1.000 | 0.156 | 0.148 |
국가 기준초과 구분 | 0.077 | 0.028 | 0.118 | 0.000 | 0.156 | 1.000 | 0.859 |
지자체 기준초과 구분 | 0.126 | 0.044 | 0.187 | 0.000 | 0.148 | 0.859 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.658 |
국가 기준초과 구분 | 0.658 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.002 | 0.002 | -0.103 | 0.194 | 0.059 | 0.097 |
측정소 코드 | 0.002 | 1.000 | -0.010 | -0.004 | 0.003 | 0.021 | 0.033 |
측정항목 | 0.002 | -0.010 | 1.000 | 0.243 | 0.266 | 0.085 | 0.134 |
평균값 | -0.103 | -0.004 | 0.243 | 1.000 | -0.618 | 0.010 | 0.021 |
측정기 상태 | 0.194 | 0.003 | 0.266 | -0.618 | 1.000 | 0.112 | 0.107 |
국가 기준초과 구분 | 0.059 | 0.021 | 0.085 | 0.010 | 0.112 | 1.000 | 0.658 |
지자체 기준초과 구분 | 0.097 | 0.033 | 0.134 | 0.021 | 0.107 | 0.658 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
2189 | 2005010114 | 115 | 9 | -9999.0 | 4 | 0 | 0 |
893 | 2005010105 | 124 | 9 | 7.0 | 0 | 0 | 0 |
80729 | 2005012310 | 105 | 9 | -9999.0 | 4 | 0 | 0 |
48322 | 2005011410 | 104 | 8 | 57.0 | 0 | 0 | 0 |
48599 | 2005011411 | 125 | 9 | -9999.0 | 4 | 0 | 0 |
15382 | 2005010506 | 114 | 8 | 41.0 | 0 | 0 | 0 |
39083 | 2005011120 | 114 | 9 | -9999.0 | 4 | 0 | 0 |
49922 | 2005011420 | 121 | 5 | 0.7 | 0 | 0 | 0 |
12553 | 2005010411 | 118 | 3 | 0.022 | 0 | 0 | 0 |
82391 | 2005012321 | 107 | 9 | 34.0 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
64778 | 2005011823 | 122 | 5 | 0.8 | 0 | 0 | 0 |
66116 | 2005011908 | 120 | 5 | -999.9 | 4 | 0 | 0 |
8017 | 2005010305 | 112 | 3 | 0.039 | 0 | 0 | 0 |
65777 | 2005011906 | 113 | 9 | 22.0 | 0 | 0 | 0 |
20356 | 2005010615 | 118 | 8 | 25.0 | 0 | 0 | 0 |
44249 | 2005011306 | 125 | 9 | -9999.0 | 4 | 0 | 0 |
69740 | 2005012008 | 124 | 5 | 0.6 | 0 | 0 | 0 |
63090 | 2005011812 | 116 | 1 | 0.007 | 0 | 0 | 0 |
1628 | 2005010110 | 122 | 5 | 0.4 | 0 | 0 | 0 |
43339 | 2005011300 | 124 | 3 | 0.05 | 0 | 0 | 0 |