Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (70.3%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (70.3%) | Imbalance |
평균값 has 154 (1.5%) zeros | Zeros |
측정기 상태 has 9229 (92.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 06:57:42.452154 |
---|---|
Analysis finished | 2024-05-11 06:57:47.742018 |
Duration | 5.29 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 566 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0070112 × 109 |
Minimum | 2.0070101 × 109 |
---|---|
Maximum | 2.0070124 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0070101 × 109 |
---|---|
5-th percentile | 2.0070102 × 109 |
Q1 | 2.0070106 × 109 |
median | 2.0070112 × 109 |
Q3 | 2.0070118 × 109 |
95-th percentile | 2.0070123 × 109 |
Maximum | 2.0070124 × 109 |
Range | 2313 |
Interquartile range (IQR) | 1197 |
Descriptive statistics
Standard deviation | 686.81627 |
---|---|
Coefficient of variation (CV) | 3.4220848 × 10-7 |
Kurtosis | -1.2065903 |
Mean | 2.0070112 × 109 |
Median Absolute Deviation (MAD) | 599 |
Skewness | 0.0078732948 |
Sum | 2.0070112 × 1013 |
Variance | 471716.58 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2007010401 | 28 | 0.3% |
2007012316 | 28 | 0.3% |
2007010302 | 28 | 0.3% |
2007010814 | 28 | 0.3% |
2007010114 | 27 | 0.3% |
2007011517 | 27 | 0.3% |
2007010621 | 27 | 0.3% |
2007011216 | 27 | 0.3% |
2007010106 | 27 | 0.3% |
2007010710 | 26 | 0.3% |
Other values (556) | 9727 |
Value | Count | Frequency (%) |
2007010100 | 16 | |
2007010101 | 20 | |
2007010102 | 14 | |
2007010103 | 16 | |
2007010104 | 20 | |
2007010105 | 16 | |
2007010106 | 27 | |
2007010107 | 16 | |
2007010108 | 18 | |
2007010109 | 22 |
Value | Count | Frequency (%) |
2007012413 | 19 | |
2007012412 | 19 | |
2007012411 | 16 | |
2007012410 | 16 | |
2007012409 | 20 | |
2007012408 | 19 | |
2007012407 | 25 | |
2007012406 | 23 | |
2007012405 | 14 | |
2007012404 | 16 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.8982 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.1846641 |
---|---|
Coefficient of variation (CV) | 0.063638429 |
Kurtosis | -1.1895578 |
Mean | 112.8982 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0015054535 |
Sum | 1128982 |
Variance | 51.619399 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
101 | 444 | 4.4% |
117 | 421 | 4.2% |
111 | 420 | 4.2% |
114 | 419 | 4.2% |
113 | 418 | 4.2% |
121 | 418 | 4.2% |
116 | 417 | 4.2% |
115 | 415 | 4.2% |
118 | 415 | 4.2% |
106 | 407 | 4.1% |
Other values (15) | 5806 |
Value | Count | Frequency (%) |
101 | 444 | |
102 | 389 | |
103 | 404 | |
104 | 404 | |
105 | 391 | |
106 | 407 | |
107 | 403 | |
108 | 403 | |
109 | 365 | |
110 | 380 |
Value | Count | Frequency (%) |
125 | 385 | |
124 | 357 | |
123 | 388 | |
122 | 388 | |
121 | 418 | |
120 | 401 | |
119 | 362 | |
118 | 415 | |
117 | 421 | |
116 | 417 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3213 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.769013 |
---|---|
Coefficient of variation (CV) | 0.52036401 |
Kurtosis | -1.231943 |
Mean | 5.3213 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.19981145 |
Sum | 53213 |
Variance | 7.6674331 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1711 | |
8 | 1705 | |
9 | 1672 | |
5 | 1672 | |
3 | 1662 | |
6 | 1578 |
Value | Count | Frequency (%) |
1 | 1711 | |
3 | 1662 | |
5 | 1672 | |
6 | 1578 | |
8 | 1705 | |
9 | 1672 |
Value | Count | Frequency (%) |
9 | 1672 | |
8 | 1705 | |
6 | 1578 | |
5 | 1672 | |
3 | 1662 | |
1 | 1711 |
평균값
Real number (ℝ)
ZEROS
 
Distinct | 327 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -299.64771 |
Minimum | -9999 |
---|---|
Maximum | 231 |
Zeros | 154 |
Zeros (%) | 1.5% |
Negative | 488 |
Negative (%) | 4.9% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0 |
Q1 | 0.009 |
median | 0.057 |
Q3 | 24 |
95-th percentile | 83 |
Maximum | 231 |
Range | 10230 |
Interquartile range (IQR) | 23.991 |
Descriptive statistics
Standard deviation | 1739.6109 |
---|---|
Coefficient of variation (CV) | -5.8055203 |
Kurtosis | 27.076841 |
Mean | -299.64771 |
Median Absolute Deviation (MAD) | 0.443 |
Skewness | -5.3864844 |
Sum | -2996477.1 |
Variance | 3026245.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 311 | 3.1% |
0.007 | 232 | 2.3% |
0.001 | 224 | 2.2% |
0.003 | 223 | 2.2% |
0.002 | 222 | 2.2% |
0.006 | 214 | 2.1% |
0.008 | 209 | 2.1% |
0.009 | 184 | 1.8% |
0.004 | 183 | 1.8% |
0.005 | 180 | 1.8% |
Other values (317) | 7818 |
Value | Count | Frequency (%) |
-9999.0 | 311 | |
-999.9 | 51 | 0.5% |
-9.999 | 126 | |
0.0 | 154 | |
0.001 | 224 | |
0.002 | 222 | |
0.003 | 223 | |
0.004 | 183 | |
0.005 | 180 | |
0.006 | 214 |
Value | Count | Frequency (%) |
231.0 | 2 | |
222.0 | 1 | |
205.0 | 2 | |
200.0 | 1 | |
199.0 | 1 | |
197.0 | 1 | |
190.0 | 1 | |
188.0 | 1 | |
185.0 | 1 | |
183.0 | 1 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.308 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9229 |
Zeros (%) | 92.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.1828254 |
---|---|
Coefficient of variation (CV) | 3.8403422 |
Kurtosis | 22.054152 |
Mean | 0.308 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.4287447 |
Sum | 3080 |
Variance | 1.3990759 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9229 | |
4 | 558 | 5.6% |
1 | 75 | 0.8% |
2 | 65 | 0.7% |
9 | 59 | 0.6% |
8 | 14 | 0.1% |
Value | Count | Frequency (%) |
0 | 9229 | |
1 | 75 | 0.8% |
2 | 65 | 0.7% |
4 | 558 | 5.6% |
8 | 14 | 0.1% |
9 | 59 | 0.6% |
Value | Count | Frequency (%) |
9 | 59 | 0.6% |
8 | 14 | 0.1% |
4 | 558 | 5.6% |
2 | 65 | 0.7% |
1 | 75 | 0.8% |
0 | 9229 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 525 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9475 | |
1 | 525 | 5.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9475 | |
1 | 525 | 5.2% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 525 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9475 | |
1 | 525 | 5.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9475 | |
1 | 525 | 5.2% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.059 | 0.092 | 0.414 | 0.414 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.289 | 0.430 | 0.091 | 0.091 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.219 | 0.355 | 0.455 | 0.455 |
평균값 | 0.059 | 0.289 | 0.219 | 1.000 | 0.572 | 0.016 | 0.016 |
측정기 상태 | 0.092 | 0.430 | 0.355 | 0.572 | 1.000 | 0.142 | 0.142 |
국가 기준초과 구분 | 0.414 | 0.091 | 0.455 | 0.016 | 0.142 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.414 | 0.091 | 0.455 | 0.016 | 0.142 | 1.000 | 1.000 |
국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|
국가 기준초과 구분 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.004 | 0.006 | 0.032 | 0.050 | 0.318 | 0.318 |
측정소 코드 | -0.004 | 1.000 | 0.004 | 0.031 | -0.019 | 0.070 | 0.070 |
측정항목 | 0.006 | 0.004 | 1.000 | 0.492 | 0.167 | 0.328 | 0.328 |
평균값 | 0.032 | 0.031 | 0.492 | 1.000 | -0.324 | 0.043 | 0.043 |
측정기 상태 | 0.050 | -0.019 | 0.167 | -0.324 | 1.000 | 0.102 | 0.102 |
국가 기준초과 구분 | 0.318 | 0.070 | 0.328 | 0.043 | 0.102 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.318 | 0.070 | 0.328 | 0.043 | 0.102 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
71773 | 2007012022 | 113 | 3 | 0.06 | 0 | 0 | 0 |
84691 | 2007012412 | 116 | 3 | -9.999 | 4 | 0 | 0 |
7817 | 2007010304 | 103 | 9 | 17.0 | 0 | 0 | 0 |
27483 | 2007010815 | 106 | 6 | 0.03 | 0 | 0 | 0 |
40291 | 2007011204 | 116 | 3 | -9.999 | 4 | 0 | 0 |
64425 | 2007011821 | 113 | 6 | 0.007 | 0 | 0 | 0 |
28495 | 2007010821 | 125 | 3 | 0.044 | 0 | 0 | 0 |
52707 | 2007011515 | 110 | 6 | 0.006 | 0 | 0 | 0 |
83660 | 2007012405 | 119 | 5 | 0.7 | 0 | 0 | 0 |
10551 | 2007010322 | 109 | 6 | 0.0 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
2038 | 2007010113 | 115 | 8 | 36.0 | 0 | 0 | 0 |
29243 | 2007010902 | 124 | 9 | 22.0 | 0 | 0 | 0 |
22567 | 2007010706 | 112 | 3 | 0.028 | 0 | 0 | 0 |
71483 | 2007012020 | 114 | 9 | 32.0 | 0 | 0 | 0 |
47819 | 2007011406 | 120 | 9 | 7.0 | 2 | 0 | 0 |
39373 | 2007011122 | 113 | 3 | 0.054 | 0 | 0 | 0 |
71791 | 2007012022 | 116 | 3 | -9.999 | 4 | 0 | 0 |
72338 | 2007012102 | 107 | 5 | 1.6 | 0 | 0 | 0 |
48644 | 2007011412 | 108 | 5 | 0.9 | 0 | 0 | 0 |
26653 | 2007010809 | 118 | 3 | 0.033 | 0 | 0 | 0 |