Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
평균값 is highly overall correlated with 측정기 상태 | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (87.1%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (80.9%) | Imbalance |
평균값 has 181 (1.8%) zeros | Zeros |
측정기 상태 has 8190 (81.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 06:58:15.809498 |
---|---|
Analysis finished | 2024-05-11 06:58:20.107853 |
Duration | 4.3 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 572 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0030112 × 109 |
Minimum | 2.0030101 × 109 |
---|---|
Maximum | 2.0030124 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0030101 × 109 |
---|---|
5-th percentile | 2.0030102 × 109 |
Q1 | 2.0030106 × 109 |
median | 2.0030112 × 109 |
Q3 | 2.0030118 × 109 |
95-th percentile | 2.0030123 × 109 |
Maximum | 2.0030124 × 109 |
Range | 2319 |
Interquartile range (IQR) | 1195.25 |
Descriptive statistics
Standard deviation | 688.16185 |
---|---|
Coefficient of variation (CV) | 3.4356365 × 10-7 |
Kurtosis | -1.1981722 |
Mean | 2.0030112 × 109 |
Median Absolute Deviation (MAD) | 598 |
Skewness | 0.0141841 |
Sum | 2.0030112 × 1013 |
Variance | 473566.74 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2003010911 | 32 | 0.3% |
2003011913 | 30 | 0.3% |
2003011108 | 29 | 0.3% |
2003010809 | 28 | 0.3% |
2003011502 | 27 | 0.3% |
2003011612 | 27 | 0.3% |
2003012410 | 27 | 0.3% |
2003010907 | 26 | 0.3% |
2003010121 | 26 | 0.3% |
2003010101 | 26 | 0.3% |
Other values (562) | 9722 |
Value | Count | Frequency (%) |
2003010100 | 21 | |
2003010101 | 26 | |
2003010102 | 15 | |
2003010103 | 23 | |
2003010104 | 16 | |
2003010105 | 13 | |
2003010106 | 14 | |
2003010107 | 18 | |
2003010108 | 20 | |
2003010109 | 23 |
Value | Count | Frequency (%) |
2003012419 | 5 | 0.1% |
2003012418 | 11 | |
2003012417 | 21 | |
2003012416 | 16 | |
2003012415 | 15 | |
2003012414 | 15 | |
2003012413 | 17 | |
2003012412 | 22 | |
2003012411 | 24 | |
2003012410 | 27 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9614 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.1693689 |
---|---|
Coefficient of variation (CV) | 0.063467422 |
Kurtosis | -1.1828797 |
Mean | 112.9614 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.0061172693 |
Sum | 1129614 |
Variance | 51.39985 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
116 | 450 | 4.5% |
106 | 440 | 4.4% |
114 | 425 | 4.2% |
101 | 418 | 4.2% |
107 | 418 | 4.2% |
125 | 410 | 4.1% |
112 | 410 | 4.1% |
122 | 407 | 4.1% |
111 | 407 | 4.1% |
117 | 405 | 4.0% |
Other values (15) | 5810 |
Value | Count | Frequency (%) |
101 | 418 | |
102 | 374 | |
103 | 393 | |
104 | 380 | |
105 | 398 | |
106 | 440 | |
107 | 418 | |
108 | 383 | |
109 | 384 | |
110 | 403 |
Value | Count | Frequency (%) |
125 | 410 | |
124 | 372 | |
123 | 381 | |
122 | 407 | |
121 | 391 | |
120 | 374 | |
119 | 385 | |
118 | 404 | |
117 | 405 | |
116 | 450 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.334 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7238183 |
---|---|
Coefficient of variation (CV) | 0.51065209 |
Kurtosis | -1.1752196 |
Mean | 5.334 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.21372359 |
Sum | 53340 |
Variance | 7.4191859 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 1738 | |
5 | 1721 | |
8 | 1658 | |
1 | 1645 | |
3 | 1624 | |
9 | 1614 |
Value | Count | Frequency (%) |
1 | 1645 | |
3 | 1624 | |
5 | 1721 | |
6 | 1738 | |
8 | 1658 | |
9 | 1614 |
Value | Count | Frequency (%) |
9 | 1614 | |
8 | 1658 | |
6 | 1738 | |
5 | 1721 | |
3 | 1624 | |
1 | 1645 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 379 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -1026.3922 |
Minimum | -9999 |
---|---|
Maximum | 333 |
Zeros | 181 |
Zeros (%) | 1.8% |
Negative | 1688 |
Negative (%) | 16.9% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | 0.003 |
median | 0.028 |
Q3 | 1.6 |
95-th percentile | 100 |
Maximum | 333 |
Range | 10332 |
Interquartile range (IQR) | 1.597 |
Descriptive statistics
Standard deviation | 3037.1718 |
---|---|
Coefficient of variation (CV) | -2.9590752 |
Kurtosis | 4.8323413 |
Mean | -1026.3922 |
Median Absolute Deviation (MAD) | 0.672 |
Skewness | -2.6101415 |
Sum | -10263922 |
Variance | 9224412.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 1026 | 10.3% |
-9.999 | 488 | 4.9% |
0.002 | 338 | 3.4% |
0.003 | 255 | 2.5% |
0.004 | 241 | 2.4% |
0.001 | 223 | 2.2% |
0.007 | 197 | 2.0% |
0.005 | 195 | 1.9% |
0.006 | 187 | 1.9% |
0.0 | 181 | 1.8% |
Other values (369) | 6669 |
Value | Count | Frequency (%) |
-9999.0 | 1026 | |
-999.9 | 172 | 1.7% |
-10.0 | 1 | < 0.1% |
-9.999 | 488 | |
-0.002 | 1 | < 0.1% |
0.0 | 181 | 1.8% |
0.001 | 223 | 2.2% |
0.002 | 338 | 3.4% |
0.003 | 255 | 2.5% |
0.004 | 241 | 2.4% |
Value | Count | Frequency (%) |
333.0 | 1 | |
304.0 | 1 | |
290.0 | 1 | |
288.0 | 1 | |
286.0 | 1 | |
285.0 | 1 | |
284.0 | 1 | |
279.0 | 2 | |
277.0 | 1 | |
271.0 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.6901 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 8190 |
Zeros (%) | 81.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.5249572 |
---|---|
Coefficient of variation (CV) | 2.2097627 |
Kurtosis | 2.766508 |
Mean | 0.6901 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.9632255 |
Sum | 6901 |
Variance | 2.3254945 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8190 | |
4 | 1577 | 15.8% |
2 | 143 | 1.4% |
1 | 62 | 0.6% |
9 | 21 | 0.2% |
8 | 7 | 0.1% |
Value | Count | Frequency (%) |
0 | 8190 | |
1 | 62 | 0.6% |
2 | 143 | 1.4% |
4 | 1577 | 15.8% |
8 | 7 | 0.1% |
9 | 21 | 0.2% |
Value | Count | Frequency (%) |
9 | 21 | 0.2% |
8 | 7 | 0.1% |
4 | 1577 | 15.8% |
2 | 143 | 1.4% |
1 | 62 | 0.6% |
0 | 8190 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 178 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9822 | |
1 | 178 | 1.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9822 | |
1 | 178 | 1.8% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 293 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9707 | |
1 | 293 | 2.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9707 | |
1 | 293 | 2.9% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.021 | 0.000 | 0.025 | 0.097 | 0.245 | 0.262 |
측정소 코드 | 0.021 | 1.000 | 0.000 | 0.281 | 0.354 | 0.043 | 0.065 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.397 | 0.516 | 0.415 | 0.536 |
평균값 | 0.025 | 0.281 | 0.397 | 1.000 | 0.655 | 0.021 | 0.033 |
측정기 상태 | 0.097 | 0.354 | 0.516 | 0.655 | 1.000 | 0.115 | 0.123 |
국가 기준초과 구분 | 0.245 | 0.043 | 0.415 | 0.021 | 0.115 | 1.000 | 0.937 |
지자체 기준초과 구분 | 0.262 | 0.065 | 0.536 | 0.033 | 0.123 | 0.937 | 1.000 |
국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|
국가 기준초과 구분 | 1.000 | 0.773 |
지자체 기준초과 구분 | 0.773 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.007 | 0.008 | 0.000 | -0.012 | 0.187 | 0.201 |
측정소 코드 | -0.007 | 1.000 | -0.016 | 0.098 | -0.193 | 0.033 | 0.049 |
측정항목 | 0.008 | -0.016 | 1.000 | 0.197 | 0.286 | 0.299 | 0.387 |
평균값 | 0.000 | 0.098 | 0.197 | 1.000 | -0.618 | 0.048 | 0.063 |
측정기 상태 | -0.012 | -0.193 | 0.286 | -0.618 | 1.000 | 0.083 | 0.089 |
국가 기준초과 구분 | 0.187 | 0.033 | 0.299 | 0.048 | 0.083 | 1.000 | 0.773 |
지자체 기준초과 구분 | 0.201 | 0.049 | 0.387 | 0.063 | 0.089 | 0.773 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
61788 | 2003011803 | 124 | 1 | 0.005 | 0 | 0 | 0 |
41899 | 2003011215 | 109 | 3 | 0.038 | 0 | 0 | 0 |
70803 | 2003012016 | 101 | 6 | -9.999 | 4 | 0 | 0 |
56959 | 2003011619 | 119 | 3 | 0.056 | 0 | 0 | 0 |
57530 | 2003011623 | 114 | 5 | 0.5 | 0 | 0 | 0 |
67974 | 2003011921 | 105 | 1 | 0.007 | 0 | 0 | 0 |
48850 | 2003011413 | 117 | 8 | 33.0 | 0 | 0 | 0 |
16449 | 2003010513 | 117 | 6 | 0.024 | 0 | 0 | 0 |
43965 | 2003011305 | 103 | 6 | 0.004 | 0 | 0 | 0 |
2523 | 2003010116 | 121 | 6 | 0.026 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
67581 | 2003011918 | 114 | 6 | 0.015 | 0 | 0 | 0 |
27329 | 2003010814 | 105 | 9 | -9999.0 | 4 | 0 | 0 |
69838 | 2003012009 | 115 | 8 | 72.0 | 0 | 0 | 0 |
27430 | 2003010814 | 122 | 8 | 83.0 | 0 | 0 | 0 |
40242 | 2003011204 | 108 | 1 | 0.008 | 0 | 0 | 0 |
30317 | 2003010910 | 103 | 9 | 0.0 | 1 | 0 | 0 |
62490 | 2003011808 | 116 | 1 | 0.005 | 1 | 0 | 0 |
26326 | 2003010807 | 113 | 8 | 194.0 | 0 | 1 | 1 |
76426 | 2003012205 | 113 | 8 | 97.0 | 0 | 0 | 0 |
16354 | 2003010513 | 101 | 8 | -9999.0 | 4 | 0 | 0 |