Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 is highly overall correlated with 측정항목 and 2 other fields | High correlation |
국가 기준초과 구분 is highly overall correlated with 측정항목 and 2 other fields | High correlation |
측정항목 is highly overall correlated with 평균값 and 2 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 and 2 other fields | High correlation |
측정기 상태 is highly imbalanced (96.4%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (52.3%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (52.3%) | Imbalance |
Reproduction
Analysis started | 2024-04-27 12:02:59.879053 |
---|---|
Analysis finished | 2024-04-27 12:03:05.239676 |
Duration | 5.36 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 435 |
---|---|
Distinct (%) | 4.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.019011 × 109 |
Minimum | 2.0190101 × 109 |
---|---|
Maximum | 2.0190119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0190101 × 109 |
---|---|
5-th percentile | 2.0190101 × 109 |
Q1 | 2.0190105 × 109 |
median | 2.019011 × 109 |
Q3 | 2.0190114 × 109 |
95-th percentile | 2.0190118 × 109 |
Maximum | 2.0190119 × 109 |
Range | 1802 |
Interquartile range (IQR) | 899 |
Descriptive statistics
Standard deviation | 519.78839 |
---|---|
Coefficient of variation (CV) | 2.5744704 × 10-7 |
Kurtosis | -1.1908663 |
Mean | 2.019011 × 109 |
Median Absolute Deviation (MAD) | 423 |
Skewness | 0.015573752 |
Sum | 2.019011 × 1013 |
Variance | 270179.97 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2019010320 | 38 | 0.4% |
2019011019 | 34 | 0.3% |
2019011018 | 33 | 0.3% |
2019010418 | 33 | 0.3% |
2019010501 | 33 | 0.3% |
2019011217 | 33 | 0.3% |
2019010518 | 33 | 0.3% |
2019010814 | 32 | 0.3% |
2019010410 | 32 | 0.3% |
2019010721 | 32 | 0.3% |
Other values (425) | 9667 |
Value | Count | Frequency (%) |
2019010100 | 22 | |
2019010101 | 30 | |
2019010102 | 17 | |
2019010103 | 23 | |
2019010104 | 27 | |
2019010105 | 23 | |
2019010106 | 23 | |
2019010107 | 24 | |
2019010108 | 26 | |
2019010109 | 27 |
Value | Count | Frequency (%) |
2019011902 | 8 | 0.1% |
2019011901 | 18 | |
2019011900 | 24 | |
2019011823 | 21 | |
2019011822 | 26 | |
2019011821 | 17 | |
2019011820 | 26 | |
2019011819 | 32 | |
2019011818 | 23 | |
2019011817 | 20 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9987 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2165578 |
---|---|
Coefficient of variation (CV) | 0.063864078 |
Kurtosis | -1.2144324 |
Mean | 112.9987 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0086693438 |
Sum | 1129987 |
Variance | 52.078706 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
119 | 433 | 4.3% |
121 | 428 | 4.3% |
124 | 418 | 4.2% |
101 | 416 | 4.2% |
113 | 416 | 4.2% |
104 | 415 | 4.2% |
111 | 415 | 4.2% |
105 | 414 | 4.1% |
123 | 413 | 4.1% |
118 | 407 | 4.1% |
Other values (15) | 5825 |
Value | Count | Frequency (%) |
101 | 416 | |
102 | 400 | |
103 | 375 | |
104 | 415 | |
105 | 414 | |
106 | 387 | |
107 | 403 | |
108 | 395 | |
109 | 394 | |
110 | 393 |
Value | Count | Frequency (%) |
125 | 356 | |
124 | 418 | |
123 | 413 | |
122 | 407 | |
121 | 428 | |
120 | 385 | |
119 | 433 | |
118 | 407 | |
117 | 376 | |
116 | 385 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3243 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7490881 |
---|---|
Coefficient of variation (CV) | 0.51632855 |
Kurtosis | -1.2030488 |
Mean | 5.3243 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.2006142 |
Sum | 53243 |
Variance | 7.5574853 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 1690 | |
5 | 1686 | |
9 | 1683 | |
1 | 1681 | |
3 | 1647 | |
8 | 1613 |
Value | Count | Frequency (%) |
1 | 1681 | |
3 | 1647 | |
5 | 1686 | |
6 | 1690 | |
8 | 1613 | |
9 | 1683 |
Value | Count | Frequency (%) |
9 | 1683 | |
8 | 1613 | |
6 | 1690 | |
5 | 1686 | |
3 | 1647 | |
1 | 1681 |
평균값
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 318 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.770869 |
Minimum | 0 |
---|---|
Maximum | 985 |
Zeros | 16 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.008 |
median | 0.078 |
Q3 | 27 |
95-th percentile | 97 |
Maximum | 985 |
Range | 985 |
Interquartile range (IQR) | 26.992 |
Descriptive statistics
Standard deviation | 47.553212 |
---|---|
Coefficient of variation (CV) | 2.4052161 |
Kurtosis | 186.95866 |
Mean | 19.770869 |
Median Absolute Deviation (MAD) | 0.077 |
Skewness | 10.177105 |
Sum | 197708.69 |
Variance | 2261.308 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.005 | 529 | 5.3% |
0.004 | 495 | 5.0% |
0.002 | 362 | 3.6% |
0.006 | 356 | 3.6% |
0.003 | 314 | 3.1% |
0.007 | 235 | 2.4% |
0.6 | 210 | 2.1% |
0.7 | 201 | 2.0% |
0.8 | 197 | 2.0% |
0.5 | 195 | 1.9% |
Other values (308) | 6906 |
Value | Count | Frequency (%) |
0.0 | 16 | 0.2% |
0.001 | 78 | 0.8% |
0.002 | 362 | |
0.003 | 314 | |
0.004 | 495 | |
0.005 | 529 | |
0.006 | 356 | |
0.007 | 235 | |
0.008 | 169 | 1.7% |
0.009 | 102 | 1.0% |
Value | Count | Frequency (%) |
985.0 | 11 | |
409.0 | 1 | < 0.1% |
247.0 | 1 | < 0.1% |
235.0 | 1 | < 0.1% |
224.0 | 1 | < 0.1% |
223.0 | 1 | < 0.1% |
218.0 | 1 | < 0.1% |
216.0 | 2 | < 0.1% |
215.0 | 1 | < 0.1% |
212.0 | 2 | < 0.1% |
측정기 상태
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 52 |
9 | 24 |
2 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9923 | |
1 | 52 | 0.5% |
9 | 24 | 0.2% |
2 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9923 | |
1 | 52 | 0.5% |
9 | 24 | 0.2% |
2 | 1 | < 0.1% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 8975 | |
1 | 1025 | 10.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 8975 | |
1 | 1025 | 10.2% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 8975 | |
1 | 1025 | 10.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 8975 | |
1 | 1025 | 10.2% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.436 | 0.066 | 0.403 | 0.403 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.056 | 0.011 | 0.047 | 0.047 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.256 | 0.043 | 0.721 | 0.721 |
평균값 | 0.436 | 0.056 | 0.256 | 1.000 | 0.473 | 0.524 | 0.524 |
측정기 상태 | 0.066 | 0.011 | 0.043 | 0.473 | 1.000 | 0.136 | 0.136 |
국가 기준초과 구분 | 0.403 | 0.047 | 0.721 | 0.524 | 0.136 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.403 | 0.047 | 0.721 | 0.524 | 0.136 | 1.000 | 1.000 |
지자체 기준초과 구분 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|
지자체 기준초과 구분 | 1.000 | 0.090 | 0.999 |
측정기 상태 | 0.090 | 1.000 | 0.090 |
국가 기준초과 구분 | 0.999 | 0.090 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.008 | 0.004 | 0.054 | 0.040 | 0.310 | 0.310 |
측정소 코드 | 0.008 | 1.000 | 0.009 | 0.001 | 0.007 | 0.036 | 0.036 |
측정항목 | 0.004 | 0.009 | 1.000 | 0.704 | 0.028 | 0.533 | 0.533 |
평균값 | 0.054 | 0.001 | 0.704 | 1.000 | 0.403 | 0.635 | 0.635 |
측정기 상태 | 0.040 | 0.007 | 0.028 | 0.403 | 1.000 | 0.090 | 0.090 |
국가 기준초과 구분 | 0.310 | 0.036 | 0.533 | 0.635 | 0.090 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.310 | 0.036 | 0.533 | 0.635 | 0.090 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
31300 | 2019010916 | 117 | 8 | 65.0 | 0 | 0 | 0 |
50509 | 2019011500 | 119 | 3 | 0.051 | 0 | 0 | 0 |
45452 | 2019011315 | 101 | 5 | 0.8 | 0 | 0 | 0 |
46880 | 2019011400 | 114 | 5 | 1.3 | 0 | 0 | 0 |
11146 | 2019010402 | 108 | 8 | 56.0 | 0 | 0 | 0 |
14097 | 2019010421 | 125 | 6 | 0.002 | 0 | 0 | 0 |
31867 | 2019010920 | 112 | 3 | 0.049 | 0 | 0 | 0 |
19354 | 2019010609 | 101 | 8 | 51.0 | 0 | 0 | 0 |
43939 | 2019011304 | 124 | 3 | 0.062 | 0 | 0 | 0 |
24231 | 2019010717 | 114 | 6 | 0.019 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
56676 | 2019011617 | 122 | 1 | 0.004 | 0 | 0 | 0 |
17169 | 2019010518 | 112 | 6 | 0.032 | 0 | 0 | 0 |
39250 | 2019011121 | 117 | 8 | 111.0 | 0 | 1 | 1 |
37457 | 2019011109 | 118 | 9 | 61.0 | 0 | 1 | 1 |
53038 | 2019011517 | 115 | 8 | 86.0 | 0 | 0 | 0 |
451 | 2019010103 | 101 | 3 | 0.053 | 0 | 0 | 0 |
1757 | 2019010111 | 118 | 9 | 27.0 | 0 | 0 | 0 |
21754 | 2019010701 | 101 | 8 | 32.0 | 0 | 0 | 0 |
45468 | 2019011315 | 104 | 1 | 0.009 | 0 | 0 | 0 |
17414 | 2019010520 | 103 | 5 | 0.4 | 0 | 0 | 0 |