Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (99.5%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (99.5%) | Imbalance |
평균값 is highly skewed (γ1 = -34.2708403) | Skewed |
측정기 상태 has 9808 (98.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-27 12:05:17.134911 |
---|---|
Analysis finished | 2024-04-27 12:05:26.209765 |
Duration | 9.07 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 435 |
---|---|
Distinct (%) | 4.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.009011 × 109 |
Minimum | 2.0090101 × 109 |
---|---|
Maximum | 2.0090119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0090101 × 109 |
---|---|
5-th percentile | 2.0090101 × 109 |
Q1 | 2.0090105 × 109 |
median | 2.009011 × 109 |
Q3 | 2.0090114 × 109 |
95-th percentile | 2.0090118 × 109 |
Maximum | 2.0090119 × 109 |
Range | 1802 |
Interquartile range (IQR) | 901 |
Descriptive statistics
Standard deviation | 524.47485 |
---|---|
Coefficient of variation (CV) | 2.6106122 × 10-7 |
Kurtosis | -1.2058591 |
Mean | 2.009011 × 109 |
Median Absolute Deviation (MAD) | 479.5 |
Skewness | 0.0076469006 |
Sum | 2.009011 × 1013 |
Variance | 275073.87 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2009010108 | 35 | 0.4% |
2009011802 | 34 | 0.3% |
2009010507 | 34 | 0.3% |
2009010518 | 34 | 0.3% |
2009011821 | 34 | 0.3% |
2009011313 | 33 | 0.3% |
2009011805 | 32 | 0.3% |
2009010617 | 32 | 0.3% |
2009010905 | 32 | 0.3% |
2009011522 | 32 | 0.3% |
Other values (425) | 9668 |
Value | Count | Frequency (%) |
2009010100 | 15 | |
2009010101 | 22 | |
2009010102 | 22 | |
2009010103 | 27 | |
2009010104 | 21 | |
2009010105 | 29 | |
2009010106 | 23 | |
2009010107 | 22 | |
2009010108 | 35 | |
2009010109 | 24 |
Value | Count | Frequency (%) |
2009011902 | 22 | |
2009011901 | 24 | |
2009011900 | 22 | |
2009011823 | 24 | |
2009011822 | 30 | |
2009011821 | 34 | |
2009011820 | 25 | |
2009011819 | 25 | |
2009011818 | 26 | |
2009011817 | 14 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9829 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2568777 |
---|---|
Coefficient of variation (CV) | 0.064229876 |
Kurtosis | -1.2075917 |
Mean | 112.9829 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.0044423865 |
Sum | 1129829 |
Variance | 52.662274 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
125 | 430 | 4.3% |
101 | 427 | 4.3% |
107 | 420 | 4.2% |
102 | 416 | 4.2% |
109 | 412 | 4.1% |
124 | 411 | 4.1% |
115 | 410 | 4.1% |
118 | 410 | 4.1% |
113 | 408 | 4.1% |
119 | 407 | 4.1% |
Other values (15) | 5849 |
Value | Count | Frequency (%) |
101 | 427 | |
102 | 416 | |
103 | 382 | |
104 | 396 | |
105 | 405 | |
106 | 388 | |
107 | 420 | |
108 | 394 | |
109 | 412 | |
110 | 371 |
Value | Count | Frequency (%) |
125 | 430 | |
124 | 411 | |
123 | 389 | |
122 | 386 | |
121 | 382 | |
120 | 405 | |
119 | 407 | |
118 | 410 | |
117 | 374 | |
116 | 403 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3248 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7675749 |
---|---|
Coefficient of variation (CV) | 0.5197519 |
Kurtosis | -1.2256539 |
Mean | 5.3248 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.2042916 |
Sum | 53248 |
Variance | 7.6594709 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1713 | |
9 | 1686 | |
8 | 1665 | |
6 | 1665 | |
3 | 1652 | |
5 | 1619 |
Value | Count | Frequency (%) |
1 | 1713 | |
3 | 1652 | |
5 | 1619 | |
6 | 1665 | |
8 | 1665 | |
9 | 1686 |
Value | Count | Frequency (%) |
9 | 1686 | |
8 | 1665 | |
6 | 1665 | |
5 | 1619 | |
3 | 1652 | |
1 | 1713 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 294 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.8392487 |
Minimum | -9999 |
---|---|
Maximum | 2310 |
Zeros | 24 |
Zeros (%) | 0.2% |
Negative | 17 |
Negative (%) | 0.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.011 |
median | 0.078 |
Q3 | 24 |
95-th percentile | 76 |
Maximum | 2310 |
Range | 12309 |
Interquartile range (IQR) | 23.989 |
Descriptive statistics
Standard deviation | 285.88855 |
---|---|
Coefficient of variation (CV) | 36.468871 |
Kurtosis | 1199.149 |
Mean | 7.8392487 |
Median Absolute Deviation (MAD) | 0.077 |
Skewness | -34.27084 |
Sum | 78392.487 |
Variance | 81732.264 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.006 | 324 | 3.2% |
0.005 | 287 | 2.9% |
0.002 | 281 | 2.8% |
0.007 | 279 | 2.8% |
0.003 | 267 | 2.7% |
0.004 | 250 | 2.5% |
0.008 | 228 | 2.3% |
0.009 | 198 | 2.0% |
0.4 | 185 | 1.8% |
0.5 | 185 | 1.8% |
Other values (284) | 7516 |
Value | Count | Frequency (%) |
-9999.0 | 8 | 0.1% |
-999.9 | 2 | < 0.1% |
-9.999 | 7 | 0.1% |
0.0 | 24 | 0.2% |
0.001 | 132 | |
0.002 | 281 | |
0.003 | 267 | |
0.004 | 250 | |
0.005 | 287 | |
0.006 | 324 |
Value | Count | Frequency (%) |
2310.0 | 1 | |
475.0 | 1 | |
379.0 | 1 | |
322.0 | 1 | |
238.0 | 1 | |
181.0 | 1 | |
178.0 | 1 | |
177.0 | 1 | |
173.0 | 1 | |
171.0 | 1 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.0863 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9808 |
Zeros (%) | 98.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.79618823 |
---|---|
Coefficient of variation (CV) | 9.2258196 |
Kurtosis | 109.32434 |
Mean | 0.0863 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 10.398174 |
Sum | 863 |
Variance | 0.6339157 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9808 | |
1 | 74 | 0.7% |
9 | 55 | 0.5% |
2 | 31 | 0.3% |
8 | 26 | 0.3% |
4 | 6 | 0.1% |
Value | Count | Frequency (%) |
0 | 9808 | |
1 | 74 | 0.7% |
2 | 31 | 0.3% |
4 | 6 | 0.1% |
8 | 26 | 0.3% |
9 | 55 | 0.5% |
Value | Count | Frequency (%) |
9 | 55 | 0.5% |
8 | 26 | 0.3% |
4 | 6 | 0.1% |
2 | 31 | 0.3% |
1 | 74 | 0.7% |
0 | 9808 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.000 | 0.105 | 0.045 | 0.045 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.033 | 0.159 | 0.038 | 0.038 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.040 | 0.135 | 0.054 | 0.054 |
평균값 | 0.000 | 0.033 | 0.040 | 1.000 | 0.576 | 0.000 | 0.000 |
측정기 상태 | 0.105 | 0.159 | 0.135 | 0.576 | 1.000 | 0.000 | 0.000 |
국가 기준초과 구분 | 0.045 | 0.038 | 0.054 | 0.000 | 0.000 | 1.000 | 0.981 |
지자체 기준초과 구분 | 0.045 | 0.038 | 0.054 | 0.000 | 0.000 | 0.981 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.875 |
국가 기준초과 구분 | 0.875 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.004 | 0.004 | 0.021 | 0.030 | 0.034 | 0.034 |
측정소 코드 | 0.004 | 1.000 | -0.007 | 0.001 | 0.005 | 0.029 | 0.029 |
측정항목 | 0.004 | -0.007 | 1.000 | 0.683 | 0.072 | 0.039 | 0.039 |
평균값 | 0.021 | 0.001 | 0.683 | 1.000 | 0.060 | 0.000 | 0.000 |
측정기 상태 | 0.030 | 0.005 | 0.072 | 0.060 | 1.000 | 0.000 | 0.000 |
국가 기준초과 구분 | 0.034 | 0.029 | 0.039 | 0.000 | 0.000 | 1.000 | 0.875 |
지자체 기준초과 구분 | 0.034 | 0.029 | 0.039 | 0.000 | 0.000 | 0.875 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
57084 | 2009011620 | 115 | 1 | 0.011 | 0 | 0 | 0 |
22351 | 2009010705 | 101 | 3 | 0.056 | 0 | 0 | 0 |
205 | 2009010101 | 110 | 3 | 0.024 | 0 | 0 | 0 |
18748 | 2009010604 | 125 | 8 | 74.0 | 0 | 0 | 0 |
11934 | 2009010407 | 115 | 1 | 0.008 | 0 | 0 | 0 |
24398 | 2009010718 | 117 | 5 | 0.9 | 0 | 0 | 0 |
46629 | 2009011322 | 122 | 6 | 0.001 | 0 | 0 | 0 |
62421 | 2009011808 | 104 | 6 | 0.011 | 0 | 0 | 0 |
44362 | 2009011307 | 119 | 8 | 38.0 | 0 | 0 | 0 |
18587 | 2009010603 | 123 | 9 | 47.0 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
4818 | 2009010208 | 104 | 1 | 0.006 | 0 | 0 | 0 |
32709 | 2009011002 | 102 | 6 | 0.024 | 0 | 0 | 0 |
26424 | 2009010808 | 105 | 1 | 0.006 | 0 | 0 | 0 |
50416 | 2009011500 | 103 | 8 | 68.0 | 0 | 0 | 0 |
57247 | 2009011621 | 117 | 3 | 0.081 | 0 | 0 | 0 |
15131 | 2009010504 | 122 | 9 | 31.0 | 0 | 0 | 0 |
25908 | 2009010804 | 119 | 1 | 0.006 | 0 | 0 | 0 |
22338 | 2009010704 | 124 | 1 | 0.017 | 0 | 0 | 0 |
18511 | 2009010603 | 111 | 3 | 0.049 | 0 | 0 | 0 |
45360 | 2009011314 | 111 | 1 | 0.006 | 0 | 0 | 0 |