Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
측정기 상태 is highly imbalanced (89.0%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (74.0%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (74.0%) | Imbalance |
평균값 is highly skewed (γ1 = -43.28980571) | Skewed |
Reproduction
Analysis started | 2024-04-27 12:05:33.218146 |
---|---|
Analysis finished | 2024-04-27 12:05:39.720766 |
Duration | 6.5 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 498 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0080111 × 109 |
Minimum | 2.0080101 × 109 |
---|---|
Maximum | 2.0080121 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0080101 × 109 |
---|---|
5-th percentile | 2.0080102 × 109 |
Q1 | 2.0080106 × 109 |
median | 2.0080111 × 109 |
Q3 | 2.0080116 × 109 |
95-th percentile | 2.008012 × 109 |
Maximum | 2.0080121 × 109 |
Range | 2017 |
Interquartile range (IQR) | 1008.25 |
Descriptive statistics
Standard deviation | 597.69576 |
---|---|
Coefficient of variation (CV) | 2.9765561 × 10-7 |
Kurtosis | -1.2022866 |
Mean | 2.0080111 × 109 |
Median Absolute Deviation (MAD) | 505 |
Skewness | 0.016015682 |
Sum | 2.0080111 × 1013 |
Variance | 357240.23 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2008011306 | 34 | 0.3% |
2008011804 | 34 | 0.3% |
2008011220 | 32 | 0.3% |
2008010302 | 30 | 0.3% |
2008011811 | 30 | 0.3% |
2008010902 | 30 | 0.3% |
2008011809 | 29 | 0.3% |
2008010322 | 29 | 0.3% |
2008010611 | 29 | 0.3% |
2008010402 | 29 | 0.3% |
Other values (488) | 9694 |
Value | Count | Frequency (%) |
2008010100 | 19 | |
2008010101 | 22 | |
2008010102 | 26 | |
2008010103 | 11 | 0.1% |
2008010104 | 28 | |
2008010105 | 22 | |
2008010106 | 17 | |
2008010107 | 23 | |
2008010108 | 18 | |
2008010109 | 19 |
Value | Count | Frequency (%) |
2008012117 | 7 | 0.1% |
2008012116 | 17 | |
2008012115 | 19 | |
2008012114 | 22 | |
2008012113 | 19 | |
2008012112 | 19 | |
2008012111 | 16 | |
2008012110 | 21 | |
2008012109 | 22 | |
2008012108 | 18 |
측정소 코드
Real number (ℝ)
Distinct | 24 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.5929 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 112 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.0629284 |
---|---|
Coefficient of variation (CV) | 0.062729785 |
Kurtosis | -1.1420043 |
Mean | 112.5929 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.04893582 |
Sum | 1125929 |
Variance | 49.884958 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
118 | 451 | 4.5% |
119 | 439 | 4.4% |
114 | 435 | 4.3% |
101 | 433 | 4.3% |
107 | 432 | 4.3% |
121 | 431 | 4.3% |
111 | 431 | 4.3% |
103 | 430 | 4.3% |
115 | 429 | 4.3% |
112 | 428 | 4.3% |
Other values (14) | 5661 |
Value | Count | Frequency (%) |
101 | 433 | |
102 | 404 | |
103 | 430 | |
104 | 417 | |
105 | 405 | |
106 | 395 | |
107 | 432 | |
108 | 388 | |
109 | 426 | |
110 | 421 |
Value | Count | Frequency (%) |
125 | 395 | |
124 | 427 | |
122 | 428 | |
121 | 431 | |
120 | 393 | |
119 | 439 | |
118 | 451 | |
117 | 409 | |
116 | 370 | |
115 | 429 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3429 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7501411 |
---|---|
Coefficient of variation (CV) | 0.51472816 |
Kurtosis | -1.2036877 |
Mean | 5.3429 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.20849362 |
Sum | 53429 |
Variance | 7.5632759 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1699 | |
6 | 1687 | |
5 | 1666 | |
1 | 1664 | |
3 | 1650 | |
8 | 1634 |
Value | Count | Frequency (%) |
1 | 1664 | |
3 | 1650 | |
5 | 1666 | |
6 | 1687 | |
8 | 1634 | |
9 | 1699 |
Value | Count | Frequency (%) |
9 | 1699 | |
8 | 1634 | |
6 | 1687 | |
5 | 1666 | |
3 | 1650 | |
1 | 1664 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 357 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.445015 |
Minimum | -9999 |
---|---|
Maximum | 288 |
Zeros | 38 |
Zeros (%) | 0.4% |
Negative | 11 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.002 |
Q1 | 0.012 |
median | 0.099 |
Q3 | 22 |
95-th percentile | 81 |
Maximum | 288 |
Range | 10287 |
Interquartile range (IQR) | 21.988 |
Descriptive statistics
Standard deviation | 226.28532 |
---|---|
Coefficient of variation (CV) | 19.771517 |
Kurtosis | 1913.3001 |
Mean | 11.445015 |
Median Absolute Deviation (MAD) | 0.099 |
Skewness | -43.289806 |
Sum | 114450.15 |
Variance | 51205.046 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.002 | 303 | 3.0% |
0.007 | 268 | 2.7% |
0.006 | 249 | 2.5% |
0.001 | 224 | 2.2% |
0.008 | 215 | 2.1% |
0.005 | 212 | 2.1% |
0.6 | 203 | 2.0% |
0.5 | 199 | 2.0% |
0.01 | 186 | 1.9% |
0.009 | 185 | 1.8% |
Other values (347) | 7756 |
Value | Count | Frequency (%) |
-9999.0 | 5 | 0.1% |
-9.999 | 6 | 0.1% |
0.0 | 38 | 0.4% |
0.001 | 224 | |
0.002 | 303 | |
0.003 | 185 | |
0.004 | 180 | |
0.005 | 212 | |
0.006 | 249 | |
0.007 | 268 |
Value | Count | Frequency (%) |
288.0 | 1 | < 0.1% |
279.0 | 1 | < 0.1% |
269.0 | 1 | < 0.1% |
255.0 | 2 | |
250.0 | 1 | < 0.1% |
247.0 | 3 | |
245.0 | 1 | < 0.1% |
244.0 | 1 | < 0.1% |
242.0 | 1 | < 0.1% |
237.0 | 1 | < 0.1% |
측정기 상태
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
8 | 135 |
9 | 84 |
1 | 67 |
2 | 24 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9690 | |
8 | 135 | 1.4% |
9 | 84 | 0.8% |
1 | 67 | 0.7% |
2 | 24 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9690 | |
8 | 135 | 1.4% |
9 | 84 | 0.8% |
1 | 67 | 0.7% |
2 | 24 | 0.2% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 440 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9560 | |
1 | 440 | 4.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9560 | |
1 | 440 | 4.4% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 440 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9560 | |
1 | 440 | 4.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9560 | |
1 | 440 | 4.4% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | NaN | 0.083 | 0.475 | 0.475 |
측정소 코드 | 0.000 | 1.000 | 0.000 | NaN | 0.360 | 0.000 | 0.000 |
측정항목 | 0.000 | 0.000 | 1.000 | NaN | 0.144 | 0.415 | 0.415 |
평균값 | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN |
측정기 상태 | 0.083 | 0.360 | 0.144 | NaN | 1.000 | 0.095 | 0.095 |
국가 기준초과 구분 | 0.475 | 0.000 | 0.415 | NaN | 0.095 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.475 | 0.000 | 0.415 | NaN | 0.095 | 1.000 | 1.000 |
지자체 기준초과 구분 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|
지자체 기준초과 구분 | 1.000 | 0.116 | 0.999 |
측정기 상태 | 0.116 | 1.000 | 0.116 |
국가 기준초과 구분 | 0.999 | 0.116 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.013 | -0.001 | -0.033 | 0.035 | 0.365 | 0.365 |
측정소 코드 | -0.013 | 1.000 | -0.005 | 0.003 | 0.157 | 0.000 | 0.000 |
측정항목 | -0.001 | -0.005 | 1.000 | 0.661 | 0.097 | 0.299 | 0.299 |
평균값 | -0.033 | 0.003 | 0.661 | 1.000 | 0.156 | 0.000 | 0.000 |
측정기 상태 | 0.035 | 0.157 | 0.097 | 0.156 | 1.000 | 0.116 | 0.116 |
국가 기준초과 구분 | 0.365 | 0.000 | 0.299 | 0.000 | 0.116 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.365 | 0.000 | 0.299 | 0.000 | 0.116 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
32555 | 2008011010 | 102 | 9 | 28.0 | 0 | 0 | 0 |
41103 | 2008011221 | 111 | 6 | 0.027 | 0 | 0 | 0 |
5953 | 2008010217 | 109 | 3 | 0.042 | 0 | 0 | 0 |
27779 | 2008010900 | 122 | 9 | 40.0 | 0 | 0 | 0 |
47669 | 2008011419 | 101 | 9 | 15.0 | 0 | 0 | 0 |
54961 | 2008011621 | 117 | 3 | 0.026 | 0 | 0 | 0 |
47043 | 2008011414 | 117 | 6 | 0.02 | 0 | 0 | 0 |
48963 | 2008011504 | 101 | 6 | 0.003 | 0 | 0 | 0 |
46707 | 2008011412 | 109 | 6 | 0.01 | 0 | 0 | 0 |
62111 | 2008011823 | 108 | 9 | 96.0 | 8 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
42665 | 2008011308 | 107 | 9 | 7.0 | 0 | 0 | 0 |
50552 | 2008011515 | 102 | 5 | 0.8 | 0 | 0 | 0 |
5414 | 2008010213 | 115 | 5 | 0.5 | 0 | 0 | 0 |
37263 | 2008011118 | 119 | 6 | 0.001 | 0 | 0 | 0 |
36884 | 2008011116 | 104 | 5 | 0.6 | 0 | 0 | 0 |
19874 | 2008010618 | 101 | 5 | 2.1 | 0 | 0 | 0 |
36500 | 2008011113 | 112 | 5 | 0.6 | 0 | 0 | 0 |
45331 | 2008011402 | 120 | 3 | 0.051 | 0 | 0 | 0 |
53618 | 2008011612 | 109 | 5 | 0.7 | 0 | 0 | 0 |
37775 | 2008011122 | 108 | 9 | 9.0 | 8 | 0 | 0 |