Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
국가 기준초과 구분 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정항목 is highly overall correlated with 평균값 and 2 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
측정기 상태 is highly imbalanced (93.7%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (62.6%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (62.6%) | Imbalance |
평균값 is highly skewed (γ1 = -46.7294729) | Skewed |
Reproduction
Analysis started | 2024-04-27 12:02:46.093435 |
---|---|
Analysis finished | 2024-04-27 12:02:52.539130 |
Duration | 6.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 448 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.020011 × 109 |
Minimum | 2.0200101 × 109 |
---|---|
Maximum | 2.0200119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0200101 × 109 |
---|---|
5-th percentile | 2.0200101 × 109 |
Q1 | 2.0200105 × 109 |
median | 2.020011 × 109 |
Q3 | 2.0200114 × 109 |
95-th percentile | 2.0200118 × 109 |
Maximum | 2.0200119 × 109 |
Range | 1815 |
Interquartile range (IQR) | 906 |
Descriptive statistics
Standard deviation | 536.69788 |
---|---|
Coefficient of variation (CV) | 2.6569057 × 10-7 |
Kurtosis | -1.1861743 |
Mean | 2.020011 × 109 |
Median Absolute Deviation (MAD) | 487 |
Skewness | 0.025820608 |
Sum | 2.020011 × 1013 |
Variance | 288044.62 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2020010609 | 36 | 0.4% |
2020010306 | 36 | 0.4% |
2020011709 | 35 | 0.4% |
2020011816 | 35 | 0.4% |
2020011116 | 34 | 0.3% |
2020010900 | 33 | 0.3% |
2020011316 | 33 | 0.3% |
2020011100 | 32 | 0.3% |
2020010507 | 32 | 0.3% |
2020011407 | 32 | 0.3% |
Other values (438) | 9662 |
Value | Count | Frequency (%) |
2020010100 | 16 | |
2020010101 | 21 | |
2020010102 | 22 | |
2020010103 | 22 | |
2020010104 | 26 | |
2020010105 | 27 | |
2020010106 | 22 | |
2020010107 | 30 | |
2020010108 | 20 | |
2020010109 | 23 |
Value | Count | Frequency (%) |
2020011915 | 9 | 0.1% |
2020011914 | 21 | |
2020011913 | 22 | |
2020011912 | 22 | |
2020011911 | 24 | |
2020011910 | 22 | |
2020011909 | 18 | |
2020011908 | 24 | |
2020011907 | 22 | |
2020011906 | 22 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.055 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2534568 |
---|---|
Coefficient of variation (CV) | 0.064158656 |
Kurtosis | -1.2174689 |
Mean | 113.055 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.011458139 |
Sum | 1130550 |
Variance | 52.612636 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
122 | 431 | 4.3% |
119 | 426 | 4.3% |
101 | 419 | 4.2% |
113 | 418 | 4.2% |
107 | 415 | 4.2% |
124 | 412 | 4.1% |
125 | 410 | 4.1% |
106 | 404 | 4.0% |
111 | 404 | 4.0% |
118 | 404 | 4.0% |
Other values (15) | 5857 |
Value | Count | Frequency (%) |
101 | 419 | |
102 | 384 | |
103 | 401 | |
104 | 403 | |
105 | 395 | |
106 | 404 | |
107 | 415 | |
108 | 398 | |
109 | 349 | |
110 | 393 |
Value | Count | Frequency (%) |
125 | 410 | |
124 | 412 | |
123 | 399 | |
122 | 431 | |
121 | 392 | |
120 | 403 | |
119 | 426 | |
118 | 404 | |
117 | 387 | |
116 | 370 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3459 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7437576 |
---|---|
Coefficient of variation (CV) | 0.51324522 |
Kurtosis | -1.1976167 |
Mean | 5.3459 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.21159593 |
Sum | 53459 |
Variance | 7.528206 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 1712 | |
9 | 1683 | |
5 | 1657 | |
3 | 1653 | |
1 | 1652 | |
8 | 1643 |
Value | Count | Frequency (%) |
1 | 1652 | |
3 | 1653 | |
5 | 1657 | |
6 | 1712 | |
8 | 1643 | |
9 | 1683 |
Value | Count | Frequency (%) |
9 | 1683 | |
8 | 1643 | |
6 | 1712 | |
5 | 1657 | |
3 | 1653 | |
1 | 1652 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 219 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.210505 |
Minimum | -9999 |
---|---|
Maximum | 1985 |
Zeros | 26 |
Zeros (%) | 0.3% |
Negative | 4 |
Negative (%) | < 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.009 |
median | 0.066 |
Q3 | 26 |
95-th percentile | 58 |
Maximum | 1985 |
Range | 11984 |
Interquartile range (IQR) | 25.991 |
Descriptive statistics
Standard deviation | 204.49678 |
---|---|
Coefficient of variation (CV) | 20.028077 |
Kurtosis | 2295.603 |
Mean | 10.210505 |
Median Absolute Deviation (MAD) | 0.066 |
Skewness | -46.729473 |
Sum | 102105.05 |
Variance | 41818.933 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.003 | 715 | 7.1% |
0.004 | 661 | 6.6% |
0.002 | 390 | 3.9% |
0.005 | 298 | 3.0% |
0.6 | 282 | 2.8% |
0.7 | 256 | 2.6% |
0.8 | 241 | 2.4% |
0.5 | 198 | 2.0% |
0.9 | 181 | 1.8% |
0.006 | 156 | 1.6% |
Other values (209) | 6622 |
Value | Count | Frequency (%) |
-9999.0 | 4 | < 0.1% |
0.0 | 26 | 0.3% |
0.001 | 49 | 0.5% |
0.002 | 390 | |
0.003 | 715 | |
0.004 | 661 | |
0.005 | 298 | |
0.006 | 156 | 1.6% |
0.007 | 104 | 1.0% |
0.008 | 80 | 0.8% |
Value | Count | Frequency (%) |
1985.0 | 1 | < 0.1% |
985.0 | 7 | |
906.0 | 1 | < 0.1% |
699.0 | 1 | < 0.1% |
683.0 | 1 | < 0.1% |
677.0 | 1 | < 0.1% |
220.0 | 1 | < 0.1% |
140.0 | 1 | < 0.1% |
133.0 | 1 | < 0.1% |
129.0 | 1 | < 0.1% |
측정기 상태
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
9 | 59 |
1 | 53 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 9 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9888 | |
9 | 59 | 0.6% |
1 | 53 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9888 | |
9 | 59 | 0.6% |
1 | 53 | 0.5% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 722 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9278 | |
1 | 722 | 7.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9278 | |
1 | 722 | 7.2% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 723 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9277 | |
1 | 723 | 7.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9277 | |
1 | 723 | 7.2% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.021 | 0.096 | 0.212 | 0.211 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.020 | 0.100 | 0.108 | 0.108 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.051 | 0.121 | 0.788 | 0.787 |
평균값 | 0.021 | 0.020 | 0.051 | 1.000 | 0.174 | 0.157 | 0.157 |
측정기 상태 | 0.096 | 0.100 | 0.121 | 0.174 | 1.000 | 0.029 | 0.033 |
국가 기준초과 구분 | 0.212 | 0.108 | 0.788 | 0.157 | 0.029 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.211 | 0.108 | 0.787 | 0.157 | 0.033 | 1.000 | 1.000 |
지자체 기준초과 구분 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|
지자체 기준초과 구분 | 1.000 | 0.054 | 0.999 |
측정기 상태 | 0.054 | 1.000 | 0.049 |
국가 기준초과 구분 | 0.999 | 0.049 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.010 | -0.000 | 0.009 | 0.057 | 0.162 | 0.162 |
측정소 코드 | -0.010 | 1.000 | -0.001 | 0.010 | 0.059 | 0.083 | 0.083 |
측정항목 | -0.000 | -0.001 | 1.000 | 0.737 | 0.050 | 0.592 | 0.592 |
평균값 | 0.009 | 0.010 | 0.737 | 1.000 | 0.278 | 0.107 | 0.107 |
측정기 상태 | 0.057 | 0.059 | 0.050 | 0.278 | 1.000 | 0.049 | 0.054 |
국가 기준초과 구분 | 0.162 | 0.083 | 0.592 | 0.107 | 0.049 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.162 | 0.083 | 0.592 | 0.107 | 0.054 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
4485 | 2020010205 | 123 | 6 | 0.002 | 0 | 0 | 0 |
39919 | 2020011202 | 104 | 3 | 0.0 | 9 | 0 | 0 |
36033 | 2020011100 | 106 | 6 | 0.015 | 0 | 0 | 0 |
22858 | 2020010708 | 110 | 8 | 11.0 | 0 | 0 | 0 |
37298 | 2020011108 | 117 | 5 | 1.1 | 0 | 0 | 0 |
41462 | 2020011212 | 111 | 5 | 0.8 | 0 | 0 | 0 |
7084 | 2020010223 | 106 | 8 | 64.0 | 0 | 0 | 0 |
40251 | 2020011204 | 109 | 6 | 0.023 | 0 | 0 | 0 |
31023 | 2020010914 | 121 | 6 | 0.027 | 0 | 0 | 0 |
45547 | 2020011315 | 117 | 3 | 0.012 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
24469 | 2020010719 | 104 | 3 | 0.033 | 0 | 0 | 0 |
57075 | 2020011620 | 113 | 6 | 0.032 | 0 | 0 | 0 |
35203 | 2020011018 | 118 | 3 | 0.059 | 0 | 0 | 0 |
8934 | 2020010311 | 115 | 1 | 0.007 | 0 | 0 | 0 |
2670 | 2020010117 | 121 | 1 | 0.003 | 0 | 0 | 0 |
63622 | 2020011816 | 104 | 8 | 53.0 | 0 | 0 | 0 |
21662 | 2020010700 | 111 | 5 | 0.6 | 0 | 0 | 0 |
21409 | 2020010622 | 119 | 3 | 0.036 | 0 | 0 | 0 |
55755 | 2020011611 | 118 | 6 | 0.017 | 0 | 0 | 0 |
52979 | 2020011517 | 105 | 9 | 18.0 | 0 | 0 | 0 |