Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 3 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
국가 기준초과 구분 is highly overall correlated with 평균값 and 1 other fields | High correlation |
지자체 기준초과 구분 is highly overall correlated with 평균값 and 1 other fields | High correlation |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 and 2 other fields | High correlation |
국가 기준초과 구분 is highly imbalanced (82.8%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (82.8%) | Imbalance |
측정기 상태 has 9852 (98.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 07:01:05.078406 |
---|---|
Analysis finished | 2024-05-11 07:01:12.182184 |
Duration | 7.1 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 468 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.016011 × 109 |
Minimum | 2.0160101 × 109 |
---|---|
Maximum | 2.016012 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0160101 × 109 |
---|---|
5-th percentile | 2.0160102 × 109 |
Q1 | 2.0160105 × 109 |
median | 2.016011 × 109 |
Q3 | 2.0160115 × 109 |
95-th percentile | 2.0160119 × 109 |
Maximum | 2.016012 × 109 |
Range | 1911 |
Interquartile range (IQR) | 990 |
Descriptive statistics
Standard deviation | 558.88052 |
---|---|
Coefficient of variation (CV) | 2.7722096 × 10-7 |
Kurtosis | -1.186872 |
Mean | 2.016011 × 109 |
Median Absolute Deviation (MAD) | 495 |
Skewness | 0.0099920943 |
Sum | 2.016011 × 1013 |
Variance | 312347.44 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2016011223 | 33 | 0.3% |
2016011804 | 32 | 0.3% |
2016010120 | 31 | 0.3% |
2016011613 | 31 | 0.3% |
2016010513 | 31 | 0.3% |
2016011101 | 31 | 0.3% |
2016010518 | 30 | 0.3% |
2016011320 | 30 | 0.3% |
2016011007 | 30 | 0.3% |
2016011206 | 30 | 0.3% |
Other values (458) | 9691 |
Value | Count | Frequency (%) |
2016010100 | 23 | |
2016010101 | 16 | |
2016010102 | 24 | |
2016010103 | 12 | |
2016010104 | 19 | |
2016010105 | 20 | |
2016010106 | 23 | |
2016010107 | 22 | |
2016010108 | 24 | |
2016010109 | 18 |
Value | Count | Frequency (%) |
2016012011 | 23 | |
2016012010 | 26 | |
2016012009 | 20 | |
2016012008 | 22 | |
2016012007 | 19 | |
2016012006 | 17 | |
2016012005 | 23 | |
2016012004 | 15 | |
2016012003 | 26 | |
2016012002 | 21 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.9305 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2074451 |
---|---|
Coefficient of variation (CV) | 0.063821953 |
Kurtosis | -1.2025787 |
Mean | 112.9305 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.0050087726 |
Sum | 1129305 |
Variance | 51.947264 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
101 | 434 | 4.3% |
105 | 432 | 4.3% |
109 | 420 | 4.2% |
121 | 420 | 4.2% |
106 | 420 | 4.2% |
119 | 414 | 4.1% |
111 | 413 | 4.1% |
113 | 403 | 4.0% |
114 | 403 | 4.0% |
124 | 402 | 4.0% |
Other values (15) | 5839 |
Value | Count | Frequency (%) |
101 | 434 | |
102 | 399 | |
103 | 392 | |
104 | 360 | |
105 | 432 | |
106 | 420 | |
107 | 390 | |
108 | 391 | |
109 | 420 | |
110 | 398 |
Value | Count | Frequency (%) |
125 | 373 | |
124 | 402 | |
123 | 392 | |
122 | 394 | |
121 | 420 | |
120 | 395 | |
119 | 414 | |
118 | 375 | |
117 | 401 | |
116 | 395 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3075 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7396157 |
---|---|
Coefficient of variation (CV) | 0.51617819 |
Kurtosis | -1.2010793 |
Mean | 5.3075 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.19155 |
Sum | 53075 |
Variance | 7.5054943 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 1721 | |
3 | 1676 | |
1 | 1671 | |
6 | 1660 | |
8 | 1637 | |
9 | 1635 |
Value | Count | Frequency (%) |
1 | 1671 | |
3 | 1676 | |
5 | 1721 | |
6 | 1660 | |
8 | 1637 | |
9 | 1635 |
Value | Count | Frequency (%) |
9 | 1635 | |
8 | 1637 | |
6 | 1660 | |
5 | 1721 | |
3 | 1676 | |
1 | 1671 |
평균값
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 420 |
---|---|
Distinct (%) | 4.2% |
Missing | 3 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.810341 |
Minimum | -248 |
---|---|
Maximum | 217 |
Zeros | 41 |
Zeros (%) | 0.4% |
Negative | 4 |
Negative (%) | < 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -248 |
---|---|
5-th percentile | 0.004 |
Q1 | 0.011 |
median | 0.067 |
Q3 | 21 |
95-th percentile | 61 |
Maximum | 217 |
Range | 465 |
Interquartile range (IQR) | 20.989 |
Descriptive statistics
Standard deviation | 22.902571 |
---|---|
Coefficient of variation (CV) | 1.7878189 |
Kurtosis | 7.7327693 |
Mean | 12.810341 |
Median Absolute Deviation (MAD) | 0.066 |
Skewness | 2.0339361 |
Sum | 128064.98 |
Variance | 524.52774 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.006 | 524 | 5.2% |
0.005 | 519 | 5.2% |
0.007 | 390 | 3.9% |
0.004 | 260 | 2.6% |
0.008 | 195 | 1.9% |
0.002 | 177 | 1.8% |
0.003 | 153 | 1.5% |
0.018 | 100 | 1.0% |
0.02 | 97 | 1.0% |
0.009 | 93 | 0.9% |
Other values (410) | 7489 |
Value | Count | Frequency (%) |
-248.0 | 1 | < 0.1% |
-4.0 | 1 | < 0.1% |
-1.0 | 2 | < 0.1% |
0.0 | 41 | 0.4% |
0.001 | 50 | 0.5% |
0.002 | 177 | 1.8% |
0.003 | 153 | 1.5% |
0.004 | 260 | |
0.005 | 519 | |
0.006 | 524 |
Value | Count | Frequency (%) |
217.0 | 1 | |
196.0 | 1 | |
193.0 | 1 | |
184.0 | 1 | |
177.0 | 1 | |
173.0 | 1 | |
163.0 | 1 | |
153.0 | 2 | |
148.0 | 1 | |
147.0 | 1 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.0547 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9852 |
Zeros (%) | 98.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.61395896 |
---|---|
Coefficient of variation (CV) | 11.224113 |
Kurtosis | 191.21127 |
Mean | 0.0547 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 13.583548 |
Sum | 547 |
Variance | 0.3769456 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9852 | |
1 | 69 | 0.7% |
9 | 42 | 0.4% |
2 | 26 | 0.3% |
4 | 10 | 0.1% |
8 | 1 | < 0.1% |
Value | Count | Frequency (%) |
0 | 9852 | |
1 | 69 | 0.7% |
2 | 26 | 0.3% |
4 | 10 | 0.1% |
8 | 1 | < 0.1% |
9 | 42 | 0.4% |
Value | Count | Frequency (%) |
9 | 42 | 0.4% |
8 | 1 | < 0.1% |
4 | 10 | 0.1% |
2 | 26 | 0.3% |
1 | 69 | 0.7% |
0 | 9852 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 256 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9744 | |
1 | 256 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9744 | |
1 | 256 | 2.6% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 256 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9744 | |
1 | 256 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9744 | |
1 | 256 | 2.6% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.336 | 0.101 | 0.334 | 0.334 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.051 | 0.102 | 0.048 | 0.048 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.506 | 0.127 | 0.385 | 0.385 |
평균값 | 0.336 | 0.051 | 0.506 | 1.000 | 0.161 | 0.429 | 0.429 |
측정기 상태 | 0.101 | 0.102 | 0.127 | 0.161 | 1.000 | 0.229 | 0.229 |
국가 기준초과 구분 | 0.334 | 0.048 | 0.385 | 0.429 | 0.229 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.334 | 0.048 | 0.385 | 0.429 | 0.229 | 1.000 | 1.000 |
국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|
국가 기준초과 구분 | 1.000 | 0.998 |
지자체 기준초과 구분 | 0.998 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.011 | 0.018 | 0.002 | -0.046 | 0.254 | 0.254 |
측정소 코드 | -0.011 | 1.000 | 0.004 | 0.009 | 0.011 | 0.037 | 0.037 |
측정항목 | 0.018 | 0.004 | 1.000 | 0.722 | 0.061 | 0.277 | 0.277 |
평균값 | 0.002 | 0.009 | 0.722 | 1.000 | -0.023 | 0.522 | 0.522 |
측정기 상태 | -0.046 | 0.011 | 0.061 | -0.023 | 1.000 | 0.164 | 0.164 |
국가 기준초과 구분 | 0.254 | 0.037 | 0.277 | 0.522 | 0.164 | 1.000 | 0.998 |
지자체 기준초과 구분 | 0.254 | 0.037 | 0.277 | 0.522 | 0.164 | 0.998 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
51108 | 2016011504 | 119 | 1 | 0.006 | 0 | 0 | 0 |
44758 | 2016011310 | 110 | 8 | 43.0 | 0 | 0 | 0 |
39357 | 2016011122 | 110 | 6 | 0.019 | 0 | 0 | 0 |
64793 | 2016011823 | 124 | 9 | 8.0 | 0 | 0 | 0 |
30306 | 2016010910 | 102 | 1 | 0.008 | 0 | 0 | 0 |
15803 | 2016010509 | 109 | 9 | 9.0 | 0 | 0 | 0 |
49625 | 2016011418 | 121 | 9 | 31.0 | 0 | 0 | 0 |
3595 | 2016010123 | 125 | 3 | 0.044 | 0 | 0 | 0 |
26463 | 2016010808 | 111 | 6 | 0.003 | 0 | 0 | 0 |
58947 | 2016011708 | 125 | 6 | 0.005 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
21939 | 2016010702 | 107 | 6 | 0.023 | 0 | 0 | 0 |
58242 | 2016011704 | 108 | 1 | 0.003 | 0 | 0 | 0 |
65744 | 2016011906 | 108 | 5 | 0.37 | 0 | 0 | 0 |
55373 | 2016011609 | 104 | 9 | 23.0 | 0 | 0 | 0 |
14604 | 2016010501 | 110 | 1 | 0.005 | 0 | 0 | 0 |
36879 | 2016011105 | 122 | 6 | 0.018 | 0 | 0 | 0 |
34812 | 2016011016 | 103 | 1 | 0.005 | 0 | 0 | 0 |
26773 | 2016010810 | 113 | 3 | 0.031 | 0 | 0 | 0 |
9136 | 2016010312 | 123 | 8 | 113.0 | 0 | 1 | 1 |
30682 | 2016010912 | 114 | 8 | 32.0 | 0 | 0 | 0 |