Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 has constant value "" | Constant |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
국가 기준초과 구분 is highly imbalanced (99.5%) | Imbalance |
평균값 is highly skewed (γ1 = -22.51214637) | Skewed |
측정기 상태 has 9819 (98.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 03:57:20.646209 |
---|---|
Analysis finished | 2024-05-04 03:57:30.982943 |
Duration | 10.34 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 438 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.013011 × 109 |
Minimum | 2.0130101 × 109 |
---|---|
Maximum | 2.0130119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0130101 × 109 |
---|---|
5-th percentile | 2.0130101 × 109 |
Q1 | 2.0130105 × 109 |
median | 2.013011 × 109 |
Q3 | 2.0130114 × 109 |
95-th percentile | 2.0130118 × 109 |
Maximum | 2.0130119 × 109 |
Range | 1805 |
Interquartile range (IQR) | 901 |
Descriptive statistics
Standard deviation | 524.79996 |
---|---|
Coefficient of variation (CV) | 2.6070398 × 10-7 |
Kurtosis | -1.1955111 |
Mean | 2.013011 × 109 |
Median Absolute Deviation (MAD) | 481 |
Skewness | 0.0089265445 |
Sum | 2.013011 × 1013 |
Variance | 275415 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2013010720 | 36 | 0.4% |
2013011121 | 35 | 0.4% |
2013010401 | 35 | 0.4% |
2013011813 | 34 | 0.3% |
2013011102 | 34 | 0.3% |
2013011002 | 34 | 0.3% |
2013010204 | 33 | 0.3% |
2013010819 | 33 | 0.3% |
2013011611 | 33 | 0.3% |
2013011311 | 32 | 0.3% |
Other values (428) | 9661 |
Value | Count | Frequency (%) |
2013010100 | 19 | |
2013010101 | 23 | |
2013010102 | 21 | |
2013010103 | 25 | |
2013010104 | 28 | |
2013010105 | 22 | |
2013010106 | 21 | |
2013010107 | 22 | |
2013010108 | 22 | |
2013010109 | 19 |
Value | Count | Frequency (%) |
2013011905 | 8 | 0.1% |
2013011904 | 23 | |
2013011903 | 21 | |
2013011902 | 17 | |
2013011901 | 24 | |
2013011900 | 28 | |
2013011823 | 21 | |
2013011822 | 29 | |
2013011821 | 18 | |
2013011820 | 20 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.0588 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2508758 |
---|---|
Coefficient of variation (CV) | 0.06413367 |
Kurtosis | -1.2230307 |
Mean | 113.0588 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.000579527 |
Sum | 1130588 |
Variance | 52.5752 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
124 | 440 | 4.4% |
105 | 422 | 4.2% |
107 | 414 | 4.1% |
108 | 414 | 4.1% |
121 | 413 | 4.1% |
122 | 411 | 4.1% |
125 | 411 | 4.1% |
120 | 407 | 4.1% |
103 | 407 | 4.1% |
117 | 407 | 4.1% |
Other values (15) | 5854 |
Value | Count | Frequency (%) |
101 | 391 | |
102 | 382 | |
103 | 407 | |
104 | 395 | |
105 | 422 | |
106 | 406 | |
107 | 414 | |
108 | 414 | |
109 | 379 | |
110 | 406 |
Value | Count | Frequency (%) |
125 | 411 | |
124 | 440 | |
123 | 391 | |
122 | 411 | |
121 | 413 | |
120 | 407 | |
119 | 384 | |
118 | 388 | |
117 | 407 | |
116 | 393 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3262 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7700832 |
---|---|
Coefficient of variation (CV) | 0.52008621 |
Kurtosis | -1.2282088 |
Mean | 5.3262 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.20432084 |
Sum | 53262 |
Variance | 7.6733609 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1717 | |
8 | 1688 | |
9 | 1685 | |
5 | 1664 | |
3 | 1640 | |
6 | 1606 |
Value | Count | Frequency (%) |
1 | 1717 | |
3 | 1640 | |
5 | 1664 | |
6 | 1606 | |
8 | 1688 | |
9 | 1685 |
Value | Count | Frequency (%) |
9 | 1685 | |
8 | 1688 | |
6 | 1606 | |
5 | 1664 | |
3 | 1640 | |
1 | 1717 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 325 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -0.1038639 |
Minimum | -9999 |
---|---|
Maximum | 375 |
Zeros | 46 |
Zeros (%) | 0.5% |
Negative | 40 |
Negative (%) | 0.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.012 |
median | 0.1 |
Q3 | 29 |
95-th percentile | 101 |
Maximum | 375 |
Range | 10374 |
Interquartile range (IQR) | 28.988 |
Descriptive statistics
Standard deviation | 438.6867 |
---|---|
Coefficient of variation (CV) | -4223.6687 |
Kurtosis | 510.17572 |
Mean | -0.1038639 |
Median Absolute Deviation (MAD) | 0.2 |
Skewness | -22.512146 |
Sum | -1038.639 |
Variance | 192446.02 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.007 | 321 | 3.2% |
0.008 | 278 | 2.8% |
0.006 | 268 | 2.7% |
0.002 | 237 | 2.4% |
0.009 | 225 | 2.2% |
0.005 | 201 | 2.0% |
0.01 | 185 | 1.8% |
0.7 | 167 | 1.7% |
0.003 | 161 | 1.6% |
0.001 | 145 | 1.5% |
Other values (315) | 7812 |
Value | Count | Frequency (%) |
-9999.0 | 19 | 0.2% |
-999.9 | 8 | 0.1% |
-9.999 | 13 | 0.1% |
0.0 | 46 | 0.5% |
0.001 | 145 | |
0.002 | 237 | |
0.003 | 161 | |
0.004 | 145 | |
0.005 | 201 | |
0.006 | 268 |
Value | Count | Frequency (%) |
375.0 | 1 | |
265.0 | 1 | |
237.0 | 1 | |
233.0 | 1 | |
210.0 | 1 | |
202.0 | 1 | |
197.0 | 1 | |
195.0 | 2 | |
192.0 | 1 | |
191.0 | 1 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.0733 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9819 |
Zeros (%) | 98.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.71258535 |
---|---|
Coefficient of variation (CV) | 9.7214919 |
Kurtosis | 138.64793 |
Mean | 0.0733 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 11.595104 |
Sum | 733 |
Variance | 0.50777789 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9819 | |
2 | 65 | 0.7% |
1 | 52 | 0.5% |
9 | 51 | 0.5% |
8 | 10 | 0.1% |
4 | 3 | < 0.1% |
Value | Count | Frequency (%) |
0 | 9819 | |
1 | 52 | 0.5% |
2 | 65 | 0.7% |
4 | 3 | < 0.1% |
8 | 10 | 0.1% |
9 | 51 | 0.5% |
Value | Count | Frequency (%) |
9 | 51 | 0.5% |
8 | 10 | 0.1% |
4 | 3 | < 0.1% |
2 | 65 | 0.7% |
1 | 52 | 0.5% |
0 | 9819 |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9996 | |
1 | 4 | < 0.1% |
지자체 기준초과 구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.013 | 0.059 | 0.074 | 0.000 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.076 | 0.110 | 0.023 |
측정항목 | 0.013 | 0.000 | 1.000 | 0.082 | 0.185 | 0.055 |
평균값 | 0.059 | 0.076 | 0.082 | 1.000 | 0.483 | 0.000 |
측정기 상태 | 0.074 | 0.110 | 0.185 | 0.483 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.000 | 0.023 | 0.055 | 0.000 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.009 | 0.006 | 0.088 | -0.004 | 0.000 |
측정소 코드 | -0.009 | 1.000 | 0.003 | 0.009 | -0.011 | 0.018 |
측정항목 | 0.006 | 0.003 | 1.000 | 0.700 | 0.088 | 0.039 |
평균값 | 0.088 | 0.009 | 0.700 | 1.000 | 0.008 | 0.000 |
측정기 상태 | -0.004 | -0.011 | 0.088 | 0.008 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.000 | 0.018 | 0.039 | 0.000 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
44952 | 2013011311 | 118 | 1 | 0.018 | 0 | 0 | 0 |
48576 | 2013011411 | 122 | 1 | 0.007 | 0 | 0 | 0 |
38539 | 2013011116 | 124 | 3 | 0.06 | 0 | 0 | 0 |
7822 | 2013010304 | 104 | 8 | 27.0 | 0 | 0 | 0 |
31039 | 2013010914 | 124 | 3 | 0.022 | 0 | 0 | 0 |
1586 | 2013010110 | 115 | 5 | 0.5 | 0 | 0 | 0 |
2301 | 2013010115 | 109 | 6 | 0.032 | 0 | 0 | 0 |
12327 | 2013010410 | 105 | 6 | 0.01 | 0 | 0 | 0 |
36944 | 2013011106 | 108 | 5 | 0.9 | 0 | 0 | 0 |
56479 | 2013011616 | 114 | 3 | 0.032 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
27504 | 2013010815 | 110 | 1 | 0.014 | 0 | 0 | 0 |
26340 | 2013010807 | 116 | 1 | 0.008 | 0 | 0 | 0 |
26644 | 2013010809 | 116 | 8 | 139.0 | 0 | 0 | 0 |
39029 | 2013011120 | 105 | 9 | 36.0 | 0 | 0 | 0 |
15942 | 2013010510 | 108 | 1 | 0.007 | 0 | 0 | 0 |
12842 | 2013010413 | 116 | 5 | 0.3 | 0 | 0 | 0 |
13180 | 2013010415 | 122 | 8 | 49.0 | 0 | 0 | 0 |
19705 | 2013010611 | 110 | 3 | 0.04 | 0 | 0 | 0 |
34105 | 2013011011 | 110 | 3 | 0.035 | 0 | 0 | 0 |
63574 | 2013011815 | 121 | 8 | 38.0 | 0 | 0 | 0 |