Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly imbalanced (96.1%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (91.1%) | Imbalance |
평균값 has 216 (2.2%) zeros | Zeros |
측정기 상태 has 9112 (91.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 03:59:48.316233 |
---|---|
Analysis finished | 2024-05-04 03:59:57.833802 |
Duration | 9.52 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 584 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0060113 × 109 |
Minimum | 2.0060101 × 109 |
---|---|
Maximum | 2.0060125 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0060101 × 109 |
---|---|
5-th percentile | 2.0060102 × 109 |
Q1 | 2.0060106 × 109 |
median | 2.0060113 × 109 |
Q3 | 2.0060119 × 109 |
95-th percentile | 2.0060124 × 109 |
Maximum | 2.0060125 × 109 |
Range | 2407 |
Interquartile range (IQR) | 1285 |
Descriptive statistics
Standard deviation | 704.20647 |
---|---|
Coefficient of variation (CV) | 3.5104811 × 10-7 |
Kurtosis | -1.2110538 |
Mean | 2.0060113 × 109 |
Median Absolute Deviation (MAD) | 604 |
Skewness | -0.00038946948 |
Sum | 2.0060113 × 1013 |
Variance | 495906.75 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2006011919 | 31 | 0.3% |
2006012222 | 29 | 0.3% |
2006010609 | 29 | 0.3% |
2006012106 | 29 | 0.3% |
2006011909 | 28 | 0.3% |
2006010605 | 27 | 0.3% |
2006012009 | 27 | 0.3% |
2006010120 | 27 | 0.3% |
2006010804 | 27 | 0.3% |
2006010314 | 27 | 0.3% |
Other values (574) | 9719 |
Value | Count | Frequency (%) |
2006010100 | 24 | |
2006010101 | 11 | |
2006010102 | 15 | |
2006010103 | 12 | |
2006010104 | 13 | |
2006010105 | 17 | |
2006010106 | 20 | |
2006010107 | 15 | |
2006010108 | 24 | |
2006010109 | 19 |
Value | Count | Frequency (%) |
2006012507 | 14 | |
2006012506 | 16 | |
2006012505 | 14 | |
2006012504 | 19 | |
2006012503 | 24 | |
2006012502 | 13 | |
2006012501 | 20 | |
2006012500 | 12 | |
2006012423 | 17 | |
2006012422 | 16 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.8313 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2200175 |
---|---|
Coefficient of variation (CV) | 0.063989492 |
Kurtosis | -1.1965161 |
Mean | 112.8313 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.033736457 |
Sum | 1128313 |
Variance | 52.128653 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
109 | 448 | 4.5% |
101 | 435 | 4.3% |
107 | 426 | 4.3% |
104 | 421 | 4.2% |
112 | 418 | 4.2% |
102 | 413 | 4.1% |
111 | 407 | 4.1% |
117 | 407 | 4.1% |
114 | 401 | 4.0% |
122 | 401 | 4.0% |
Other values (15) | 5823 |
Value | Count | Frequency (%) |
101 | 435 | |
102 | 413 | |
103 | 393 | |
104 | 421 | |
105 | 377 | |
106 | 399 | |
107 | 426 | |
108 | 401 | |
109 | 448 | |
110 | 399 |
Value | Count | Frequency (%) |
125 | 389 | |
124 | 399 | |
123 | 389 | |
122 | 401 | |
121 | 399 | |
120 | 357 | |
119 | 373 | |
118 | 378 | |
117 | 407 | |
116 | 400 |
측정항목
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3376 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7583305 |
---|---|
Coefficient of variation (CV) | 0.51677355 |
Kurtosis | -1.2254251 |
Mean | 5.3376 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.19910636 |
Sum | 53376 |
Variance | 7.6083871 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 1725 | |
9 | 1698 | |
8 | 1669 | |
1 | 1655 | |
6 | 1647 | |
5 | 1606 |
Value | Count | Frequency (%) |
1 | 1655 | |
3 | 1725 | |
5 | 1606 | |
6 | 1647 | |
8 | 1669 | |
9 | 1698 |
Value | Count | Frequency (%) |
9 | 1698 | |
8 | 1669 | |
6 | 1647 | |
5 | 1606 | |
3 | 1725 | |
1 | 1655 |
평균값
Real number (ℝ)
ZEROS
 
Distinct | 314 |
---|---|
Distinct (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -356.7894 |
Minimum | -9999 |
---|---|
Maximum | 2240 |
Zeros | 216 |
Zeros (%) | 2.2% |
Negative | 461 |
Negative (%) | 4.6% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0 |
Q1 | 0.008 |
median | 0.05 |
Q3 | 18 |
95-th percentile | 68 |
Maximum | 2240 |
Range | 12239 |
Interquartile range (IQR) | 17.992 |
Descriptive statistics
Standard deviation | 1890.9833 |
---|---|
Coefficient of variation (CV) | -5.2999985 |
Kurtosis | 22.031257 |
Mean | -356.7894 |
Median Absolute Deviation (MAD) | 0.05 |
Skewness | -4.8988821 |
Sum | -3567894 |
Variance | 3575817.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-9999.0 | 370 | 3.7% |
0.001 | 269 | 2.7% |
0.006 | 268 | 2.7% |
0.004 | 261 | 2.6% |
0.002 | 259 | 2.6% |
0.005 | 253 | 2.5% |
0.003 | 244 | 2.4% |
0.007 | 235 | 2.4% |
0.008 | 232 | 2.3% |
0.0 | 216 | 2.2% |
Other values (304) | 7393 |
Value | Count | Frequency (%) |
-9999.0 | 370 | |
-999.9 | 12 | 0.1% |
-9.999 | 79 | 0.8% |
0.0 | 216 | |
0.001 | 269 | |
0.002 | 259 | |
0.003 | 244 | |
0.004 | 261 | |
0.005 | 253 | |
0.006 | 268 |
Value | Count | Frequency (%) |
2240.0 | 1 | |
1415.0 | 1 | |
1298.0 | 1 | |
1183.0 | 1 | |
1061.0 | 1 | |
1031.0 | 1 | |
910.0 | 1 | |
728.0 | 1 | |
547.0 | 1 | |
511.0 | 1 |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.3489 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9112 |
Zeros (%) | 91.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.2997453 |
---|---|
Coefficient of variation (CV) | 3.7252658 |
Kurtosis | 22.042973 |
Mean | 0.3489 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.4822555 |
Sum | 3489 |
Variance | 1.6893377 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9112 | |
4 | 445 | 4.5% |
2 | 248 | 2.5% |
9 | 80 | 0.8% |
1 | 61 | 0.6% |
8 | 54 | 0.5% |
Value | Count | Frequency (%) |
0 | 9112 | |
1 | 61 | 0.6% |
2 | 248 | 2.5% |
4 | 445 | 4.5% |
8 | 54 | 0.5% |
9 | 80 | 0.8% |
Value | Count | Frequency (%) |
9 | 80 | 0.8% |
8 | 54 | 0.5% |
4 | 445 | 4.5% |
2 | 248 | 2.5% |
1 | 61 | 0.6% |
0 | 9112 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 42 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9958 | |
1 | 42 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9958 | |
1 | 42 | 0.4% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 113 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9887 | |
1 | 113 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9887 | |
1 | 113 | 1.1% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.015 | 0.085 | 0.151 | 0.175 | 0.254 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.129 | 0.346 | 0.045 | 0.056 |
측정항목 | 0.015 | 0.000 | 1.000 | 0.138 | 0.510 | 0.188 | 0.320 |
평균값 | 0.085 | 0.129 | 0.138 | 1.000 | 0.597 | 0.113 | 0.068 |
측정기 상태 | 0.151 | 0.346 | 0.510 | 0.597 | 1.000 | 0.077 | 0.057 |
국가 기준초과 구분 | 0.175 | 0.045 | 0.188 | 0.113 | 0.077 | 1.000 | 0.809 |
지자체 기준초과 구분 | 0.254 | 0.056 | 0.320 | 0.068 | 0.057 | 0.809 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.600 |
국가 기준초과 구분 | 0.600 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.025 | 0.010 | -0.011 | 0.059 | 0.134 | 0.195 |
측정소 코드 | -0.025 | 1.000 | 0.008 | 0.104 | -0.133 | 0.035 | 0.043 |
측정항목 | 0.010 | 0.008 | 1.000 | 0.460 | 0.229 | 0.135 | 0.230 |
평균값 | -0.011 | 0.104 | 0.460 | 1.000 | -0.332 | 0.187 | 0.114 |
측정기 상태 | 0.059 | -0.133 | 0.229 | -0.332 | 1.000 | 0.055 | 0.041 |
국가 기준초과 구분 | 0.134 | 0.035 | 0.135 | 0.187 | 0.055 | 1.000 | 0.600 |
지자체 기준초과 구분 | 0.195 | 0.043 | 0.230 | 0.114 | 0.041 | 0.600 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
5366 | 2006010211 | 120 | 5 | 0.8 | 0 | 0 | 0 |
7219 | 2006010300 | 104 | 3 | 0.023 | 0 | 0 | 0 |
69889 | 2006012009 | 124 | 3 | 0.047 | 0 | 0 | 0 |
21319 | 2006010622 | 104 | 3 | 0.023 | 0 | 0 | 0 |
8617 | 2006010309 | 112 | 3 | 0.024 | 0 | 0 | 0 |
23995 | 2006010715 | 125 | 3 | 0.018 | 0 | 0 | 0 |
53041 | 2006011517 | 116 | 3 | 0.063 | 0 | 0 | 0 |
9393 | 2006010314 | 116 | 6 | 0.023 | 0 | 0 | 0 |
40060 | 2006011203 | 102 | 8 | 78.0 | 0 | 0 | 0 |
22849 | 2006010708 | 109 | 3 | 0.036 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
27165 | 2006010813 | 103 | 6 | 0.002 | 0 | 0 | 0 |
4026 | 2006010202 | 122 | 1 | 0.007 | 0 | 0 | 0 |
73484 | 2006012109 | 123 | 5 | 0.6 | 0 | 0 | 0 |
78820 | 2006012221 | 112 | 8 | 28.0 | 0 | 0 | 0 |
32043 | 2006010921 | 116 | 6 | 0.0 | 0 | 0 | 0 |
5757 | 2006010214 | 110 | 6 | 0.014 | 0 | 0 | 0 |
69930 | 2006012010 | 106 | 1 | 0.007 | 0 | 0 | 0 |
3842 | 2006010201 | 116 | 5 | 1.1 | 0 | 0 | 0 |
61393 | 2006011801 | 108 | 3 | 0.043 | 0 | 0 | 0 |
7698 | 2006010303 | 109 | 1 | 0.004 | 0 | 0 | 0 |