Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
국가 기준초과 구분 is highly imbalanced (69.0%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (69.0%) | Imbalance |
평균값 has 212 (2.1%) zeros | Zeros |
측정기 상태 has 9622 (96.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 03:56:00.574412 |
---|---|
Analysis finished | 2024-05-04 03:56:11.520194 |
Duration | 10.95 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 470 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.018011 × 109 |
Minimum | 2.0180101 × 109 |
---|---|
Maximum | 2.018012 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0180101 × 109 |
---|---|
5-th percentile | 2.0180102 × 109 |
Q1 | 2.0180105 × 109 |
median | 2.018011 × 109 |
Q3 | 2.0180115 × 109 |
95-th percentile | 2.0180119 × 109 |
Maximum | 2.018012 × 109 |
Range | 1913 |
Interquartile range (IQR) | 994 |
Descriptive statistics
Standard deviation | 563.51008 |
---|---|
Coefficient of variation (CV) | 2.7924034 × 10-7 |
Kurtosis | -1.1912847 |
Mean | 2.018011 × 109 |
Median Absolute Deviation (MAD) | 497 |
Skewness | -0.00048395523 |
Sum | 2.018011 × 1013 |
Variance | 317543.62 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018010321 | 38 | 0.4% |
2018010811 | 36 | 0.4% |
2018010712 | 34 | 0.3% |
2018010410 | 32 | 0.3% |
2018010607 | 32 | 0.3% |
2018011503 | 32 | 0.3% |
2018012010 | 31 | 0.3% |
2018010807 | 31 | 0.3% |
2018011213 | 31 | 0.3% |
2018012011 | 30 | 0.3% |
Other values (460) | 9673 |
Value | Count | Frequency (%) |
2018010100 | 27 | |
2018010101 | 19 | |
2018010102 | 17 | |
2018010103 | 17 | |
2018010104 | 20 | |
2018010105 | 16 | |
2018010106 | 22 | |
2018010107 | 29 | |
2018010108 | 24 | |
2018010109 | 25 |
Value | Count | Frequency (%) |
2018012013 | 14 | |
2018012012 | 18 | |
2018012011 | 30 | |
2018012010 | 31 | |
2018012009 | 27 | |
2018012008 | 23 | |
2018012007 | 15 | |
2018012006 | 19 | |
2018012005 | 25 | |
2018012004 | 25 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.0226 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2036295 |
---|---|
Coefficient of variation (CV) | 0.063736187 |
Kurtosis | -1.203123 |
Mean | 113.0226 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0023322311 |
Sum | 1130226 |
Variance | 51.892278 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
107 | 424 | 4.2% |
114 | 421 | 4.2% |
113 | 417 | 4.2% |
119 | 416 | 4.2% |
123 | 416 | 4.2% |
116 | 410 | 4.1% |
104 | 409 | 4.1% |
124 | 409 | 4.1% |
105 | 408 | 4.1% |
108 | 407 | 4.1% |
Other values (15) | 5863 |
Value | Count | Frequency (%) |
101 | 397 | |
102 | 392 | |
103 | 381 | |
104 | 409 | |
105 | 408 | |
106 | 398 | |
107 | 424 | |
108 | 407 | |
109 | 380 | |
110 | 379 |
Value | Count | Frequency (%) |
125 | 394 | |
124 | 409 | |
123 | 416 | |
122 | 389 | |
121 | 389 | |
120 | 401 | |
119 | 416 | |
118 | 385 | |
117 | 394 | |
116 | 410 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3592 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7586476 |
---|---|
Coefficient of variation (CV) | 0.51474988 |
Kurtosis | -1.2072375 |
Mean | 5.3592 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.22050428 |
Sum | 53592 |
Variance | 7.6101364 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 1717 | |
6 | 1684 | |
1 | 1676 | |
8 | 1660 | |
5 | 1645 | |
3 | 1618 |
Value | Count | Frequency (%) |
1 | 1676 | |
3 | 1618 | |
5 | 1645 | |
6 | 1684 | |
8 | 1660 | |
9 | 1717 |
Value | Count | Frequency (%) |
9 | 1717 | |
8 | 1660 | |
6 | 1684 | |
5 | 1645 | |
3 | 1618 | |
1 | 1676 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 265 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -18.48203 |
Minimum | -9999 |
---|---|
Maximum | 3487 |
Zeros | 212 |
Zeros (%) | 2.1% |
Negative | 36 |
Negative (%) | 0.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.002 |
Q1 | 0.007 |
median | 0.066 |
Q3 | 26 |
95-th percentile | 78 |
Maximum | 3487 |
Range | 13486 |
Interquartile range (IQR) | 25.993 |
Descriptive statistics
Standard deviation | 594.15545 |
---|---|
Coefficient of variation (CV) | -32.147737 |
Kurtosis | 276.09082 |
Mean | -18.48203 |
Median Absolute Deviation (MAD) | 0.066 |
Skewness | -16.553609 |
Sum | -184820.3 |
Variance | 353020.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.005 | 489 | 4.9% |
0.006 | 447 | 4.5% |
0.004 | 424 | 4.2% |
0.007 | 330 | 3.3% |
0.002 | 270 | 2.7% |
0.003 | 262 | 2.6% |
0.5 | 256 | 2.6% |
0.4 | 234 | 2.3% |
0.6 | 217 | 2.2% |
0.7 | 213 | 2.1% |
Other values (255) | 6858 |
Value | Count | Frequency (%) |
-9999.0 | 35 | 0.4% |
-0.1 | 1 | < 0.1% |
0.0 | 212 | |
0.001 | 54 | 0.5% |
0.002 | 270 | |
0.003 | 262 | |
0.004 | 424 | |
0.005 | 489 | |
0.006 | 447 | |
0.007 | 330 |
Value | Count | Frequency (%) |
3487.0 | 1 | < 0.1% |
3416.0 | 1 | < 0.1% |
161.0 | 1 | < 0.1% |
160.0 | 1 | < 0.1% |
157.0 | 1 | < 0.1% |
155.0 | 4 | |
154.0 | 1 | < 0.1% |
153.0 | 1 | < 0.1% |
151.0 | 2 | |
150.0 | 1 | < 0.1% |
측정기 상태
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.2505 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9622 |
Zeros (%) | 96.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.3725663 |
---|---|
Coefficient of variation (CV) | 5.4793064 |
Kurtosis | 29.147384 |
Mean | 0.2505 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.5333242 |
Sum | 2505 |
Variance | 1.8839381 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9622 | |
8 | 218 | 2.2% |
9 | 59 | 0.6% |
1 | 54 | 0.5% |
4 | 41 | 0.4% |
2 | 6 | 0.1% |
Value | Count | Frequency (%) |
0 | 9622 | |
1 | 54 | 0.5% |
2 | 6 | 0.1% |
4 | 41 | 0.4% |
8 | 218 | 2.2% |
9 | 59 | 0.6% |
Value | Count | Frequency (%) |
9 | 59 | 0.6% |
8 | 218 | 2.2% |
4 | 41 | 0.4% |
2 | 6 | 0.1% |
1 | 54 | 0.5% |
0 | 9622 |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 557 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9443 | |
1 | 557 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9443 | |
1 | 557 | 5.6% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 557 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9443 | |
1 | 557 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9443 | |
1 | 557 | 5.6% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.040 | 0.149 | 0.477 | 0.477 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.049 | 0.438 | 0.038 | 0.038 |
측정항목 | 0.000 | 0.000 | 1.000 | 0.031 | 0.092 | 0.521 | 0.521 |
평균값 | 0.040 | 0.049 | 0.031 | 1.000 | 0.253 | 0.065 | 0.065 |
측정기 상태 | 0.149 | 0.438 | 0.092 | 0.253 | 1.000 | 0.055 | 0.055 |
국가 기준초과 구분 | 0.477 | 0.038 | 0.521 | 0.065 | 0.055 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.477 | 0.038 | 0.521 | 0.065 | 0.055 | 1.000 | 1.000 |
지자체 기준초과 구분 | 국가 기준초과 구분 | |
---|---|---|
지자체 기준초과 구분 | 1.000 | 0.999 |
국가 기준초과 구분 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.002 | -0.003 | 0.087 | -0.070 | 0.367 | 0.367 |
측정소 코드 | -0.002 | 1.000 | -0.001 | 0.052 | -0.158 | 0.029 | 0.029 |
측정항목 | -0.003 | -0.001 | 1.000 | 0.684 | 0.005 | 0.377 | 0.377 |
평균값 | 0.087 | 0.052 | 0.684 | 1.000 | -0.209 | 0.058 | 0.058 |
측정기 상태 | -0.070 | -0.158 | 0.005 | -0.209 | 1.000 | 0.040 | 0.040 |
국가 기준초과 구분 | 0.367 | 0.029 | 0.377 | 0.058 | 0.040 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.367 | 0.029 | 0.377 | 0.058 | 0.040 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
1881 | 2018010112 | 114 | 6 | 0.024 | 0 | 0 | 0 |
49119 | 2018011415 | 112 | 6 | 0.016 | 0 | 0 | 0 |
50590 | 2018011501 | 107 | 8 | 57.0 | 0 | 0 | 0 |
42721 | 2018011220 | 121 | 3 | 0.061 | 0 | 0 | 0 |
3500 | 2018010123 | 109 | 5 | 0.8 | 0 | 0 | 0 |
16902 | 2018010516 | 118 | 1 | 0.006 | 0 | 0 | 0 |
1548 | 2018010110 | 109 | 1 | 0.006 | 0 | 0 | 0 |
4133 | 2018010203 | 114 | 9 | 26.0 | 0 | 0 | 0 |
39120 | 2018011120 | 121 | 1 | 0.006 | 0 | 0 | 0 |
29604 | 2018010905 | 110 | 1 | 0.006 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
5512 | 2018010212 | 119 | 8 | 58.0 | 0 | 0 | 0 |
2562 | 2018010117 | 103 | 1 | 0.006 | 0 | 0 | 0 |
23890 | 2018010715 | 107 | 8 | 68.0 | 0 | 0 | 0 |
18673 | 2018010604 | 113 | 3 | 0.033 | 0 | 0 | 0 |
43003 | 2018011222 | 118 | 3 | 0.053 | 0 | 0 | 0 |
14617 | 2018010501 | 112 | 3 | 0.056 | 0 | 0 | 0 |
25650 | 2018010803 | 101 | 1 | 0.006 | 0 | 0 | 0 |
56180 | 2018011614 | 114 | 5 | 1.0 | 0 | 0 | 0 |
11669 | 2018010405 | 120 | 9 | 16.0 | 0 | 0 | 0 |
14328 | 2018010423 | 114 | 1 | 0.008 | 0 | 0 | 0 |