Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 has constant value "" | Constant |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
측정기 상태 is highly imbalanced (95.5%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (99.6%) | Imbalance |
평균값 is highly skewed (γ1 = -55.78194489) | Skewed |
Reproduction
Analysis started | 2024-05-04 03:58:26.160405 |
---|---|
Analysis finished | 2024-05-04 03:58:35.045871 |
Duration | 8.89 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 440 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.010011 × 109 |
Minimum | 2.0100101 × 109 |
---|---|
Maximum | 2.0100119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0100101 × 109 |
---|---|
5-th percentile | 2.0100101 × 109 |
Q1 | 2.0100105 × 109 |
median | 2.010011 × 109 |
Q3 | 2.0100114 × 109 |
95-th percentile | 2.0100118 × 109 |
Maximum | 2.0100119 × 109 |
Range | 1807 |
Interquartile range (IQR) | 904 |
Descriptive statistics
Standard deviation | 530.60642 |
---|---|
Coefficient of variation (CV) | 2.6398185 × 10-7 |
Kurtosis | -1.2045808 |
Mean | 2.010011 × 109 |
Median Absolute Deviation (MAD) | 484 |
Skewness | 0.0038751439 |
Sum | 2.010011 × 1013 |
Variance | 281543.18 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2010011011 | 36 | 0.4% |
2010010311 | 36 | 0.4% |
2010010921 | 36 | 0.4% |
2010011007 | 34 | 0.3% |
2010010913 | 33 | 0.3% |
2010011818 | 33 | 0.3% |
2010011900 | 33 | 0.3% |
2010010914 | 32 | 0.3% |
2010010323 | 32 | 0.3% |
2010010201 | 31 | 0.3% |
Other values (430) | 9664 |
Value | Count | Frequency (%) |
2010010100 | 19 | |
2010010101 | 19 | |
2010010102 | 19 | |
2010010103 | 21 | |
2010010104 | 21 | |
2010010105 | 27 | |
2010010106 | 22 | |
2010010107 | 25 | |
2010010108 | 20 | |
2010010109 | 18 |
Value | Count | Frequency (%) |
2010011907 | 19 | |
2010011906 | 20 | |
2010011905 | 19 | |
2010011904 | 20 | |
2010011903 | 27 | |
2010011902 | 31 | |
2010011901 | 28 | |
2010011900 | 33 | |
2010011823 | 15 | |
2010011822 | 18 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.965 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2202346 |
---|---|
Coefficient of variation (CV) | 0.063915679 |
Kurtosis | -1.221894 |
Mean | 112.965 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0016855759 |
Sum | 1129650 |
Variance | 52.131788 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
104 | 431 | 4.3% |
107 | 430 | 4.3% |
121 | 430 | 4.3% |
120 | 424 | 4.2% |
112 | 423 | 4.2% |
123 | 422 | 4.2% |
119 | 414 | 4.1% |
103 | 413 | 4.1% |
101 | 411 | 4.1% |
111 | 409 | 4.1% |
Other values (15) | 5793 |
Value | Count | Frequency (%) |
101 | 411 | |
102 | 383 | |
103 | 413 | |
104 | 431 | |
105 | 398 | |
106 | 381 | |
107 | 430 | |
108 | 390 | |
109 | 374 | |
110 | 405 |
Value | Count | Frequency (%) |
125 | 356 | |
124 | 399 | |
123 | 422 | |
122 | 397 | |
121 | 430 | |
120 | 424 | |
119 | 414 | |
118 | 403 | |
117 | 384 | |
116 | 356 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3319 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7279455 |
---|---|
Coefficient of variation (CV) | 0.51162728 |
Kurtosis | -1.1914248 |
Mean | 5.3319 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.20148428 |
Sum | 53319 |
Variance | 7.4416866 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 1746 | |
3 | 1713 | |
5 | 1641 | |
8 | 1641 | |
9 | 1639 | |
1 | 1620 |
Value | Count | Frequency (%) |
1 | 1620 | |
3 | 1713 | |
5 | 1641 | |
6 | 1746 | |
8 | 1641 | |
9 | 1639 |
Value | Count | Frequency (%) |
9 | 1639 | |
8 | 1641 | |
6 | 1746 | |
5 | 1641 | |
3 | 1713 | |
1 | 1620 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 250 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.677921 |
Minimum | -9999 |
---|---|
Maximum | 224 |
Zeros | 22 |
Zeros (%) | 0.2% |
Negative | 6 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.011 |
median | 0.077 |
Q3 | 27 |
95-th percentile | 68 |
Maximum | 224 |
Range | 10223 |
Interquartile range (IQR) | 26.989 |
Descriptive statistics
Standard deviation | 175.42232 |
---|---|
Coefficient of variation (CV) | 15.021708 |
Kurtosis | 3180.8685 |
Mean | 11.677921 |
Median Absolute Deviation (MAD) | 0.076 |
Skewness | -55.781945 |
Sum | 116779.21 |
Variance | 30772.991 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.006 | 316 | 3.2% |
0.005 | 312 | 3.1% |
0.007 | 300 | 3.0% |
0.008 | 285 | 2.9% |
0.003 | 240 | 2.4% |
0.009 | 234 | 2.3% |
0.5 | 211 | 2.1% |
0.6 | 206 | 2.1% |
0.004 | 205 | 2.1% |
0.002 | 196 | 2.0% |
Other values (240) | 7495 |
Value | Count | Frequency (%) |
-9999.0 | 3 | < 0.1% |
-999.9 | 1 | < 0.1% |
-9.999 | 2 | < 0.1% |
0.0 | 22 | 0.2% |
0.001 | 123 | 1.2% |
0.002 | 196 | |
0.003 | 240 | |
0.004 | 205 | |
0.005 | 312 | |
0.006 | 316 |
Value | Count | Frequency (%) |
224.0 | 1 | |
165.0 | 1 | |
155.0 | 1 | |
128.0 | 1 | |
126.0 | 2 | |
125.0 | 1 | |
122.0 | 1 | |
121.0 | 1 | |
120.0 | 1 | |
119.0 | 1 |
측정기 상태
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 57 |
9 | 29 |
2 | 9 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9905 | |
1 | 57 | 0.6% |
9 | 29 | 0.3% |
2 | 9 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9905 | |
1 | 57 | 0.6% |
9 | 29 | 0.3% |
2 | 9 | 0.1% |
국가 기준초과 구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9997 | |
1 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9997 | |
1 | 3 | < 0.1% |
지자체 기준초과 구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.007 | 0.025 | 0.081 | 0.029 |
측정소 코드 | 0.000 | 1.000 | 0.000 | 0.019 | 0.112 | 0.000 |
측정항목 | 0.007 | 0.000 | 1.000 | 0.004 | 0.088 | 0.043 |
평균값 | 0.025 | 0.019 | 0.004 | 1.000 | 0.203 | 0.000 |
측정기 상태 | 0.081 | 0.112 | 0.088 | 0.203 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.029 | 0.000 | 0.043 | 0.000 | 0.000 | 1.000 |
국가 기준초과 구분 | 측정기 상태 | |
---|---|---|
국가 기준초과 구분 | 1.000 | 0.000 |
측정기 상태 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.002 | 0.002 | 0.039 | 0.048 | 0.022 |
측정소 코드 | -0.002 | 1.000 | 0.007 | 0.001 | 0.067 | 0.000 |
측정항목 | 0.002 | 0.007 | 1.000 | 0.677 | 0.057 | 0.031 |
평균값 | 0.039 | 0.001 | 0.677 | 1.000 | 0.186 | 0.000 |
측정기 상태 | 0.048 | 0.067 | 0.057 | 0.186 | 1.000 | 0.000 |
국가 기준초과 구분 | 0.022 | 0.000 | 0.031 | 0.000 | 0.000 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
37259 | 2010011108 | 110 | 9 | 49.0 | 0 | 0 | 0 |
13131 | 2010010415 | 114 | 6 | 0.025 | 0 | 0 | 0 |
37848 | 2010011112 | 109 | 1 | 0.013 | 0 | 0 | 0 |
45363 | 2010011314 | 111 | 6 | 0.027 | 0 | 0 | 0 |
2679 | 2010010117 | 122 | 6 | 0.002 | 0 | 0 | 0 |
19350 | 2010010609 | 101 | 1 | 0.008 | 0 | 0 | 0 |
40598 | 2010011206 | 117 | 5 | 0.7 | 0 | 0 | 0 |
29086 | 2010010901 | 123 | 8 | 87.0 | 0 | 0 | 0 |
33980 | 2010011010 | 114 | 5 | 0.8 | 0 | 0 | 0 |
12606 | 2010010412 | 102 | 1 | 0.005 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
49276 | 2010011416 | 113 | 8 | 42.0 | 0 | 0 | 0 |
30451 | 2010010911 | 101 | 3 | 0.065 | 0 | 0 | 0 |
5651 | 2010010213 | 117 | 9 | 31.0 | 0 | 0 | 0 |
14268 | 2010010423 | 104 | 1 | 0.006 | 0 | 0 | 0 |
575 | 2010010103 | 121 | 9 | 19.0 | 0 | 0 | 0 |
47761 | 2010011406 | 111 | 3 | 0.056 | 0 | 0 | 0 |
40843 | 2010011208 | 108 | 3 | 0.032 | 0 | 0 | 0 |
32173 | 2010010922 | 113 | 3 | 0.069 | 0 | 0 | 0 |
19343 | 2010010608 | 124 | 9 | 23.0 | 0 | 0 | 0 |
1254 | 2010010108 | 110 | 1 | 0.007 | 0 | 0 | 0 |