Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 | High correlation |
측정기 상태 is highly imbalanced (91.6%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (78.0%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (78.0%) | Imbalance |
평균값 is highly skewed (γ1 = -59.15126768) | Skewed |
Reproduction
Analysis started | 2024-04-27 12:02:29.464980 |
---|---|
Analysis finished | 2024-04-27 12:02:38.427120 |
Duration | 8.96 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 449 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.021011 × 109 |
Minimum | 2.0210101 × 109 |
---|---|
Maximum | 2.0210119 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0210101 × 109 |
---|---|
5-th percentile | 2.0210101 × 109 |
Q1 | 2.0210105 × 109 |
median | 2.021011 × 109 |
Q3 | 2.0210115 × 109 |
95-th percentile | 2.0210118 × 109 |
Maximum | 2.0210119 × 109 |
Range | 1816 |
Interquartile range (IQR) | 983 |
Descriptive statistics
Standard deviation | 540.2197 |
---|---|
Coefficient of variation (CV) | 2.6730171 × 10-7 |
Kurtosis | -1.1948509 |
Mean | 2.021011 × 109 |
Median Absolute Deviation (MAD) | 491.5 |
Skewness | -0.0073428005 |
Sum | 2.021011 × 1013 |
Variance | 291837.32 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2021010422 | 42 | 0.4% |
2021011111 | 34 | 0.3% |
2021010113 | 34 | 0.3% |
2021010306 | 33 | 0.3% |
2021010318 | 32 | 0.3% |
2021010214 | 32 | 0.3% |
2021011020 | 32 | 0.3% |
2021011319 | 32 | 0.3% |
2021011504 | 31 | 0.3% |
2021010114 | 31 | 0.3% |
Other values (439) | 9667 |
Value | Count | Frequency (%) |
2021010100 | 15 | |
2021010101 | 19 | |
2021010102 | 25 | |
2021010103 | 20 | |
2021010104 | 28 | |
2021010105 | 18 | |
2021010106 | 14 | |
2021010107 | 25 | |
2021010108 | 19 | |
2021010109 | 25 |
Value | Count | Frequency (%) |
2021011916 | 24 | |
2021011915 | 24 | |
2021011914 | 29 | |
2021011913 | 16 | |
2021011912 | 20 | |
2021011911 | 29 | |
2021011910 | 19 | |
2021011909 | 21 | |
2021011908 | 23 | |
2021011907 | 16 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.1735 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.232263 |
---|---|
Coefficient of variation (CV) | 0.063904209 |
Kurtosis | -1.2107478 |
Mean | 113.1735 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.033927399 |
Sum | 1131735 |
Variance | 52.305628 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
124 | 428 | 4.3% |
121 | 427 | 4.3% |
120 | 423 | 4.2% |
106 | 418 | 4.2% |
123 | 417 | 4.2% |
116 | 417 | 4.2% |
102 | 416 | 4.2% |
125 | 412 | 4.1% |
118 | 408 | 4.1% |
119 | 406 | 4.1% |
Other values (15) | 5828 |
Value | Count | Frequency (%) |
101 | 374 | |
102 | 416 | |
103 | 402 | |
104 | 373 | |
105 | 383 | |
106 | 418 | |
107 | 372 | |
108 | 375 | |
109 | 388 | |
110 | 394 |
Value | Count | Frequency (%) |
125 | 412 | |
124 | 428 | |
123 | 417 | |
122 | 387 | |
121 | 427 | |
120 | 423 | |
119 | 406 | |
118 | 408 | |
117 | 400 | |
116 | 417 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3416 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7356275 |
---|---|
Coefficient of variation (CV) | 0.51213634 |
Kurtosis | -1.190938 |
Mean | 5.3416 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.20544676 |
Sum | 53416 |
Variance | 7.4836578 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 1712 | |
5 | 1684 | |
9 | 1680 | |
3 | 1670 | |
1 | 1634 | |
8 | 1620 |
Value | Count | Frequency (%) |
1 | 1634 | |
3 | 1670 | |
5 | 1684 | |
6 | 1712 | |
8 | 1620 | |
9 | 1680 |
Value | Count | Frequency (%) |
9 | 1680 | |
8 | 1620 | |
6 | 1712 | |
5 | 1684 | |
3 | 1670 | |
1 | 1634 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 238 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.4306731 |
Minimum | -9999 |
---|---|
Maximum | 1985 |
Zeros | 2 |
Zeros (%) | < 0.1% |
Negative | 10 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.012 |
median | 0.065 |
Q3 | 15 |
95-th percentile | 52 |
Maximum | 1985 |
Range | 11984 |
Interquartile range (IQR) | 14.988 |
Descriptive statistics
Standard deviation | 149.56717 |
---|---|
Coefficient of variation (CV) | 15.85965 |
Kurtosis | 4020.3883 |
Mean | 9.4306731 |
Median Absolute Deviation (MAD) | 0.064 |
Skewness | -59.151268 |
Sum | 94306.731 |
Variance | 22370.34 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.003 | 777 | 7.8% |
0.004 | 604 | 6.0% |
0.4 | 460 | 4.6% |
0.5 | 339 | 3.4% |
0.002 | 318 | 3.2% |
0.6 | 238 | 2.4% |
0.005 | 228 | 2.3% |
0.7 | 209 | 2.1% |
0.3 | 143 | 1.4% |
13.0 | 131 | 1.3% |
Other values (228) | 6553 |
Value | Count | Frequency (%) |
-9999.0 | 2 | < 0.1% |
-1000.0 | 2 | < 0.1% |
-100.0 | 2 | < 0.1% |
-9.999 | 3 | < 0.1% |
-0.016 | 1 | < 0.1% |
0.0 | 2 | < 0.1% |
0.001 | 31 | 0.3% |
0.002 | 318 | |
0.003 | 777 | |
0.004 | 604 |
Value | Count | Frequency (%) |
1985.0 | 3 | |
985.0 | 5 | |
672.0 | 1 | < 0.1% |
666.0 | 1 | < 0.1% |
287.0 | 1 | < 0.1% |
210.0 | 1 | < 0.1% |
209.0 | 1 | < 0.1% |
188.0 | 1 | < 0.1% |
182.0 | 1 | < 0.1% |
172.0 | 1 | < 0.1% |
측정기 상태
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
9 | 117 |
1 | 44 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9839 | |
9 | 117 | 1.2% |
1 | 44 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9839 | |
9 | 117 | 1.2% |
1 | 44 | 0.4% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 353 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 1 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9647 | |
1 | 353 | 3.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9647 | |
1 | 353 | 3.5% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 353 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 1 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9647 | |
1 | 353 | 3.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9647 | |
1 | 353 | 3.5% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.000 | 0.080 | 0.097 | 0.352 | 0.352 |
측정소 코드 | 0.000 | 1.000 | 0.033 | 0.089 | 0.153 | 0.026 | 0.026 |
측정항목 | 0.000 | 0.033 | 1.000 | 0.069 | 0.166 | 0.490 | 0.490 |
평균값 | 0.080 | 0.089 | 0.069 | 1.000 | 0.535 | 0.089 | 0.089 |
측정기 상태 | 0.097 | 0.153 | 0.166 | 0.535 | 1.000 | 0.047 | 0.047 |
국가 기준초과 구분 | 0.352 | 0.026 | 0.490 | 0.089 | 0.047 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.352 | 0.026 | 0.490 | 0.089 | 0.047 | 1.000 | 1.000 |
지자체 기준초과 구분 | 측정기 상태 | 국가 기준초과 구분 | |
---|---|---|---|
지자체 기준초과 구분 | 1.000 | 0.079 | 0.999 |
측정기 상태 | 0.079 | 1.000 | 0.079 |
국가 기준초과 구분 | 0.999 | 0.079 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | -0.004 | -0.002 | 0.031 | 0.057 | 0.270 | 0.270 |
측정소 코드 | -0.004 | 1.000 | 0.008 | 0.009 | 0.092 | 0.020 | 0.020 |
측정항목 | -0.002 | 0.008 | 1.000 | 0.772 | 0.069 | 0.353 | 0.353 |
평균값 | 0.031 | 0.009 | 0.772 | 1.000 | 0.269 | 0.147 | 0.147 |
측정기 상태 | 0.057 | 0.092 | 0.069 | 0.269 | 1.000 | 0.079 | 0.079 |
국가 기준초과 구분 | 0.270 | 0.020 | 0.353 | 0.147 | 0.079 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.270 | 0.020 | 0.353 | 0.147 | 0.079 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
51207 | 2021011505 | 110 | 6 | 0.002 | 0 | 0 | 0 |
23549 | 2021010712 | 125 | 9 | 14.0 | 0 | 0 | 0 |
4479 | 2021010205 | 122 | 6 | 0.022 | 0 | 0 | 0 |
52835 | 2021011516 | 106 | 9 | 59.0 | 0 | 1 | 1 |
34250 | 2021011012 | 109 | 5 | 0.6 | 0 | 0 | 0 |
5429 | 2021010212 | 105 | 9 | 6.0 | 0 | 0 | 0 |
12070 | 2021010408 | 112 | 8 | 37.0 | 0 | 0 | 0 |
65828 | 2021011906 | 122 | 5 | 0.3 | 0 | 0 | 0 |
56800 | 2021011618 | 117 | 8 | 30.0 | 0 | 0 | 0 |
64886 | 2021011900 | 115 | 5 | 0.4 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
35487 | 2021011020 | 115 | 6 | 0.008 | 0 | 0 | 0 |
7227 | 2021010300 | 105 | 6 | 0.028 | 0 | 0 | 0 |
9288 | 2021010313 | 124 | 1 | 0.005 | 0 | 0 | 0 |
8216 | 2021010306 | 120 | 5 | 0.4 | 0 | 0 | 0 |
49968 | 2021011421 | 104 | 1 | 0.003 | 0 | 0 | 0 |
65336 | 2021011903 | 115 | 5 | 0.4 | 0 | 0 | 0 |
59611 | 2021011713 | 111 | 3 | 0.017 | 0 | 0 | 0 |
30775 | 2021010913 | 105 | 3 | 0.01 | 0 | 0 | 0 |
57854 | 2021011701 | 118 | 5 | 0.4 | 0 | 0 | 0 |
11543 | 2021010404 | 124 | 9 | 22.0 | 0 | 0 | 0 |