Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분 | High correlation |
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분 | High correlation |
측정항목 is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 평균값 | High correlation |
측정기 상태 is highly imbalanced (93.8%) | Imbalance |
국가 기준초과 구분 is highly imbalanced (68.9%) | Imbalance |
지자체 기준초과 구분 is highly imbalanced (68.9%) | Imbalance |
평균값 is highly skewed (γ1 = -22.16042092) | Skewed |
Reproduction
Analysis started | 2024-05-11 06:56:00.772207 |
---|---|
Analysis finished | 2024-05-11 06:56:06.021054 |
Duration | 5.25 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 473 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.017011 × 109 |
Minimum | 2.0170101 × 109 |
---|---|
Maximum | 2.017012 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0170101 × 109 |
---|---|
5-th percentile | 2.0170101 × 109 |
Q1 | 2.0170105 × 109 |
median | 2.017011 × 109 |
Q3 | 2.0170115 × 109 |
95-th percentile | 2.0170119 × 109 |
Maximum | 2.017012 × 109 |
Range | 1916 |
Interquartile range (IQR) | 1000 |
Descriptive statistics
Standard deviation | 571.27757 |
---|---|
Coefficient of variation (CV) | 2.8322977 × 10-7 |
Kurtosis | -1.2059239 |
Mean | 2.017011 × 109 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 0.010999287 |
Sum | 2.017011 × 1013 |
Variance | 326358.06 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2017011518 | 35 | 0.4% |
2017012000 | 33 | 0.3% |
2017011615 | 33 | 0.3% |
2017011701 | 32 | 0.3% |
2017010109 | 32 | 0.3% |
2017011521 | 31 | 0.3% |
2017011209 | 31 | 0.3% |
2017010219 | 31 | 0.3% |
2017010105 | 30 | 0.3% |
2017011123 | 30 | 0.3% |
Other values (463) | 9682 |
Value | Count | Frequency (%) |
2017010100 | 16 | |
2017010101 | 23 | |
2017010102 | 20 | |
2017010103 | 24 | |
2017010104 | 25 | |
2017010105 | 30 | |
2017010106 | 23 | |
2017010107 | 28 | |
2017010108 | 18 | |
2017010109 | 32 |
Value | Count | Frequency (%) |
2017012016 | 7 | 0.1% |
2017012015 | 20 | |
2017012014 | 16 | |
2017012013 | 22 | |
2017012012 | 19 | |
2017012011 | 26 | |
2017012010 | 26 | |
2017012009 | 23 | |
2017012008 | 20 | |
2017012007 | 27 |
측정소 코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.041 |
Minimum | 101 |
---|---|
Maximum | 125 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 102 |
Q1 | 107 |
median | 113 |
Q3 | 119 |
95-th percentile | 124 |
Maximum | 125 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.2482531 |
---|---|
Coefficient of variation (CV) | 0.064120568 |
Kurtosis | -1.2079017 |
Mean | 113.041 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.019049945 |
Sum | 1130410 |
Variance | 52.537173 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
124 | 430 | 4.3% |
101 | 430 | 4.3% |
122 | 428 | 4.3% |
112 | 420 | 4.2% |
120 | 417 | 4.2% |
111 | 416 | 4.2% |
116 | 416 | 4.2% |
102 | 413 | 4.1% |
121 | 408 | 4.1% |
104 | 399 | 4.0% |
Other values (15) | 5823 |
Value | Count | Frequency (%) |
101 | 430 | |
102 | 413 | |
103 | 396 | |
104 | 399 | |
105 | 380 | |
106 | 380 | |
107 | 377 | |
108 | 385 | |
109 | 392 | |
110 | 394 |
Value | Count | Frequency (%) |
125 | 387 | |
124 | 430 | |
123 | 388 | |
122 | 428 | |
121 | 408 | |
120 | 417 | |
119 | 389 | |
118 | 391 | |
117 | 391 | |
116 | 416 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3583 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7596889 |
---|---|
Coefficient of variation (CV) | 0.51503068 |
Kurtosis | -1.2144538 |
Mean | 5.3583 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.22088221 |
Sum | 53583 |
Variance | 7.6158827 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 1709 | |
9 | 1692 | |
1 | 1673 | |
6 | 1658 | |
3 | 1639 | |
5 | 1629 |
Value | Count | Frequency (%) |
1 | 1673 | |
3 | 1639 | |
5 | 1629 | |
6 | 1658 | |
8 | 1709 | |
9 | 1692 |
Value | Count | Frequency (%) |
9 | 1692 | |
8 | 1709 | |
6 | 1658 | |
5 | 1629 | |
3 | 1639 | |
1 | 1673 |
평균값
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 283 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -4.534255 |
Minimum | -9999 |
---|---|
Maximum | 297 |
Zeros | 33 |
Zeros (%) | 0.3% |
Negative | 22 |
Negative (%) | 0.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | 0.003 |
Q1 | 0.008 |
median | 0.0875 |
Q3 | 22 |
95-th percentile | 79 |
Maximum | 297 |
Range | 10296 |
Interquartile range (IQR) | 21.992 |
Descriptive statistics
Standard deviation | 448.34494 |
---|---|
Coefficient of variation (CV) | -98.879516 |
Kurtosis | 491.22279 |
Mean | -4.534255 |
Median Absolute Deviation (MAD) | 0.1125 |
Skewness | -22.160421 |
Sum | -45342.55 |
Variance | 201013.18 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.005 | 578 | 5.8% |
0.004 | 570 | 5.7% |
0.006 | 348 | 3.5% |
0.003 | 330 | 3.3% |
0.002 | 287 | 2.9% |
0.4 | 234 | 2.3% |
0.007 | 210 | 2.1% |
0.5 | 203 | 2.0% |
0.6 | 184 | 1.8% |
0.3 | 157 | 1.6% |
Other values (273) | 6899 |
Value | Count | Frequency (%) |
-9999.0 | 20 | 0.2% |
-18.0 | 1 | < 0.1% |
-12.0 | 1 | < 0.1% |
0.0 | 33 | 0.3% |
0.001 | 26 | 0.3% |
0.002 | 287 | |
0.003 | 330 | |
0.004 | 570 | |
0.005 | 578 | |
0.006 | 348 |
Value | Count | Frequency (%) |
297.0 | 1 | |
244.0 | 1 | |
216.0 | 1 | |
197.0 | 1 | |
194.0 | 1 | |
187.0 | 1 | |
182.0 | 1 | |
178.0 | 2 | |
177.0 | 1 | |
176.0 | 1 |
측정기 상태
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 71 |
9 | 42 |
8 | 25 |
4 | 19 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 1 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9843 | |
1 | 71 | 0.7% |
9 | 42 | 0.4% |
8 | 25 | 0.2% |
4 | 19 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9843 | |
1 | 71 | 0.7% |
9 | 42 | 0.4% |
8 | 25 | 0.2% |
4 | 19 | 0.2% |
국가 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 558 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9442 | |
1 | 558 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9442 | |
1 | 558 | 5.6% |
지자체 기준초과 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 558 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9442 | |
1 | 558 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9442 | |
1 | 558 | 5.6% |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.028 | NaN | 0.105 | 0.368 | 0.368 |
측정소 코드 | 0.000 | 1.000 | 0.000 | NaN | 0.177 | 0.041 | 0.041 |
측정항목 | 0.028 | 0.000 | 1.000 | NaN | 0.065 | 0.505 | 0.505 |
평균값 | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN |
측정기 상태 | 0.105 | 0.177 | 0.065 | NaN | 1.000 | 0.038 | 0.038 |
국가 기준초과 구분 | 0.368 | 0.041 | 0.505 | NaN | 0.038 | 1.000 | 1.000 |
지자체 기준초과 구분 | 0.368 | 0.041 | 0.505 | NaN | 0.038 | 1.000 | 1.000 |
측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|
측정기 상태 | 1.000 | 0.047 | 0.047 |
국가 기준초과 구분 | 0.047 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.047 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
측정일시 | 1.000 | 0.015 | 0.012 | 0.027 | 0.044 | 0.281 | 0.281 |
측정소 코드 | 0.015 | 1.000 | -0.022 | -0.010 | 0.074 | 0.031 | 0.031 |
측정항목 | 0.012 | -0.022 | 1.000 | 0.727 | 0.044 | 0.365 | 0.365 |
평균값 | 0.027 | -0.010 | 0.727 | 1.000 | 0.882 | 0.000 | 0.000 |
측정기 상태 | 0.044 | 0.074 | 0.044 | 0.882 | 1.000 | 0.047 | 0.047 |
국가 기준초과 구분 | 0.281 | 0.031 | 0.365 | 0.000 | 0.047 | 1.000 | 0.999 |
지자체 기준초과 구분 | 0.281 | 0.031 | 0.365 | 0.000 | 0.047 | 0.999 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
4988 | 2017010209 | 107 | 5 | 0.7 | 0 | 0 | 0 |
30917 | 2017010914 | 103 | 9 | 0.0 | 1 | 0 | 0 |
26646 | 2017010809 | 117 | 1 | 0.005 | 0 | 0 | 0 |
4905 | 2017010208 | 118 | 6 | 0.006 | 0 | 0 | 0 |
50517 | 2017011500 | 120 | 6 | 0.023 | 0 | 0 | 0 |
30941 | 2017010914 | 107 | 9 | 50.0 | 0 | 0 | 0 |
30558 | 2017010911 | 119 | 1 | 0.005 | 0 | 0 | 0 |
5966 | 2017010215 | 120 | 5 | 4.1 | 1 | 0 | 0 |
5651 | 2017010213 | 117 | 9 | 85.0 | 0 | 1 | 1 |
53067 | 2017011517 | 120 | 6 | 0.021 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
37986 | 2017011113 | 107 | 1 | 0.005 | 0 | 0 | 0 |
39179 | 2017011121 | 105 | 9 | 22.0 | 0 | 0 | 0 |
2064 | 2017010113 | 120 | 1 | 0.006 | 0 | 0 | 0 |
47420 | 2017011404 | 104 | 5 | 1.0 | 0 | 0 | 0 |
68507 | 2017012000 | 118 | 9 | 59.0 | 0 | 1 | 1 |
61154 | 2017011723 | 118 | 5 | 1.2 | 0 | 0 | 0 |
3220 | 2017010121 | 112 | 8 | 84.0 | 0 | 0 | 0 |
9124 | 2017010312 | 121 | 8 | 59.0 | 0 | 0 | 0 |
31749 | 2017010919 | 117 | 6 | 0.02 | 0 | 0 | 0 |
30042 | 2017010908 | 108 | 1 | 0.005 | 0 | 0 | 0 |