Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 693.4 KiB |
Average record size in memory | 71.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do |
국가 기준초과 구분 has constant value "" | Constant |
지자체 기준초과 구분 has constant value "" | Constant |
측정항목 is highly overall correlated with 평균값 and 1 other fields | High correlation |
평균값 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
측정기 상태 is highly overall correlated with 측정항목 and 1 other fields | High correlation |
평균값 has 507 (5.1%) zeros | Zeros |
측정기 상태 has 5950 (59.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-07-27 00:30:02.799487 |
---|---|
Analysis finished | 2024-07-27 00:30:13.906772 |
Duration | 11.11 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
측정일시
Real number (ℝ)
Distinct | 2076 |
---|---|
Distinct (%) | 20.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9890212 × 109 |
Minimum | 1.9890101 × 109 |
---|---|
Maximum | 1.9890328 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9890101 × 109 |
---|---|
5-th percentile | 1.9890105 × 109 |
Q1 | 1.9890122 × 109 |
median | 1.9890213 × 109 |
Q3 | 1.9890307 × 109 |
95-th percentile | 1.9890324 × 109 |
Maximum | 1.9890328 × 109 |
Range | 22719 |
Interquartile range (IQR) | 18489 |
Descriptive statistics
Standard deviation | 8200.7491 |
---|---|
Coefficient of variation (CV) | 4.1230075 × 10-6 |
Kurtosis | -1.486408 |
Mean | 1.9890212 × 109 |
Median Absolute Deviation (MAD) | 9208 |
Skewness | 0.067166008 |
Sum | 1.9890212 × 1013 |
Variance | 67252286 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
1989031016 | 14 | 0.1% |
1989032016 | 13 | 0.1% |
1989031810 | 12 | 0.1% |
1989012111 | 12 | 0.1% |
1989022403 | 12 | 0.1% |
1989012119 | 11 | 0.1% |
1989031607 | 11 | 0.1% |
1989011901 | 11 | 0.1% |
1989010705 | 11 | 0.1% |
1989021220 | 11 | 0.1% |
Other values (2066) | 9882 |
Value | Count | Frequency (%) |
1989010100 | 7 | |
1989010101 | 6 | |
1989010102 | 3 | |
1989010103 | 3 | |
1989010104 | 6 | |
1989010105 | 6 | |
1989010106 | 7 | |
1989010107 | 5 | |
1989010108 | 5 | |
1989010109 | 2 | < 0.1% |
Value | Count | Frequency (%) |
1989032819 | 2 | < 0.1% |
1989032818 | 4 | < 0.1% |
1989032817 | 5 | |
1989032816 | 6 | |
1989032815 | 10 | |
1989032814 | 8 | |
1989032813 | 8 | |
1989032812 | 7 | |
1989032811 | 5 | |
1989032810 | 4 | < 0.1% |
측정소 코드
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112.388 |
Minimum | 103 |
---|---|
Maximum | 124 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 103 |
---|---|
5-th percentile | 103 |
Q1 | 105 |
median | 108 |
Q3 | 122 |
95-th percentile | 124 |
Maximum | 124 |
Range | 21 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 7.4461534 |
---|---|
Coefficient of variation (CV) | 0.06625399 |
Kurtosis | -1.3964105 |
Mean | 112.388 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.33006119 |
Sum | 1123880 |
Variance | 55.445201 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
Value | Count | Frequency (%) |
122 | 1293 | |
103 | 1264 | |
105 | 1261 | |
107 | 1256 | |
117 | 1246 | |
124 | 1243 | |
108 | 1230 | |
113 | 1207 |
Value | Count | Frequency (%) |
103 | 1264 | |
105 | 1261 | |
107 | 1256 | |
108 | 1230 | |
113 | 1207 | |
117 | 1246 | |
122 | 1293 | |
124 | 1243 |
Value | Count | Frequency (%) |
124 | 1243 | |
122 | 1293 | |
117 | 1246 | |
113 | 1207 | |
108 | 1230 | |
107 | 1256 | |
105 | 1261 | |
103 | 1264 |
측정항목
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.3214 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.7535904 |
---|---|
Coefficient of variation (CV) | 0.51745601 |
Kurtosis | -1.2151839 |
Mean | 5.3214 |
Median Absolute Deviation (MAD) | 2.5 |
Skewness | -0.19625573 |
Sum | 53214 |
Variance | 7.5822603 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
Value | Count | Frequency (%) |
3 | 1682 | |
1 | 1678 | |
9 | 1675 | |
5 | 1669 | |
6 | 1649 | |
8 | 1647 |
Value | Count | Frequency (%) |
1 | 1678 | |
3 | 1682 | |
5 | 1669 | |
6 | 1649 | |
8 | 1647 | |
9 | 1675 |
Value | Count | Frequency (%) |
9 | 1675 | |
8 | 1647 | |
6 | 1649 | |
5 | 1669 | |
3 | 1682 | |
1 | 1678 |
평균값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 503 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -3338.1404 |
Minimum | -9999 |
---|---|
Maximum | 34.8 |
Zeros | 507 |
Zeros (%) | 5.1% |
Negative | 4048 |
Negative (%) | 40.5% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -9999 |
---|---|
5-th percentile | -9999 |
Q1 | -9999 |
median | 0.006 |
Q3 | 0.067 |
95-th percentile | 4.5 |
Maximum | 34.8 |
Range | 10033.8 |
Interquartile range (IQR) | 9999.067 |
Descriptive statistics
Standard deviation | 4699.8927 |
---|---|
Coefficient of variation (CV) | -1.4079374 |
Kurtosis | -1.4928678 |
Mean | -3338.1404 |
Median Absolute Deviation (MAD) | 2.394 |
Skewness | -0.71044253 |
Sum | -33381404 |
Variance | 22088991 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
-9999.0 | 3322 | |
-9.999 | 556 | 5.6% |
0.0 | 507 | 5.1% |
-999.9 | 166 | 1.7% |
0.001 | 146 | 1.5% |
0.002 | 93 | 0.9% |
0.003 | 71 | 0.7% |
0.005 | 62 | 0.6% |
0.016 | 61 | 0.6% |
0.01 | 61 | 0.6% |
Other values (493) | 4955 |
Value | Count | Frequency (%) |
-9999.0 | 3322 | |
-999.9 | 166 | 1.7% |
-10.051 | 1 | < 0.1% |
-10.002 | 1 | < 0.1% |
-9.999 | 556 | 5.6% |
-0.025 | 1 | < 0.1% |
-0.007 | 1 | < 0.1% |
0.0 | 507 | 5.1% |
0.001 | 146 | 1.5% |
0.002 | 93 | 0.9% |
Value | Count | Frequency (%) |
34.8 | 1 | |
30.8 | 1 | |
29.8 | 1 | |
25.1 | 1 | |
24.4 | 1 | |
23.8 | 1 | |
22.9 | 2 | |
22.6 | 1 | |
22.0 | 1 | |
21.6 | 1 |
측정기 상태
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.5899 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 5950 |
Zeros (%) | 59.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.0550281 |
---|---|
Coefficient of variation (CV) | 1.2925518 |
Kurtosis | -0.021434708 |
Mean | 1.5899 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.8737231 |
Sum | 15899 |
Variance | 4.2231403 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
Value | Count | Frequency (%) |
0 | 5950 | |
4 | 3466 | |
2 | 423 | 4.2% |
9 | 118 | 1.2% |
1 | 31 | 0.3% |
8 | 12 | 0.1% |
Value | Count | Frequency (%) |
0 | 5950 | |
1 | 31 | 0.3% |
2 | 423 | 4.2% |
4 | 3466 | |
8 | 12 | 0.1% |
9 | 118 | 1.2% |
Value | Count | Frequency (%) |
9 | 118 | 1.2% |
8 | 12 | 0.1% |
4 | 3466 | |
2 | 423 | 4.2% |
1 | 31 | 0.3% |
0 | 5950 |
국가 기준초과 구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
지자체 기준초과 구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | |
---|---|---|---|---|---|
측정일시 | 1.000 | 0.000 | 0.018 | 0.106 | 0.141 |
측정소 코드 | 0.000 | 1.000 | 0.019 | 0.087 | 0.273 |
측정항목 | 0.018 | 0.019 | 1.000 | 0.411 | 0.823 |
평균값 | 0.106 | 0.087 | 0.411 | 1.000 | 0.664 |
측정기 상태 | 0.141 | 0.273 | 0.823 | 0.664 | 1.000 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | |
---|---|---|---|---|---|
측정일시 | 1.000 | 0.009 | -0.003 | 0.039 | -0.037 |
측정소 코드 | 0.009 | 1.000 | -0.008 | 0.015 | -0.027 |
측정항목 | -0.003 | -0.008 | 1.000 | -0.758 | 0.724 |
평균값 | 0.039 | 0.015 | -0.758 | 1.000 | -0.867 |
측정기 상태 | -0.037 | -0.027 | 0.724 | -0.867 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
23300 | 1989012105 | 108 | 5 | 3.3 | 0 | 0 | 0 |
73099 | 1989030510 | 124 | 3 | 0.021 | 0 | 0 | 0 |
46277 | 1989021004 | 103 | 9 | -9999.0 | 4 | 0 | 0 |
69096 | 1989030123 | 113 | 1 | 0.071 | 0 | 0 | 0 |
94120 | 1989032316 | 122 | 8 | -9999.0 | 4 | 0 | 0 |
89142 | 1989031909 | 105 | 1 | 0.254 | 0 | 0 | 0 |
99633 | 1989032811 | 117 | 6 | 0.01 | 0 | 0 | 0 |
45629 | 1989020914 | 113 | 9 | -9999.0 | 4 | 0 | 0 |
2776 | 1989010309 | 122 | 8 | -9999.0 | 4 | 0 | 0 |
34914 | 1989013107 | 108 | 1 | 0.088 | 0 | 0 | 0 |
측정일시 | 측정소 코드 | 측정항목 | 평균값 | 측정기 상태 | 국가 기준초과 구분 | 지자체 기준초과 구분 | |
---|---|---|---|---|---|---|---|
24654 | 1989012209 | 117 | 1 | 0.136 | 0 | 0 | 0 |
13521 | 1989011217 | 117 | 6 | 0.0 | 0 | 0 | 0 |
48490 | 1989021202 | 105 | 8 | -9999.0 | 4 | 0 | 0 |
8088 | 1989010800 | 113 | 1 | -9.999 | 2 | 0 | 0 |
92664 | 1989032210 | 113 | 1 | 0.038 | 0 | 0 | 0 |
64338 | 1989022520 | 108 | 1 | 0.022 | 0 | 0 | 0 |
16667 | 1989011511 | 105 | 9 | -9999.0 | 4 | 0 | 0 |
77957 | 1989030916 | 103 | 9 | -9999.0 | 4 | 0 | 0 |
86864 | 1989031709 | 117 | 5 | 3.1 | 0 | 0 | 0 |
12303 | 1989011116 | 107 | 6 | 0.008 | 0 | 0 | 0 |