Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.1 KiB |
Average record size in memory | 62.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 노바코스 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=6c5049e0-2e4e-11eb-8f72-932712f5aa3c |
측정일 has constant value "" | Constant |
강수량(mm) has constant value "" | Constant |
강우량(mm) has constant value "" | Constant |
기본키 is highly overall correlated with 측정시간 and 2 other fields | High correlation |
측정시간 is highly overall correlated with 기본키 | High correlation |
지점 is highly overall correlated with 기본키 and 1 other fields | High correlation |
주소 is highly overall correlated with 기본키 and 1 other fields | High correlation |
지점 is highly imbalanced (71.4%) | Imbalance |
주소 is highly imbalanced (71.4%) | Imbalance |
기본키 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:42:43.235733 |
---|---|
Analysis finished | 2023-12-10 13:42:44.569917 |
Duration | 1.33 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기본키
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
측정일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20210101 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20210101 |
---|---|
2nd row | 20210101 |
3rd row | 20210101 |
4th row | 20210101 |
5th row | 20210101 |
Common Values
Value | Count | Frequency (%) |
20210101 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20210101 | 100 |
측정시간
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 95 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1128.65 |
Minimum | 15 |
---|---|
Maximum | 2345 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 45 |
Q1 | 511.25 |
median | 1122.5 |
Q3 | 1733.75 |
95-th percentile | 2230.75 |
Maximum | 2345 |
Range | 2330 |
Interquartile range (IQR) | 1222.5 |
Descriptive statistics
Standard deviation | 715.23742 |
---|---|
Coefficient of variation (CV) | 0.63371056 |
Kurtosis | -1.2427915 |
Mean | 1128.65 |
Median Absolute Deviation (MAD) | 615 |
Skewness | 0.039182625 |
Sum | 112865 |
Variance | 511564.57 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 2 | 2.0% |
45 | 2 | 2.0% |
100 | 2 | 2.0% |
115 | 2 | 2.0% |
30 | 2 | 2.0% |
1530 | 1 | 1.0% |
1745 | 1 | 1.0% |
1730 | 1 | 1.0% |
1715 | 1 | 1.0% |
1700 | 1 | 1.0% |
Other values (85) | 85 |
Value | Count | Frequency (%) |
15 | 2 | |
30 | 2 | |
45 | 2 | |
100 | 2 | |
115 | 2 | |
130 | 1 | |
145 | 1 | |
200 | 1 | |
215 | 1 | |
230 | 1 |
Value | Count | Frequency (%) |
2345 | 1 | |
2330 | 1 | |
2315 | 1 | |
2300 | 1 | |
2245 | 1 | |
2230 | 1 | |
2215 | 1 | |
2200 | 1 | |
2145 | 1 | |
2130 | 1 |
지점
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
A-0010-1185E-8 | |
---|---|
A-0010-3019E-6 | 5 |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A-0010-1185E-8 |
---|---|
2nd row | A-0010-1185E-8 |
3rd row | A-0010-1185E-8 |
4th row | A-0010-1185E-8 |
5th row | A-0010-1185E-8 |
Common Values
Value | Count | Frequency (%) |
A-0010-1185E-8 | 95 | |
A-0010-3019E-6 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a-0010-1185e-8 | 95 | |
a-0010-3019e-6 | 5 | 5.0% |
주소
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
대구 동구 안심3동 | |
---|---|
충북 청주시 흥덕구 강서1동 | 5 |
Length
Max length | 15 |
---|---|
Median length | 10 |
Mean length | 10.25 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 대구 동구 안심3동 |
---|---|
2nd row | 대구 동구 안심3동 |
3rd row | 대구 동구 안심3동 |
4th row | 대구 동구 안심3동 |
5th row | 대구 동구 안심3동 |
Common Values
Value | Count | Frequency (%) |
대구 동구 안심3동 | 95 | |
충북 청주시 흥덕구 강서1동 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대구 | 95 | |
동구 | 95 | |
안심3동 | 95 | |
충북 | 5 | 1.6% |
청주시 | 5 | 1.6% |
흥덕구 | 5 | 1.6% |
강서1동 | 5 | 1.6% |
강수량(mm)
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
강우량(mm)
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
기본키 | 측정시간 | 지점 | 주소 | |
---|---|---|---|---|
기본키 | 1.000 | 0.977 | 0.816 | 0.816 |
측정시간 | 0.977 | 1.000 | 0.585 | 0.585 |
지점 | 0.816 | 0.585 | 1.000 | 0.986 |
주소 | 0.816 | 0.585 | 0.986 | 1.000 |
주소 | 지점 | |
---|---|---|
주소 | 1.000 | 0.894 |
지점 | 0.894 | 1.000 |
기본키 | 측정시간 | 지점 | 주소 | |
---|---|---|---|---|
기본키 | 1.000 | 0.729 | 0.622 | 0.622 |
측정시간 | 0.729 | 1.000 | 0.433 | 0.433 |
지점 | 0.622 | 0.433 | 1.000 | 0.894 |
주소 | 0.622 | 0.433 | 0.894 | 1.000 |
기본키 | 측정일 | 측정시간 | 지점 | 주소 | 강수량(mm) | 강우량(mm) | |
---|---|---|---|---|---|---|---|
0 | 1 | 20210101 | 15 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
1 | 2 | 20210101 | 30 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
2 | 3 | 20210101 | 45 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
3 | 4 | 20210101 | 100 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
4 | 5 | 20210101 | 115 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
5 | 6 | 20210101 | 130 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
6 | 7 | 20210101 | 145 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
7 | 8 | 20210101 | 200 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
8 | 9 | 20210101 | 215 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
9 | 10 | 20210101 | 230 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
기본키 | 측정일 | 측정시간 | 지점 | 주소 | 강수량(mm) | 강우량(mm) | |
---|---|---|---|---|---|---|---|
90 | 91 | 20210101 | 2245 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
91 | 92 | 20210101 | 2300 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
92 | 93 | 20210101 | 2315 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
93 | 94 | 20210101 | 2330 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
94 | 95 | 20210101 | 2345 | A-0010-1185E-8 | 대구 동구 안심3동 | 0 | 0 |
95 | 96 | 20210101 | 15 | A-0010-3019E-6 | 충북 청주시 흥덕구 강서1동 | 0 | 0 |
96 | 97 | 20210101 | 30 | A-0010-3019E-6 | 충북 청주시 흥덕구 강서1동 | 0 | 0 |
97 | 98 | 20210101 | 45 | A-0010-3019E-6 | 충북 청주시 흥덕구 강서1동 | 0 | 0 |
98 | 99 | 20210101 | 100 | A-0010-3019E-6 | 충북 청주시 흥덕구 강서1동 | 0 | 0 |
99 | 100 | 20210101 | 115 | A-0010-3019E-6 | 충북 청주시 흥덕구 강서1동 | 0 | 0 |