Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 3 |
Duplicate rows (%) | 3.0% |
Total size in memory | 2.7 KiB |
Average record size in memory | 27.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=116960f0-1fe6-11eb-8adf-f5453fd1d47b |
Dataset has 3 (3.0%) duplicate rows | Duplicates |
공급량 is highly overall correlated with 시설명 | High correlation |
시설명 is highly overall correlated with 공급량 | High correlation |
Reproduction
Analysis started | 2023-12-10 13:10:08.653055 |
---|---|
Analysis finished | 2023-12-10 13:10:09.659687 |
Duration | 1.01 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시설명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
갑천가압장 | |
---|---|
강진가압장 | |
계룡가압장 | |
광명가압장 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 갑천가압장 |
---|---|
2nd row | 갑천가압장 |
3rd row | 갑천가압장 |
4th row | 갑천가압장 |
5th row | 갑천가압장 |
Common Values
Value | Count | Frequency (%) |
갑천가압장 | 30 | |
강진가압장 | 30 | |
계룡가압장 | 30 | |
광명가압장 | 10 | 10.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
갑천가압장 | 30 | |
강진가압장 | 30 | |
계룡가압장 | 30 | |
광명가압장 | 10 | 10.0% |
공급날짜
Real number (ℝ)
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20201115 |
Minimum | 20201101 |
---|---|
Maximum | 20201130 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20201101 |
---|---|
5-th percentile | 20201102 |
Q1 | 20201108 |
median | 20201115 |
Q3 | 20201122 |
95-th percentile | 20201129 |
Maximum | 20201130 |
Range | 29 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 8.4622215 |
---|---|
Coefficient of variation (CV) | 4.1889873 × 10-7 |
Kurtosis | -1.1399752 |
Mean | 20201115 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.089331804 |
Sum | 2.0201115 × 109 |
Variance | 71.609192 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201109 | 5 | 5.0% |
20201116 | 5 | 5.0% |
20201108 | 5 | 5.0% |
20201105 | 4 | 4.0% |
20201110 | 4 | 4.0% |
20201120 | 4 | 4.0% |
20201117 | 4 | 4.0% |
20201126 | 3 | 3.0% |
20201103 | 3 | 3.0% |
20201123 | 3 | 3.0% |
Other values (20) | 60 |
Value | Count | Frequency (%) |
20201101 | 3 | |
20201102 | 3 | |
20201103 | 3 | |
20201104 | 3 | |
20201105 | 4 | |
20201106 | 3 | |
20201107 | 3 | |
20201108 | 5 | |
20201109 | 5 | |
20201110 | 4 |
Value | Count | Frequency (%) |
20201130 | 3 | |
20201129 | 3 | |
20201128 | 3 | |
20201127 | 3 | |
20201126 | 3 | |
20201125 | 3 | |
20201124 | 3 | |
20201123 | 3 | |
20201122 | 3 | |
20201121 | 3 |
공급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 95 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 96049.51 |
Minimum | 972 |
---|---|
Maximum | 385103 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 972 |
---|---|
5-th percentile | 1090.9 |
Q1 | 1297.25 |
median | 64860 |
Q3 | 131805 |
95-th percentile | 365022.25 |
Maximum | 385103 |
Range | 384131 |
Interquartile range (IQR) | 130507.75 |
Descriptive statistics
Standard deviation | 104903.9 |
---|---|
Coefficient of variation (CV) | 1.0921857 |
Kurtosis | 2.1536454 |
Mean | 96049.51 |
Median Absolute Deviation (MAD) | 63701 |
Skewness | 1.615181 |
Sum | 9604951 |
Variance | 1.1004829 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1159 | 2 | 2.0% |
380502 | 2 | 2.0% |
364068 | 2 | 2.0% |
354831 | 2 | 2.0% |
132408 | 2 | 2.0% |
1179 | 1 | 1.0% |
62910 | 1 | 1.0% |
64500 | 1 | 1.0% |
62060 | 1 | 1.0% |
65440 | 1 | 1.0% |
Other values (85) | 85 |
Value | Count | Frequency (%) |
972 | 1 | |
984 | 1 | |
1082 | 1 | |
1086 | 1 | |
1089 | 1 | |
1091 | 1 | |
1097 | 1 | |
1098 | 1 | |
1104 | 1 | |
1113 | 1 |
Value | Count | Frequency (%) |
385103 | 1 | |
383771 | 1 | |
380502 | 2 | |
365122 | 1 | |
365017 | 1 | |
364068 | 2 | |
354831 | 2 | |
137032 | 1 | |
136872 | 1 | |
136216 | 1 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
시설명 | 1.000 | 0.000 | 1.000 |
공급날짜 | 0.000 | 1.000 | 0.000 |
공급량 | 1.000 | 0.000 | 1.000 |
공급날짜 | 공급량 | 시설명 | |
---|---|---|---|
공급날짜 | 1.000 | -0.093 | 0.000 |
공급량 | -0.093 | 1.000 | 1.000 |
시설명 | 0.000 | 1.000 | 1.000 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
0 | 갑천가압장 | 20201122 | 1179 |
1 | 갑천가압장 | 20201101 | 1160 |
2 | 갑천가압장 | 20201105 | 1159 |
3 | 갑천가압장 | 20201108 | 1280 |
4 | 갑천가압장 | 20201115 | 1363 |
5 | 갑천가압장 | 20201110 | 1113 |
6 | 갑천가압장 | 20201113 | 1303 |
7 | 갑천가압장 | 20201121 | 1246 |
8 | 갑천가압장 | 20201107 | 1342 |
9 | 갑천가압장 | 20201124 | 972 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
90 | 광명가압장 | 20201109 | 354831 |
91 | 광명가압장 | 20201120 | 365017 |
92 | 광명가압장 | 20201108 | 364068 |
93 | 광명가압장 | 20201109 | 354831 |
94 | 광명가압장 | 20201110 | 365122 |
95 | 광명가압장 | 20201108 | 364068 |
96 | 광명가압장 | 20201116 | 380502 |
97 | 광명가압장 | 20201105 | 383771 |
98 | 광명가압장 | 20201117 | 385103 |
99 | 광명가압장 | 20201116 | 380502 |
Most frequently occurring
시설명 | 공급날짜 | 공급량 | # duplicates | |
---|---|---|---|---|
0 | 광명가압장 | 20201108 | 364068 | 2 |
1 | 광명가압장 | 20201109 | 354831 | 2 |
2 | 광명가압장 | 20201116 | 380502 | 2 |