Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 2 |
Duplicate rows (%) | 2.0% |
Total size in memory | 2.7 KiB |
Average record size in memory | 27.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=116960f0-1fe6-11eb-8adf-f5453fd1d47b |
Dataset has 2 (2.0%) duplicate rows | Duplicates |
공급량 is highly overall correlated with 시설명 | High correlation |
시설명 is highly overall correlated with 공급량 | High correlation |
Reproduction
Analysis started | 2023-12-10 13:09:53.760702 |
---|---|
Analysis finished | 2023-12-10 13:09:54.751815 |
Duration | 0.99 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시설명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
갑천가압장 | |
---|---|
강진가압장 | |
계룡가압장 | |
광명가압장 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 갑천가압장 |
---|---|
2nd row | 갑천가압장 |
3rd row | 갑천가압장 |
4th row | 갑천가압장 |
5th row | 갑천가압장 |
Common Values
Value | Count | Frequency (%) |
갑천가압장 | 31 | |
강진가압장 | 31 | |
계룡가압장 | 29 | |
광명가압장 | 9 | 9.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
갑천가압장 | 31 | |
강진가압장 | 31 | |
계룡가압장 | 29 | |
광명가압장 | 9 | 9.0% |
공급날짜
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20210315 |
Minimum | 20210301 |
---|---|
Maximum | 20210331 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20210301 |
---|---|
5-th percentile | 20210302 |
Q1 | 20210308 |
median | 20210314 |
Q3 | 20210323 |
95-th percentile | 20210330 |
Maximum | 20210331 |
Range | 30 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 8.8721275 |
---|---|
Coefficient of variation (CV) | 4.3899006 × 10-7 |
Kurtosis | -1.1627818 |
Mean | 20210315 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0.12730589 |
Sum | 2.0210315 × 109 |
Variance | 78.714646 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20210309 | 5 | 5.0% |
20210314 | 5 | 5.0% |
20210320 | 4 | 4.0% |
20210310 | 4 | 4.0% |
20210307 | 4 | 4.0% |
20210311 | 4 | 4.0% |
20210303 | 4 | 4.0% |
20210329 | 3 | 3.0% |
20210321 | 3 | 3.0% |
20210327 | 3 | 3.0% |
Other values (21) | 61 |
Value | Count | Frequency (%) |
20210301 | 3 | |
20210302 | 3 | |
20210303 | 4 | |
20210304 | 3 | |
20210305 | 3 | |
20210306 | 3 | |
20210307 | 4 | |
20210308 | 3 | |
20210309 | 5 | |
20210310 | 4 |
Value | Count | Frequency (%) |
20210331 | 3 | |
20210330 | 3 | |
20210329 | 3 | |
20210328 | 3 | |
20210327 | 3 | |
20210326 | 3 | |
20210325 | 3 | |
20210324 | 3 | |
20210323 | 2 | |
20210322 | 3 |
공급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 96 |
---|---|
Distinct (%) | 96.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 97169.18 |
Minimum | 1086 |
---|---|
Maximum | 439762 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1086 |
---|---|
5-th percentile | 1148.85 |
Q1 | 1322 |
median | 64380 |
Q3 | 134062 |
95-th percentile | 398236 |
Maximum | 439762 |
Range | 438676 |
Interquartile range (IQR) | 132740 |
Descriptive statistics
Standard deviation | 112115.65 |
---|---|
Coefficient of variation (CV) | 1.153819 |
Kurtosis | 3.0558211 |
Mean | 97169.18 |
Median Absolute Deviation (MAD) | 63197.5 |
Skewness | 1.8377523 |
Sum | 9716918 |
Variance | 1.2569919 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1149 | 2 | 2.0% |
1249 | 2 | 2.0% |
419871 | 2 | 2.0% |
397743 | 2 | 2.0% |
60100 | 1 | 1.0% |
63880 | 1 | 1.0% |
61830 | 1 | 1.0% |
65580 | 1 | 1.0% |
68060 | 1 | 1.0% |
63230 | 1 | 1.0% |
Other values (86) | 86 |
Value | Count | Frequency (%) |
1086 | 1 | |
1118 | 1 | |
1137 | 1 | |
1142 | 1 | |
1146 | 1 | |
1149 | 2 | |
1158 | 1 | |
1176 | 1 | |
1179 | 1 | |
1186 | 1 |
Value | Count | Frequency (%) |
439762 | 1 | |
427151 | 1 | |
419871 | 2 | |
407603 | 1 | |
397743 | 2 | |
395453 | 1 | |
385936 | 1 | |
143912 | 1 | |
141336 | 1 | |
140352 | 1 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
시설명 | 1.000 | 0.000 | 1.000 |
공급날짜 | 0.000 | 1.000 | 0.000 |
공급량 | 1.000 | 0.000 | 1.000 |
공급날짜 | 공급량 | 시설명 | |
---|---|---|---|
공급날짜 | 1.000 | -0.194 | 0.000 |
공급량 | -0.194 | 1.000 | 0.990 |
시설명 | 0.000 | 0.990 | 1.000 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
0 | 갑천가압장 | 20210329 | 1149 |
1 | 갑천가압장 | 20210318 | 1142 |
2 | 갑천가압장 | 20210306 | 1249 |
3 | 갑천가압장 | 20210323 | 1200 |
4 | 갑천가압장 | 20210313 | 1352 |
5 | 갑천가압장 | 20210304 | 1334 |
6 | 갑천가압장 | 20210330 | 1118 |
7 | 갑천가압장 | 20210312 | 1227 |
8 | 갑천가압장 | 20210324 | 1146 |
9 | 갑천가압장 | 20210301 | 1348 |
시설명 | 공급날짜 | 공급량 | |
---|---|---|---|
90 | 계룡가압장 | 20210321 | 60910 |
91 | 광명가압장 | 20210310 | 407603 |
92 | 광명가압장 | 20210311 | 395453 |
93 | 광명가압장 | 20210309 | 397743 |
94 | 광명가압장 | 20210309 | 397743 |
95 | 광명가압장 | 20210303 | 439762 |
96 | 광명가압장 | 20210307 | 427151 |
97 | 광명가압장 | 20210314 | 419871 |
98 | 광명가압장 | 20210314 | 419871 |
99 | 광명가압장 | 20210320 | 385936 |
Most frequently occurring
시설명 | 공급날짜 | 공급량 | # duplicates | |
---|---|---|---|---|
0 | 광명가압장 | 20210309 | 397743 | 2 |
1 | 광명가압장 | 20210314 | 419871 | 2 |