Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=cd5119d0-40d8-11eb-bdda-25af7f339cc7 |
시군명 has constant value "" | Constant |
금년(이번년) is highly overall correlated with 평균값 | High correlation |
평균값 is highly overall correlated with 금년(이번년) and 1 other fields | High correlation |
최고값 is highly overall correlated with 평균값 and 1 other fields | High correlation |
시도명 is highly overall correlated with 최고값 | High correlation |
최소값 is highly imbalanced (71.4%) | Imbalance |
금년(이번년) has unique values | Unique |
평균값 has unique values | Unique |
최고값 has unique values | Unique |
Reproduction
Analysis started | 2024-04-21 09:31:02.637033 |
---|---|
Analysis finished | 2024-04-21 09:31:07.969479 |
Duration | 5.33 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 928.0 B |
경기도 | |
---|---|
경상북도 | |
강원도 | |
경상남도 | |
전라남도 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.51 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 강원도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
경기도 | 31 | |
경상북도 | 23 | |
강원도 | 18 | |
경상남도 | 18 | |
전라남도 | 10 | 10.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경기도 | 31 | |
경상북도 | 23 | |
강원도 | 18 | |
경상남도 | 18 | |
전라남도 | 10 | 10.0% |
시군명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 928.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
코드
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 928.0 B |
Value | Count | Frequency (%) |
고성군 | 2 | 2.0% |
경산시 | 1 | 1.0% |
김해시 | 1 | 1.0% |
영천시 | 1 | 1.0% |
영주시 | 1 | 1.0% |
구미시 | 1 | 1.0% |
안동시 | 1 | 1.0% |
경주시 | 1 | 1.0% |
포항시 | 1 | 1.0% |
김천시 | 1 | 1.0% |
Other values (89) | 89 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 54 | |
군 | 49 | 16.2% |
천 | 14 | 4.6% |
양 | 12 | 4.0% |
주 | 11 | 3.6% |
성 | 9 | 3.0% |
영 | 8 | 2.6% |
안 | 6 | 2.0% |
산 | 5 | 1.7% |
남 | 5 | 1.7% |
Other values (75) | 130 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 303 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 54 | |
군 | 49 | 16.2% |
천 | 14 | 4.6% |
양 | 12 | 4.0% |
주 | 11 | 3.6% |
성 | 9 | 3.0% |
영 | 8 | 2.6% |
안 | 6 | 2.0% |
산 | 5 | 1.7% |
남 | 5 | 1.7% |
Other values (75) | 130 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 303 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 54 | |
군 | 49 | 16.2% |
천 | 14 | 4.6% |
양 | 12 | 4.0% |
주 | 11 | 3.6% |
성 | 9 | 3.0% |
영 | 8 | 2.6% |
안 | 6 | 2.0% |
산 | 5 | 1.7% |
남 | 5 | 1.7% |
Other values (75) | 130 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 303 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 54 | |
군 | 49 | 16.2% |
천 | 14 | 4.6% |
양 | 12 | 4.0% |
주 | 11 | 3.6% |
성 | 9 | 3.0% |
영 | 8 | 2.6% |
안 | 6 | 2.0% |
산 | 5 | 1.7% |
남 | 5 | 1.7% |
Other values (75) | 130 |
금년(이번년)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 567.75896 |
Minimum | 394.16753 |
---|---|
Maximum | 1057.5334 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 394.16753 |
---|---|
5-th percentile | 447.92013 |
Q1 | 490.56751 |
median | 538.48592 |
Q3 | 612.15522 |
95-th percentile | 753.904 |
Maximum | 1057.5334 |
Range | 663.36588 |
Interquartile range (IQR) | 121.5877 |
Descriptive statistics
Standard deviation | 114.8673 |
---|---|
Coefficient of variation (CV) | 0.20231701 |
Kurtosis | 3.9725842 |
Mean | 567.75896 |
Median Absolute Deviation (MAD) | 60.036674 |
Skewness | 1.6864931 |
Sum | 56775.896 |
Variance | 13194.496 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
436.909544501153 | 1 | 1.0% |
738.402991104716 | 1 | 1.0% |
561.185647034184 | 1 | 1.0% |
526.656718381288 | 1 | 1.0% |
496.080554954141 | 1 | 1.0% |
451.89410276433 | 1 | 1.0% |
473.052962901171 | 1 | 1.0% |
611.388216499358 | 1 | 1.0% |
542.545990005092 | 1 | 1.0% |
549.580369736142 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
394.167529031218 | 1 | |
399.455213451457 | 1 | |
424.000876936962 | 1 | |
436.909544501153 | 1 | |
442.602683254561 | 1 | |
448.2 | 1 | |
450.8 | 1 | |
451.89410276433 | 1 | |
453.429372303462 | 1 | |
454.516240996681 | 1 |
Value | Count | Frequency (%) |
1057.53341258812 | 1 | |
974.1 | 1 | |
908.553244981868 | 1 | |
794.355994883617 | 1 | |
778.314634533799 | 1 | |
752.619231158211 | 1 | |
747.1 | 1 | |
738.402991104716 | 1 | |
731.72032836931 | 1 | |
723.118526530247 | 1 |
평균값
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 249.36763 |
Minimum | 184.30518 |
---|---|
Maximum | 462.53973 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 184.30518 |
---|---|
5-th percentile | 200.14504 |
Q1 | 210.69976 |
median | 228.44762 |
Q3 | 270.0229 |
95-th percentile | 372.39134 |
Maximum | 462.53973 |
Range | 278.23455 |
Interquartile range (IQR) | 59.323139 |
Descriptive statistics
Standard deviation | 57.166014 |
---|---|
Coefficient of variation (CV) | 0.22924392 |
Kurtosis | 3.0430164 |
Mean | 249.36763 |
Median Absolute Deviation (MAD) | 21.716372 |
Skewness | 1.7657391 |
Sum | 24936.763 |
Variance | 3267.9531 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
213.158688832852 | 1 | 1.0% |
361.448027686256 | 1 | 1.0% |
244.09909261962 | 1 | 1.0% |
229.046361521852 | 1 | 1.0% |
235.584891469085 | 1 | 1.0% |
204.250464409322 | 1 | 1.0% |
200.199183579118 | 1 | 1.0% |
254.81011457411 | 1 | 1.0% |
250.786064621752 | 1 | 1.0% |
224.703860291813 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
184.305181400191 | 1 | |
193.946299656482 | 1 | |
198.278125 | 1 | |
198.928512682242 | 1 | |
199.116380181653 | 1 | |
200.199183579118 | 1 | |
200.643856219311 | 1 | |
201.921154216219 | 1 | |
203.751479286353 | 1 | |
204.250464409322 | 1 |
Value | Count | Frequency (%) |
462.5397314445 | 1 | |
460.682541737958 | 1 | |
394.119373846713 | 1 | |
381.6796875 | 1 | |
376.724169801442 | 1 | |
372.163297268256 | 1 | |
366.553724983478 | 1 | |
361.448027686256 | 1 | |
344.97767309272 | 1 | |
342.06215431531 | 1 |
최고값
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44861.7 |
Minimum | 41110 |
---|---|
Maximum | 48890 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 41110 |
---|---|
5-th percentile | 41209 |
Q1 | 41625 |
median | 46795 |
Q3 | 47822.5 |
95-th percentile | 48840.5 |
Maximum | 48890 |
Range | 7780 |
Interquartile range (IQR) | 6197.5 |
Descriptive statistics
Standard deviation | 3052.4053 |
---|---|
Coefficient of variation (CV) | 0.068040339 |
Kurtosis | -1.8616898 |
Mean | 44861.7 |
Median Absolute Deviation (MAD) | 2090 |
Skewness | -0.012394933 |
Sum | 4486170 |
Variance | 9317177.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
42770 | 1 | 1.0% |
48240 | 1 | 1.0% |
47250 | 1 | 1.0% |
47230 | 1 | 1.0% |
47210 | 1 | 1.0% |
47190 | 1 | 1.0% |
47170 | 1 | 1.0% |
47130 | 1 | 1.0% |
47110 | 1 | 1.0% |
47150 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
41110 | 1 | |
41130 | 1 | |
41150 | 1 | |
41170 | 1 | |
41190 | 1 | |
41210 | 1 | |
41220 | 1 | |
41250 | 1 | |
41270 | 1 | |
41280 | 1 |
Value | Count | Frequency (%) |
48890 | 1 | |
48880 | 1 | |
48870 | 1 | |
48860 | 1 | |
48850 | 1 | |
48840 | 1 | |
48820 | 1 | |
48740 | 1 | |
48730 | 1 | |
48720 | 1 |
최소값
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 928.0 B |
- | |
---|---|
5 | 5 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 95 | |
5 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
95 | ||
5 | 5 | 5.0% |
재현기간
Real number (ℝ)
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 364.833 |
Minimum | 264.7 |
---|---|
Maximum | 519.3 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 264.7 |
---|---|
5-th percentile | 279.645 |
Q1 | 328.4 |
median | 371.6 |
Q3 | 400.65 |
95-th percentile | 444.33 |
Maximum | 519.3 |
Range | 254.6 |
Interquartile range (IQR) | 72.25 |
Descriptive statistics
Standard deviation | 53.570588 |
---|---|
Coefficient of variation (CV) | 0.14683592 |
Kurtosis | -0.092912765 |
Mean | 364.833 |
Median Absolute Deviation (MAD) | 36.65 |
Skewness | 0.19623719 |
Sum | 36483.3 |
Variance | 2869.8079 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
401.1 | 2 | 2.0% |
389.5 | 1 | 1.0% |
352.3 | 1 | 1.0% |
333.4 | 1 | 1.0% |
306.2 | 1 | 1.0% |
359.9 | 1 | 1.0% |
276.7 | 1 | 1.0% |
345.1 | 1 | 1.0% |
308.4 | 1 | 1.0% |
319.4 | 1 | 1.0% |
Other values (89) | 89 |
Value | Count | Frequency (%) |
264.7 | 1 | |
271.0 | 1 | |
272.1 | 1 | |
275.4 | 1 | |
276.7 | 1 | |
279.8 | 1 | |
280.2 | 1 | |
282.6 | 1 | |
287.4 | 1 | |
289.1 | 1 |
Value | Count | Frequency (%) |
519.3 | 1 | |
506.7 | 1 | |
472.8 | 1 | |
470.9 | 1 | |
446.8 | 1 | |
444.2 | 1 | |
441.1 | 1 | |
440.7 | 1 | |
432.2 | 1 | |
419.1 | 1 |
시도명 | 코드 | 금년(이번년) | 평균값 | 최고값 | 최소값 | 재현기간 | |
---|---|---|---|---|---|---|---|
시도명 | 1.000 | 0.837 | 0.644 | 0.670 | 0.932 | 0.260 | 0.569 |
코드 | 0.837 | 1.000 | 0.951 | 0.745 | 0.832 | 1.000 | 0.824 |
금년(이번년) | 0.644 | 0.951 | 1.000 | 0.806 | 0.498 | 0.319 | 0.832 |
평균값 | 0.670 | 0.745 | 0.806 | 1.000 | 0.593 | 0.418 | 0.817 |
최고값 | 0.932 | 0.832 | 0.498 | 0.593 | 1.000 | 0.298 | 0.433 |
최소값 | 0.260 | 1.000 | 0.319 | 0.418 | 0.298 | 1.000 | 0.251 |
재현기간 | 0.569 | 0.824 | 0.832 | 0.817 | 0.433 | 0.251 | 1.000 |
최소값 | 시도명 | |
---|---|---|
최소값 | 1.000 | 0.312 |
시도명 | 0.312 | 1.000 |
금년(이번년) | 평균값 | 최고값 | 재현기간 | 시도명 | 최소값 | |
---|---|---|---|---|---|---|
금년(이번년) | 1.000 | 0.616 | 0.233 | 0.410 | 0.316 | 0.236 |
평균값 | 0.616 | 1.000 | 0.653 | 0.123 | 0.455 | 0.404 |
최고값 | 0.233 | 0.653 | 1.000 | -0.171 | 0.896 | 0.211 |
재현기간 | 0.410 | 0.123 | -0.171 | 1.000 | 0.271 | 0.180 |
시도명 | 0.316 | 0.455 | 0.896 | 0.271 | 1.000 | 0.312 |
최소값 | 0.236 | 0.404 | 0.211 | 0.180 | 0.312 | 1.000 |
시도명 | 시군명 | 코드 | 금년(이번년) | 평균값 | 최고값 | 최소값 | 재현기간 | |
---|---|---|---|---|---|---|---|---|
0 | 강원도 | 0 | 정선군 | 436.909545 | 213.158689 | 42770 | - | 389.5 |
1 | 강원도 | 0 | 평창군 | 466.7 | 235.945977 | 42760 | - | 379.7 |
2 | 강원도 | 0 | 영월군 | 394.167529 | 208.261957 | 42750 | - | 380.6 |
3 | 강원도 | 0 | 횡성군 | 493.410556 | 222.22278 | 42730 | - | 380.8 |
4 | 강원도 | 0 | 홍천군 | 468.1 | 215.591779 | 42720 | - | 401.1 |
5 | 강원도 | 0 | 삼척시 | 478.358313 | 234.470437 | 42230 | - | 351.6 |
6 | 강원도 | 0 | 양양군 | 584.637449 | 263.675 | 42830 | - | 407.7 |
7 | 강원도 | 0 | 고성군 | 589.7 | 239.829696 | 42820 | - | 441.1 |
8 | 강원도 | 0 | 인제군 | 453.429372 | 207.014355 | 42810 | - | 416.1 |
9 | 강원도 | 0 | 양구군 | 399.455213 | 184.305181 | 42800 | - | 380.9 |
시도명 | 시군명 | 코드 | 금년(이번년) | 평균값 | 최고값 | 최소값 | 재현기간 | |
---|---|---|---|---|---|---|---|---|
90 | 전라남도 | 0 | 화순군 | 539.022193 | 279.68412 | 46790 | - | 332.6 |
91 | 전라남도 | 0 | 장흥군 | 634.916861 | 344.977673 | 46800 | - | 444.2 |
92 | 전라남도 | 0 | 강진군 | 635.61817 | 334.842196 | 46810 | - | 410.0 |
93 | 전라남도 | 0 | 해남군 | 571.899521 | 301.500734 | 46820 | - | 369.5 |
94 | 전라남도 | 0 | 영암군 | 552.665697 | 280.81436 | 46830 | - | 349.3 |
95 | 전라남도 | 0 | 무안군 | 587.634945 | 265.77621 | 46840 | - | 308.2 |
96 | 전라남도 | 0 | 함평군 | 602.504068 | 275.302202 | 46860 | - | 323.9 |
97 | 전라남도 | 0 | 영광군 | 644.967011 | 278.188162 | 46870 | 5 | 298.5 |
98 | 전라남도 | 0 | 장성군 | 614.456214 | 283.966811 | 46880 | 5 | 303.0 |
99 | 전라남도 | 0 | 완도군 | 715.900488 | 376.72417 | 46890 | - | 432.2 |