Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells10000
Missing cells (%)20.0%
Duplicate rows282
Duplicate rows (%)2.8%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Categorical1
Text1
DateTime1
Unsupported1
Numeric1

Dataset

Description한강홍수통제소 소속 레이더관측소에서 한강 등 대권역을 표준유역으로 분할하여 지점 유역별로 관측, 수집한 우량자료입니다. 표준유역마다 연월일시분 단위로 우량자료를 제공합니다.
URLhttps://www.data.go.kr/data/15117576/fileData.do

Alerts

대권역명 has constant value ""Constant
Dataset has 282 (2.8%) duplicate rowsDuplicates
우량1 has 10000 (100.0%) missing valuesMissing
우량2 is highly skewed (γ1 = 65.30713022)Skewed
우량1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
우량2 has 9979 (99.8%) zerosZeros

Reproduction

Analysis started2023-12-12 16:37:53.107841
Analysis finished2023-12-12 16:37:53.589124
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대권역명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한강
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한강
2nd row한강
3rd row한강
4th row한강
5th row한강

Common Values

ValueCountFrequency (%)
한강 10000
100.0%

Length

2023-12-13T01:37:53.652107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:37:53.757195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한강 10000
100.0%
Distinct242
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T01:37:54.019465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length4.1411
Min length2

Characters and Unicode

Total characters41411
Distinct characters136
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서천상류
2nd row내성천상류
3rd row광산천합류점
4th row화매천
5th row사천천
ValueCountFrequency (%)
남천 109
 
1.1%
동천 106
 
1.1%
경천댐 89
 
0.9%
북천 73
 
0.7%
운곡천합류점 56
 
0.6%
낙화암천 55
 
0.5%
동천합류점 55
 
0.5%
광산천 54
 
0.5%
안강수위표 54
 
0.5%
남강댐 54
 
0.5%
Other values (232) 9295
93.0%
2023-12-13T01:37:54.509763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6679
 
16.1%
3926
 
9.5%
1831
 
4.4%
1705
 
4.1%
1595
 
3.9%
1593
 
3.8%
1483
 
3.6%
1371
 
3.3%
1116
 
2.7%
1115
 
2.7%
Other values (126) 18997
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41411
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6679
 
16.1%
3926
 
9.5%
1831
 
4.4%
1705
 
4.1%
1595
 
3.9%
1593
 
3.8%
1483
 
3.6%
1371
 
3.3%
1116
 
2.7%
1115
 
2.7%
Other values (126) 18997
45.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41411
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6679
 
16.1%
3926
 
9.5%
1831
 
4.4%
1705
 
4.1%
1595
 
3.9%
1593
 
3.8%
1483
 
3.6%
1371
 
3.3%
1116
 
2.7%
1115
 
2.7%
Other values (126) 18997
45.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41411
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6679
 
16.1%
3926
 
9.5%
1831
 
4.4%
1705
 
4.1%
1595
 
3.9%
1593
 
3.8%
1483
 
3.6%
1371
 
3.3%
1116
 
2.7%
1115
 
2.7%
Other values (126) 18997
45.9%
Distinct291
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2012-06-04 04:10:00
Maximum2012-06-13 12:50:00
2023-12-13T01:37:54.681945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:37:54.846388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

우량1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

우량2
Real number (ℝ)

SKEWED  ZEROS 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.000295
Minimum0
Maximum1.01
Zeros9979
Zeros (%)99.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:37:54.988037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1.01
Range1.01
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.012177759
Coefficient of variation (CV)41.280538
Kurtosis4972.3087
Mean0.000295
Median Absolute Deviation (MAD)0
Skewness65.30713
Sum2.95
Variance0.0001482978
MonotonicityNot monotonic
2023-12-13T01:37:55.098923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0.0 9979
99.8%
0.01 6
 
0.1%
0.04 2
 
< 0.1%
0.05 2
 
< 0.1%
0.03 2
 
< 0.1%
0.42 1
 
< 0.1%
1.01 1
 
< 0.1%
0.13 1
 
< 0.1%
0.26 1
 
< 0.1%
0.12 1
 
< 0.1%
Other values (4) 4
 
< 0.1%
ValueCountFrequency (%)
0.0 9979
99.8%
0.01 6
 
0.1%
0.03 2
 
< 0.1%
0.04 2
 
< 0.1%
0.05 2
 
< 0.1%
0.06 1
 
< 0.1%
0.11 1
 
< 0.1%
0.12 1
 
< 0.1%
0.13 1
 
< 0.1%
0.18 1
 
< 0.1%
ValueCountFrequency (%)
1.01 1
< 0.1%
0.42 1
< 0.1%
0.36 1
< 0.1%
0.26 1
< 0.1%
0.18 1
< 0.1%
0.13 1
< 0.1%
0.12 1
< 0.1%
0.11 1
< 0.1%
0.06 1
< 0.1%
0.05 2
< 0.1%

Interactions

2023-12-13T01:37:53.283812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T01:37:53.425588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:37:53.546458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대권역명표준유역명년월일시분우량1우량2
77166한강서천상류2012-06-07 06:30<NA>0.0
23844한강내성천상류2012-06-05 06:40<NA>0.0
37976한강광산천합류점2012-06-06 04:10<NA>0.0
6465한강화매천2012-06-04 08:30<NA>0.0
84202한강사천천2012-06-07 11:10<NA>0.0
77036한강영천강2012-06-07 06:20<NA>0.0
30252한강탐진댐2012-06-05 10:50<NA>0.0
2304한강용곡수위표2012-06-04 05:40<NA>0.0
14714한강감천하류2012-06-05 02:00<NA>0.0
16724한강이언천2012-06-05 03:20<NA>0.0
대권역명표준유역명년월일시분우량1우량2
57985한강대암댐2012-06-06 05:30<NA>0.0
45292한강영천강2012-06-06 09:00<NA>0.0
9905한강동지산수위표2012-06-04 10:40<NA>0.0
78270한강남강댐상류2012-06-07 07:10<NA>0.0
42414한강운곡천2012-06-06 07:10<NA>0.0
59497한강소양천2012-06-06 06:30<NA>0.0
6151한강형산강하류2012-06-04 08:10<NA>0.0
22927한강대가천2012-06-05 06:00<NA>0.0
89579한강영강상류2012-06-13 10:40<NA>0.0
86970한강청도천2012-06-07 01:00<NA>0.0

Duplicate rows

Most frequently occurring

대권역명표준유역명년월일시분우량2# duplicates
15한강경천댐2012-06-06 11:300.03
69한강남천2012-06-06 04:200.03
110한강동천2012-06-06 09:500.03
156한강북천2012-06-06 10:000.03
0한강가야천2012-06-06 09:000.02
1한강가야천2012-06-07 12:300.02
2한강감천상류2012-06-06 04:100.02
3한강감천하류2012-06-06 10:400.02
4한강강청수위표2012-06-06 06:400.02
5한강강청수위표2012-06-06 11:300.02