Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory75.3 B

Variable types

Categorical8
Numeric1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=00eb74d0-2e00-11ea-9713-eb3e5186fb38

Alerts

상위유역명 has constant value ""Constant
급수등급명 has constant value ""Constant
성별코드 has constant value ""Constant
유역명 is highly overall correlated with 유역코드 and 2 other fieldsHigh correlation
유역위치명 is highly overall correlated with 유역코드 and 2 other fieldsHigh correlation
상세유역명 is highly overall correlated with 유역코드 and 2 other fieldsHigh correlation
유역코드 is highly overall correlated with 유역명 and 2 other fieldsHigh correlation
인구수 has 11 (11.0%) zerosZeros

Reproduction

Analysis started2023-12-10 13:10:09.675985
Analysis finished2023-12-10 13:10:10.803806
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

유역코드
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
100101
22 
100102
22 
100103
22 
100105
22 
100106
12 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100101
2nd row100101
3rd row100101
4th row100101
5th row100101

Common Values

ValueCountFrequency (%)
100101 22
22.0%
100102 22
22.0%
100103 22
22.0%
100105 22
22.0%
100106 12
12.0%

Length

2023-12-10T22:10:10.909404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:11.073604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100101 22
22.0%
100102 22
22.0%
100103 22
22.0%
100105 22
22.0%
100106 12
12.0%

유역명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
광동댐
22 
광동댐하류
22 
임계천
22 
도암댐
22 
송천
12 

Length

Max length5
Median length3
Mean length3.32
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광동댐
2nd row광동댐
3rd row광동댐
4th row광동댐
5th row광동댐

Common Values

ValueCountFrequency (%)
광동댐 22
22.0%
광동댐하류 22
22.0%
임계천 22
22.0%
도암댐 22
22.0%
송천 12
12.0%

Length

2023-12-10T22:10:11.253451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:11.435335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광동댐 22
22.0%
광동댐하류 22
22.0%
임계천 22
22.0%
도암댐 22
22.0%
송천 12
12.0%

상위유역명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남한강상류
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남한강상류
2nd row남한강상류
3rd row남한강상류
4th row남한강상류
5th row남한강상류

Common Values

ValueCountFrequency (%)
남한강상류 100
100.0%

Length

2023-12-10T22:10:11.614540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:11.787838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남한강상류 100
100.0%

유역위치명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
골지천-광동댐
22 
광동댐-임계천하구
22 
임계천-임계천하구
22 
송천-도암(강릉)댐
22 
도암(강릉)댐-송천하구
12 

Length

Max length12
Median length10
Mean length9.14
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row골지천-광동댐
2nd row골지천-광동댐
3rd row골지천-광동댐
4th row골지천-광동댐
5th row골지천-광동댐

Common Values

ValueCountFrequency (%)
골지천-광동댐 22
22.0%
광동댐-임계천하구 22
22.0%
임계천-임계천하구 22
22.0%
송천-도암(강릉)댐 22
22.0%
도암(강릉)댐-송천하구 12
12.0%

Length

2023-12-10T22:10:12.059727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:12.305412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
골지천-광동댐 22
22.0%
광동댐-임계천하구 22
22.0%
임계천-임계천하구 22
22.0%
송천-도암(강릉)댐 22
22.0%
도암(강릉)댐-송천하구 12
12.0%

상세유역명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
골지천
44 
송천
34 
임계천
22 

Length

Max length3
Median length3
Mean length2.66
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row골지천
2nd row골지천
3rd row골지천
4th row골지천
5th row골지천

Common Values

ValueCountFrequency (%)
골지천 44
44.0%
송천 34
34.0%
임계천 22
22.0%

Length

2023-12-10T22:10:12.620932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:12.779255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
골지천 44
44.0%
송천 34
34.0%
임계천 22
22.0%

급수등급명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
지방2급
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방2급
2nd row지방2급
3rd row지방2급
4th row지방2급
5th row지방2급

Common Values

ValueCountFrequency (%)
지방2급 100
100.0%

Length

2023-12-10T22:10:13.007094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:13.229567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방2급 100
100.0%
Distinct22
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0~4세
 
5
30~34세
 
5
35~39세
 
5
15~19세
 
5
20~24세
 
5
Other values (17)
75 

Length

Max length6
Median length6
Mean length5.78
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0~4세
2nd row100세이상
3rd row10~14세
4th row15~19세
5th row20~24세

Common Values

ValueCountFrequency (%)
0~4세 5
 
5.0%
30~34세 5
 
5.0%
35~39세 5
 
5.0%
15~19세 5
 
5.0%
20~24세 5
 
5.0%
25~29세 5
 
5.0%
10~14세 5
 
5.0%
55~59세 5
 
5.0%
40~44세 5
 
5.0%
45~49세 5
 
5.0%
Other values (12) 50
50.0%

Length

2023-12-10T22:10:13.402376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0~4세 5
 
5.0%
55~59세 5
 
5.0%
30~34세 5
 
5.0%
50~54세 5
 
5.0%
45~49세 5
 
5.0%
40~44세 5
 
5.0%
100세이상 5
 
5.0%
10~14세 5
 
5.0%
25~29세 5
 
5.0%
20~24세 5
 
5.0%
Other values (12) 50
50.0%

성별코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 100
100.0%

Length

2023-12-10T22:10:13.636354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:13.785321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 100
100.0%

인구수
Real number (ℝ)

ZEROS 

Distinct73
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90.96
Minimum0
Maximum289
Zeros11
Zeros (%)11.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:10:13.939112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q121.25
median64
Q3135
95-th percentile268.05
Maximum289
Range289
Interquartile range (IQR)113.75

Descriptive statistics

Standard deviation84.244104
Coefficient of variation (CV)0.92616649
Kurtosis-0.27131134
Mean90.96
Median Absolute Deviation (MAD)50.5
Skewness0.89397602
Sum9096
Variance7097.0691
MonotonicityNot monotonic
2023-12-10T22:10:14.171391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 11
 
11.0%
32 3
 
3.0%
16 3
 
3.0%
95 2
 
2.0%
23 2
 
2.0%
64 2
 
2.0%
19 2
 
2.0%
46 2
 
2.0%
12 2
 
2.0%
6 2
 
2.0%
Other values (63) 69
69.0%
ValueCountFrequency (%)
0 11
11.0%
6 2
 
2.0%
8 2
 
2.0%
12 2
 
2.0%
14 1
 
1.0%
16 3
 
3.0%
18 2
 
2.0%
19 2
 
2.0%
22 1
 
1.0%
23 2
 
2.0%
ValueCountFrequency (%)
289 1
1.0%
285 2
2.0%
284 1
1.0%
269 1
1.0%
268 1
1.0%
256 1
1.0%
253 1
1.0%
245 1
1.0%
241 1
1.0%
220 1
1.0%

Interactions

2023-12-10T22:10:10.201405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:10:14.318743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유역코드유역명유역위치명상세유역명연령대구분명인구수
유역코드1.0001.0001.0001.0000.0000.485
유역명1.0001.0001.0001.0000.0000.485
유역위치명1.0001.0001.0001.0000.0000.485
상세유역명1.0001.0001.0001.0000.0000.221
연령대구분명0.0000.0000.0000.0001.0000.564
인구수0.4850.4850.4850.2210.5641.000
2023-12-10T22:10:14.565789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유역명유역위치명상세유역명유역코드연령대구분명
유역명1.0001.0000.9901.0000.000
유역위치명1.0001.0000.9901.0000.000
상세유역명0.9900.9901.0000.9900.000
유역코드1.0001.0000.9901.0000.000
연령대구분명0.0000.0000.0000.0001.000
2023-12-10T22:10:14.753024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구수유역코드유역명유역위치명상세유역명연령대구분명
인구수1.0000.2140.2140.2140.1260.225
유역코드0.2141.0001.0001.0000.9900.000
유역명0.2141.0001.0001.0000.9900.000
유역위치명0.2141.0001.0001.0000.9900.000
상세유역명0.1260.9900.9900.9901.0000.000
연령대구분명0.2250.0000.0000.0000.0001.000

Missing values

2023-12-10T22:10:10.461062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:10:10.699927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

유역코드유역명상위유역명유역위치명상세유역명급수등급명연령대구분명성별코드인구수
0100101광동댐남한강상류골지천-광동댐골지천지방2급0~4세M95
1100101광동댐남한강상류골지천-광동댐골지천지방2급100세이상M0
2100101광동댐남한강상류골지천-광동댐골지천지방2급10~14세M129
3100101광동댐남한강상류골지천-광동댐골지천지방2급15~19세M100
4100101광동댐남한강상류골지천-광동댐골지천지방2급20~24세M123
5100101광동댐남한강상류골지천-광동댐골지천지방2급25~29세M92
6100101광동댐남한강상류골지천-광동댐골지천지방2급30~34세M100
7100101광동댐남한강상류골지천-광동댐골지천지방2급35~39세M157
8100101광동댐남한강상류골지천-광동댐골지천지방2급40~44세M245
9100101광동댐남한강상류골지천-광동댐골지천지방2급45~49세M285
유역코드유역명상위유역명유역위치명상세유역명급수등급명연령대구분명성별코드인구수
90100106송천남한강상류도암(강릉)댐-송천하구송천지방2급10~14세M26
91100106송천남한강상류도암(강릉)댐-송천하구송천지방2급15~19세M23
92100106송천남한강상류도암(강릉)댐-송천하구송천지방2급20~24세M64
93100106송천남한강상류도암(강릉)댐-송천하구송천지방2급25~29세M51
94100106송천남한강상류도암(강릉)댐-송천하구송천지방2급30~34세M45
95100106송천남한강상류도암(강릉)댐-송천하구송천지방2급35~39세M63
96100106송천남한강상류도암(강릉)댐-송천하구송천지방2급40~44세M69
97100106송천남한강상류도암(강릉)댐-송천하구송천지방2급45~49세M116
98100106송천남한강상류도암(강릉)댐-송천하구송천지방2급50~54세M163
99100106송천남한강상류도암(강릉)댐-송천하구송천지방2급55~59세M179