Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory75.3 B

Variable types

Categorical8
Numeric1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=00eb74d0-2e00-11ea-9713-eb3e5186fb38

Alerts

상위 유역 명 has constant value ""Constant
급수 등급 명 has constant value ""Constant
유역명 is highly overall correlated with 인구수 and 3 other fieldsHigh correlation
상세유역명 is highly overall correlated with 인구수 and 3 other fieldsHigh correlation
유역코드 is highly overall correlated with 인구수 and 3 other fieldsHigh correlation
위치 명 is highly overall correlated with 인구수 and 3 other fieldsHigh correlation
인구수 is highly overall correlated with 유역코드 and 3 other fieldsHigh correlation
인구수 has 12 (12.0%) zerosZeros

Reproduction

Analysis started2023-12-10 13:10:03.329399
Analysis finished2023-12-10 13:10:04.495354
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

유역코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
100101
42 
100102
42 
100103
16 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100101
2nd row100101
3rd row100101
4th row100101
5th row100101

Common Values

ValueCountFrequency (%)
100101 42
42.0%
100102 42
42.0%
100103 16
 
16.0%

Length

2023-12-10T22:10:04.595396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:04.759125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100101 42
42.0%
100102 42
42.0%
100103 16
 
16.0%

유역명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
광동댐
42 
광동댐하류
42 
임계천
16 

Length

Max length5
Median length3
Mean length3.84
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광동댐
2nd row광동댐
3rd row광동댐
4th row광동댐
5th row광동댐

Common Values

ValueCountFrequency (%)
광동댐 42
42.0%
광동댐하류 42
42.0%
임계천 16
 
16.0%

Length

2023-12-10T22:10:04.933617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:05.106407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광동댐 42
42.0%
광동댐하류 42
42.0%
임계천 16
 
16.0%

상위 유역 명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남한강상류
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남한강상류
2nd row남한강상류
3rd row남한강상류
4th row남한강상류
5th row남한강상류

Common Values

ValueCountFrequency (%)
남한강상류 100
100.0%

Length

2023-12-10T22:10:05.305937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:05.457774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남한강상류 100
100.0%

위치 명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
골지천-광동댐
42 
광동댐-임계천하구
42 
임계천-임계천하구
16 

Length

Max length9
Median length9
Mean length8.16
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row골지천-광동댐
2nd row골지천-광동댐
3rd row골지천-광동댐
4th row골지천-광동댐
5th row골지천-광동댐

Common Values

ValueCountFrequency (%)
골지천-광동댐 42
42.0%
광동댐-임계천하구 42
42.0%
임계천-임계천하구 16
 
16.0%

Length

2023-12-10T22:10:05.634886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:05.808050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
골지천-광동댐 42
42.0%
광동댐-임계천하구 42
42.0%
임계천-임계천하구 16
 
16.0%

상세유역명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
골지천
84 
임계천
16 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row골지천
2nd row골지천
3rd row골지천
4th row골지천
5th row골지천

Common Values

ValueCountFrequency (%)
골지천 84
84.0%
임계천 16
 
16.0%

Length

2023-12-10T22:10:06.026380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:06.225973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
골지천 84
84.0%
임계천 16
 
16.0%

급수 등급 명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
지방2급
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방2급
2nd row지방2급
3rd row지방2급
4th row지방2급
5th row지방2급

Common Values

ValueCountFrequency (%)
지방2급 100
100.0%

Length

2023-12-10T22:10:06.450454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:06.600679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방2급 100
100.0%

연령대 명
Categorical

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20~24세
 
6
5~9세
 
6
0~4세
 
6
15~19세
 
6
30~34세
 
6
Other values (16)
70 

Length

Max length6
Median length6
Mean length5.76
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20~24세
2nd row10~14세
3rd row100세이상
4th row55~59세
5th row40~44세

Common Values

ValueCountFrequency (%)
20~24세 6
 
6.0%
5~9세 6
 
6.0%
0~4세 6
 
6.0%
15~19세 6
 
6.0%
30~34세 6
 
6.0%
25~29세 6
 
6.0%
10~14세 6
 
6.0%
35~39세 6
 
6.0%
75~79세 4
 
4.0%
100세이상 4
 
4.0%
Other values (11) 44
44.0%

Length

2023-12-10T22:10:06.787883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20~24세 6
 
6.0%
0~4세 6
 
6.0%
15~19세 6
 
6.0%
30~34세 6
 
6.0%
25~29세 6
 
6.0%
10~14세 6
 
6.0%
35~39세 6
 
6.0%
5~9세 6
 
6.0%
80~84세 4
 
4.0%
90~94세 4
 
4.0%
Other values (11) 44
44.0%

성별
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
50 
F
50 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowF
3rd rowF
4th rowF
5th rowM

Common Values

ValueCountFrequency (%)
M 50
50.0%
F 50
50.0%

Length

2023-12-10T22:10:06.976785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:07.119337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 50
50.0%
f 50
50.0%

인구수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct74
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.61
Minimum0
Maximum323
Zeros12
Zeros (%)12.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:10:07.305382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q118
median49
Q3131.25
95-th percentile270.3
Maximum323
Range323
Interquartile range (IQR)113.25

Descriptive statistics

Standard deviation85.867986
Coefficient of variation (CV)1.0270062
Kurtosis0.28241092
Mean83.61
Median Absolute Deviation (MAD)40
Skewness1.1588582
Sum8361
Variance7373.311
MonotonicityNot monotonic
2023-12-10T22:10:07.546808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 12
 
12.0%
9 4
 
4.0%
48 4
 
4.0%
36 3
 
3.0%
38 2
 
2.0%
49 2
 
2.0%
7 2
 
2.0%
18 2
 
2.0%
54 2
 
2.0%
42 2
 
2.0%
Other values (64) 65
65.0%
ValueCountFrequency (%)
0 12
12.0%
6 1
 
1.0%
7 2
 
2.0%
9 4
 
4.0%
11 1
 
1.0%
12 1
 
1.0%
15 1
 
1.0%
16 1
 
1.0%
17 1
 
1.0%
18 2
 
2.0%
ValueCountFrequency (%)
323 1
1.0%
305 1
1.0%
288 1
1.0%
285 1
1.0%
276 1
1.0%
270 1
1.0%
245 1
1.0%
243 1
1.0%
237 1
1.0%
234 1
1.0%

Interactions

2023-12-10T22:10:03.997277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:10:07.707068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유역코드유역명위치 명상세유역명연령대 명성별인구수
유역코드1.0001.0001.0001.0000.0000.0000.756
유역명1.0001.0001.0001.0000.0000.0000.756
위치 명1.0001.0001.0001.0000.0000.0000.756
상세유역명1.0001.0001.0001.0000.1020.0000.702
연령대 명0.0000.0000.0000.1021.0000.0000.651
성별0.0000.0000.0000.0000.0001.0000.000
인구수0.7560.7560.7560.7020.6510.0001.000
2023-12-10T22:10:07.999013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유역명성별연령대 명상세유역명유역코드위치 명
유역명1.0000.0000.0000.9951.0001.000
성별0.0001.0000.0000.0000.0000.000
연령대 명0.0000.0001.0000.0660.0000.000
상세유역명0.9950.0000.0661.0000.9950.995
유역코드1.0000.0000.0000.9951.0001.000
위치 명1.0000.0000.0000.9951.0001.000
2023-12-10T22:10:08.245115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구수유역코드유역명위치 명상세유역명연령대 명성별
인구수1.0000.6060.6060.6060.5250.2820.000
유역코드0.6061.0001.0001.0000.9950.0000.000
유역명0.6061.0001.0001.0000.9950.0000.000
위치 명0.6061.0001.0001.0000.9950.0000.000
상세유역명0.5250.9950.9950.9951.0000.0660.000
연령대 명0.2820.0000.0000.0000.0661.0000.000
성별0.0000.0000.0000.0000.0000.0001.000

Missing values

2023-12-10T22:10:04.203938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:10:04.416686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

유역코드유역명상위 유역 명위치 명상세유역명급수 등급 명연령대 명성별인구수
0100101광동댐남한강상류골지천-광동댐골지천지방2급20~24세M130
1100101광동댐남한강상류골지천-광동댐골지천지방2급10~14세F144
2100101광동댐남한강상류골지천-광동댐골지천지방2급100세이상F0
3100101광동댐남한강상류골지천-광동댐골지천지방2급55~59세F285
4100101광동댐남한강상류골지천-광동댐골지천지방2급40~44세M323
5100101광동댐남한강상류골지천-광동댐골지천지방2급65~69세M208
6100101광동댐남한강상류골지천-광동댐골지천지방2급85~89세M6
7100101광동댐남한강상류골지천-광동댐골지천지방2급75~79세M116
8100101광동댐남한강상류골지천-광동댐골지천지방2급25~29세M108
9100101광동댐남한강상류골지천-광동댐골지천지방2급60~64세M245
유역코드유역명상위 유역 명위치 명상세유역명급수 등급 명연령대 명성별인구수
90100103임계천남한강상류임계천-임계천하구임계천지방2급25~29세F62
91100103임계천남한강상류임계천-임계천하구임계천지방2급10~14세M48
92100103임계천남한강상류임계천-임계천하구임계천지방2급5~9세F36
93100103임계천남한강상류임계천-임계천하구임계천지방2급5~9세M40
94100103임계천남한강상류임계천-임계천하구임계천지방2급0~4세F36
95100103임계천남한강상류임계천-임계천하구임계천지방2급0~4세M36
96100103임계천남한강상류임계천-임계천하구임계천지방2급30~34세M65
97100103임계천남한강상류임계천-임계천하구임계천지방2급30~34세F49
98100103임계천남한강상류임계천-임계천하구임계천지방2급35~39세M60
99100103임계천남한강상류임계천-임계천하구임계천지방2급35~39세F54