Overview

Dataset statistics

Number of variables9
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.5 KiB
Average record size in memory76.2 B

Variable types

Categorical7
Numeric2

Dataset

Description재해위험지구 정비사업 현황
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2718

Alerts

읍 면 is highly overall correlated with 총 사업비 and 5 other fieldsHigh correlation
수식 설명 is highly overall correlated with 총 사업비 and 7 other fieldsHigh correlation
시군 is highly overall correlated with 읍 면 and 4 other fieldsHigh correlation
지구 명 is highly overall correlated with 총 사업비 and 5 other fieldsHigh correlation
연도 is highly overall correlated with 수식 설명High correlation
비고 is highly overall correlated with 시군 and 4 other fieldsHigh correlation
사업 개요 is highly overall correlated with 총 사업비 and 5 other fieldsHigh correlation
총 사업비 is highly overall correlated with 읍 면 and 3 other fieldsHigh correlation
연도 별 사업비 is highly overall correlated with 수식 설명High correlation
연도 별 사업비 has 16 (14.0%) zerosZeros

Reproduction

Analysis started2024-01-09 23:03:13.400685
Analysis finished2024-01-09 23:03:14.684378
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
옥천
21 
영동
21 
충주
15 
음성
15 
괴산
12 
Other values (6)
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청주
2nd row청주
3rd row청주
4th row충주
5th row충주

Common Values

ValueCountFrequency (%)
옥천 21
18.4%
영동 21
18.4%
충주 15
13.2%
음성 15
13.2%
괴산 12
10.5%
제천 6
 
5.3%
보은 6
 
5.3%
진천 6
 
5.3%
단양 6
 
5.3%
청주 3
 
2.6%

Length

2024-01-10T08:03:14.738860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
옥천 21
18.4%
영동 21
18.4%
충주 15
13.2%
음성 15
13.2%
괴산 12
10.5%
제천 6
 
5.3%
보은 6
 
5.3%
진천 6
 
5.3%
단양 6
 
5.3%
청주 3
 
2.6%

읍 면
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
음성
12 
감물
 
6
청산
 
6
백곡
 
6
청성
 
6
Other values (25)
78 

Length

Max length3
Median length2
Mean length2.0263158
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row옥산
2nd row옥산
3rd row옥산
4th row중앙탑
5th row중앙탑

Common Values

ValueCountFrequency (%)
음성 12
 
10.5%
감물 6
 
5.3%
청산 6
 
5.3%
백곡 6
 
5.3%
청성 6
 
5.3%
신니 6
 
5.3%
수한 3
 
2.6%
동이 3
 
2.6%
군북 3
 
2.6%
군서 3
 
2.6%
Other values (20) 60
52.6%

Length

2024-01-10T08:03:14.841327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음성 12
 
10.5%
청산 6
 
5.3%
백곡 6
 
5.3%
청성 6
 
5.3%
신니 6
 
5.3%
감물 6
 
5.3%
청천 3
 
2.6%
상촌 3
 
2.6%
용산 3
 
2.6%
증평 3
 
2.6%
Other values (20) 60
52.6%

지구 명
Categorical

HIGH CORRELATION 

Distinct38
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
환희
 
3
회인
 
3
마산
 
3
단암
 
3
견학
 
3
Other values (33)
99 

Length

Max length3
Median length2
Mean length2.1315789
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환희
2nd row환희
3rd row환희
4th row소일
5th row소일

Common Values

ValueCountFrequency (%)
환희 3
 
2.6%
회인 3
 
2.6%
마산 3
 
2.6%
단암 3
 
2.6%
견학 3
 
2.6%
단월 3
 
2.6%
용원 3
 
2.6%
봉양 3
 
2.6%
안간 3
 
2.6%
율산 3
 
2.6%
Other values (28) 84
73.7%

Length

2024-01-10T08:03:14.946433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
환희 3
 
2.6%
구룡 3
 
2.6%
상시 3
 
2.6%
두평 3
 
2.6%
둔전2 3
 
2.6%
한석 3
 
2.6%
질벌뜰 3
 
2.6%
양백1 3
 
2.6%
양백2 3
 
2.6%
구월 3
 
2.6%
Other values (28) 84
73.7%

사업 개요
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)27.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
다목적가뭄방재시설 1식
25 
배수펌프장 1식
 
6
하천 0.31㎞,교량 6개소 등
 
3
도로,하천 1.2km
 
3
펌프장, 유수지, 우수관 거 개량
 
3
Other values (26)
74 

Length

Max length27
Median length25
Mean length17.763158
Min length8

Unique

Unique2 ?
Unique (%)1.8%

Sample

1st row펌프장, 유수지 1개소 등
2nd row펌프장, 유수지 1개소 등
3rd row펌프장, 유수지 1개소 등
4th row다목적가뭄방재시설 1식
5th row다목적가뭄방재시설 1식

Common Values

ValueCountFrequency (%)
다목적가뭄방재시설 1식 25
21.9%
배수펌프장 1식 6
 
5.3%
하천 0.31㎞,교량 6개소 등 3
 
2.6%
도로,하천 1.2km 3
 
2.6%
펌프장, 유수지, 우수관 거 개량 3
 
2.6%
하천정비 3.10㎞,교량 5개소 3
 
2.6%
우수관로 개량 및 신설 1.06km 3
 
2.6%
펌프장 1식,하천정비 1.6㎞ 3
 
2.6%
교량2개소,하천정비 0.4km 3
 
2.6%
하천 1.2km, 교량 1개소 3
 
2.6%
Other values (21) 59
51.8%

Length

2024-01-10T08:03:15.063644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1식 34
 
9.9%
다목적가뭄방재시설 27
 
7.9%
하천정비 27
 
7.9%
하천 21
 
6.1%
15
 
4.4%
교량 15
 
4.4%
배수펌프장 9
 
2.6%
2개소 9
 
2.6%
6개소 9
 
2.6%
1.2km&#44 9
 
2.6%
Other values (48) 167
48.8%

총 사업비
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)27.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10696.711
Minimum2000
Maximum48527
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-10T08:03:15.187565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile3027.5
Q16000
median8433.5
Q314000
95-th percentile21296.5
Maximum48527
Range46527
Interquartile range (IQR)8000

Descriptive statistics

Standard deviation8235.6134
Coefficient of variation (CV)0.76992019
Kurtosis9.8258122
Mean10696.711
Median Absolute Deviation (MAD)4383.5
Skewness2.6473498
Sum1219425
Variance67825328
MonotonicityNot monotonic
2024-01-10T08:03:15.298965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
7000 12
 
10.5%
4000 9
 
7.9%
6000 6
 
5.3%
14000 6
 
5.3%
4250 3
 
2.6%
22200 3
 
2.6%
14302 3
 
2.6%
10796 3
 
2.6%
8867 3
 
2.6%
20810 3
 
2.6%
Other values (21) 63
55.3%
ValueCountFrequency (%)
2000 3
 
2.6%
2150 3
 
2.6%
3500 3
 
2.6%
3900 3
 
2.6%
4000 9
7.9%
4100 3
 
2.6%
4250 3
 
2.6%
6000 6
5.3%
6500 3
 
2.6%
7000 12
10.5%
ValueCountFrequency (%)
48527 3
2.6%
22200 3
2.6%
20810 3
2.6%
20804 3
2.6%
19000 3
2.6%
16000 3
2.6%
15000 3
2.6%
14302 3
2.6%
14280 3
2.6%
14000 6
5.3%

연도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2020
38 
2021
38 
2022
38 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2021
3rd row2022
4th row2020
5th row2021

Common Values

ValueCountFrequency (%)
2020 38
33.3%
2021 38
33.3%
2022 38
33.3%

Length

2024-01-10T08:03:15.438988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:03:15.554635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 38
33.3%
2021 38
33.3%
2022 38
33.3%

연도 별 사업비
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct72
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3565.5702
Minimum0
Maximum33300
Zeros16
Zeros (%)14.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-10T08:03:15.679862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1485
median1920
Q34560
95-th percentile12817
Maximum33300
Range33300
Interquartile range (IQR)4075

Descriptive statistics

Standard deviation4959.3391
Coefficient of variation (CV)1.3908965
Kurtosis12.153599
Mean3565.5702
Median Absolute Deviation (MAD)1720
Skewness2.9471756
Sum406475
Variance24595044
MonotonicityNot monotonic
2024-01-10T08:03:15.808932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 16
 
14.0%
1600 5
 
4.4%
200 5
 
4.4%
500 4
 
3.5%
400 3
 
2.6%
1000 3
 
2.6%
4000 3
 
2.6%
2000 3
 
2.6%
1920 2
 
1.8%
2900 2
 
1.8%
Other values (62) 68
59.6%
ValueCountFrequency (%)
0 16
14.0%
200 5
 
4.4%
320 2
 
1.8%
360 1
 
0.9%
400 3
 
2.6%
444 1
 
0.9%
480 1
 
0.9%
500 4
 
3.5%
540 1
 
0.9%
640 1
 
0.9%
ValueCountFrequency (%)
33300 1
0.9%
19940 1
0.9%
18160 1
0.9%
18010 1
0.9%
14300 1
0.9%
13662 1
0.9%
12362 1
0.9%
10520 1
0.9%
10040 1
0.9%
9300 1
0.9%

비고
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
계속(계속)
66 
신규(계속)
33 
계속(완료)
15 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규(계속)
2nd row신규(계속)
3rd row신규(계속)
4th row계속(계속)
5th row계속(계속)

Common Values

ValueCountFrequency (%)
계속(계속) 66
57.9%
신규(계속) 33
28.9%
계속(완료) 15
 
13.2%

Length

2024-01-10T08:03:15.926039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:03:16.016781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계속(계속 66
57.9%
신규(계속 33
28.9%
계속(완료 15
 
13.2%

수식 설명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
<NA>
76 
2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
38 

Length

Max length46
Median length4
Mean length18
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
3rd row<NA>
4th row<NA>
5th row2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)

Common Values

ValueCountFrequency (%)
<NA> 76
66.7%
2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%) 38
33.3%

Length

2024-01-10T08:03:16.118588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:03:16.206439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 76
28.6%
2021년 38
14.3%
사업비 38
14.3%
비중 38
14.3%
국비(50%)&#44 38
14.3%
도비(15%)&#44;시군비(35 38
14.3%

Interactions

2024-01-10T08:03:14.326514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:03:13.915745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:03:14.405317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:03:13.994165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:03:16.274504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군읍 면지구 명사업 개요총 사업비연도연도 별 사업비비고
시군1.0001.0001.0000.9790.6790.0000.0000.710
읍 면1.0001.0001.0000.9930.9590.0000.0000.991
지구 명1.0001.0001.0000.9991.0000.0000.0001.000
사업 개요0.9790.9930.9991.0000.9910.0000.4920.929
총 사업비0.6790.9591.0000.9911.0000.0000.4880.532
연도0.0000.0000.0000.0000.0001.0000.4410.000
연도 별 사업비0.0000.0000.0000.4920.4880.4411.0000.000
비고0.7100.9911.0000.9290.5320.0000.0001.000
2024-01-10T08:03:16.390287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍 면수식 설명시군지구 명연도비고사업 개요
읍 면1.0001.0000.9030.9510.0000.7790.851
수식 설명1.0001.0001.0001.0001.0001.0001.000
시군0.9031.0001.0000.8590.0000.5350.769
지구 명0.9511.0000.8591.0000.0000.8270.924
연도0.0001.0000.0000.0001.0000.0000.000
비고0.7791.0000.5350.8270.0001.0000.691
사업 개요0.8511.0000.7690.9240.0000.6911.000
2024-01-10T08:03:16.506236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총 사업비연도 별 사업비시군읍 면지구 명사업 개요연도비고수식 설명
총 사업비1.0000.3820.4160.7240.8390.8260.0000.1841.000
연도 별 사업비0.3821.0000.0000.0000.0000.1960.3250.0001.000
시군0.4160.0001.0000.9030.8590.7690.0000.5351.000
읍 면0.7240.0000.9031.0000.9510.8510.0000.7791.000
지구 명0.8390.0000.8590.9511.0000.9240.0000.8271.000
사업 개요0.8260.1960.7690.8510.9241.0000.0000.6911.000
연도0.0000.3250.0000.0000.0000.0001.0000.0001.000
비고0.1840.0000.5350.7790.8270.6910.0001.0001.000
수식 설명1.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2024-01-10T08:03:14.514372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:03:14.638035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군읍 면지구 명사업 개요총 사업비연도연도 별 사업비비고수식 설명
0청주옥산환희펌프장&#44; 유수지 1개소 등425020200신규(계속)<NA>
1청주옥산환희펌프장&#44; 유수지 1개소 등42502021200신규(계속)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
2청주옥산환희펌프장&#44; 유수지 1개소 등425020224050신규(계속)<NA>
3충주중앙탑소일다목적가뭄방재시설 1식39002020500계속(계속)<NA>
4충주중앙탑소일다목적가뭄방재시설 1식390020211600계속(계속)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
5충주중앙탑소일다목적가뭄방재시설 1식390020221800계속(계속)<NA>
6충주앙성단암배수펌프장 1식994620207748계속(완료)<NA>
7충주앙성단암배수펌프장 1식994620212198계속(완료)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
8충주앙성단암배수펌프장 1식994620220계속(완료)<NA>
9충주신니견학배수펌프장 1식410020201920계속(계속)<NA>
시군읍 면지구 명사업 개요총 사업비연도연도 별 사업비비고수식 설명
104음성음성음성다목적가뭄방재시설 1식700020224900계속(계속)<NA>
105음성음성목골소하천2.0km&#44; 교량7개소 등1900020200신규(계속)<NA>
106음성음성목골소하천2.0km&#44; 교량7개소 등190002021840신규(계속)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
107음성음성목골소하천2.0km&#44; 교량7개소 등19000202218160신규(계속)<NA>
108단양매포상시하천정비 3.1㎞&#44;교량 3개소950020202920계속(계속)<NA>
109단양매포상시하천정비 3.1㎞&#44;교량 3개소950020212000계속(계속)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
110단양매포상시하천정비 3.1㎞&#44;교량 3개소950020224580계속(계속)<NA>
111단양단성북상하천 0.31㎞&#44;교량 6개소 등350020200신규(계속)<NA>
112단양단성북상하천 0.31㎞&#44;교량 6개소 등35002021200신규(계속)2021년 사업비 비중 국비(50%)&#44; 도비(15%)&#44;시군비(35%)
113단양단성북상하천 0.31㎞&#44;교량 6개소 등350020223300신규(계속)<NA>