Overview

Dataset statistics

Number of variables14
Number of observations182
Missing cells986
Missing cells (%)38.7%
Duplicate rows18
Duplicate rows (%)9.9%
Total size in memory20.0 KiB
Average record size in memory112.7 B

Variable types

Text1
Categorical1
Unsupported12

Dataset

Description숲가꾸기정책,공공추진실적2019
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202857

Alerts

Dataset has 18 (9.9%) duplicate rowsDuplicates
Unnamed: 1 is highly imbalanced (74.4%)Imbalance
2018년 숲가꾸기(정책, 공공) 추진실적(전라북도) has 64 (35.2%) missing valuesMissing
Unnamed: 2 has 129 (70.9%) missing valuesMissing
Unnamed: 3 has 65 (35.7%) missing valuesMissing
Unnamed: 4 has 60 (33.0%) missing valuesMissing
Unnamed: 5 has 70 (38.5%) missing valuesMissing
Unnamed: 6 has 71 (39.0%) missing valuesMissing
Unnamed: 7 has 80 (44.0%) missing valuesMissing
Unnamed: 8 has 91 (50.0%) missing valuesMissing
Unnamed: 9 has 88 (48.4%) missing valuesMissing
Unnamed: 10 has 67 (36.8%) missing valuesMissing
Unnamed: 11 has 71 (39.0%) missing valuesMissing
Unnamed: 12 has 59 (32.4%) missing valuesMissing
Unnamed: 13 has 71 (39.0%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 00:25:06.334440
Analysis finished2024-03-14 00:25:07.124779
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct65
Distinct (%)55.1%
Missing64
Missing (%)35.2%
Memory size1.6 KiB
2024-03-14T09:25:07.244653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length25
Mean length7.3220339
Min length2

Characters and Unicode

Total characters864
Distinct characters147
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)44.1%

Sample

1st row1. 예산 집행 현황(누계)
2nd row구분
3rd row누계
4th row전월까지
5th row금월
ValueCountFrequency (%)
37
 
16.7%
구분 14
 
6.3%
o 14
 
6.3%
누계 12
 
5.4%
전월까지 9
 
4.1%
기타 9
 
4.1%
금월 8
 
3.6%
합계 4
 
1.8%
산림조합 3
 
1.4%
사업량 3
 
1.4%
Other values (88) 108
48.9%
2024-03-14T09:25:07.557013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
 
18.5%
- 43
 
5.0%
24
 
2.8%
24
 
2.8%
22
 
2.5%
22
 
2.5%
) 19
 
2.2%
18
 
2.1%
17
 
2.0%
17
 
2.0%
Other values (137) 498
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 565
65.4%
Space Separator 160
 
18.5%
Dash Punctuation 43
 
5.0%
Decimal Number 30
 
3.5%
Close Punctuation 19
 
2.2%
Other Punctuation 19
 
2.2%
Lowercase Letter 14
 
1.6%
Open Punctuation 13
 
1.5%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.2%
24
 
4.2%
22
 
3.9%
22
 
3.9%
18
 
3.2%
17
 
3.0%
17
 
3.0%
15
 
2.7%
14
 
2.5%
12
 
2.1%
Other values (119) 380
67.3%
Decimal Number
ValueCountFrequency (%)
1 9
30.0%
2 8
26.7%
5 3
 
10.0%
3 2
 
6.7%
4 2
 
6.7%
6 2
 
6.7%
9 1
 
3.3%
8 1
 
3.3%
7 1
 
3.3%
0 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
. 16
84.2%
, 3
 
15.8%
Space Separator
ValueCountFrequency (%)
160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Lowercase Letter
ValueCountFrequency (%)
o 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 565
65.4%
Common 285
33.0%
Latin 14
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.2%
24
 
4.2%
22
 
3.9%
22
 
3.9%
18
 
3.2%
17
 
3.0%
17
 
3.0%
15
 
2.7%
14
 
2.5%
12
 
2.1%
Other values (119) 380
67.3%
Common
ValueCountFrequency (%)
160
56.1%
- 43
 
15.1%
) 19
 
6.7%
. 16
 
5.6%
( 13
 
4.6%
1 9
 
3.2%
2 8
 
2.8%
5 3
 
1.1%
, 3
 
1.1%
3 2
 
0.7%
Other values (7) 9
 
3.2%
Latin
ValueCountFrequency (%)
o 14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 565
65.4%
ASCII 299
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
53.5%
- 43
 
14.4%
) 19
 
6.4%
. 16
 
5.4%
o 14
 
4.7%
( 13
 
4.3%
1 9
 
3.0%
2 8
 
2.7%
5 3
 
1.0%
, 3
 
1.0%
Other values (8) 11
 
3.7%
Hangul
ValueCountFrequency (%)
24
 
4.2%
24
 
4.2%
22
 
3.9%
22
 
3.9%
18
 
3.2%
17
 
3.0%
17
 
3.0%
15
 
2.7%
14
 
2.5%
12
 
2.1%
Other values (119) 380
67.3%

Unnamed: 1
Categorical

IMBALANCE 

Distinct7
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
<NA>
164 
 
4
경제림육성단지 내
 
3
경제림육성단지 외
 
3
정책
 
3
Other values (2)
 
5

Length

Max length9
Median length4
Mean length4.010989
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 164
90.1%
4
 
2.2%
경제림육성단지 내 3
 
1.6%
경제림육성단지 외 3
 
1.6%
정책 3
 
1.6%
공공 3
 
1.6%
소계 2
 
1.1%

Length

2024-03-14T09:25:07.676912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:25:07.779917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 164
87.2%
경제림육성단지 6
 
3.2%
4
 
2.1%
3
 
1.6%
3
 
1.6%
정책 3
 
1.6%
공공 3
 
1.6%
소계 2
 
1.1%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing129
Missing (%)70.9%
Memory size1.6 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)35.7%
Memory size1.6 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing60
Missing (%)33.0%
Memory size1.6 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing70
Missing (%)38.5%
Memory size1.6 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)39.0%
Memory size1.6 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing80
Missing (%)44.0%
Memory size1.6 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing91
Missing (%)50.0%
Memory size1.6 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing88
Missing (%)48.4%
Memory size1.6 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing67
Missing (%)36.8%
Memory size1.6 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)39.0%
Memory size1.6 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing59
Missing (%)32.4%
Memory size1.6 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)39.0%
Memory size1.6 KiB

Correlations

2024-03-14T09:25:07.849473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2018년 숲가꾸기(정책, 공공) 추진실적(전라북도)Unnamed: 1
2018년 숲가꾸기(정책, 공공) 추진실적(전라북도)1.0001.000
Unnamed: 11.0001.000

Missing values

2024-03-14T09:25:06.707200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:25:06.841592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:25:06.996046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2018년 숲가꾸기(정책, 공공) 추진실적(전라북도)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
0<NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
11. 예산 집행 현황(누계)<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2<NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 천원, %)
3구분<NA>계획NaNNaN집행NaNNaNNaNNaNNaN집행률NaNNaN
4<NA><NA>국고지방비NaN국고NaN지방비NaN국고지방비
5누계<NA>21625410108297051079570519921614.6NaN9978408.3NaN9943206.3NaN0.9212130.9213920.921034
6전월까지<NA>00017433743NaN8732777.5NaN8700965.5NaN0.8061690.8063730.805965
7금월<NA>0002487871.6NaN1245630.8NaN1242240.8NaN0.1150440.115020.115068
8<NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
91-1. 예산집행현황(정책숲가꾸기)<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2018년 숲가꾸기(정책, 공공) 추진실적(전라북도)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
172금월<NA>170NaN92NaN4NaN30NaN4NaN40NaN
173<NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
17410. 숲가꾸기 패트롤 운영실적<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
175<NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
176구분<NA>운영실적NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
177<NA><NA>덩굴제거(ha)NaNNaNNaN지장목 제거NaN가옥 등 건축물 지장목 제거NaN농경지 지장목 제거NaN기타 위험목 제거NaN
178<NA><NA>도로변조림지일반산림건수제거본수건수제거본수건수제거본수건수제거본수
179누계<NA>321116518310871196802915735250
180전월까지<NA>42201838941195802915735157
181금회<NA>2891450193010000093

Duplicate rows

Most frequently occurring

2018년 숲가꾸기(정책, 공공) 추진실적(전라북도)Unnamed: 1# duplicates
17<NA><NA>52
8구분<NA>14
1- 기타<NA>8
10누계<NA>8
11전월까지<NA>8
9금월<NA>7
12합계<NA>3
13<NA>경제림육성단지 내3
14<NA>경제림육성단지 외3
15<NA>공공3