Overview

Dataset statistics

Number of variables15
Number of observations49
Missing cells153
Missing cells (%)20.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory123.7 B

Variable types

Unsupported9
Categorical3
Text3

Dataset

Description2019년도토지수용재결현황
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=203182

Alerts

Unnamed: 5 has constant value ""Constant
Unnamed: 14 has constant value ""Constant
Unnamed: 2 is highly overall correlated with Unnamed: 4 and 1 other fieldsHigh correlation
Unnamed: 4 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 6 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 2 is highly imbalanced (78.5%)Imbalance
Unnamed: 0 has 49 (100.0%) missing valuesMissing
Unnamed: 1 has 1 (2.0%) missing valuesMissing
Unnamed: 3 has 2 (4.1%) missing valuesMissing
Unnamed: 5 has 48 (98.0%) missing valuesMissing
Unnamed: 8 has 1 (2.0%) missing valuesMissing
Unnamed: 9 has 1 (2.0%) missing valuesMissing
Unnamed: 11 has 1 (2.0%) missing valuesMissing
Unnamed: 12 has 1 (2.0%) missing valuesMissing
(단위 : 제곱미터, 원) has 1 (2.0%) missing valuesMissing
Unnamed: 14 has 48 (98.0%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
(단위 : 제곱미터, 원) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 00:20:28.008539
Analysis finished2024-03-14 00:20:28.683138
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing49
Missing (%)100.0%
Memory size573.0 B

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

Unnamed: 2
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
전북지토위
46 
위원회 명
 
1
<NA>
 
1
합계
 
1

Length

Max length5
Median length5
Mean length4.9183673
Min length2

Unique

Unique3 ?
Unique (%)6.1%

Sample

1st row위원회 명
2nd row<NA>
3rd row합계
4th row전북지토위
5th row전북지토위

Common Values

ValueCountFrequency (%)
전북지토위 46
93.9%
위원회 명 1
 
2.0%
<NA> 1
 
2.0%
합계 1
 
2.0%

Length

2024-03-14T09:20:28.736092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:20:28.816078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북지토위 46
92.0%
위원회 1
 
2.0%
1
 
2.0%
na 1
 
2.0%
합계 1
 
2.0%

Unnamed: 3
Text

MISSING 

Distinct47
Distinct (%)100.0%
Missing2
Missing (%)4.1%
Memory size524.0 B
2024-03-14T09:20:28.982815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length25
Mean length19.021277
Min length10

Characters and Unicode

Total characters894
Distinct characters182
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)100.0%

Sample

1st row사업명(수용재결명)
2nd row옥석지구 다목적농촌용수개발사업
3rd row태평1구역 주택재개발정비사업(1차)
4th row효자구역 주택재개발정비사업(2차)
5th row노암 산업단지 연결도로 개설사업
ValueCountFrequency (%)
조성사업 4
 
2.9%
감곡지구 3
 
2.2%
도로개설공사 3
 
2.2%
개설공사 3
 
2.2%
진입도로 3
 
2.2%
농촌중심지활성화사업 2
 
1.4%
정비사업 2
 
1.4%
주거환경개선 2
 
1.4%
개설사업 2
 
1.4%
사업 2
 
1.4%
Other values (109) 113
81.3%
2024-03-14T09:20:29.314919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
 
10.3%
47
 
5.3%
37
 
4.1%
28
 
3.1%
( 27
 
3.0%
) 27
 
3.0%
25
 
2.8%
24
 
2.7%
20
 
2.2%
19
 
2.1%
Other values (172) 548
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 691
77.3%
Space Separator 92
 
10.3%
Decimal Number 42
 
4.7%
Open Punctuation 27
 
3.0%
Close Punctuation 27
 
3.0%
Other Punctuation 8
 
0.9%
Dash Punctuation 4
 
0.4%
Math Symbol 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
6.8%
37
 
5.4%
28
 
4.1%
25
 
3.6%
24
 
3.5%
20
 
2.9%
19
 
2.7%
17
 
2.5%
17
 
2.5%
15
 
2.2%
Other values (156) 442
64.0%
Decimal Number
ValueCountFrequency (%)
2 15
35.7%
1 9
21.4%
3 5
 
11.9%
0 4
 
9.5%
5 3
 
7.1%
7 2
 
4.8%
6 2
 
4.8%
4 1
 
2.4%
8 1
 
2.4%
Space Separator
ValueCountFrequency (%)
92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 692
77.4%
Common 202
 
22.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
6.8%
37
 
5.3%
28
 
4.0%
25
 
3.6%
24
 
3.5%
20
 
2.9%
19
 
2.7%
17
 
2.5%
17
 
2.5%
15
 
2.2%
Other values (157) 443
64.0%
Common
ValueCountFrequency (%)
92
45.5%
( 27
 
13.4%
) 27
 
13.4%
2 15
 
7.4%
1 9
 
4.5%
, 8
 
4.0%
3 5
 
2.5%
- 4
 
2.0%
0 4
 
2.0%
5 3
 
1.5%
Other values (5) 8
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 691
77.3%
ASCII 202
 
22.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92
45.5%
( 27
 
13.4%
) 27
 
13.4%
2 15
 
7.4%
1 9
 
4.5%
, 8
 
4.0%
3 5
 
2.5%
- 4
 
2.0%
0 4
 
2.0%
5 3
 
1.5%
Other values (5) 8
 
4.0%
Hangul
ValueCountFrequency (%)
47
 
6.8%
37
 
5.4%
28
 
4.1%
25
 
3.6%
24
 
3.5%
20
 
2.9%
19
 
2.7%
17
 
2.5%
17
 
2.5%
15
 
2.2%
Other values (156) 442
64.0%
None
ValueCountFrequency (%)
1
100.0%

Unnamed: 4
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)46.9%
Missing0
Missing (%)0.0%
Memory size524.0 B
한국농어촌공사
완주군수
전주시장
김제시장
군산시장
Other values (18)
24 

Length

Max length17
Median length4
Mean length6.0816327
Min length4

Unique

Unique12 ?
Unique (%)24.5%

Sample

1st row사업시행자
2nd row사업시행자명
3rd row<NA>
4th row한국농어촌공사
5th row태평1구역 주택재개발정비사업조합

Common Values

ValueCountFrequency (%)
한국농어촌공사 8
16.3%
완주군수 5
 
10.2%
전주시장 4
 
8.2%
김제시장 4
 
8.2%
군산시장 4
 
8.2%
익산시장 2
 
4.1%
전북개발공사 2
 
4.1%
페이퍼코리아㈜, ㈜디오션시티쓰리 2
 
4.1%
진안군수 2
 
4.1%
정읍시장 2
 
4.1%
Other values (13) 14
28.6%

Length

2024-03-14T09:20:29.473868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국농어촌공사 8
15.1%
완주군수 5
 
9.4%
전주시장 4
 
7.5%
김제시장 4
 
7.5%
군산시장 4
 
7.5%
정읍시장 2
 
3.8%
주택재개발정비사업조합 2
 
3.8%
부안군수 2
 
3.8%
진안군수 2
 
3.8%
㈜디오션시티쓰리 2
 
3.8%
Other values (15) 18
34.0%

Unnamed: 5
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing48
Missing (%)98.0%
Memory size524.0 B
2024-03-14T09:20:29.578279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row종류
ValueCountFrequency (%)
종류 1
100.0%
2024-03-14T09:20:29.742959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 6
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size524.0 B
국토의 계획 및 이용에 관한 법률 제88조제2항
17 
농어촌정비법 제9조제6항
농어촌정비법 제59조제2항
도시 및 주거환경정비법 제28조제1항
자연재해대책법 제14조의2
Other values (14)
16 

Length

Max length26
Median length24
Mean length18.387755
Min length4

Unique

Unique12 ?
Unique (%)24.5%

Sample

1st row사업인정 근거법률
2nd row<NA>
3rd row<NA>
4th row농어촌정비법 제9조제6항
5th row도시 및 주거환경정비법 제28조제1항

Common Values

ValueCountFrequency (%)
국토의 계획 및 이용에 관한 법률 제88조제2항 17
34.7%
농어촌정비법 제9조제6항 8
16.3%
농어촌정비법 제59조제2항 4
 
8.2%
도시 및 주거환경정비법 제28조제1항 2
 
4.1%
자연재해대책법 제14조의2 2
 
4.1%
<NA> 2
 
4.1%
도시 및 주거환경정비법 제28조 2
 
4.1%
산업입지 및 개발에 관한 법률 제7조 1
 
2.0%
농어촌정비법 제9조 1
 
2.0%
국방,군사시설 사업에 관한 법률 제4조제1항 1
 
2.0%
Other values (9) 9
18.4%

Length

2024-03-14T09:20:29.845505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
25
12.0%
관한 21
10.1%
법률 21
10.1%
국토의 17
 
8.2%
이용에 17
 
8.2%
제88조제2항 17
 
8.2%
계획 17
 
8.2%
농어촌정비법 13
 
6.2%
제9조제6항 8
 
3.8%
도시 5
 
2.4%
Other values (31) 47
22.6%

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size524.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

Unnamed: 10
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size524.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

(단위 : 제곱미터, 원)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.0%
Memory size524.0 B

Unnamed: 14
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing48
Missing (%)98.0%
Memory size524.0 B
2024-03-14T09:20:29.915345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row비고
ValueCountFrequency (%)
비고 1
100.0%
2024-03-14T09:20:30.104005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Correlations

2024-03-14T09:20:30.171880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 6
Unnamed: 21.0001.0001.0001.000
Unnamed: 31.0001.0001.0001.000
Unnamed: 41.0001.0001.0000.659
Unnamed: 61.0001.0000.6591.000
2024-03-14T09:20:30.249141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 4Unnamed: 6Unnamed: 2
Unnamed: 41.0000.2090.760
Unnamed: 60.2091.0000.803
Unnamed: 20.7600.8031.000
2024-03-14T09:20:30.319619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 4Unnamed: 6
Unnamed: 21.0000.7600.803
Unnamed: 40.7601.0000.209
Unnamed: 60.8030.2091.000

Missing values

2024-03-14T09:20:28.302510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:20:28.452098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:20:28.590002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12(단위 : 제곱미터, 원)Unnamed: 14
0<NA>연도위원회 명사업명(수용재결명)사업시행자<NA>사업인정 근거법률사업 정보NaNNaN수용 정보NaNNaN수용재결액비고
1<NA>NaN<NA><NA>사업시행자명종류<NA>필지수사업면적토지\n소유자수필지수수용면적토지\n소유자수NaN<NA>
2<NA>2019합계<NA><NA><NA><NA>68102853942.245203682269181.3665535189116600<NA>
3<NA>2019전북지토위옥석지구 다목적농촌용수개발사업한국농어촌공사<NA>농어촌정비법 제9조제6항2671781522704865173483095220<NA>
4<NA>2019전북지토위태평1구역 주택재개발정비사업(1차)태평1구역 주택재개발정비사업조합<NA>도시 및 주거환경정비법 제28조제1항409620353996595477410562283700<NA>
5<NA>2019전북지토위효자구역 주택재개발정비사업(2차)효자구역 주택재개발정비사업조합<NA>도시 및 주거환경정비법 제28조제1항33667848.543592116.75483111610<NA>
6<NA>2019전북지토위노암 산업단지 연결도로 개설사업남원시장<NA>국토의 계획 및 이용에 관한 법률 제88조제2항5463914012367915208150<NA>
7<NA>2019전북지토위순창지구 농촌용수이용체계 재편사업(2차)한국농어촌공사<NA>농어촌정비법 제9조제6항454395431195535553122310000<NA>
8<NA>2019전북지토위익산 도시계획시설(수도산 근린공원) 사업㈜한국토지신탁<NA>국토의 계획 및 이용에 관한 법률 제88조제2항2355638718755273370950<NA>
9<NA>2019전북지토위성수면 농촌중심지활성화사업진안군수<NA>농어촌정비법 제59조제2항1411751102183.426658610<NA>
Unnamed: 0Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12(단위 : 제곱미터, 원)Unnamed: 14
39<NA>2019전북지토위장수 북동로(소로3-26호) 개설사업장수군수<NA>국토의 계획 및 이용에 관한 법률 제88조제2항1115717149131428600<NA>
40<NA>2019전북지토위은파관광지 조성사업군산시장<NA>관광진흥법 제52조제1항22715741240134490000<NA>
41<NA>2019전북지토위팔덕지구 다목적농촌용수개발사업(1,2차)한국농어촌공사<NA>농어촌정비법 제9조제6항698416344.55657415137.785324303660<NA>
42<NA>2019전북지토위감곡지구 다목적농촌굥수개발사업(2, 3, 4, 5차)한국농어촌공사<NA>농어촌정비법 제9조제6항889179130.76421239198.4128151681040<NA>
43<NA>2019전북지토위감곡지구 다목적농촌굥수개발사업(6차)한국농어촌공사<NA>농어촌정비법 제9조제6항889179130.7642460599348730<NA>
44<NA>2019전북지토위계화면 농촌중심지 활성화사업부안군수<NA>농어촌정비법 제59조제2항1061111821222288382550<NA>
45<NA>2019전북지토위숭모공원~석치마을 도로확포장공사익산시장<NA>국토의 계획 및 이용에 관한 법률 제88조제2항4564231916244216107850460<NA>
46<NA>2019전북지토위마동 테니스공원 조성사업익산시장<NA>국토의 계획 및 이용에 관한 법률 제88조제2항84360652314876.58114099520<NA>
47<NA>2019전북지토위전주시 거점확산형 주거환경개선 시범사업(동산구역)(2차)전주시장<NA>도시 및 주거환경정비법 제28조71628443193530.4101855008590<NA>
48<NA>2019전북지토위전북 스마트팜 혁신밸리 조성사업김제시장<NA>농어촌정비법 제9조제6항65104972491941826262336875600<NA>