Overview

Dataset statistics

Number of variables6
Number of observations92
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory52.4 B

Variable types

Numeric3
Categorical2
Text1

Dataset

Description공공용지 수용재결취득 토지면적 및 보상금액 현황을 아래와 같이 제공합니다. 제공현황 - 사업종류,사업명,사업시행자,토지_면적(㎡),토지_보상금액(원) 등
URLhttps://www.data.go.kr/data/15049034/fileData.do

Alerts

토지_면적(제곱미터) is highly overall correlated with 토지_보상금액(원)High correlation
토지_보상금액(원) is highly overall correlated with 토지_면적(제곱미터)High correlation
사업시행자 is highly imbalanced (62.1%)Imbalance
순번 has unique valuesUnique
토지_면적(제곱미터) has unique valuesUnique
토지_보상금액(원) has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:08:23.647909
Analysis finished2023-12-12 23:08:25.274755
Duration1.63 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.5
Minimum1
Maximum92
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-13T08:08:25.341599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.55
Q123.75
median46.5
Q369.25
95-th percentile87.45
Maximum92
Range91
Interquartile range (IQR)45.5

Descriptive statistics

Standard deviation26.70206
Coefficient of variation (CV)0.57423785
Kurtosis-1.2
Mean46.5
Median Absolute Deviation (MAD)23
Skewness0
Sum4278
Variance713
MonotonicityStrictly increasing
2023-12-13T08:08:25.463091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
60 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
Other values (82) 82
89.1%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
92 1
1.1%
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%

사업종류
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
수도
40 
37 
단지
15 

Length

Max length2
Median length2
Mean length1.5978261
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row수도
3rd row수도
4th row
5th row

Common Values

ValueCountFrequency (%)
수도 40
43.5%
37
40.2%
단지 15
 
16.3%

Length

2023-12-13T08:08:25.602660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:25.721805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수도 40
43.5%
37
40.2%
단지 15
 
16.3%
Distinct91
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size868.0 B
2023-12-13T08:08:25.909628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length16.695652
Min length8

Characters and Unicode

Total characters1536
Distinct characters185
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)97.8%

Sample

1st row부항다목적댐건설사업
2nd row금강북부급수체계조정(청양계통)
3rd row낙동강중부권급수체계구축사업
4th row용담댐 직하류 하천정비공사
5th row경인 아라뱃길사업
ValueCountFrequency (%)
직하류 10
 
4.6%
건설사업 8
 
3.7%
하천정비공사 6
 
2.7%
급수체계조정사업 5
 
2.3%
시화2단계(송산그린시티 4
 
1.8%
용수공급사업 4
 
1.8%
운문댐 4
 
1.8%
용수공급시설 3
 
1.4%
친수구역 3
 
1.4%
사업 3
 
1.4%
Other values (144) 169
77.2%
2023-12-13T08:08:26.255380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
 
8.3%
93
 
6.1%
93
 
6.1%
62
 
4.0%
) 37
 
2.4%
( 37
 
2.4%
36
 
2.3%
34
 
2.2%
34
 
2.2%
33
 
2.1%
Other values (175) 950
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1276
83.1%
Space Separator 127
 
8.3%
Close Punctuation 38
 
2.5%
Open Punctuation 38
 
2.5%
Uppercase Letter 26
 
1.7%
Decimal Number 25
 
1.6%
Letter Number 4
 
0.3%
Connector Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
7.3%
93
 
7.3%
62
 
4.9%
36
 
2.8%
34
 
2.7%
34
 
2.7%
33
 
2.6%
32
 
2.5%
32
 
2.5%
28
 
2.2%
Other values (156) 799
62.6%
Uppercase Letter
ValueCountFrequency (%)
M 6
23.1%
V 6
23.1%
T 6
23.1%
I 4
15.4%
S 2
 
7.7%
K 2
 
7.7%
Decimal Number
ValueCountFrequency (%)
2 13
52.0%
1 6
24.0%
3 3
 
12.0%
7 2
 
8.0%
4 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 37
97.4%
] 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 37
97.4%
[ 1
 
2.6%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
127
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1276
83.1%
Common 230
 
15.0%
Latin 30
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
7.3%
93
 
7.3%
62
 
4.9%
36
 
2.8%
34
 
2.7%
34
 
2.7%
33
 
2.6%
32
 
2.5%
32
 
2.5%
28
 
2.2%
Other values (156) 799
62.6%
Common
ValueCountFrequency (%)
127
55.2%
) 37
 
16.1%
( 37
 
16.1%
2 13
 
5.7%
1 6
 
2.6%
3 3
 
1.3%
7 2
 
0.9%
_ 2
 
0.9%
[ 1
 
0.4%
] 1
 
0.4%
Latin
ValueCountFrequency (%)
M 6
20.0%
V 6
20.0%
T 6
20.0%
I 4
13.3%
2
 
6.7%
S 2
 
6.7%
K 2
 
6.7%
2
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1276
83.1%
ASCII 256
 
16.7%
Number Forms 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
127
49.6%
) 37
 
14.5%
( 37
 
14.5%
2 13
 
5.1%
M 6
 
2.3%
1 6
 
2.3%
V 6
 
2.3%
T 6
 
2.3%
I 4
 
1.6%
3 3
 
1.2%
Other values (7) 11
 
4.3%
Hangul
ValueCountFrequency (%)
93
 
7.3%
93
 
7.3%
62
 
4.9%
36
 
2.8%
34
 
2.7%
34
 
2.7%
33
 
2.6%
32
 
2.5%
32
 
2.5%
28
 
2.2%
Other values (156) 799
62.6%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%

사업시행자
Categorical

IMBALANCE 

Distinct6
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size868.0 B
국토교통부
76 
한국수자원공사
 
7
환경부
 
6
강원도
 
1
세종특별자치시
 
1

Length

Max length7
Median length5
Mean length5.0434783
Min length3

Unique

Unique3 ?
Unique (%)3.3%

Sample

1st row국토교통부
2nd row한국수자원공사
3rd row국토교통부
4th row국토교통부
5th row국토교통부

Common Values

ValueCountFrequency (%)
국토교통부 76
82.6%
한국수자원공사 7
 
7.6%
환경부 6
 
6.5%
강원도 1
 
1.1%
세종특별자치시 1
 
1.1%
금강유역환경청 1
 
1.1%

Length

2023-12-13T08:08:26.382278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:26.482632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국토교통부 76
82.6%
한국수자원공사 7
 
7.6%
환경부 6
 
6.5%
강원도 1
 
1.1%
세종특별자치시 1
 
1.1%
금강유역환경청 1
 
1.1%

토지_면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57811.2
Minimum15
Maximum1285399
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-13T08:08:26.594339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile152.55
Q11886
median5152.5
Q333147.65
95-th percentile339319.05
Maximum1285399
Range1285384
Interquartile range (IQR)31261.65

Descriptive statistics

Standard deviation175896.55
Coefficient of variation (CV)3.0426034
Kurtosis28.561429
Mean57811.2
Median Absolute Deviation (MAD)4590
Skewness4.9943275
Sum5318630.4
Variance3.0939598 × 1010
MonotonicityNot monotonic
2023-12-13T08:08:26.729686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
219507.0 1
 
1.1%
42427.0 1
 
1.1%
46867.0 1
 
1.1%
1122.0 1
 
1.1%
847.0 1
 
1.1%
331.0 1
 
1.1%
33677.0 1
 
1.1%
992.0 1
 
1.1%
54481.0 1
 
1.1%
2950.0 1
 
1.1%
Other values (82) 82
89.1%
ValueCountFrequency (%)
15.0 1
1.1%
41.0 1
1.1%
43.0 1
1.1%
69.0 1
1.1%
141.0 1
1.1%
162.0 1
1.1%
264.0 1
1.1%
331.0 1
1.1%
380.0 1
1.1%
524.0 1
1.1%
ValueCountFrequency (%)
1285399.0 1
1.1%
655502.0 1
1.1%
611069.2 1
1.1%
542271.349 1
1.1%
485756.0 1
1.1%
219507.0 1
1.1%
121892.0 1
1.1%
120263.0 1
1.1%
96476.3 1
1.1%
87238.0 1
1.1%

토지_보상금액(원)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.165295 × 109
Minimum516750
Maximum6.998292 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-13T08:08:26.848290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum516750
5-th percentile3662891.5
Q176228332
median3.1029791 × 108
Q37.4661286 × 108
95-th percentile8.0679603 × 109
Maximum6.998292 × 1010
Range6.9982403 × 1010
Interquartile range (IQR)6.7038452 × 108

Descriptive statistics

Standard deviation8.008396 × 109
Coefficient of variation (CV)3.6985242
Kurtosis58.216778
Mean2.165295 × 109
Median Absolute Deviation (MAD)2.5967415 × 108
Skewness7.1823457
Sum1.9920714 × 1011
Variance6.4134406 × 1019
MonotonicityNot monotonic
2023-12-13T08:08:26.986517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1525424240 1
 
1.1%
3175547179 1
 
1.1%
878150700 1
 
1.1%
27853000 1
 
1.1%
21810250 1
 
1.1%
38194500 1
 
1.1%
1174357950 1
 
1.1%
34635800 1
 
1.1%
3751787330 1
 
1.1%
322911400 1
 
1.1%
Other values (82) 82
89.1%
ValueCountFrequency (%)
516750 1
1.1%
1790040 1
1.1%
1902160 1
1.1%
1917730 1
1.1%
2647080 1
1.1%
4494010 1
1.1%
13792000 1
1.1%
14964900 1
1.1%
18167000 1
1.1%
18210040 1
1.1%
ValueCountFrequency (%)
69982919860 1
1.1%
24311657210 1
1.1%
19159290380 1
1.1%
10519289990 1
1.1%
9734916200 1
1.1%
6704087280 1
1.1%
6084316968 1
1.1%
5937202450 1
1.1%
4230778880 1
1.1%
4025912680 1
1.1%

Interactions

2023-12-13T08:08:24.831836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.241783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.539623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.908539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.318213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.623080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:25.012615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.440850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:24.724610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:08:27.072726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업종류사업명사업시행자토지_면적(제곱미터)토지_보상금액(원)
순번1.0000.1260.9430.2520.0270.089
사업종류0.1261.0001.0000.0700.5100.246
사업명0.9431.0001.0000.8371.0001.000
사업시행자0.2520.0700.8371.0000.0000.000
토지_면적(제곱미터)0.0270.5101.0000.0001.0000.929
토지_보상금액(원)0.0890.2461.0000.0000.9291.000
2023-12-13T08:08:27.160436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업시행자사업종류
사업시행자1.0000.014
사업종류0.0141.000
2023-12-13T08:08:27.236720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번토지_면적(제곱미터)토지_보상금액(원)사업종류사업시행자
순번1.000-0.132-0.1480.0640.128
토지_면적(제곱미터)-0.1321.0000.8500.2380.000
토지_보상금액(원)-0.1480.8501.0000.1890.000
사업종류0.0640.2380.1891.0000.014
사업시행자0.1280.0000.0000.0141.000

Missing values

2023-12-13T08:08:25.128923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:08:25.227454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업종류사업명사업시행자토지_면적(제곱미터)토지_보상금액(원)
01부항다목적댐건설사업국토교통부219507.01525424240
12수도금강북부급수체계조정(청양계통)한국수자원공사3766.887700910
23수도낙동강중부권급수체계구축사업국토교통부3706.076938980
34용담댐 직하류 하천정비공사국토교통부65891.1808980350
45경인 아라뱃길사업국토교통부21396.06704087280
56수도포천복합화력 용수공급사업국토교통부69.01790040
67굴포천방수로건설사업국토교통부902.0200437330
78충주댐 치수능력증대사업국토교통부7852.0567959800
89수도고덕산업단지 용수공급시설 설치사업국토교통부24460.71783936330
910단지부여 규암지구 친수구역 조성사업국토교통부41.04494010
순번사업종류사업명사업시행자토지_면적(제곱미터)토지_보상금액(원)
8283수도금강남부권급수체계구축사업(익산계통)국토교통부4069.688522240
8384성덕댐 건설사업한국수자원공사55817.0420990790
8485안동댐 치수능력증대사업국토교통부66501.0437405000
8586단지시화MTV 광역교통시설 해안로 확장사업국토교통부2469.0399895290
8687수도대청댐계통(Ⅲ)광역상수도사업(1차)국토교통부6012.0457940560
8788운문댐 직하류 하천정비공사(1공구)국토교통부7147.9205567690
8889군남홍수조절지건설사업국토교통부1821.053303790
8990남강댐 소문제 및 속사제 하천개수공사국토교통부2448.0102765150
9091군위댐 직하류 하천정비공사국토교통부9329.0430714600
9192수도진안계통 급수체계조정사업국토교통부601.014964900