Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description경기도 김포시 공유지재산현황에 대한 데이터(순번, 재산구분, 재산관리관, 소재지주소, 토지지목, 면적, 데이터기준일자)의 정보를 제공하고 있습니다.
Author경기도 김포시
URLhttps://www.data.go.kr/data/15034890/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
재산구분 is highly imbalanced (90.4%)Imbalance
면적(제곱미터) is highly skewed (γ1 = 57.50339836)Skewed
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:37:48.665837
Analysis finished2024-04-06 08:37:54.147740
Duration5.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7914.6486
Minimum2
Maximum15815
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T17:37:54.501638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile816.95
Q13931.75
median7906
Q311864.25
95-th percentile15025.05
Maximum15815
Range15813
Interquartile range (IQR)7932.5

Descriptive statistics

Standard deviation4571.0261
Coefficient of variation (CV)0.57753999
Kurtosis-1.208742
Mean7914.6486
Median Absolute Deviation (MAD)3967
Skewness-0.004014426
Sum79146486
Variance20894280
MonotonicityNot monotonic
2024-04-06T17:37:55.214931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1416 1
 
< 0.1%
13227 1
 
< 0.1%
9894 1
 
< 0.1%
1960 1
 
< 0.1%
4026 1
 
< 0.1%
1092 1
 
< 0.1%
6475 1
 
< 0.1%
8137 1
 
< 0.1%
15275 1
 
< 0.1%
4457 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
15815 1
< 0.1%
15813 1
< 0.1%
15812 1
< 0.1%
15811 1
< 0.1%
15809 1
< 0.1%
15807 1
< 0.1%
15805 1
< 0.1%
15801 1
< 0.1%
15796 1
< 0.1%
15795 1
< 0.1%

재산구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
행정재산
9877 
일반재산
 
123

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row행정재산
2nd row행정재산
3rd row행정재산
4th row행정재산
5th row행정재산

Common Values

ValueCountFrequency (%)
행정재산 9877
98.8%
일반재산 123
 
1.2%

Length

2024-04-06T17:37:55.885720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:37:56.379966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
행정재산 9877
98.8%
일반재산 123
 
1.2%
Distinct9998
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T17:37:57.764520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length28
Mean length21.2274
Min length15

Characters and Unicode

Total characters212274
Distinct characters101
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9997 ?
Unique (%)> 99.9%

Sample

1st row경기도 김포시 운양동 1330-2
2nd row경기도 김포시 양촌읍 구래리 425-4
3rd row경기도 김포시 고촌읍 신곡리 489-45
4th row경기도 김포시 월곶면 개곡리 187-3
5th row경기도 김포시 고촌읍 풍곡리 657-30
ValueCountFrequency (%)
경기도 10000
21.0%
김포시 10000
21.0%
대곶면 1558
 
3.3%
고촌읍 1432
 
3.0%
통진읍 1427
 
3.0%
하성면 1334
 
2.8%
양촌읍 1076
 
2.3%
월곶면 696
 
1.5%
신곡리 474
 
1.0%
장기동 444
 
0.9%
Other values (8290) 19220
40.3%
2024-04-06T17:37:59.784296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47801
22.5%
10444
 
4.9%
10389
 
4.9%
10091
 
4.8%
10088
 
4.8%
10000
 
4.7%
10000
 
4.7%
- 8988
 
4.2%
1 8104
 
3.8%
7523
 
3.5%
Other values (91) 78846
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 112770
53.1%
Space Separator 47801
22.5%
Decimal Number 42715
 
20.1%
Dash Punctuation 8988
 
4.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10444
 
9.3%
10389
 
9.2%
10091
 
8.9%
10088
 
8.9%
10000
 
8.9%
10000
 
8.9%
7523
 
6.7%
3935
 
3.5%
3588
 
3.2%
2557
 
2.3%
Other values (79) 34155
30.3%
Decimal Number
ValueCountFrequency (%)
1 8104
19.0%
2 5809
13.6%
3 4962
11.6%
4 4562
10.7%
5 3913
9.2%
6 3710
8.7%
8 3021
 
7.1%
0 2906
 
6.8%
7 2900
 
6.8%
9 2828
 
6.6%
Space Separator
ValueCountFrequency (%)
47801
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8988
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 112770
53.1%
Common 99504
46.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10444
 
9.3%
10389
 
9.2%
10091
 
8.9%
10088
 
8.9%
10000
 
8.9%
10000
 
8.9%
7523
 
6.7%
3935
 
3.5%
3588
 
3.2%
2557
 
2.3%
Other values (79) 34155
30.3%
Common
ValueCountFrequency (%)
47801
48.0%
- 8988
 
9.0%
1 8104
 
8.1%
2 5809
 
5.8%
3 4962
 
5.0%
4 4562
 
4.6%
5 3913
 
3.9%
6 3710
 
3.7%
8 3021
 
3.0%
0 2906
 
2.9%
Other values (2) 5728
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 112770
53.1%
ASCII 99504
46.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47801
48.0%
- 8988
 
9.0%
1 8104
 
8.1%
2 5809
 
5.8%
3 4962
 
5.0%
4 4562
 
4.6%
5 3913
 
3.9%
6 3710
 
3.7%
8 3021
 
3.0%
0 2906
 
2.9%
Other values (2) 5728
 
5.8%
Hangul
ValueCountFrequency (%)
10444
 
9.3%
10389
 
9.2%
10091
 
8.9%
10088
 
8.9%
10000
 
8.9%
10000
 
8.9%
7523
 
6.7%
3935
 
3.5%
3588
 
3.2%
2557
 
2.3%
Other values (79) 34155
30.3%

토지지목
Categorical

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도로
4673 
1269 
1094 
872 
공원
664 
Other values (19)
1428 

Length

Max length6
Median length2
Mean length1.7443
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row도로
2nd row도로
3rd row잡종지
4th row도로
5th row도로

Common Values

ValueCountFrequency (%)
도로 4673
46.7%
1269
 
12.7%
1094
 
10.9%
872
 
8.7%
공원 664
 
6.6%
임야 573
 
5.7%
구거 198
 
2.0%
하천 137
 
1.4%
잡종지 137
 
1.4%
공장용지 132
 
1.3%
Other values (14) 251
 
2.5%

Length

2024-04-06T17:38:00.299573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도로 4673
46.7%
1269
 
12.7%
1094
 
10.9%
872
 
8.7%
공원 664
 
6.6%
임야 573
 
5.7%
구거 198
 
2.0%
하천 137
 
1.4%
잡종지 137
 
1.4%
공장용지 132
 
1.3%
Other values (14) 251
 
2.5%

면적(제곱미터)
Real number (ℝ)

SKEWED 

Distinct2997
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1167.0625
Minimum-25.46
Maximum1022427
Zeros1
Zeros (%)< 0.1%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2024-04-06T17:38:00.945764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-25.46
5-th percentile6
Q138
median130
Q3479
95-th percentile3698.515
Maximum1022427
Range1022452.5
Interquartile range (IQR)441

Descriptive statistics

Standard deviation13460.286
Coefficient of variation (CV)11.533475
Kurtosis3862.5218
Mean1167.0625
Median Absolute Deviation (MAD)114
Skewness57.503398
Sum11670625
Variance1.8117929 × 108
MonotonicityNot monotonic
2024-04-06T17:38:01.495081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 127
 
1.3%
17.0 115
 
1.1%
7.0 115
 
1.1%
1.0 107
 
1.1%
3.0 106
 
1.1%
13.0 105
 
1.1%
20.0 104
 
1.0%
30.0 82
 
0.8%
23.0 76
 
0.8%
4.0 76
 
0.8%
Other values (2987) 8987
89.9%
ValueCountFrequency (%)
-25.46 1
 
< 0.1%
0.0 1
 
< 0.1%
0.22 1
 
< 0.1%
0.67 1
 
< 0.1%
0.7 1
 
< 0.1%
0.8 1
 
< 0.1%
0.86 1
 
< 0.1%
1.0 107
1.1%
1.2 1
 
< 0.1%
2.0 76
0.8%
ValueCountFrequency (%)
1022427.0 1
< 0.1%
569696.0 1
< 0.1%
524308.9 1
< 0.1%
128448.4 1
< 0.1%
109659.1 1
< 0.1%
107020.2 1
< 0.1%
86751.0 1
< 0.1%
82402.0 1
< 0.1%
77868.0 1
< 0.1%
77129.0 1
< 0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-20
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-20
2nd row2024-03-20
3rd row2024-03-20
4th row2024-03-20
5th row2024-03-20

Common Values

ValueCountFrequency (%)
2024-03-20 10000
100.0%

Length

2024-04-06T17:38:01.939859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:38:02.252485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-20 10000
100.0%

Interactions

2024-04-06T17:37:52.488213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:37:51.563642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:37:52.990271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:37:52.050133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:38:02.472980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번재산구분토지지목면적(제곱미터)
순번1.0000.0990.4320.035
재산구분0.0991.0000.1790.000
토지지목0.4320.1791.0000.051
면적(제곱미터)0.0350.0000.0511.000
2024-04-06T17:38:02.789811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지지목재산구분
토지지목1.0000.141
재산구분0.1411.000
2024-04-06T17:38:03.177727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적(제곱미터)재산구분토지지목
순번1.000-0.1670.0760.173
면적(제곱미터)-0.1671.0000.0000.024
재산구분0.0760.0001.0000.141
토지지목0.1730.0240.1411.000

Missing values

2024-04-06T17:37:53.533725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:37:53.959152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번재산구분소재지 주소토지지목면적(제곱미터)데이터기준일자
14151416행정재산경기도 김포시 운양동 1330-2도로193.02024-03-20
91209121행정재산경기도 김포시 양촌읍 구래리 425-4도로50.02024-03-20
78477848행정재산경기도 김포시 고촌읍 신곡리 489-45잡종지3.02024-03-20
1354813549행정재산경기도 김포시 월곶면 개곡리 187-3도로38.02024-03-20
71747175행정재산경기도 김포시 고촌읍 풍곡리 657-30도로1154.02024-03-20
1233712338행정재산경기도 김포시 대곶면 석정리 1035-1109.02024-03-20
155156행정재산경기도 김포시 북변동 202-663.02024-03-20
1157411575행정재산경기도 김포시 대곶면 초원지리 402-17513.02024-03-20
41994200행정재산경기도 김포시 통진읍 마송리 226-2도로26.02024-03-20
86818682행정재산경기도 김포시 양촌읍 양곡리 1287도로98.22024-03-20
순번재산구분소재지 주소토지지목면적(제곱미터)데이터기준일자
90339034행정재산경기도 김포시 양촌읍 구래리 56-1도로7.02024-03-20
29812982행정재산경기도 김포시 사우동 903도로3646.12024-03-20
1382813829행정재산경기도 김포시 하성면 마곡리 448-2023.02024-03-20
1028710288행정재산경기도 김포시 대곶면 대능리 125889.02024-03-20
20852086행정재산경기도 김포시 장기동 1928-5도로824.62024-03-20
32413242행정재산경기도 김포시 풍무동 622-3274.02024-03-20
1082410825행정재산경기도 김포시 대곶면 약암리 111-4도로203.02024-03-20
82498250행정재산경기도 김포시 고촌읍 신곡리 산 15-4도로214.02024-03-20
1164011641행정재산경기도 김포시 대곶면 초원지리 426-41.02024-03-20
1339213393행정재산경기도 김포시 월곶면 용강리 299-19160.02024-03-20