Overview

Dataset statistics

Number of variables4
Number of observations3810
Missing cells0
Missing cells (%)0.0%
Duplicate rows15
Duplicate rows (%)0.4%
Total size in memory126.6 KiB
Average record size in memory34.0 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description지자체별 토지이동내역에 대한 통계자료입니다.제공항목은 통계 기준년도, 행정구역, 토지 이동 종목, 정리 지번 수로 구성되어 있습니다.
Author국토교통부
URLhttps://www.data.go.kr/data/15063998/fileData.do

Alerts

통계 기준년도 has constant value ""Constant
Dataset has 15 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 20:43:21.216945
Analysis finished2023-12-12 20:43:21.723676
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계 기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
2022
3810 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 3810
100.0%

Length

2023-12-13T05:43:21.792811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:43:21.904593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 3810
100.0%
Distinct250
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
2023-12-13T05:43:22.263873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.480315
Min length7

Characters and Unicode

Total characters32310
Distinct characters149
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 종로구
2nd row서울특별시 종로구
3rd row서울특별시 종로구
4th row서울특별시 종로구
5th row서울특별시 종로구
ValueCountFrequency (%)
경기도 624
 
7.7%
경상북도 396
 
4.9%
전라남도 372
 
4.6%
경상남도 351
 
4.4%
강원도 296
 
3.7%
서울특별시 268
 
3.3%
충청남도 262
 
3.2%
전라북도 247
 
3.1%
충청북도 234
 
2.9%
부산광역시 229
 
2.8%
Other values (243) 4784
59.3%
2023-12-13T05:43:22.816397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4253
 
13.2%
2896
 
9.0%
2564
 
7.9%
1531
 
4.7%
1422
 
4.4%
1392
 
4.3%
1218
 
3.8%
998
 
3.1%
871
 
2.7%
807
 
2.5%
Other values (139) 14358
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28057
86.8%
Space Separator 4253
 
13.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2896
 
10.3%
2564
 
9.1%
1531
 
5.5%
1422
 
5.1%
1392
 
5.0%
1218
 
4.3%
998
 
3.6%
871
 
3.1%
807
 
2.9%
726
 
2.6%
Other values (138) 13632
48.6%
Space Separator
ValueCountFrequency (%)
4253
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28057
86.8%
Common 4253
 
13.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2896
 
10.3%
2564
 
9.1%
1531
 
5.5%
1422
 
5.1%
1392
 
5.0%
1218
 
4.3%
998
 
3.6%
871
 
3.1%
807
 
2.9%
726
 
2.6%
Other values (138) 13632
48.6%
Common
ValueCountFrequency (%)
4253
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28057
86.8%
ASCII 4253
 
13.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4253
100.0%
Hangul
ValueCountFrequency (%)
2896
 
10.3%
2564
 
9.1%
1531
 
5.5%
1422
 
5.1%
1392
 
5.0%
1218
 
4.3%
998
 
3.6%
871
 
3.1%
807
 
2.9%
726
 
2.6%
Other values (138) 13632
48.6%
Distinct26
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
기타
660 
분할되어 본번에 을 부함
231 
번과 합병
231 
지목변경
231 
등록사항 정정대상 토지
231 
Other values (21)
2226 

Length

Max length13
Median length11
Mean length7.7259843
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row분할되어 본번에 을 부함
2nd row번에서 분할
3rd row번과 합병
4th row지목변경
5th row축척변경 시행

Common Values

ValueCountFrequency (%)
기타 660
17.3%
분할되어 본번에 을 부함 231
 
6.1%
번과 합병 231
 
6.1%
지목변경 231
 
6.1%
등록사항 정정대상 토지 231
 
6.1%
등록사항 말소 ( ) 230
 
6.0%
축척변경 시행 221
 
5.8%
등록사항 정정 ( ) 220
 
5.8%
번에서 분할 215
 
5.6%
산 번에서 등록전환 205
 
5.4%
Other values (16) 1135
29.8%

Length

2023-12-13T05:43:22.973068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1304
 
14.9%
등록사항 883
 
10.1%
기타 660
 
7.6%
번에서 428
 
4.9%
축척변경 408
 
4.7%
번과 405
 
4.6%
말소 404
 
4.6%
정정대상 231
 
2.6%
분할되어 231
 
2.6%
토지 231
 
2.6%
Other values (30) 3539
40.6%

정리_지번_수
Real number (ℝ)

Distinct1476
Distinct (%)38.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean940.03438
Minimum0
Maximum25821
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size33.6 KiB
2023-12-13T05:43:23.162904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q120
median131.5
Q3796.75
95-th percentile4876.3
Maximum25821
Range25821
Interquartile range (IQR)776.75

Descriptive statistics

Standard deviation2059.2727
Coefficient of variation (CV)2.1906355
Kurtosis27.266046
Mean940.03438
Median Absolute Deviation (MAD)128.5
Skewness4.3576326
Sum3581531
Variance4240603.9
MonotonicityNot monotonic
2023-12-13T05:43:23.369224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 164
 
4.3%
2 145
 
3.8%
3 91
 
2.4%
4 64
 
1.7%
6 53
 
1.4%
8 49
 
1.3%
5 39
 
1.0%
7 38
 
1.0%
12 35
 
0.9%
15 33
 
0.9%
Other values (1466) 3099
81.3%
ValueCountFrequency (%)
0 2
 
0.1%
1 164
4.3%
2 145
3.8%
3 91
2.4%
4 64
 
1.7%
5 39
 
1.0%
6 53
 
1.4%
7 38
 
1.0%
8 49
 
1.3%
9 26
 
0.7%
ValueCountFrequency (%)
25821 1
< 0.1%
21440 1
< 0.1%
21296 1
< 0.1%
20961 1
< 0.1%
20823 1
< 0.1%
18326 1
< 0.1%
18176 1
< 0.1%
16418 1
< 0.1%
15797 1
< 0.1%
15681 1
< 0.1%

Interactions

2023-12-13T05:43:21.402347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:43:23.512777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지_이동_종목정리_지번_수
토지_이동_종목1.0000.463
정리_지번_수0.4631.000
2023-12-13T05:43:23.612260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정리_지번_수토지_이동_종목
정리_지번_수1.0000.186
토지_이동_종목0.1861.000

Missing values

2023-12-13T05:43:21.544658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:43:21.673936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계 기준년도행정구역토지_이동_종목정리_지번_수
02022서울특별시 종로구분할되어 본번에 을 부함227
12022서울특별시 종로구번에서 분할2
22022서울특별시 종로구번과 합병193
32022서울특별시 종로구지목변경24
42022서울특별시 종로구축척변경 시행2
52022서울특별시 종로구등록사항 정정대상 토지453
62022서울특별시 종로구등록사항 정정 ( )2
72022서울특별시 종로구등록사항 말소 ( )25
82022서울특별시 중구분할되어 본번에 을 부함89
92022서울특별시 중구번과 합병72
통계 기준년도행정구역토지_이동_종목정리_지번_수
38002022제주특별자치도 서귀포시에서 행정구역명칭변경2
38012022제주특별자치도 서귀포시구획정리완료7
38022022제주특별자치도 서귀포시기타2
38032022제주특별자치도 서귀포시경지정리 시행신고폐지3079
38042022제주특별자치도 서귀포시축척변경 시행39
38052022제주특별자치도 서귀포시축척변경 시행폐지1
38062022제주특별자치도 서귀포시등록사항 정정대상 토지6466
38072022제주특별자치도 서귀포시등록사항 정정 ( )157
38082022제주특별자치도 서귀포시등록사항 말소 ( )839
38092022제주특별자치도 서귀포시등록사항 회복 ( )4

Duplicate rows

Most frequently occurring

통계 기준년도행정구역토지_이동_종목정리_지번_수# duplicates
22022강원도 삼척시기타24
42022강원도 속초시기타23
02022강원도 강릉시기타22
12022강원도 고성군기타42
32022강원도 삼척시기타32
52022강원도 속초시기타502
62022강원도 인제군기타22
72022강원도 정선군기타12
82022강원도 철원군기타22
92022강원도 철원군기타32