Overview

Dataset statistics

Number of variables3
Number of observations5988
Missing cells0
Missing cells (%)0.0%
Duplicate rows13
Duplicate rows (%)0.2%
Total size in memory146.3 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Text1
DateTime1

Dataset

Description한강유역환경청 토지매수정보에 대한 데이터로 (토지고유코드,소재지,데이터기준일)에 대한 정보를 제공합니다.
Author환경부 한강유역환경청
URLhttps://www.data.go.kr/data/15069911/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 13 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 05:46:39.810513
Analysis finished2024-03-23 05:46:40.613154
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

토지고유코드
Real number (ℝ)

Distinct5975
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.174374 × 1018
Minimum4.1360256 × 1018
Maximum5.11104 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.8 KiB
2024-03-23T14:46:41.198236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.1360256 × 1018
5-th percentile4.136036 × 1018
Q14.1461253 × 1018
median4.182031 × 1018
Q34.183033 × 1018
95-th percentile4.21104 × 1018
Maximum5.11104 × 1018
Range9.750144 × 1017
Interquartile range (IQR)3.69077 × 1016

Descriptive statistics

Standard deviation4.771467 × 1016
Coefficient of variation (CV)0.011430377
Kurtosis270.92445
Mean4.174374 × 1018
Median Absolute Deviation (MAD)2.0995997 × 1016
Skewness14.117733
Sum8.1315628 × 1017
Variance2.2766898 × 1033
MonotonicityNot monotonic
2024-03-23T14:46:41.468334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4183041024104670005 2
 
< 0.1%
4183041021101420000 2
 
< 0.1%
4313037035102190001 2
 
< 0.1%
4183033021105520006 2
 
< 0.1%
4183033021105230003 2
 
< 0.1%
4161035027102940002 2
 
< 0.1%
4183033027101230020 2
 
< 0.1%
4183041024200040000 2
 
< 0.1%
4146110800107010012 2
 
< 0.1%
4183033021105790013 2
 
< 0.1%
Other values (5965) 5968
99.7%
ValueCountFrequency (%)
4136025624100060036 1
< 0.1%
4136025624100060037 1
< 0.1%
4136025624100060039 1
< 0.1%
4136025624100120002 1
< 0.1%
4136025624100130002 1
< 0.1%
4136025624100140002 1
< 0.1%
4136025624100150002 1
< 0.1%
4136025624100400000 1
< 0.1%
4136025624100420000 1
< 0.1%
4136025624100420004 1
< 0.1%
ValueCountFrequency (%)
5111040028106530007 1
< 0.1%
5111040028105800008 1
< 0.1%
5111040028105800003 1
< 0.1%
5111040028103200008 1
< 0.1%
5111040028103200005 1
< 0.1%
5111040028103190000 1
< 0.1%
5111035030106290000 1
< 0.1%
5111035029103750000 1
< 0.1%
5111035029103720003 1
< 0.1%
5111035029103710004 1
< 0.1%
Distinct5975
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size46.9 KiB
2024-03-23T14:46:42.067364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length20.885438
Min length14

Characters and Unicode

Total characters125062
Distinct characters155
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5962 ?
Unique (%)99.6%

Sample

1st row경기도 남양주시 화도읍 금남리 332-7
2nd row경기도 남양주시 화도읍 금남리 449
3rd row경기도 남양주시 화도읍 금남리 457
4th row경기도 남양주시 화도읍 금남리 457-1
5th row경기도 남양주시 화도읍 금남리 449-3
ValueCountFrequency (%)
경기도 5263
 
17.4%
양평군 1433
 
4.7%
용인시 1241
 
4.1%
처인구 1241
 
4.1%
가평군 1101
 
3.6%
청평면 824
 
2.7%
강원도 665
 
2.2%
춘천시 654
 
2.2%
광주시 620
 
2.0%
삼회리 558
 
1.8%
Other values (4846) 16682
55.1%
2024-03-23T14:46:42.838255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24296
19.4%
6256
 
5.0%
5263
 
4.2%
5263
 
4.2%
5080
 
4.1%
- 4762
 
3.8%
1 4195
 
3.4%
4158
 
3.3%
3900
 
3.1%
3456
 
2.8%
Other values (145) 58433
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73086
58.4%
Space Separator 24296
 
19.4%
Decimal Number 22918
 
18.3%
Dash Punctuation 4762
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6256
 
8.6%
5263
 
7.2%
5263
 
7.2%
5080
 
7.0%
4158
 
5.7%
3900
 
5.3%
3456
 
4.7%
2717
 
3.7%
2663
 
3.6%
2482
 
3.4%
Other values (133) 31848
43.6%
Decimal Number
ValueCountFrequency (%)
1 4195
18.3%
2 3119
13.6%
3 2644
11.5%
5 2402
10.5%
4 2290
10.0%
7 1790
7.8%
6 1764
7.7%
8 1598
 
7.0%
9 1565
 
6.8%
0 1551
 
6.8%
Space Separator
ValueCountFrequency (%)
24296
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4762
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73086
58.4%
Common 51976
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6256
 
8.6%
5263
 
7.2%
5263
 
7.2%
5080
 
7.0%
4158
 
5.7%
3900
 
5.3%
3456
 
4.7%
2717
 
3.7%
2663
 
3.6%
2482
 
3.4%
Other values (133) 31848
43.6%
Common
ValueCountFrequency (%)
24296
46.7%
- 4762
 
9.2%
1 4195
 
8.1%
2 3119
 
6.0%
3 2644
 
5.1%
5 2402
 
4.6%
4 2290
 
4.4%
7 1790
 
3.4%
6 1764
 
3.4%
8 1598
 
3.1%
Other values (2) 3116
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73086
58.4%
ASCII 51976
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24296
46.7%
- 4762
 
9.2%
1 4195
 
8.1%
2 3119
 
6.0%
3 2644
 
5.1%
5 2402
 
4.6%
4 2290
 
4.4%
7 1790
 
3.4%
6 1764
 
3.4%
8 1598
 
3.1%
Other values (2) 3116
 
6.0%
Hangul
ValueCountFrequency (%)
6256
 
8.6%
5263
 
7.2%
5263
 
7.2%
5080
 
7.0%
4158
 
5.7%
3900
 
5.3%
3456
 
4.7%
2717
 
3.7%
2663
 
3.6%
2482
 
3.4%
Other values (133) 31848
43.6%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.9 KiB
Minimum2024-03-14 00:00:00
Maximum2024-03-14 00:00:00
2024-03-23T14:46:43.022585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:46:43.213011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-23T14:46:40.197149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-23T14:46:40.409469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:46:40.557065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

토지고유코드소재지데이터기준일
04136025625103320007경기도 남양주시 화도읍 금남리 332-72024-03-14
14136025625104490000경기도 남양주시 화도읍 금남리 4492024-03-14
24136025625104570000경기도 남양주시 화도읍 금남리 4572024-03-14
34136025625104570001경기도 남양주시 화도읍 금남리 457-12024-03-14
44136025625104490003경기도 남양주시 화도읍 금남리 449-32024-03-14
54136025624104570001경기도 남양주시 화도읍 구암리 457-12024-03-14
64136025624100150002경기도 남양주시 화도읍 구암리 15-22024-03-14
74136025624104560002경기도 남양주시 화도읍 구암리 456-22024-03-14
84136025624104560003경기도 남양주시 화도읍 구암리 456-32024-03-14
94136025624104560004경기도 남양주시 화도읍 구암리 456-42024-03-14
토지고유코드소재지데이터기준일
59784167032022102070002경기도 여주시 흥천면 계신리 207-22024-03-14
59794167034523100720011경기도 여주시 세종대왕면 왕대리 72-112024-03-14
59804167034523100720019경기도 여주시 세종대왕면 왕대리 72-192024-03-14
59814167034523100730010경기도 여주시 세종대왕면 왕대리 73-102024-03-14
59824146110900100300002경기도 용인시 처인구 호동 30-22024-03-14
59834146110900100300001경기도 용인시 처인구 호동 30-12024-03-14
59844146110900100010006경기도 용인시 처인구 호동 1-62024-03-14
59854146110900100010004경기도 용인시 처인구 호동 1-42024-03-14
59864146125024101080005경기도 용인시 처인구 포곡읍 영문리 108-52024-03-14
59874146110900102430000경기도 용인시 처인구 호동 2432024-03-14

Duplicate rows

Most frequently occurring

토지고유코드소재지데이터기준일# duplicates
04146110800107010012경기도 용인시 처인구 운학동 701-122024-03-142
14146125322106510000경기도 용인시 처인구 모현면 갈담리 6512024-03-142
24161035027102940002경기도 광주시 남종면 수청리 294-22024-03-142
34183033021105230003경기도 양평군 양서면 양수리 523-32024-03-142
44183033021105520006경기도 양평군 양서면 양수리 552-62024-03-142
54183033021105790013경기도 양평군 양서면 양수리 579-132024-03-142
64183033027101230020경기도 양평군 양서면 대심리 123-202024-03-142
74183033027101280007경기도 양평군 양서면 대심리 128-72024-03-142
84183041021101420000경기도 양평군 개군면 하자포리 1422024-03-142
94183041024104670005경기도 양평군 개군면 석장리 467-52024-03-142