Overview

Dataset statistics

Number of variables3
Number of observations3855
Missing cells0
Missing cells (%)0.0%
Duplicate rows123
Duplicate rows (%)3.2%
Total size in memory94.2 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Text1
DateTime1

Dataset

Description금강유역환경청 토지매수정보시스템 조성정보에 대한 데이터로 (토지고유코드,소재지,데이터기준일)에 대한 항목정보를 제공합니다.
Author환경부 금강유역환경청
URLhttps://www.data.go.kr/data/15069238/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 123 (3.2%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 04:28:36.612845
Analysis finished2024-03-23 04:28:38.123015
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

토지고유코드
Real number (ℝ)

Distinct3722
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2531512 × 1018
Minimum3.0110121 × 1018
Maximum4.574036 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.0 KiB
2024-03-23T04:28:38.482035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0110121 × 1018
5-th percentile3.0110127 × 1018
Q14.3720385 × 1018
median4.373038 × 1018
Q34.572025 × 1018
95-th percentile4.574025 × 1018
Maximum4.574036 × 1018
Range1.5630239 × 1018
Interquartile range (IQR)1.9998651 × 1017

Descriptive statistics

Standard deviation4.8939386 × 1017
Coefficient of variation (CV)0.11506618
Kurtosis2.4436624
Mean4.2531512 × 1018
Median Absolute Deviation (MAD)6.1903005 × 1016
Skewness-2.037228
Sum-3.2576106 × 1018
Variance2.3950635 × 1035
MonotonicityNot monotonic
2024-03-23T04:28:39.143011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4311135034103060003 5
 
0.1%
4372037034100890001 4
 
0.1%
4373037024101430006 3
 
0.1%
4373037024101430001 3
 
0.1%
4372037037102530002 3
 
0.1%
4372037037102530001 3
 
0.1%
4373033027100980000 3
 
0.1%
3023012500100790005 2
 
0.1%
4372037040200390001 2
 
0.1%
4372037027100060001 2
 
0.1%
Other values (3712) 3825
99.2%
ValueCountFrequency (%)
3011012100101800001 1
< 0.1%
3011012100101800002 1
< 0.1%
3011012100101850000 1
< 0.1%
3011012100101870000 1
< 0.1%
3011012100101870001 1
< 0.1%
3011012100101960000 1
< 0.1%
3011012100102000007 1
< 0.1%
3011012100102010001 1
< 0.1%
3011012100102040000 1
< 0.1%
3011012100102050000 1
< 0.1%
ValueCountFrequency (%)
4574036022103370000 1
< 0.1%
4574036022103350005 1
< 0.1%
4574036022103350002 1
< 0.1%
4574036022103350001 1
< 0.1%
4574035021113520001 1
< 0.1%
4574034030201400000 1
< 0.1%
4574034030121480002 1
< 0.1%
4574034030121420002 1
< 0.1%
4574034030121230018 1
< 0.1%
4574034030121230004 1
< 0.1%
Distinct3722
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size30.2 KiB
2024-03-23T04:28:40.196342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length20.753307
Min length13

Characters and Unicode

Total characters80004
Distinct characters168
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3599 ?
Unique (%)93.4%

Sample

1st row전라북도 진안군 진안읍 운산리 203-1
2nd row대전광역시 동구 세천동 37-14
3rd row대전광역시 동구 직동 142-9
4th row대전광역시 동구 신촌동 411
5th row대전광역시 동구 신촌동 413
ValueCountFrequency (%)
충청북도 2016
 
10.6%
전라북도 1066
 
5.6%
옥천군 949
 
5.0%
진안군 647
 
3.4%
영동군 560
 
2.9%
대전광역시 505
 
2.7%
동구 371
 
2.0%
심천면 357
 
1.9%
금산군 268
 
1.4%
충청남도 268
 
1.4%
Other values (2828) 12010
63.2%
2024-03-23T04:28:42.127468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15162
 
19.0%
3476
 
4.3%
3451
 
4.3%
3328
 
4.2%
3221
 
4.0%
2974
 
3.7%
1 2602
 
3.3%
2596
 
3.2%
2284
 
2.9%
2088
 
2.6%
Other values (158) 38822
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49441
61.8%
Space Separator 15162
 
19.0%
Decimal Number 13390
 
16.7%
Dash Punctuation 2011
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3476
 
7.0%
3451
 
7.0%
3328
 
6.7%
3221
 
6.5%
2974
 
6.0%
2596
 
5.3%
2284
 
4.6%
2088
 
4.2%
1860
 
3.8%
1768
 
3.6%
Other values (146) 22395
45.3%
Decimal Number
ValueCountFrequency (%)
1 2602
19.4%
2 2028
15.1%
3 1570
11.7%
4 1294
9.7%
5 1163
8.7%
6 1071
8.0%
8 953
 
7.1%
7 946
 
7.1%
9 907
 
6.8%
0 856
 
6.4%
Space Separator
ValueCountFrequency (%)
15162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2011
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49441
61.8%
Common 30563
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3476
 
7.0%
3451
 
7.0%
3328
 
6.7%
3221
 
6.5%
2974
 
6.0%
2596
 
5.3%
2284
 
4.6%
2088
 
4.2%
1860
 
3.8%
1768
 
3.6%
Other values (146) 22395
45.3%
Common
ValueCountFrequency (%)
15162
49.6%
1 2602
 
8.5%
2 2028
 
6.6%
- 2011
 
6.6%
3 1570
 
5.1%
4 1294
 
4.2%
5 1163
 
3.8%
6 1071
 
3.5%
8 953
 
3.1%
7 946
 
3.1%
Other values (2) 1763
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49441
61.8%
ASCII 30563
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15162
49.6%
1 2602
 
8.5%
2 2028
 
6.6%
- 2011
 
6.6%
3 1570
 
5.1%
4 1294
 
4.2%
5 1163
 
3.8%
6 1071
 
3.5%
8 953
 
3.1%
7 946
 
3.1%
Other values (2) 1763
 
5.8%
Hangul
ValueCountFrequency (%)
3476
 
7.0%
3451
 
7.0%
3328
 
6.7%
3221
 
6.5%
2974
 
6.0%
2596
 
5.3%
2284
 
4.6%
2088
 
4.2%
1860
 
3.8%
1768
 
3.6%
Other values (146) 22395
45.3%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.2 KiB
Minimum2024-03-14 00:00:00
Maximum2024-03-14 00:00:00
2024-03-23T04:28:42.555875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T04:28:42.933998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-23T04:28:37.082921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-23T04:28:37.658157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T04:28:38.006384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

토지고유코드소재지데이터기준일
04572025033102030001전라북도 진안군 진안읍 운산리 203-12024-03-14
13011012800100370014대전광역시 동구 세천동 37-142024-03-14
23011012700101420009대전광역시 동구 직동 142-92024-03-14
33011013100104110000대전광역시 동구 신촌동 4112024-03-14
43011013100104130000대전광역시 동구 신촌동 4132024-03-14
53011012800100070007대전광역시 동구 세천동 7-72024-03-14
63011013100104130001대전광역시 동구 신촌동 413-12024-03-14
73011013100104140000대전광역시 동구 신촌동 4142024-03-14
83011012100104130000대전광역시 동구 추동 4132024-03-14
93011012800100080007대전광역시 동구 세천동 8-72024-03-14
토지고유코드소재지데이터기준일
38454574025027107280023전라북도 장수군 장수읍 두산리 728-232024-03-14
38464574025027107280024전라북도 장수군 장수읍 두산리 728-242024-03-14
38474574025027107280039전라북도 장수군 장수읍 두산리 728-392024-03-14
38484574025027107500000전라북도 장수군 장수읍 두산리 7502024-03-14
38494574034027115610000전라북도 장수군 천천면 춘송리 15612024-03-14
38504574034027115620000전라북도 장수군 천천면 춘송리 15622024-03-14
38514574034030103300001전라북도 장수군 천천면 연평리 330-12024-03-14
38524574034030103350001전라북도 장수군 천천면 연평리 335-12024-03-14
38534574034030103780002전라북도 장수군 천천면 연평리 378-22024-03-14
38544372037023101610000충청북도 보은군 회남면 조곡리 1612024-03-14

Duplicate rows

Most frequently occurring

토지고유코드소재지데이터기준일# duplicates
204311135034103060003충청북도 청주시 상당구 문의면 품곡리 306-32024-03-145
394372037034100890001충청북도 보은군 회남면 매산리 89-12024-03-144
404372037037102530001충청북도 보은군 회남면 사음리 253-12024-03-143
414372037037102530002충청북도 보은군 회남면 사음리 253-22024-03-143
574373033027100980000충청북도 옥천군 안내면 인포리 982024-03-143
794373037024101430001충청북도 옥천군 군서면 상지리 143-12024-03-143
804373037024101430006충청북도 옥천군 군서면 상지리 143-62024-03-143
03011012600101410002대전광역시 동구 효평동 141-22024-03-142
13011012600101590001대전광역시 동구 효평동 159-12024-03-142
23011012600101620000대전광역시 동구 효평동 1622024-03-142