Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Numeric2
Text2

Dataset

Description경기부동산포털_토지_피합병지
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=7TGEDV444EDLR97HAZ0P34214032&infSeq=1

Reproduction

Analysis started2023-12-10 22:03:09.582651
Analysis finished2023-12-10 22:03:10.556647
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

토지고유번호
Real number (ℝ)

Distinct8420
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1111529 × 1018
Minimum4.1111129 × 1018
Maximum4.1113126 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:03:10.629243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.1111129 × 1018
5-th percentile4.1111129 × 1018
Q14.1111131 × 1018
median4.1111134 × 1018
Q34.1111137 × 1018
95-th percentile4.1113126 × 1018
Maximum4.1113126 × 1018
Range1.9970001 × 1014
Interquartile range (IQR)6.0069601 × 1011

Descriptive statistics

Standard deviation7.9521978 × 1013
Coefficient of variation (CV)1.9342987 × 10-5
Kurtosis0.28137186
Mean4.1111529 × 1018
Median Absolute Deviation (MAD)3.0000027 × 1011
Skewness1.5103806
Sum-6.2633808 × 1018
Variance6.323745 × 1027
MonotonicityNot monotonic
2023-12-11T07:03:10.779589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4111113700101930045 49
 
0.5%
4111312600103340025 16
 
0.2%
4111113300103000000 14
 
0.1%
4111312600103340006 14
 
0.1%
4111113600109060000 13
 
0.1%
4111113600108990000 13
 
0.1%
4111113700100610001 12
 
0.1%
4111113700100610002 12
 
0.1%
4111113400101240002 11
 
0.1%
4111113500102720046 11
 
0.1%
Other values (8410) 9835
98.4%
ValueCountFrequency (%)
4111112900100100001 1
 
< 0.1%
4111112900100110000 1
 
< 0.1%
4111112900100150005 1
 
< 0.1%
4111112900100150006 1
 
< 0.1%
4111112900100160000 5
0.1%
4111112900100160005 1
 
< 0.1%
4111112900100160007 1
 
< 0.1%
4111112900100160008 1
 
< 0.1%
4111112900100190003 1
 
< 0.1%
4111112900100190010 1
 
< 0.1%
ValueCountFrequency (%)
4111312600109830014 1
 
< 0.1%
4111312600109830012 1
 
< 0.1%
4111312600109830005 1
 
< 0.1%
4111312600109810007 1
 
< 0.1%
4111312600109810006 1
 
< 0.1%
4111312600109810005 1
 
< 0.1%
4111312600109800118 5
0.1%
4111312600109800099 1
 
< 0.1%
4111312600109800096 1
 
< 0.1%
4111312600109800095 1
 
< 0.1%
Distinct8418
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:03:10.951085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length17.2611
Min length13

Characters and Unicode

Total characters172611
Distinct characters43
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7428 ?
Unique (%)74.3%

Sample

1st row수원시장안구 영화동 411-7
2nd row수원시장안구 상광교동 231-2
3rd row수원시장안구 파장동 212-5
4th row수원시장안구 정자동 69-6
5th row수원시장안구 율전동 314-2
ValueCountFrequency (%)
수원시장안구 8013
26.5%
수원시권선구 1987
 
6.6%
세류동 1987
 
6.6%
정자동 1173
 
3.9%
율전동 1113
 
3.7%
조원동 1030
 
3.4%
영화동 997
 
3.3%
파장동 806
 
2.7%
이목동 771
 
2.5%
연무동 764
 
2.5%
Other values (7099) 11628
38.4%
2023-12-11T07:03:11.318186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30538
17.7%
11030
 
6.4%
10000
 
5.8%
10000
 
5.8%
10000
 
5.8%
10000
 
5.8%
- 9100
 
5.3%
8819
 
5.1%
8013
 
4.6%
1 7482
 
4.3%
Other values (33) 57629
33.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90716
52.6%
Decimal Number 42257
24.5%
Space Separator 30538
 
17.7%
Dash Punctuation 9100
 
5.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%
Decimal Number
ValueCountFrequency (%)
1 7482
17.7%
2 5761
13.6%
3 5381
12.7%
4 4906
11.6%
5 3852
9.1%
6 3291
7.8%
7 3170
7.5%
8 3152
7.5%
9 2780
 
6.6%
0 2482
 
5.9%
Space Separator
ValueCountFrequency (%)
30538
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90716
52.6%
Common 81895
47.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%
Common
ValueCountFrequency (%)
30538
37.3%
- 9100
 
11.1%
1 7482
 
9.1%
2 5761
 
7.0%
3 5381
 
6.6%
4 4906
 
6.0%
5 3852
 
4.7%
6 3291
 
4.0%
7 3170
 
3.9%
8 3152
 
3.8%
Other values (2) 5262
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90716
52.6%
ASCII 81895
47.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30538
37.3%
- 9100
 
11.1%
1 7482
 
9.1%
2 5761
 
7.0%
3 5381
 
6.6%
4 4906
 
6.0%
5 3852
 
4.7%
6 3291
 
4.0%
7 3170
 
3.9%
8 3152
 
3.8%
Other values (2) 5262
 
6.4%
Hangul
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%

토지연혁순번
Real number (ℝ)

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.334
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:03:11.446915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile5
Maximum36
Range35
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6908962
Coefficient of variation (CV)0.7244628
Kurtosis29.264446
Mean2.334
Median Absolute Deviation (MAD)1
Skewness3.111742
Sum23340
Variance2.8591299
MonotonicityNot monotonic
2023-12-11T07:03:11.567137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 4188
41.9%
3 2000
20.0%
2 1993
19.9%
4 980
 
9.8%
5 422
 
4.2%
6 197
 
2.0%
7 84
 
0.8%
8 59
 
0.6%
9 23
 
0.2%
10 19
 
0.2%
Other values (9) 35
 
0.4%
ValueCountFrequency (%)
1 4188
41.9%
2 1993
19.9%
3 2000
20.0%
4 980
 
9.8%
5 422
 
4.2%
6 197
 
2.0%
7 84
 
0.8%
8 59
 
0.6%
9 23
 
0.2%
10 19
 
0.2%
ValueCountFrequency (%)
36 1
 
< 0.1%
29 1
 
< 0.1%
20 1
 
< 0.1%
17 2
 
< 0.1%
15 4
 
< 0.1%
14 2
 
< 0.1%
13 9
0.1%
12 8
0.1%
11 7
 
0.1%
10 19
0.2%
Distinct8737
Distinct (%)87.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:03:11.983069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length17.2675
Min length13

Characters and Unicode

Total characters172675
Distinct characters43
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7846 ?
Unique (%)78.5%

Sample

1st row수원시장안구 영화동 411-3
2nd row수원시장안구 상광교동 231
3rd row수원시장안구 파장동 212-8
4th row수원시장안구 정자동 69-40
5th row수원시장안구 율전동 314-8
ValueCountFrequency (%)
수원시장안구 8013
26.5%
수원시권선구 1987
 
6.6%
세류동 1987
 
6.6%
정자동 1173
 
3.9%
율전동 1113
 
3.7%
조원동 1030
 
3.4%
영화동 997
 
3.3%
파장동 806
 
2.7%
이목동 771
 
2.5%
연무동 764
 
2.5%
Other values (7265) 11628
38.4%
2023-12-11T07:03:12.536302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30538
17.7%
11030
 
6.4%
10000
 
5.8%
10000
 
5.8%
10000
 
5.8%
10000
 
5.8%
- 9119
 
5.3%
8819
 
5.1%
8013
 
4.6%
1 7559
 
4.4%
Other values (33) 57597
33.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90716
52.5%
Decimal Number 42302
24.5%
Space Separator 30538
 
17.7%
Dash Punctuation 9119
 
5.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%
Decimal Number
ValueCountFrequency (%)
1 7559
17.9%
2 5859
13.9%
3 5310
12.6%
4 4929
11.7%
5 3835
9.1%
6 3239
7.7%
8 3202
7.6%
7 3201
7.6%
9 2695
 
6.4%
0 2473
 
5.8%
Space Separator
ValueCountFrequency (%)
30538
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9119
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90716
52.5%
Common 81959
47.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%
Common
ValueCountFrequency (%)
30538
37.3%
- 9119
 
11.1%
1 7559
 
9.2%
2 5859
 
7.1%
3 5310
 
6.5%
4 4929
 
6.0%
5 3835
 
4.7%
6 3239
 
4.0%
8 3202
 
3.9%
7 3201
 
3.9%
Other values (2) 5168
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90716
52.5%
ASCII 81959
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30538
37.3%
- 9119
 
11.1%
1 7559
 
9.2%
2 5859
 
7.1%
3 5310
 
6.5%
4 4929
 
6.0%
5 3835
 
4.7%
6 3239
 
4.0%
8 3202
 
3.9%
7 3201
 
3.9%
Other values (2) 5168
 
6.3%
Hangul
ValueCountFrequency (%)
11030
12.2%
10000
11.0%
10000
11.0%
10000
11.0%
10000
11.0%
8819
9.7%
8013
8.8%
1987
 
2.2%
1987
 
2.2%
1987
 
2.2%
Other values (21) 16893
18.6%

Interactions

2023-12-11T07:03:10.207788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:09.994559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:10.302825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:03:10.108829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:03:12.654356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지고유번호토지연혁순번
토지고유번호1.0000.049
토지연혁순번0.0491.000
2023-12-11T07:03:12.981225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지고유번호토지연혁순번
토지고유번호1.000-0.079
토지연혁순번-0.0791.000

Missing values

2023-12-11T07:03:10.422307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:03:10.511106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

토지고유번호토지소재지토지연혁순번피합병지
449594111113400104110007수원시장안구 영화동 411-71수원시장안구 영화동 411-3
749484111113800102310002수원시장안구 상광교동 231-21수원시장안구 상광교동 231
19514111112900102120005수원시장안구 파장동 212-51수원시장안구 파장동 212-8
146034111113000100690006수원시장안구 정자동 69-62수원시장안구 정자동 69-40
306644111113200103140002수원시장안구 율전동 314-24수원시장안구 율전동 314-8
762964111113700101930416수원시장안구 연무동 193-4164수원시장안구 연무동 160-51
85854111113000105320035수원시장안구 정자동 532-351수원시장안구 정자동 532-4
73134111112900201380009수원시장안구 파장동 산 138-91수원시장안구 파장동 산 138-1
400814111113300103850002수원시장안구 천천동 385-22수원시장안구 천천동 385-17
244734111113000108860008수원시장안구 정자동 886-86수원시장안구 정자동 92-4
토지고유번호토지소재지토지연혁순번피합병지
629064111113700101720008수원시장안구 연무동 172-83수원시장안구 연무동 171-1
38334111112900103750004수원시장안구 파장동 375-43수원시장안구 파장동 375-32
567624111113600100650005수원시장안구 조원동 65-51수원시장안구 조원동 65-2
278444111113200103340019수원시장안구 율전동 334-193수원시장안구 율전동 334-67
450024111113400104160007수원시장안구 영화동 416-71수원시장안구 영화동 416-4
212654111113100104030011수원시장안구 이목동 403-119수원시장안구 이목동 403-28
615784111113500103740004수원시장안구 송죽동 374-44수원시장안구 송죽동 374-32
172344111113000100790048수원시장안구 정자동 79-482수원시장안구 정자동 79-27
419194111113300102730002수원시장안구 천천동 273-24수원시장안구 천천동 273-6
215044111113100104030021수원시장안구 이목동 403-211수원시장안구 이목동 403-3