Overview

Dataset statistics

Number of variables5
Number of observations2927
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory123.0 KiB
Average record size in memory43.0 B

Variable types

Categorical2
Text1
Numeric2

Dataset

Description외국인 부동산 보유 집계 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=3OFLQ7M8ZNQWJTVSN66824453844&infSeq=1

Alerts

지번수(개) is highly overall correlated with 토지면적(㎡)High correlation
토지면적(㎡) is highly overall correlated with 지번수(개)High correlation
지번수(개) has 309 (10.6%) zerosZeros
토지면적(㎡) has 309 (10.6%) zerosZeros

Reproduction

Analysis started2023-12-10 21:22:31.260556
Analysis finished2023-12-10 21:22:32.104743
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
201512
1490 
201412
1437 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row201512
2nd row201512
3rd row201512
4th row201512
5th row201512

Common Values

ValueCountFrequency (%)
201512 1490
50.9%
201412 1437
49.1%

Length

2023-12-11T06:22:32.162835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:22:32.267221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
201512 1490
50.9%
201412 1437
49.1%

시군명
Categorical

Distinct31
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
화성시
244 
평택시
209 
파주시
194 
양평군
 
187
여주시
 
181
Other values (26)
1912 

Length

Max length4
Median length3
Mean length3.0563717
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 244
 
8.3%
평택시 209
 
7.1%
파주시 194
 
6.6%
양평군 187
 
6.4%
여주시 181
 
6.2%
용인시 174
 
5.9%
이천시 172
 
5.9%
안성시 166
 
5.7%
광주시 127
 
4.3%
가평군 118
 
4.0%
Other values (21) 1155
39.5%

Length

2023-12-11T06:22:32.370076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 244
 
8.3%
평택시 209
 
7.1%
파주시 194
 
6.6%
양평군 187
 
6.4%
여주시 181
 
6.2%
용인시 174
 
5.9%
이천시 172
 
5.9%
안성시 166
 
5.7%
광주시 127
 
4.3%
가평군 118
 
4.0%
Other values (21) 1155
39.5%
Distinct1513
Distinct (%)51.7%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
2023-12-11T06:22:32.695480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length14.407243
Min length10

Characters and Unicode

Total characters42170
Distinct characters296
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)3.4%

Sample

1st row경기도 가평군 설악면 엄소리
2nd row경기도 가평군 가평읍 대곡리
3rd row경기도 가평군 설악면 묵안리
4th row경기도 가평군 설악면 가일리
5th row경기도 가평군 설악면 방일리
ValueCountFrequency (%)
경기도 2927
26.1%
화성시 244
 
2.2%
평택시 209
 
1.9%
파주시 194
 
1.7%
양평군 187
 
1.7%
여주시 181
 
1.6%
용인시 174
 
1.6%
이천시 172
 
1.5%
안성시 166
 
1.5%
광주시 127
 
1.1%
Other values (1544) 6640
59.2%
2023-12-11T06:22:33.182087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8294
19.7%
3041
 
7.2%
2994
 
7.1%
2935
 
7.0%
2598
 
6.2%
1917
 
4.5%
1346
 
3.2%
1275
 
3.0%
775
 
1.8%
750
 
1.8%
Other values (286) 16245
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33864
80.3%
Space Separator 8294
 
19.7%
Decimal Number 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3041
 
9.0%
2994
 
8.8%
2935
 
8.7%
2598
 
7.7%
1917
 
5.7%
1346
 
4.0%
1275
 
3.8%
775
 
2.3%
750
 
2.2%
681
 
2.0%
Other values (282) 15552
45.9%
Decimal Number
ValueCountFrequency (%)
3 4
33.3%
2 4
33.3%
1 4
33.3%
Space Separator
ValueCountFrequency (%)
8294
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33864
80.3%
Common 8306
 
19.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3041
 
9.0%
2994
 
8.8%
2935
 
8.7%
2598
 
7.7%
1917
 
5.7%
1346
 
4.0%
1275
 
3.8%
775
 
2.3%
750
 
2.2%
681
 
2.0%
Other values (282) 15552
45.9%
Common
ValueCountFrequency (%)
8294
99.9%
3 4
 
< 0.1%
2 4
 
< 0.1%
1 4
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33864
80.3%
ASCII 8306
 
19.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8294
99.9%
3 4
 
< 0.1%
2 4
 
< 0.1%
1 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
3041
 
9.0%
2994
 
8.8%
2935
 
8.7%
2598
 
7.7%
1917
 
5.7%
1346
 
4.0%
1275
 
3.8%
775
 
2.3%
750
 
2.2%
681
 
2.0%
Other values (282) 15552
45.9%

지번수(개)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct48
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.0949778
Minimum0
Maximum69
Zeros309
Zeros (%)10.6%
Negative0
Negative (%)0.0%
Memory size25.9 KiB
2023-12-11T06:22:33.332409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q36
95-th percentile17
Maximum69
Range69
Interquartile range (IQR)5

Descriptive statistics

Standard deviation6.5216291
Coefficient of variation (CV)1.2800113
Kurtosis20.710531
Mean5.0949778
Median Absolute Deviation (MAD)2
Skewness3.6658197
Sum14913
Variance42.531646
MonotonicityNot monotonic
2023-12-11T06:22:33.455573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 541
18.5%
2 382
13.1%
3 334
11.4%
0 309
10.6%
4 275
9.4%
5 198
 
6.8%
6 179
 
6.1%
7 127
 
4.3%
8 102
 
3.5%
9 74
 
2.5%
Other values (38) 406
13.9%
ValueCountFrequency (%)
0 309
10.6%
1 541
18.5%
2 382
13.1%
3 334
11.4%
4 275
9.4%
5 198
 
6.8%
6 179
 
6.1%
7 127
 
4.3%
8 102
 
3.5%
9 74
 
2.5%
ValueCountFrequency (%)
69 1
< 0.1%
66 1
< 0.1%
62 2
0.1%
56 2
0.1%
54 1
< 0.1%
52 2
0.1%
49 1
< 0.1%
47 2
0.1%
46 2
0.1%
43 2
0.1%

토지면적(㎡)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1719
Distinct (%)58.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21247.931
Minimum0
Maximum1794212
Zeros309
Zeros (%)10.6%
Negative0
Negative (%)0.0%
Memory size25.9 KiB
2023-12-11T06:22:33.625882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1837.65
median3504
Q312135.5
95-th percentile71992.1
Maximum1794212
Range1794212
Interquartile range (IQR)11297.85

Descriptive statistics

Standard deviation92089.259
Coefficient of variation (CV)4.3340342
Kurtosis204.93863
Mean21247.931
Median Absolute Deviation (MAD)3219
Skewness12.805054
Sum62192693
Variance8.4804316 × 109
MonotonicityNot monotonic
2023-12-11T06:22:34.045819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 309
 
10.6%
992.0 7
 
0.2%
350.0 6
 
0.2%
1653.0 5
 
0.2%
215.0 5
 
0.2%
331.0 4
 
0.1%
1154.0 4
 
0.1%
337.0 4
 
0.1%
1028.0 4
 
0.1%
609.0 4
 
0.1%
Other values (1709) 2575
88.0%
ValueCountFrequency (%)
0.0 309
10.6%
3.0 2
 
0.1%
7.0 1
 
< 0.1%
12.0 2
 
0.1%
20.0 1
 
< 0.1%
27.0 2
 
0.1%
30.0 2
 
0.1%
40.0 2
 
0.1%
43.0 2
 
0.1%
50.0 2
 
0.1%
ValueCountFrequency (%)
1794212.0 1
< 0.1%
1793406.0 1
< 0.1%
1727668.0 1
< 0.1%
1708821.0 1
< 0.1%
1134573.0 1
< 0.1%
1070881.0 2
0.1%
1021563.0 2
0.1%
701670.0 2
0.1%
695512.0 1
< 0.1%
695358.0 1
< 0.1%

Interactions

2023-12-11T06:22:31.731515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:22:31.551027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:22:31.827619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:22:31.640256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:22:34.140278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년월시군명지번수(개)토지면적(㎡)
기준년월1.0000.0000.0000.000
시군명0.0001.0000.4000.180
지번수(개)0.0000.4001.0000.224
토지면적(㎡)0.0000.1800.2241.000
2023-12-11T06:22:34.252356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명기준년월
시군명1.0000.000
기준년월0.0001.000
2023-12-11T06:22:34.330172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지번수(개)토지면적(㎡)기준년월시군명
지번수(개)1.0000.6960.0000.152
토지면적(㎡)0.6961.0000.0000.075
기준년월0.0000.0001.0000.000
시군명0.1520.0750.0001.000

Missing values

2023-12-11T06:22:31.955653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:22:32.067646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월시군명법정동명지번수(개)토지면적(㎡)
0201512가평군경기도 가평군 설악면 엄소리858937.0
1201512가평군경기도 가평군 가평읍 대곡리97023.0
2201512가평군경기도 가평군 설악면 묵안리41015.0
3201512가평군경기도 가평군 설악면 가일리2449414.0
4201512가평군경기도 가평군 설악면 방일리1520125.0
5201512가평군경기도 가평군 설악면 천안리196000.0
6201512가평군경기도 가평군 설악면 이천리00.0
7201512가평군경기도 가평군 청평면 청평리3343878.0
8201512가평군경기도 가평군 청평면 상천리1412649.0
9201512가평군경기도 가평군 청평면 하천리2612844.0
기준년월시군명법정동명지번수(개)토지면적(㎡)
2917201412화성시경기도 화성시 송산면 신천리27439.0
2918201412화성시경기도 화성시 송산면 독지리1992.0
2919201412화성시경기도 화성시 송산면 마산리15455.0
2920201412화성시경기도 화성시 송산면 고포리1011022.3
2921201412화성시경기도 화성시 송산면 지화리34370.0
2922201412화성시경기도 화성시 송산면 육일리00.0
2923201412화성시경기도 화성시 서신면 전곡리00.0
2924201412화성시경기도 화성시 서신면 상안리317541.0
2925201412화성시경기도 화성시 서신면 광평리1355.0
2926201412화성시경기도 화성시 서신면 송교리79093.1