Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1967 |
Missing cells | 84 |
Missing cells (%) | 0.7% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.1% |
Total size in memory | 101.9 KiB |
Average record size in memory | 53.1 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Dataset
Description | 한국부동산원(구.한국감정원)에서 제공하는 부동산거래현황 중 아파트매매 거래현황의 연도별 매입자연령대별(동(호)수) 데이터입니다.-(단위 : 동(호)수)- 공표시기 : 익월 말일경 |
---|---|
Author | 한국부동산원 |
URL | https://www.data.go.kr/data/15068658/fileData.do |
Dataset has 1 (0.1%) duplicate rows | Duplicates |
2019 is highly overall correlated with 2020 and 3 other fields | High correlation |
2020 is highly overall correlated with 2019 and 3 other fields | High correlation |
2021 is highly overall correlated with 2019 and 3 other fields | High correlation |
2022 is highly overall correlated with 2019 and 3 other fields | High correlation |
2023 is highly overall correlated with 2019 and 3 other fields | High correlation |
2019 has 21 (1.1%) missing values | Missing |
2020 has 21 (1.1%) missing values | Missing |
2019 has 22 (1.1%) zeros | Zeros |
2021 has 29 (1.5%) zeros | Zeros |
2022 has 35 (1.8%) zeros | Zeros |
2023 has 38 (1.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-23 06:32:16.656756 |
---|---|
Analysis finished | 2024-03-23 06:32:29.027291 |
Duration | 12.37 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
지역 및 연령
Text
Distinct | 1960 |
---|---|
Distinct (%) | 100.0% |
Missing | 7 |
Missing (%) | 0.4% |
Memory size | 15.5 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 10.539286 |
Min length | 6 |
Characters and Unicode
Total characters | 20657 |
---|---|
Distinct characters | 153 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1960 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 전국 /20대이하 |
---|---|
2nd row | 전국 /30대 |
3rd row | 전국 /40대 |
4th row | 전국 /50대 |
5th row | 전국 /60대 |
Value | Count | Frequency (%) |
경기 | 343 | 8.8% |
서울 | 182 | 4.6% |
경북 | 182 | 4.6% |
경남 | 168 | 4.3% |
전남 | 161 | 4.1% |
강원 | 133 | 3.4% |
충남 | 126 | 3.2% |
전북 | 119 | 3.0% |
부산 | 119 | 3.0% |
충북 | 112 | 2.9% |
Other values (1688) | 2275 |
Most occurring characters
Value | Count | Frequency (%) |
1960 | 9.5% | |
/ | 1960 | 9.5% |
대 | 1820 | 8.8% |
0 | 1680 | 8.1% |
구 | 812 | 3.9% |
시 | 777 | 3.8% |
경 | 714 | 3.5% |
기 | 637 | 3.1% |
군 | 609 | 2.9% |
남 | 574 | 2.8% |
Other values (143) | 9114 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 13377 | |
Decimal Number | 3360 | 16.3% |
Space Separator | 1960 | 9.5% |
Other Punctuation | 1960 | 9.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 1820 | 13.6% |
구 | 812 | 6.1% |
시 | 777 | 5.8% |
경 | 714 | 5.3% |
기 | 637 | 4.8% |
군 | 609 | 4.6% |
남 | 574 | 4.3% |
이 | 567 | 4.2% |
북 | 469 | 3.5% |
전 | 350 | 2.6% |
Other values (134) | 6048 |
Decimal Number
Value | Count | Frequency (%) |
0 | 1680 | |
3 | 280 | 8.3% |
5 | 280 | 8.3% |
6 | 280 | 8.3% |
2 | 280 | 8.3% |
7 | 280 | 8.3% |
4 | 280 | 8.3% |
Space Separator
Value | Count | Frequency (%) |
1960 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1960 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 13377 | |
Common | 7280 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 1820 | 13.6% |
구 | 812 | 6.1% |
시 | 777 | 5.8% |
경 | 714 | 5.3% |
기 | 637 | 4.8% |
군 | 609 | 4.6% |
남 | 574 | 4.3% |
이 | 567 | 4.2% |
북 | 469 | 3.5% |
전 | 350 | 2.6% |
Other values (134) | 6048 |
Common
Value | Count | Frequency (%) |
1960 | ||
/ | 1960 | |
0 | 1680 | |
3 | 280 | 3.8% |
5 | 280 | 3.8% |
6 | 280 | 3.8% |
2 | 280 | 3.8% |
7 | 280 | 3.8% |
4 | 280 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 13377 | |
ASCII | 7280 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1960 | ||
/ | 1960 | |
0 | 1680 | |
3 | 280 | 3.8% |
5 | 280 | 3.8% |
6 | 280 | 3.8% |
2 | 280 | 3.8% |
7 | 280 | 3.8% |
4 | 280 | 3.8% |
Hangul
Value | Count | Frequency (%) |
대 | 1820 | 13.6% |
구 | 812 | 6.1% |
시 | 777 | 5.8% |
경 | 714 | 5.3% |
기 | 637 | 4.8% |
군 | 609 | 4.6% |
남 | 574 | 4.3% |
이 | 567 | 4.2% |
북 | 469 | 3.5% |
전 | 350 | 2.6% |
Other values (134) | 6048 |
2019
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 872 |
---|---|
Distinct (%) | 44.8% |
Missing | 21 |
Missing (%) | 1.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 897.90339 |
Minimum | 0 |
---|---|
Maximum | 156664 |
Zeros | 22 |
Zeros (%) | 1.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 40 |
median | 173 |
Q3 | 577.75 |
95-th percentile | 2268.75 |
Maximum | 156664 |
Range | 156664 |
Interquartile range (IQR) | 537.75 |
Descriptive statistics
Standard deviation | 5886.3392 |
---|---|
Coefficient of variation (CV) | 6.5556487 |
Kurtosis | 455.13628 |
Mean | 897.90339 |
Median Absolute Deviation (MAD) | 159 |
Skewness | 19.99382 |
Sum | 1747320 |
Variance | 34648989 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 27 | 1.4% |
7 | 26 | 1.3% |
6 | 23 | 1.2% |
1 | 22 | 1.1% |
0 | 22 | 1.1% |
8 | 20 | 1.0% |
3 | 18 | 0.9% |
13 | 18 | 0.9% |
2 | 18 | 0.9% |
21 | 17 | 0.9% |
Other values (862) | 1735 | |
(Missing) | 21 | 1.1% |
Value | Count | Frequency (%) |
0 | 22 | |
1 | 22 | |
2 | 18 | |
3 | 18 | |
4 | 27 | |
5 | 15 | |
6 | 23 | |
7 | 26 | |
8 | 20 | |
9 | 13 |
Value | Count | Frequency (%) |
156664 | 1 | |
130914 | 1 | |
115110 | 1 | |
63429 | 1 | |
42185 | 1 | |
34386 | 1 | |
29688 | 1 | |
28737 | 1 | |
26809 | 1 | |
23398 | 1 |
2020
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 1055 |
---|---|
Distinct (%) | 54.2% |
Missing | 21 |
Missing (%) | 1.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1544.7297 |
Minimum | 0 |
---|---|
Maximum | 257112 |
Zeros | 18 |
Zeros (%) | 0.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5 |
Q1 | 55 |
median | 292 |
Q3 | 991.25 |
95-th percentile | 3825.75 |
Maximum | 257112 |
Range | 257112 |
Interquartile range (IQR) | 936.25 |
Descriptive statistics
Standard deviation | 10011.34 |
---|---|
Coefficient of variation (CV) | 6.4809657 |
Kurtosis | 425.64331 |
Mean | 1544.7297 |
Median Absolute Deviation (MAD) | 273 |
Skewness | 19.308694 |
Sum | 3006044 |
Variance | 1.0022693 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 21 | 1.1% |
5 | 18 | 0.9% |
0 | 18 | 0.9% |
7 | 17 | 0.9% |
9 | 17 | 0.9% |
2 | 17 | 0.9% |
4 | 17 | 0.9% |
1 | 16 | 0.8% |
14 | 15 | 0.8% |
6 | 14 | 0.7% |
Other values (1045) | 1776 | |
(Missing) | 21 | 1.1% |
Value | Count | Frequency (%) |
0 | 18 | |
1 | 16 | |
2 | 17 | |
3 | 21 | |
4 | 17 | |
5 | 18 | |
6 | 14 | |
7 | 17 | |
8 | 11 | |
9 | 17 |
Value | Count | Frequency (%) |
257112 | 1 | |
227768 | 1 | |
188046 | 1 | |
115249 | 1 | |
78637 | 1 | |
72071 | 1 | |
56163 | 1 | |
53088 | 1 | |
47945 | 1 | |
44870 | 1 |
2021
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 959 |
---|---|
Distinct (%) | 49.1% |
Missing | 14 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1099.6749 |
Minimum | 0 |
---|---|
Maximum | 169838 |
Zeros | 29 |
Zeros (%) | 1.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 48 |
median | 227 |
Q3 | 714 |
95-th percentile | 2948.8 |
Maximum | 169838 |
Range | 169838 |
Interquartile range (IQR) | 666 |
Descriptive statistics
Standard deviation | 6937.9512 |
---|---|
Coefficient of variation (CV) | 6.3090932 |
Kurtosis | 414.47209 |
Mean | 1099.6749 |
Median Absolute Deviation (MAD) | 211 |
Skewness | 19.069752 |
Sum | 2147665 |
Variance | 48135167 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 29 | 1.5% |
7 | 26 | 1.3% |
2 | 23 | 1.2% |
6 | 19 | 1.0% |
5 | 17 | 0.9% |
1 | 17 | 0.9% |
3 | 16 | 0.8% |
10 | 16 | 0.8% |
11 | 16 | 0.8% |
9 | 15 | 0.8% |
Other values (949) | 1759 |
Value | Count | Frequency (%) |
0 | 29 | |
1 | 17 | |
2 | 23 | |
3 | 16 | |
4 | 14 | |
5 | 17 | |
6 | 19 | |
7 | 26 | |
8 | 13 | |
9 | 15 |
Value | Count | Frequency (%) |
169838 | 1 | |
166281 | 1 | |
127330 | 1 | |
86820 | 1 | |
51711 | 1 | |
46295 | 1 | |
44441 | 1 | |
41111 | 1 | |
33361 | 1 | |
32148 | 1 |
2022
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 677 |
---|---|
Distinct (%) | 34.7% |
Missing | 14 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 488.99334 |
Minimum | 0 |
---|---|
Maximum | 71861 |
Zeros | 35 |
Zeros (%) | 1.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3 |
Q1 | 28 |
median | 103 |
Q3 | 309 |
95-th percentile | 1329.2 |
Maximum | 71861 |
Range | 71861 |
Interquartile range (IQR) | 281 |
Descriptive statistics
Standard deviation | 2998.6797 |
---|---|
Coefficient of variation (CV) | 6.1323528 |
Kurtosis | 401.05252 |
Mean | 488.99334 |
Median Absolute Deviation (MAD) | 91 |
Skewness | 18.899892 |
Sum | 955004 |
Variance | 8992080 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 35 | 1.8% |
1 | 31 | 1.6% |
4 | 31 | 1.6% |
3 | 25 | 1.3% |
2 | 22 | 1.1% |
5 | 22 | 1.1% |
7 | 22 | 1.1% |
8 | 22 | 1.1% |
9 | 21 | 1.1% |
6 | 20 | 1.0% |
Other values (667) | 1702 |
Value | Count | Frequency (%) |
0 | 35 | |
1 | 31 | |
2 | 22 | |
3 | 25 | |
4 | 31 | |
5 | 22 | |
6 | 20 | |
7 | 22 | |
8 | 22 | |
9 | 21 |
Value | Count | Frequency (%) |
71861 | 1 | |
66790 | 1 | |
62704 | 1 | |
41675 | 1 | |
20654 | 1 | |
18045 | 1 | |
16852 | 1 | |
15830 | 1 | |
13827 | 1 | |
10967 | 1 |
2023
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 776 |
---|---|
Distinct (%) | 39.6% |
Missing | 7 |
Missing (%) | 0.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 673.80765 |
Minimum | 0 |
---|---|
Maximum | 109529 |
Zeros | 38 |
Zeros (%) | 1.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3 |
Q1 | 26.75 |
median | 123 |
Q3 | 412 |
95-th percentile | 1689.85 |
Maximum | 109529 |
Range | 109529 |
Interquartile range (IQR) | 385.25 |
Descriptive statistics
Standard deviation | 4445.5488 |
---|---|
Coefficient of variation (CV) | 6.5976526 |
Kurtosis | 437.22859 |
Mean | 673.80765 |
Median Absolute Deviation (MAD) | 113 |
Skewness | 19.771198 |
Sum | 1320663 |
Variance | 19762904 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 38 | 1.9% |
1 | 35 | 1.8% |
5 | 34 | 1.7% |
6 | 28 | 1.4% |
8 | 27 | 1.4% |
11 | 22 | 1.1% |
7 | 22 | 1.1% |
3 | 22 | 1.1% |
4 | 21 | 1.1% |
16 | 20 | 1.0% |
Other values (766) | 1691 |
Value | Count | Frequency (%) |
0 | 38 | |
1 | 35 | |
2 | 20 | |
3 | 22 | |
4 | 21 | |
5 | 34 | |
6 | 28 | |
7 | 22 | |
8 | 27 | |
9 | 15 | 0.8% |
Value | Count | Frequency (%) |
109529 | 1 | |
106272 | 1 | |
88516 | 1 | |
56233 | 1 | |
30935 | 1 | |
27118 | 1 | |
23825 | 1 | |
21363 | 1 | |
18772 | 1 | |
12949 | 1 |
2019 | 2020 | 2021 | 2022 | 2023 | |
---|---|---|---|---|---|
2019 | 1.000 | 0.982 | 0.992 | 0.929 | 0.960 |
2020 | 0.982 | 1.000 | 0.942 | 0.957 | 0.974 |
2021 | 0.992 | 0.942 | 1.000 | 0.939 | 0.966 |
2022 | 0.929 | 0.957 | 0.939 | 1.000 | 0.990 |
2023 | 0.960 | 0.974 | 0.966 | 0.990 | 1.000 |
2019 | 2020 | 2021 | 2022 | 2023 | |
---|---|---|---|---|---|
2019 | 1.000 | 0.970 | 0.929 | 0.881 | 0.934 |
2020 | 0.970 | 1.000 | 0.946 | 0.894 | 0.938 |
2021 | 0.929 | 0.946 | 1.000 | 0.952 | 0.947 |
2022 | 0.881 | 0.894 | 0.952 | 1.000 | 0.928 |
2023 | 0.934 | 0.938 | 0.947 | 0.928 | 1.000 |
지역 및 연령 | 2019 | 2020 | 2021 | 2022 | 2023 | |
---|---|---|---|---|---|---|
0 | 전국 /20대이하 | 23398 | 44870 | 41111 | 18045 | 18772 |
1 | 전국 /30대 | 130914 | 227768 | 166281 | 66790 | 109529 |
2 | 전국 /40대 | 156664 | 257112 | 169838 | 71861 | 106272 |
3 | 전국 /50대 | 115110 | 188046 | 127330 | 62704 | 88516 |
4 | 전국 /60대 | 63429 | 115249 | 86820 | 41675 | 56233 |
5 | 전국 /70대이상 | 28737 | 53088 | 44441 | 20654 | 23825 |
6 | 전국 /기타 | 26809 | 47945 | 33361 | 16852 | 8665 |
7 | 서울 /20대이하 | 2155 | 3622 | 2614 | 862 | 1257 |
8 | 서울 /30대 | 20691 | 31372 | 18116 | 4344 | 12048 |
9 | 서울 /40대 | 20562 | 25804 | 13146 | 3632 | 10425 |
지역 및 연령 | 2019 | 2020 | 2021 | 2022 | 2023 | |
---|---|---|---|---|---|---|
1957 | 제주 서귀포시/60대 | 94 | 128 | 164 | 120 | 101 |
1958 | 제주 서귀포시/70대이상 | 25 | 58 | 86 | 40 | 35 |
1959 | 제주 서귀포시/기타 | 55 | 167 | 30 | 53 | 33 |
1960 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1961 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1962 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1963 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1964 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1965 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1966 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
지역 및 연령 | 2019 | 2020 | 2021 | 2022 | 2023 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 7 |