Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 51 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.3 KiB |
Average record size in memory | 46.6 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 4 |
Dataset
Description | 한국부동산원(구.한국감정원)의 청약홈에서 제공하는 연령별 청약 신청자 수 현황입니다.※ 매월 25일, 전월까지의 데이터를 제공하며 전월 데이터는 향후 변동될 수 있습니다. |
---|---|
Author | 한국부동산원 |
URL | https://www.data.go.kr/data/15110978/fileData.do |
30대 이하 is highly overall correlated with 40대 and 2 other fields | High correlation |
40대 is highly overall correlated with 30대 이하 and 2 other fields | High correlation |
50대 is highly overall correlated with 30대 이하 and 2 other fields | High correlation |
60대 이상 is highly overall correlated with 30대 이하 and 2 other fields | High correlation |
연월 has unique values | Unique |
30대 이하 has unique values | Unique |
40대 has unique values | Unique |
50대 has unique values | Unique |
60대 이상 has unique values | Unique |
Reproduction
Analysis started | 2024-05-25 18:50:18.804144 |
---|---|
Analysis finished | 2024-05-25 18:50:25.611349 |
Duration | 6.81 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연월
Date
UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 540.0 B |
Minimum | 2020-02-01 00:00:00 |
---|---|
Maximum | 2024-04-01 00:00:00 |
Histogram with fixed size bins (bins=50)
30대 이하
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 133491.94 |
Minimum | 254 |
---|---|
Maximum | 553967 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 254 |
---|---|
5-th percentile | 27461.5 |
Q1 | 44945.5 |
median | 99891 |
Q3 | 184365.5 |
95-th percentile | 348614 |
Maximum | 553967 |
Range | 553713 |
Interquartile range (IQR) | 139420 |
Descriptive statistics
Standard deviation | 113074.24 |
---|---|
Coefficient of variation (CV) | 0.84704922 |
Kurtosis | 2.6814324 |
Mean | 133491.94 |
Median Absolute Deviation (MAD) | 62370 |
Skewness | 1.4853623 |
Sum | 6808089 |
Variance | 1.2785785 × 1010 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
231940 | 1 | 2.0% |
137617 | 1 | 2.0% |
79663 | 1 | 2.0% |
44803 | 1 | 2.0% |
27940 | 1 | 2.0% |
37521 | 1 | 2.0% |
29500 | 1 | 2.0% |
70461 | 1 | 2.0% |
28565 | 1 | 2.0% |
254 | 1 | 2.0% |
Other values (41) | 41 |
Value | Count | Frequency (%) |
254 | 1 | |
9478 | 1 | |
26983 | 1 | |
27940 | 1 | |
28189 | 1 | |
28565 | 1 | |
29500 | 1 | |
29717 | 1 | |
37521 | 1 | |
38719 | 1 |
Value | Count | Frequency (%) |
553967 | 1 | |
369265 | 1 | |
366595 | 1 | |
330633 | 1 | |
293464 | 1 | |
269271 | 1 | |
260887 | 1 | |
257831 | 1 | |
238533 | 1 | |
231940 | 1 |
40대
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 57204.922 |
Minimum | 124 |
---|---|
Maximum | 279858 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 124 |
---|---|
5-th percentile | 10390 |
Q1 | 24469.5 |
median | 39338 |
Q3 | 71362.5 |
95-th percentile | 141721.5 |
Maximum | 279858 |
Range | 279734 |
Interquartile range (IQR) | 46893 |
Descriptive statistics
Standard deviation | 51024.572 |
---|---|
Coefficient of variation (CV) | 0.89196123 |
Kurtosis | 6.1799368 |
Mean | 57204.922 |
Median Absolute Deviation (MAD) | 22463 |
Skewness | 2.0815251 |
Sum | 2917451 |
Variance | 2.603507 × 109 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
96922 | 1 | 2.0% |
60373 | 1 | 2.0% |
30787 | 1 | 2.0% |
20298 | 1 | 2.0% |
11459 | 1 | 2.0% |
15692 | 1 | 2.0% |
13487 | 1 | 2.0% |
39338 | 1 | 2.0% |
13918 | 1 | 2.0% |
124 | 1 | 2.0% |
Other values (41) | 41 |
Value | Count | Frequency (%) |
124 | 1 | |
7541 | 1 | |
9990 | 1 | |
10790 | 1 | |
11459 | 1 | |
13487 | 1 | |
13918 | 1 | |
15692 | 1 | |
17385 | 1 | |
20298 | 1 |
Value | Count | Frequency (%) |
279858 | 1 | |
162492 | 1 | |
153960 | 1 | |
129483 | 1 | |
127011 | 1 | |
117328 | 1 | |
111086 | 1 | |
107472 | 1 | |
106707 | 1 | |
101020 | 1 |
50대
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28342.627 |
Minimum | 71 |
---|---|
Maximum | 142728 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 71 |
---|---|
5-th percentile | 4335 |
Q1 | 10994 |
median | 21139 |
Q3 | 34910 |
95-th percentile | 70225.5 |
Maximum | 142728 |
Range | 142657 |
Interquartile range (IQR) | 23916 |
Descriptive statistics
Standard deviation | 26236.516 |
---|---|
Coefficient of variation (CV) | 0.92569103 |
Kurtosis | 6.2765866 |
Mean | 28342.627 |
Median Absolute Deviation (MAD) | 11369 |
Skewness | 2.1196733 |
Sum | 1445474 |
Variance | 6.8835477 × 108 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
52356 | 1 | 2.0% |
32508 | 1 | 2.0% |
14036 | 1 | 2.0% |
10303 | 1 | 2.0% |
5259 | 1 | 2.0% |
7549 | 1 | 2.0% |
6948 | 1 | 2.0% |
21205 | 1 | 2.0% |
9100 | 1 | 2.0% |
71 | 1 | 2.0% |
Other values (41) | 41 |
Value | Count | Frequency (%) |
71 | 1 | |
2036 | 1 | |
3850 | 1 | |
4820 | 1 | |
5259 | 1 | |
6948 | 1 | |
7549 | 1 | |
8813 | 1 | |
9100 | 1 | |
9439 | 1 |
Value | Count | Frequency (%) |
142728 | 1 | |
87312 | 1 | |
79364 | 1 | |
61087 | 1 | |
60399 | 1 | |
59230 | 1 | |
55866 | 1 | |
55502 | 1 | |
52356 | 1 | |
51310 | 1 |
60대 이상
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16987.608 |
Minimum | 29 |
---|---|
Maximum | 79778 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 29 |
---|---|
5-th percentile | 2434 |
Q1 | 5724 |
median | 12255 |
Q3 | 21086.5 |
95-th percentile | 44838 |
Maximum | 79778 |
Range | 79749 |
Interquartile range (IQR) | 15362.5 |
Descriptive statistics
Standard deviation | 15840.36 |
---|---|
Coefficient of variation (CV) | 0.93246558 |
Kurtosis | 3.8806958 |
Mean | 16987.608 |
Median Absolute Deviation (MAD) | 6999 |
Skewness | 1.766522 |
Sum | 866368 |
Variance | 2.5091699 × 108 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
30373 | 1 | 2.0% |
19756 | 1 | 2.0% |
7740 | 1 | 2.0% |
5748 | 1 | 2.0% |
2735 | 1 | 2.0% |
5256 | 1 | 2.0% |
3965 | 1 | 2.0% |
12255 | 1 | 2.0% |
4855 | 1 | 2.0% |
29 | 1 | 2.0% |
Other values (41) | 41 |
Value | Count | Frequency (%) |
29 | 1 | |
971 | 1 | |
2393 | 1 | |
2475 | 1 | |
2735 | 1 | |
3965 | 1 | |
4271 | 1 | |
4855 | 1 | |
4959 | 1 | |
5256 | 1 |
Value | Count | Frequency (%) |
79778 | 1 | |
52352 | 1 | |
49014 | 1 | |
40662 | 1 | |
38029 | 1 | |
35490 | 1 | |
35460 | 1 | |
34772 | 1 | |
33329 | 1 | |
32611 | 1 |
연월 | 30대 이하 | 40대 | 50대 | 60대 이상 | |
---|---|---|---|---|---|
연월 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
30대 이하 | 1.000 | 1.000 | 0.950 | 0.981 | 0.969 |
40대 | 1.000 | 0.950 | 1.000 | 0.959 | 0.956 |
50대 | 1.000 | 0.981 | 0.959 | 1.000 | 0.990 |
60대 이상 | 1.000 | 0.969 | 0.956 | 0.990 | 1.000 |
30대 이하 | 40대 | 50대 | 60대 이상 | |
---|---|---|---|---|
30대 이하 | 1.000 | 0.982 | 0.976 | 0.973 |
40대 | 0.982 | 1.000 | 0.993 | 0.985 |
50대 | 0.976 | 0.993 | 1.000 | 0.992 |
60대 이상 | 0.973 | 0.985 | 0.992 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
연월 | 30대 이하 | 40대 | 50대 | 60대 이상 | |
---|---|---|---|---|---|
0 | 2020-02 | 231940 | 96922 | 52356 | 30373 |
1 | 2020-03 | 137617 | 60373 | 32508 | 19756 |
2 | 2020-04 | 142168 | 49114 | 22404 | 14035 |
3 | 2020-05 | 366595 | 162492 | 87312 | 52352 |
4 | 2020-06 | 162583 | 61801 | 33401 | 21404 |
5 | 2020-07 | 369265 | 153960 | 79364 | 49014 |
6 | 2020-08 | 187372 | 69224 | 30470 | 17262 |
7 | 2020-09 | 181359 | 79660 | 42146 | 35460 |
8 | 2020-10 | 553967 | 279858 | 142728 | 79778 |
9 | 2020-11 | 238533 | 101020 | 50417 | 34772 |
연월 | 30대 이하 | 40대 | 50대 | 60대 이상 | |
---|---|---|---|---|---|
41 | 2023-07 | 62586 | 31733 | 15716 | 8698 |
42 | 2023-08 | 81344 | 36257 | 15692 | 8711 |
43 | 2023-09 | 49534 | 25790 | 10717 | 5700 |
44 | 2023-10 | 146397 | 68838 | 31737 | 17836 |
45 | 2023-11 | 113132 | 62331 | 31199 | 17188 |
46 | 2023-12 | 63715 | 36923 | 17412 | 8501 |
47 | 2024-01 | 38719 | 21677 | 9439 | 4959 |
48 | 2024-02 | 44508 | 24964 | 13200 | 7718 |
49 | 2024-03 | 9478 | 7541 | 2036 | 971 |
50 | 2024-04 | 26983 | 17385 | 8813 | 4271 |