Overview

Dataset statistics

Number of variables5
Number of observations51
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory46.6 B

Variable types

DateTime1
Numeric4

Dataset

Description한국부동산원(구.한국감정원)의 청약홈에서 제공하는 연령별 청약 신청자 수 현황입니다.※ 매월 25일, 전월까지의 데이터를 제공하며 전월 데이터는 향후 변동될 수 있습니다.
Author한국부동산원
URLhttps://www.data.go.kr/data/15110978/fileData.do

Alerts

30대 이하 is highly overall correlated with 40대 and 2 other fieldsHigh correlation
40대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
50대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
60대 이상 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
연월 has unique valuesUnique
30대 이하 has unique valuesUnique
40대 has unique valuesUnique
50대 has unique valuesUnique
60대 이상 has unique valuesUnique

Reproduction

Analysis started2024-05-25 18:50:18.804144
Analysis finished2024-05-25 18:50:25.611349
Duration6.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
Minimum2020-02-01 00:00:00
Maximum2024-04-01 00:00:00
2024-05-26T03:50:25.891360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:26.565233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

30대 이하
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean133491.94
Minimum254
Maximum553967
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2024-05-26T03:50:27.151499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum254
5-th percentile27461.5
Q144945.5
median99891
Q3184365.5
95-th percentile348614
Maximum553967
Range553713
Interquartile range (IQR)139420

Descriptive statistics

Standard deviation113074.24
Coefficient of variation (CV)0.84704922
Kurtosis2.6814324
Mean133491.94
Median Absolute Deviation (MAD)62370
Skewness1.4853623
Sum6808089
Variance1.2785785 × 1010
MonotonicityNot monotonic
2024-05-26T03:50:27.818771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
231940 1
 
2.0%
137617 1
 
2.0%
79663 1
 
2.0%
44803 1
 
2.0%
27940 1
 
2.0%
37521 1
 
2.0%
29500 1
 
2.0%
70461 1
 
2.0%
28565 1
 
2.0%
254 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
254 1
2.0%
9478 1
2.0%
26983 1
2.0%
27940 1
2.0%
28189 1
2.0%
28565 1
2.0%
29500 1
2.0%
29717 1
2.0%
37521 1
2.0%
38719 1
2.0%
ValueCountFrequency (%)
553967 1
2.0%
369265 1
2.0%
366595 1
2.0%
330633 1
2.0%
293464 1
2.0%
269271 1
2.0%
260887 1
2.0%
257831 1
2.0%
238533 1
2.0%
231940 1
2.0%

40대
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57204.922
Minimum124
Maximum279858
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2024-05-26T03:50:28.300873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum124
5-th percentile10390
Q124469.5
median39338
Q371362.5
95-th percentile141721.5
Maximum279858
Range279734
Interquartile range (IQR)46893

Descriptive statistics

Standard deviation51024.572
Coefficient of variation (CV)0.89196123
Kurtosis6.1799368
Mean57204.922
Median Absolute Deviation (MAD)22463
Skewness2.0815251
Sum2917451
Variance2.603507 × 109
MonotonicityNot monotonic
2024-05-26T03:50:28.878385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
96922 1
 
2.0%
60373 1
 
2.0%
30787 1
 
2.0%
20298 1
 
2.0%
11459 1
 
2.0%
15692 1
 
2.0%
13487 1
 
2.0%
39338 1
 
2.0%
13918 1
 
2.0%
124 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
124 1
2.0%
7541 1
2.0%
9990 1
2.0%
10790 1
2.0%
11459 1
2.0%
13487 1
2.0%
13918 1
2.0%
15692 1
2.0%
17385 1
2.0%
20298 1
2.0%
ValueCountFrequency (%)
279858 1
2.0%
162492 1
2.0%
153960 1
2.0%
129483 1
2.0%
127011 1
2.0%
117328 1
2.0%
111086 1
2.0%
107472 1
2.0%
106707 1
2.0%
101020 1
2.0%

50대
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28342.627
Minimum71
Maximum142728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2024-05-26T03:50:29.445204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum71
5-th percentile4335
Q110994
median21139
Q334910
95-th percentile70225.5
Maximum142728
Range142657
Interquartile range (IQR)23916

Descriptive statistics

Standard deviation26236.516
Coefficient of variation (CV)0.92569103
Kurtosis6.2765866
Mean28342.627
Median Absolute Deviation (MAD)11369
Skewness2.1196733
Sum1445474
Variance6.8835477 × 108
MonotonicityNot monotonic
2024-05-26T03:50:29.960548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
52356 1
 
2.0%
32508 1
 
2.0%
14036 1
 
2.0%
10303 1
 
2.0%
5259 1
 
2.0%
7549 1
 
2.0%
6948 1
 
2.0%
21205 1
 
2.0%
9100 1
 
2.0%
71 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
71 1
2.0%
2036 1
2.0%
3850 1
2.0%
4820 1
2.0%
5259 1
2.0%
6948 1
2.0%
7549 1
2.0%
8813 1
2.0%
9100 1
2.0%
9439 1
2.0%
ValueCountFrequency (%)
142728 1
2.0%
87312 1
2.0%
79364 1
2.0%
61087 1
2.0%
60399 1
2.0%
59230 1
2.0%
55866 1
2.0%
55502 1
2.0%
52356 1
2.0%
51310 1
2.0%

60대 이상
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16987.608
Minimum29
Maximum79778
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2024-05-26T03:50:30.488101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum29
5-th percentile2434
Q15724
median12255
Q321086.5
95-th percentile44838
Maximum79778
Range79749
Interquartile range (IQR)15362.5

Descriptive statistics

Standard deviation15840.36
Coefficient of variation (CV)0.93246558
Kurtosis3.8806958
Mean16987.608
Median Absolute Deviation (MAD)6999
Skewness1.766522
Sum866368
Variance2.5091699 × 108
MonotonicityNot monotonic
2024-05-26T03:50:30.988454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30373 1
 
2.0%
19756 1
 
2.0%
7740 1
 
2.0%
5748 1
 
2.0%
2735 1
 
2.0%
5256 1
 
2.0%
3965 1
 
2.0%
12255 1
 
2.0%
4855 1
 
2.0%
29 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
29 1
2.0%
971 1
2.0%
2393 1
2.0%
2475 1
2.0%
2735 1
2.0%
3965 1
2.0%
4271 1
2.0%
4855 1
2.0%
4959 1
2.0%
5256 1
2.0%
ValueCountFrequency (%)
79778 1
2.0%
52352 1
2.0%
49014 1
2.0%
40662 1
2.0%
38029 1
2.0%
35490 1
2.0%
35460 1
2.0%
34772 1
2.0%
33329 1
2.0%
32611 1
2.0%

Interactions

2024-05-26T03:50:23.704375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:19.089583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:20.687680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:22.078515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:23.970634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:19.327413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:21.025980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:22.497593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:24.336579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:19.692272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:21.277001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:22.832422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:24.674576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:20.310357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:21.696215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-26T03:50:23.249722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-26T03:50:31.350229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월30대 이하40대50대60대 이상
연월1.0001.0001.0001.0001.000
30대 이하1.0001.0000.9500.9810.969
40대1.0000.9501.0000.9590.956
50대1.0000.9810.9591.0000.990
60대 이상1.0000.9690.9560.9901.000
2024-05-26T03:50:31.621018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
30대 이하40대50대60대 이상
30대 이하1.0000.9820.9760.973
40대0.9821.0000.9930.985
50대0.9760.9931.0000.992
60대 이상0.9730.9850.9921.000

Missing values

2024-05-26T03:50:25.108075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-26T03:50:25.436872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월30대 이하40대50대60대 이상
02020-02231940969225235630373
12020-03137617603733250819756
22020-04142168491142240414035
32020-053665951624928731252352
42020-06162583618013340121404
52020-073692651539607936449014
62020-08187372692243047017262
72020-09181359796604214635460
82020-1055396727985814272879778
92020-112385331010205041734772
연월30대 이하40대50대60대 이상
412023-076258631733157168698
422023-088134436257156928711
432023-094953425790107175700
442023-10146397688383173717836
452023-11113132623313119917188
462023-126371536923174128501
472024-01387192167794394959
482024-024450824964132007718
492024-03947875412036971
502024-04269831738588134271