Overview

Dataset statistics

Number of variables5
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory46.6 B

Variable types

DateTime1
Numeric4

Dataset

Description한국부동산원(구.한국감정원)의 청약홈에서 제공하는 연령별 청약 당첨자 수 현황입니다.※ 매월 25일, 전월까지의 데이터를 제공하며 전월 데이터는 향후 변동될 수 있습니다.
Author한국부동산원
URLhttps://www.data.go.kr/data/15110981/fileData.do

Alerts

30대 이하 is highly overall correlated with 40대 and 2 other fieldsHigh correlation
40대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
50대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
60대 이상 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
연월 has unique valuesUnique
30대 이하 has unique valuesUnique
40대 has unique valuesUnique
60대 이상 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:03:02.587009
Analysis finished2024-04-29 23:03:05.716381
Duration3.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2020-02-01 00:00:00
Maximum2024-03-01 00:00:00
2024-04-30T08:03:05.789210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.961162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

30대 이하
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7597.14
Minimum240
Maximum18568
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:03:06.096941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum240
5-th percentile2200.35
Q15353.75
median7325
Q39211.75
95-th percentile14524.55
Maximum18568
Range18328
Interquartile range (IQR)3858

Descriptive statistics

Standard deviation3818.3511
Coefficient of variation (CV)0.50260376
Kurtosis1.089795
Mean7597.14
Median Absolute Deviation (MAD)1917.5
Skewness0.72482455
Sum379857
Variance14579805
MonotonicityNot monotonic
2024-04-30T08:03:06.237159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4986 1
 
2.0%
5407 1
 
2.0%
6114 1
 
2.0%
7664 1
 
2.0%
6542 1
 
2.0%
4324 1
 
2.0%
8670 1
 
2.0%
8788 1
 
2.0%
4475 1
 
2.0%
240 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
240 1
2.0%
435 1
2.0%
2019 1
2.0%
2422 1
2.0%
2766 1
2.0%
3724 1
2.0%
3893 1
2.0%
4037 1
2.0%
4324 1
2.0%
4475 1
2.0%
ValueCountFrequency (%)
18568 1
2.0%
17465 1
2.0%
14534 1
2.0%
14513 1
2.0%
13136 1
2.0%
12782 1
2.0%
11454 1
2.0%
10888 1
2.0%
10264 1
2.0%
9623 1
2.0%

40대
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3814.36
Minimum120
Maximum9722
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:03:06.386573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum120
5-th percentile1077.75
Q12529.5
median3568
Q34790
95-th percentile7062.7
Maximum9722
Range9602
Interquartile range (IQR)2260.5

Descriptive statistics

Standard deviation1937.4952
Coefficient of variation (CV)0.50794764
Kurtosis1.1090933
Mean3814.36
Median Absolute Deviation (MAD)1083.5
Skewness0.75103444
Sum190718
Variance3753887.5
MonotonicityNot monotonic
2024-04-30T08:03:06.521727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2738 1
 
2.0%
2561 1
 
2.0%
2718 1
 
2.0%
3110 1
 
2.0%
3040 1
 
2.0%
2518 1
 
2.0%
4554 1
 
2.0%
5924 1
 
2.0%
2343 1
 
2.0%
120 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
120 1
2.0%
147 1
2.0%
972 1
2.0%
1207 1
2.0%
1726 1
2.0%
1870 1
2.0%
1879 1
2.0%
2285 1
2.0%
2343 1
2.0%
2366 1
2.0%
ValueCountFrequency (%)
9722 1
2.0%
8473 1
2.0%
7177 1
2.0%
6923 1
2.0%
6842 1
2.0%
6024 1
2.0%
5924 1
2.0%
5605 1
2.0%
5557 1
2.0%
5373 1
2.0%

50대
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1892.02
Minimum43
Maximum5283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:03:06.658847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile487.95
Q11361.25
median1775
Q32508.5
95-th percentile3455.9
Maximum5283
Range5240
Interquartile range (IQR)1147.25

Descriptive statistics

Standard deviation974.41358
Coefficient of variation (CV)0.5150123
Kurtosis2.0401558
Mean1892.02
Median Absolute Deviation (MAD)543.5
Skewness0.90706717
Sum94601
Variance949481.82
MonotonicityNot monotonic
2024-04-30T08:03:06.802874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
2009 2
 
4.0%
1454 2
 
4.0%
1418 1
 
2.0%
912 1
 
2.0%
1299 1
 
2.0%
1477 1
 
2.0%
1402 1
 
2.0%
1439 1
 
2.0%
2477 1
 
2.0%
2690 1
 
2.0%
Other values (38) 38
76.0%
ValueCountFrequency (%)
43 1
2.0%
68 1
2.0%
438 1
2.0%
549 1
2.0%
824 1
2.0%
912 1
2.0%
1043 1
2.0%
1141 1
2.0%
1151 1
2.0%
1158 1
2.0%
ValueCountFrequency (%)
5283 1
2.0%
3930 1
2.0%
3545 1
2.0%
3347 1
2.0%
3319 1
2.0%
2945 1
2.0%
2876 1
2.0%
2690 1
2.0%
2676 1
2.0%
2652 1
2.0%

60대 이상
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean981.02
Minimum17
Maximum2538
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:03:07.157301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile270.75
Q1651.75
median924.5
Q31299.25
95-th percentile1806
Maximum2538
Range2521
Interquartile range (IQR)647.5

Descriptive statistics

Standard deviation507.3679
Coefficient of variation (CV)0.51718405
Kurtosis0.97340622
Mean981.02
Median Absolute Deviation (MAD)293
Skewness0.68729412
Sum49051
Variance257422.18
MonotonicityNot monotonic
2024-04-30T08:03:07.305577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
776 1
 
2.0%
503 1
 
2.0%
712 1
 
2.0%
757 1
 
2.0%
671 1
 
2.0%
645 1
 
2.0%
1343 1
 
2.0%
1333 1
 
2.0%
651 1
 
2.0%
29 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
17 1
2.0%
29 1
2.0%
246 1
2.0%
301 1
2.0%
362 1
2.0%
473 1
2.0%
483 1
2.0%
503 1
2.0%
565 1
2.0%
632 1
2.0%
ValueCountFrequency (%)
2538 1
2.0%
2155 1
2.0%
1815 1
2.0%
1795 1
2.0%
1711 1
2.0%
1516 1
2.0%
1510 1
2.0%
1483 1
2.0%
1453 1
2.0%
1343 1
2.0%

Interactions

2024-04-30T08:03:05.248564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.104187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.631658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.947299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.333145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.245158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.713443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.024484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.405636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.461081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.791715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.106404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.478820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.548610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:04.872190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:05.177471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:03:07.392708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월30대 이하40대50대60대 이상
연월1.0001.0001.0001.0001.000
30대 이하1.0001.0000.8830.9500.818
40대1.0000.8831.0000.9570.982
50대1.0000.9500.9571.0000.948
60대 이상1.0000.8180.9820.9481.000
2024-04-30T08:03:07.483426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
30대 이하40대50대60대 이상
30대 이하1.0000.9640.9350.952
40대0.9641.0000.9720.978
50대0.9350.9721.0000.974
60대 이상0.9520.9780.9741.000

Missing values

2024-04-30T08:03:05.587702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:03:05.677148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월30대 이하40대50대60대 이상
02020-02498627381418776
12020-03464322851320632
22020-04693232011397807
32020-0514534717735451795
42020-069181482526761516
52020-0718568972252832538
62020-08743933551805787
72020-098464440918501061
82020-109242517826521218
92020-1110888537325191483
연월30대 이하40대50대60대 이상
402023-06664635011940917
412023-07389323791389650
422023-0837241879824362
432023-09533625191151565
442023-1010264560528761510
452023-11630637601782994
462023-12701339372009892
472024-01276617261043473
482024-027162372920091070
492024-034351474317