Overview

Dataset statistics

Number of variables5
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory46.6 B

Variable types

DateTime1
Numeric4

Dataset

Description한국부동산원(구.한국감정원)의 청약홈에서 제공하는 연령별 청약 신청자 수 현황입니다.※ 매월 25일, 전월까지의 데이터를 제공하며 전월 데이터는 향후 변동될 수 있습니다.
Author한국부동산원
URLhttps://www.data.go.kr/data/15110978/fileData.do

Alerts

30대 이하 is highly overall correlated with 40대 and 2 other fieldsHigh correlation
40대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
50대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
60대 이상 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
연월 has unique valuesUnique
30대 이하 has unique valuesUnique
40대 has unique valuesUnique
50대 has unique valuesUnique
60대 이상 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:02:54.050277
Analysis finished2024-04-29 23:02:57.346145
Duration3.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2020-02-01 00:00:00
Maximum2024-03-01 00:00:00
2024-04-30T08:02:57.423928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:57.569644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

30대 이하
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135622.12
Minimum254
Maximum553967
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:02:57.698435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum254
5-th percentile28052.05
Q146199.5
median100501
Q3185868.75
95-th percentile350412.1
Maximum553967
Range553713
Interquartile range (IQR)139669.25

Descriptive statistics

Standard deviation113183.81
Coefficient of variation (CV)0.83455275
Kurtosis2.6479873
Mean135622.12
Median Absolute Deviation (MAD)61932
Skewness1.4714066
Sum6781106
Variance1.2810575 × 1010
MonotonicityNot monotonic
2024-04-30T08:02:57.815105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
231940 1
 
2.0%
45088 1
 
2.0%
79663 1
 
2.0%
44803 1
 
2.0%
27940 1
 
2.0%
37521 1
 
2.0%
29500 1
 
2.0%
70461 1
 
2.0%
28565 1
 
2.0%
254 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
254 1
2.0%
9478 1
2.0%
27940 1
2.0%
28189 1
2.0%
28565 1
2.0%
29500 1
2.0%
29717 1
2.0%
37521 1
2.0%
38719 1
2.0%
40242 1
2.0%
ValueCountFrequency (%)
553967 1
2.0%
369265 1
2.0%
366595 1
2.0%
330633 1
2.0%
293464 1
2.0%
269271 1
2.0%
260887 1
2.0%
257831 1
2.0%
238533 1
2.0%
231940 1
2.0%

40대
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58001.32
Minimum124
Maximum279858
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:02:57.934418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum124
5-th percentile10350
Q125170.5
median42031.5
Q371459.25
95-th percentile142945.35
Maximum279858
Range279734
Interquartile range (IQR)46288.75

Descriptive statistics

Standard deviation51221.41
Coefficient of variation (CV)0.88310766
Kurtosis6.0948273
Mean58001.32
Median Absolute Deviation (MAD)21044
Skewness2.0627237
Sum2900066
Variance2.6236328 × 109
MonotonicityNot monotonic
2024-04-30T08:02:58.059324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
96922 1
 
2.0%
23975 1
 
2.0%
30787 1
 
2.0%
20298 1
 
2.0%
11459 1
 
2.0%
15692 1
 
2.0%
13487 1
 
2.0%
39338 1
 
2.0%
13918 1
 
2.0%
124 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
124 1
2.0%
7541 1
2.0%
9990 1
2.0%
10790 1
2.0%
11459 1
2.0%
13487 1
2.0%
13918 1
2.0%
15692 1
2.0%
20298 1
2.0%
21677 1
2.0%
ValueCountFrequency (%)
279858 1
2.0%
162492 1
2.0%
153960 1
2.0%
129483 1
2.0%
127011 1
2.0%
117328 1
2.0%
111086 1
2.0%
107472 1
2.0%
106707 1
2.0%
101020 1
2.0%

50대
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28733.22
Minimum71
Maximum142728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:02:58.216597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum71
5-th percentile4286.5
Q111289.25
median21172
Q335083
95-th percentile71139.35
Maximum142728
Range142657
Interquartile range (IQR)23793.75

Descriptive statistics

Standard deviation26352.672
Coefficient of variation (CV)0.91714999
Kurtosis6.1763901
Mean28733.22
Median Absolute Deviation (MAD)11170
Skewness2.0985316
Sum1436661
Variance6.9446335 × 108
MonotonicityNot monotonic
2024-04-30T08:02:58.364270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
52356 1
 
2.0%
11271 1
 
2.0%
14036 1
 
2.0%
10303 1
 
2.0%
5259 1
 
2.0%
7549 1
 
2.0%
6948 1
 
2.0%
21205 1
 
2.0%
9100 1
 
2.0%
71 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
71 1
2.0%
2036 1
2.0%
3850 1
2.0%
4820 1
2.0%
5259 1
2.0%
6948 1
2.0%
7549 1
2.0%
9100 1
2.0%
9439 1
2.0%
10168 1
2.0%
ValueCountFrequency (%)
142728 1
2.0%
87312 1
2.0%
79364 1
2.0%
61087 1
2.0%
60399 1
2.0%
59230 1
2.0%
55866 1
2.0%
55502 1
2.0%
52356 1
2.0%
51310 1
2.0%

60대 이상
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17241.94
Minimum29
Maximum79778
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-04-30T08:02:58.522243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum29
5-th percentile2429.9
Q15939
median12817
Q321245.25
95-th percentile45255.6
Maximum79778
Range79749
Interquartile range (IQR)15306.25

Descriptive statistics

Standard deviation15895.644
Coefficient of variation (CV)0.92191736
Kurtosis3.8061059
Mean17241.94
Median Absolute Deviation (MAD)7375.5
Skewness1.7461683
Sum862097
Variance2.5267149 × 108
MonotonicityNot monotonic
2024-04-30T08:02:58.847377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30373 1
 
2.0%
5443 1
 
2.0%
7740 1
 
2.0%
5748 1
 
2.0%
2735 1
 
2.0%
5256 1
 
2.0%
3965 1
 
2.0%
12255 1
 
2.0%
4855 1
 
2.0%
29 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
29 1
2.0%
971 1
2.0%
2393 1
2.0%
2475 1
2.0%
2735 1
2.0%
3965 1
2.0%
4855 1
2.0%
4959 1
2.0%
5256 1
2.0%
5262 1
2.0%
ValueCountFrequency (%)
79778 1
2.0%
52352 1
2.0%
49014 1
2.0%
40662 1
2.0%
38029 1
2.0%
35490 1
2.0%
35460 1
2.0%
34772 1
2.0%
33329 1
2.0%
32611 1
2.0%

Interactions

2024-04-30T08:02:56.791469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:55.519193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.062018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.422817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.879277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:55.651767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.150584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.508942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.986217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:55.872028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.230947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.602652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:57.089366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:55.961399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.319458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:56.701558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:02:58.955398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월30대 이하40대50대60대 이상
연월1.0001.0001.0001.0001.000
30대 이하1.0001.0000.9490.9810.969
40대1.0000.9491.0000.9590.955
50대1.0000.9810.9591.0000.990
60대 이상1.0000.9690.9550.9901.000
2024-04-30T08:02:59.050706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
30대 이하40대50대60대 이상
30대 이하1.0000.9830.9770.972
40대0.9831.0000.9930.984
50대0.9770.9931.0000.992
60대 이상0.9720.9840.9921.000

Missing values

2024-04-30T08:02:57.219895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:02:57.305678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월30대 이하40대50대60대 이상
02020-02231940969225235630373
12020-03137617603733250819756
22020-04142168491142240414035
32020-053665951624928731252352
42020-06162583618013340121404
52020-073692651539607936449014
62020-08187372692243047017262
72020-09181359796604214635460
82020-1055396727985814272879778
92020-112385331010205041734772
연월30대 이하40대50대60대 이상
402023-065688725828113446512
412023-076258631733157168698
422023-088134436257156928711
432023-094953425790107175700
442023-10146397688383173717836
452023-11113132623313119917188
462023-126371536923174128501
472024-01387192167794394959
482024-024450824964132007718
492024-03947875412036971