Overview

Dataset statistics

Number of variables6
Number of observations592
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.2 KiB
Average record size in memory52.2 B

Variable types

Categorical2
Numeric4

Dataset

Description한국부동산원(구.한국감정원)의 청약홈에서 제공하는 지역별 청약 당첨자 수 현황입니다.※ 매월 25일, 전월까지의 데이터를 제공하며 전월 데이터는 향후 변동될 수 있습니다.
Author한국부동산원
URLhttps://www.data.go.kr/data/15110976/fileData.do

Alerts

30대 이하 is highly overall correlated with 40대 and 2 other fieldsHigh correlation
40대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
50대 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
60대 이상 is highly overall correlated with 30대 이하 and 2 other fieldsHigh correlation
40대 has 15 (2.5%) zerosZeros
50대 has 26 (4.4%) zerosZeros
60대 이상 has 49 (8.3%) zerosZeros

Reproduction

Analysis started2024-04-29 23:02:44.428301
Analysis finished2024-04-29 23:02:48.206672
Duration3.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Categorical

Distinct50
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2021-12
 
16
2021-07
 
16
2022-03
 
15
2022-10
 
15
2022-07
 
15
Other values (45)
515 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-02
2nd row2020-02
3rd row2020-02
4th row2020-02
5th row2020-02

Common Values

ValueCountFrequency (%)
2021-12 16
 
2.7%
2021-07 16
 
2.7%
2022-03 15
 
2.5%
2022-10 15
 
2.5%
2022-07 15
 
2.5%
2022-06 15
 
2.5%
2022-12 15
 
2.5%
2021-11 15
 
2.5%
2022-08 15
 
2.5%
2020-05 15
 
2.5%
Other values (40) 440
74.3%

Length

2024-04-30T08:02:48.283756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-12 16
 
2.7%
2021-07 16
 
2.7%
2022-03 15
 
2.5%
2022-10 15
 
2.5%
2022-07 15
 
2.5%
2022-06 15
 
2.5%
2022-12 15
 
2.5%
2021-11 15
 
2.5%
2022-08 15
 
2.5%
2020-05 15
 
2.5%
Other values (40) 440
74.3%

시도
Categorical

Distinct17
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
경기
49 
부산
43 
인천
43 
서울
42 
충남
42 
Other values (12)
373 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row부산
3rd row대구
4th row인천
5th row울산

Common Values

ValueCountFrequency (%)
경기 49
 
8.3%
부산 43
 
7.3%
인천 43
 
7.3%
서울 42
 
7.1%
충남 42
 
7.1%
대구 38
 
6.4%
경남 38
 
6.4%
경북 37
 
6.2%
강원 35
 
5.9%
전북 35
 
5.9%
Other values (7) 190
32.1%

Length

2024-04-30T08:02:48.392235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 49
 
8.3%
인천 43
 
7.3%
부산 43
 
7.3%
서울 42
 
7.1%
충남 42
 
7.1%
대구 38
 
6.4%
경남 38
 
6.4%
경북 37
 
6.2%
전남 35
 
5.9%
강원 35
 
5.9%
Other values (7) 190
32.1%

30대 이하
Real number (ℝ)

HIGH CORRELATION 

Distinct460
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean641.65034
Minimum0
Maximum6103
Zeros4
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-04-30T08:02:48.514035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8
Q192
median322
Q3795.25
95-th percentile2389.6
Maximum6103
Range6103
Interquartile range (IQR)703.25

Descriptive statistics

Standard deviation875.0391
Coefficient of variation (CV)1.363732
Kurtosis8.751079
Mean641.65034
Median Absolute Deviation (MAD)269.5
Skewness2.6569884
Sum379857
Variance765693.43
MonotonicityNot monotonic
2024-04-30T08:02:48.643434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8 5
 
0.8%
83 4
 
0.7%
58 4
 
0.7%
0 4
 
0.7%
7 4
 
0.7%
1 4
 
0.7%
97 4
 
0.7%
31 4
 
0.7%
19 3
 
0.5%
6 3
 
0.5%
Other values (450) 553
93.4%
ValueCountFrequency (%)
0 4
0.7%
1 4
0.7%
2 3
0.5%
3 2
 
0.3%
4 3
0.5%
5 3
0.5%
6 3
0.5%
7 4
0.7%
8 5
0.8%
9 3
0.5%
ValueCountFrequency (%)
6103 1
0.2%
5312 1
0.2%
5248 1
0.2%
4900 1
0.2%
4562 1
0.2%
4547 1
0.2%
4541 1
0.2%
4052 1
0.2%
3990 1
0.2%
3810 1
0.2%

40대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct364
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean322.15878
Minimum0
Maximum2763
Zeros15
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-04-30T08:02:48.774588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q145.75
median178
Q3401.25
95-th percentile1232.7
Maximum2763
Range2763
Interquartile range (IQR)355.5

Descriptive statistics

Standard deviation432.11411
Coefficient of variation (CV)1.3413079
Kurtosis7.9708873
Mean322.15878
Median Absolute Deviation (MAD)151
Skewness2.5542486
Sum190718
Variance186722.6
MonotonicityNot monotonic
2024-04-30T08:02:48.906781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 15
 
2.5%
1 12
 
2.0%
2 10
 
1.7%
7 6
 
1.0%
15 5
 
0.8%
48 5
 
0.8%
4 5
 
0.8%
3 5
 
0.8%
188 4
 
0.7%
25 4
 
0.7%
Other values (354) 521
88.0%
ValueCountFrequency (%)
0 15
2.5%
1 12
2.0%
2 10
1.7%
3 5
 
0.8%
4 5
 
0.8%
5 4
 
0.7%
6 3
 
0.5%
7 6
 
1.0%
8 3
 
0.5%
9 3
 
0.5%
ValueCountFrequency (%)
2763 1
0.2%
2744 1
0.2%
2581 1
0.2%
2452 1
0.2%
2441 1
0.2%
2288 1
0.2%
1898 1
0.2%
1872 1
0.2%
1870 1
0.2%
1729 1
0.2%

50대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct291
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean159.79899
Minimum0
Maximum1429
Zeros26
Zeros (%)4.4%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-04-30T08:02:49.051133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q119
median82.5
Q3201.5
95-th percentile593.15
Maximum1429
Range1429
Interquartile range (IQR)182.5

Descriptive statistics

Standard deviation213.74712
Coefficient of variation (CV)1.3376
Kurtosis7.399817
Mean159.79899
Median Absolute Deviation (MAD)72.5
Skewness2.4736261
Sum94601
Variance45687.833
MonotonicityNot monotonic
2024-04-30T08:02:49.187089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 26
 
4.4%
1 17
 
2.9%
3 12
 
2.0%
2 12
 
2.0%
54 10
 
1.7%
12 9
 
1.5%
10 9
 
1.5%
16 8
 
1.4%
19 8
 
1.4%
5 7
 
1.2%
Other values (281) 474
80.1%
ValueCountFrequency (%)
0 26
4.4%
1 17
2.9%
2 12
2.0%
3 12
2.0%
4 6
 
1.0%
5 7
 
1.2%
6 4
 
0.7%
7 4
 
0.7%
8 3
 
0.5%
9 3
 
0.5%
ValueCountFrequency (%)
1429 1
0.2%
1292 1
0.2%
1193 1
0.2%
1131 1
0.2%
1077 1
0.2%
1056 1
0.2%
1026 1
0.2%
1009 1
0.2%
942 1
0.2%
915 1
0.2%

60대 이상
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct217
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.856419
Minimum0
Maximum709
Zeros49
Zeros (%)8.3%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-04-30T08:02:49.499671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q110
median42
Q3105
95-th percentile316.35
Maximum709
Range709
Interquartile range (IQR)95

Descriptive statistics

Standard deviation112.92669
Coefficient of variation (CV)1.3629203
Kurtosis7.8057608
Mean82.856419
Median Absolute Deviation (MAD)38
Skewness2.5196719
Sum49051
Variance12752.438
MonotonicityNot monotonic
2024-04-30T08:02:49.643997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 49
 
8.3%
1 20
 
3.4%
2 15
 
2.5%
12 14
 
2.4%
5 12
 
2.0%
16 12
 
2.0%
3 11
 
1.9%
4 9
 
1.5%
10 8
 
1.4%
34 8
 
1.4%
Other values (207) 434
73.3%
ValueCountFrequency (%)
0 49
8.3%
1 20
3.4%
2 15
 
2.5%
3 11
 
1.9%
4 9
 
1.5%
5 12
 
2.0%
6 8
 
1.4%
7 7
 
1.2%
8 7
 
1.2%
9 7
 
1.2%
ValueCountFrequency (%)
709 1
0.2%
675 1
0.2%
674 1
0.2%
669 1
0.2%
641 1
0.2%
610 1
0.2%
503 1
0.2%
501 1
0.2%
495 1
0.2%
485 1
0.2%

Interactions

2024-04-30T08:02:47.654698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:46.365293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:46.930945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:47.297294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:47.748758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:46.671596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:47.025506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:47.382647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:47.844228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:02:46.749733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/