Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory49.3 B

Variable types

Text1
Numeric4

Dataset

Description서울주택도시공사의 전세임대주택(신혼부부 1 유형)의 공급계획입니다 신혼부부 Ⅰ : 전년도 도시근로자 가구당 월평균 소득의 70%이하인 자 (배우자가 소득이 있는 경우 90% 이하) * 공급계획은 언제나 변경 될 수 있습니다. ※ 지원한도액을 초과하는 전세주택은 전세금액을 입주자가 부담할 경우 지원가능. 단, 전세금은 호당 대출한도액의 250%이내로 제한하되 세대원수가 5인 이상일 경우 예외 가능
URLhttps://www.data.go.kr/data/15088459/fileData.do

Alerts

2023년 호수 is highly overall correlated with 2022년 호수 and 2 other fieldsHigh correlation
2022년 호수 is highly overall correlated with 2023년 호수 and 2 other fieldsHigh correlation
2021년 호수 is highly overall correlated with 2023년 호수 and 2 other fieldsHigh correlation
2020년 호수 is highly overall correlated with 2023년 호수 and 2 other fieldsHigh correlation
지역 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:39:45.721728
Analysis finished2023-12-12 08:39:47.904608
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T17:39:48.061091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.08
Min length2

Characters and Unicode

Total characters77
Distinct characters36
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row강남구
2nd row강동구
3rd row강북구
4th row강서구
5th row관악구
ValueCountFrequency (%)
강남구 1
 
4.0%
서대문구 1
 
4.0%
중구 1
 
4.0%
종로구 1
 
4.0%
은평구 1
 
4.0%
용산구 1
 
4.0%
영등포구 1
 
4.0%
양천구 1
 
4.0%
송파구 1
 
4.0%
성북구 1
 
4.0%
Other values (15) 15
60.0%
2023-12-12T17:39:48.765680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
33.8%
4
 
5.2%
4
 
5.2%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 28
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
33.8%
4
 
5.2%
4
 
5.2%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 28
36.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
33.8%
4
 
5.2%
4
 
5.2%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 28
36.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
33.8%
4
 
5.2%
4
 
5.2%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 28
36.4%

2023년 호수
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T17:39:48.924952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median4
Q35
95-th percentile6.8
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7559423
Coefficient of variation (CV)0.43898557
Kurtosis-0.56622502
Mean4
Median Absolute Deviation (MAD)1
Skewness-0.20076197
Sum100
Variance3.0833333
MonotonicityNot monotonic
2023-12-12T17:39:49.077117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
4 9
36.0%
5 4
16.0%
6 3
 
12.0%
2 3
 
12.0%
1 3
 
12.0%
7 2
 
8.0%
3 1
 
4.0%
ValueCountFrequency (%)
1 3
 
12.0%
2 3
 
12.0%
3 1
 
4.0%
4 9
36.0%
5 4
16.0%
6 3
 
12.0%
7 2
 
8.0%
ValueCountFrequency (%)
7 2
 
8.0%
6 3
 
12.0%
5 4
16.0%
4 9
36.0%
3 1
 
4.0%
2 3
 
12.0%
1 3
 
12.0%

2022년 호수
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8
Minimum1
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T17:39:49.202537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.4
Q16
median8
Q311
95-th percentile13.6
Maximum17
Range16
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.8405729
Coefficient of variation (CV)0.48007161
Kurtosis0.14683437
Mean8
Median Absolute Deviation (MAD)2
Skewness0.17748672
Sum200
Variance14.75
MonotonicityNot monotonic
2023-12-12T17:39:49.345655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
6 5
20.0%
9 4
16.0%
11 3
12.0%
12 2
 
8.0%
7 2
 
8.0%
1 2
 
8.0%
4 1
 
4.0%
10 1
 
4.0%
17 1
 
4.0%
8 1
 
4.0%
Other values (3) 3
12.0%
ValueCountFrequency (%)
1 2
 
8.0%
3 1
 
4.0%
4 1
 
4.0%
5 1
 
4.0%
6 5
20.0%
7 2
 
8.0%
8 1
 
4.0%
9 4
16.0%
10 1
 
4.0%
11 3
12.0%
ValueCountFrequency (%)
17 1
 
4.0%
14 1
 
4.0%
12 2
 
8.0%
11 3
12.0%
10 1
 
4.0%
9 4
16.0%
8 1
 
4.0%
7 2
 
8.0%
6 5
20.0%
5 1
 
4.0%

2021년 호수
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T17:39:49.495255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q15
median8
Q310
95-th percentile14.8
Maximum18
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.0722639
Coefficient of variation (CV)0.50903299
Kurtosis0.14708336
Mean8
Median Absolute Deviation (MAD)3
Skewness0.63577218
Sum200
Variance16.583333
MonotonicityNot monotonic
2023-12-12T17:39:49.808488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
5 6
24.0%
9 4
16.0%
10 3
12.0%
12 2
 
8.0%
4 2
 
8.0%
1 1
 
4.0%
15 1
 
4.0%
18 1
 
4.0%
7 1
 
4.0%
6 1
 
4.0%
Other values (3) 3
12.0%
ValueCountFrequency (%)
1 1
 
4.0%
3 1
 
4.0%
4 2
 
8.0%
5 6
24.0%
6 1
 
4.0%
7 1
 
4.0%
8 1
 
4.0%
9 4
16.0%
10 3
12.0%
12 2
 
8.0%
ValueCountFrequency (%)
18 1
 
4.0%
15 1
 
4.0%
14 1
 
4.0%
12 2
 
8.0%
10 3
12.0%
9 4
16.0%
8 1
 
4.0%
7 1
 
4.0%
6 1
 
4.0%
5 6
24.0%

2020년 호수
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)56.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T17:39:49.956980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.2
Q15
median8
Q310
95-th percentile14.8
Maximum16
Range15
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.0722639
Coefficient of variation (CV)0.50903299
Kurtosis-0.55124611
Mean8
Median Absolute Deviation (MAD)3
Skewness0.27764735
Sum200
Variance16.583333
MonotonicityNot monotonic
2023-12-12T17:39:50.082536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
8 4
16.0%
10 3
12.0%
6 3
12.0%
5 2
8.0%
14 2
8.0%
11 2
8.0%
3 2
8.0%
16 1
 
4.0%
7 1
 
4.0%
4 1
 
4.0%
Other values (4) 4
16.0%
ValueCountFrequency (%)
1 1
 
4.0%
2 1
 
4.0%
3 2
8.0%
4 1
 
4.0%
5 2
8.0%
6 3
12.0%
7 1
 
4.0%
8 4
16.0%
9 1
 
4.0%
10 3
12.0%
ValueCountFrequency (%)
16 1
 
4.0%
15 1
 
4.0%
14 2
8.0%
11 2
8.0%
10 3
12.0%
9 1
 
4.0%
8 4
16.0%
7 1
 
4.0%
6 3
12.0%
5 2
8.0%

Interactions

2023-12-12T17:39:47.167499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:45.888269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.261433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.752070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:47.338748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:45.989283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.424105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.854496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:47.436316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.086448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.552279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.965259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:47.585194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.167993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:46.658695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:47.066067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:39:50.185854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역2023년 호수2022년 호수2021년 호수2020년 호수
지역1.0001.0001.0001.0001.000
2023년 호수1.0001.0000.7530.6990.610
2022년 호수1.0000.7531.0000.5700.909
2021년 호수1.0000.6990.5701.0000.291
2020년 호수1.0000.6100.9090.2911.000
2023-12-12T17:39:50.301380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023년 호수2022년 호수2021년 호수2020년 호수
2023년 호수1.0000.5810.6710.746
2022년 호수0.5811.0000.7650.684
2021년 호수0.6710.7651.0000.702
2020년 호수0.7460.6840.7021.000

Missing values

2023-12-12T17:39:47.723853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:39:47.848320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역2023년 호수2022년 호수2021년 호수2020년 호수
0강남구3415
1강동구612916
2강북구791510
3강서구5101010
4관악구6171214
5광진구6111811
6구로구49108
7금천구41198
8노원구46711
9도봉구4658
지역2023년 호수2022년 호수2021년 호수2020년 호수
15성동구1653
16성북구512106
17송파구591414
18양천구214129
19영등포구4733
20용산구47510
21은평구41198
22종로구1142
23중구1151
24중랑구79915