Overview

Dataset statistics

Number of variables34
Number of observations2844
Missing cells40
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory825.0 KiB
Average record size in memory297.0 B

Variable types

Numeric10
Categorical22
Text2

Dataset

Description경기주택도시공사 GH주택청약센터의 공급정보로써 계획연도, 전용면적, 공급호수, 공급연월, 입주예정연월 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15119391/fileData.do

Alerts

공급일련번호 has constant value ""Constant
전환공급호수 has constant value ""Constant
특별공급호수 has constant value ""Constant
사전청약공급호수 has constant value ""Constant
주거약자추첨여부 has constant value ""Constant
기타 유의사항 has constant value ""Constant
주거약자호수 is highly imbalanced (99.4%)Imbalance
우선추첨여부 is highly imbalanced (86.6%)Imbalance
일반추첨여부 is highly imbalanced (81.3%)Imbalance
자격요건내용 is highly imbalanced (92.0%)Imbalance
저층우선신청여부 is highly imbalanced (96.9%)Imbalance
공동주택여부 is highly imbalanced (99.5%)Imbalance
일반거주서열기준여부 is highly imbalanced (98.8%)Imbalance
우선거주서열기준여부 is highly imbalanced (95.3%)Imbalance
자격요건내용.1 is highly imbalanced (95.5%)Imbalance
모집횟수 has 37 (1.3%) missing valuesMissing
예비자수 is highly skewed (γ1 = 30.83541447)Skewed
선정인원수 has 2785 (97.9%) zerosZeros
일반공급호수 has 1468 (51.6%) zerosZeros
우선공급호수 has 2395 (84.2%) zerosZeros
예비자선정비율 has 2221 (78.1%) zerosZeros
예비자수 has 2254 (79.3%) zerosZeros
서류심사대상비율 has 2672 (94.0%) zerosZeros
서류심사대상번호 has 2287 (80.4%) zerosZeros

Reproduction

Analysis started2023-12-12 16:28:23.744061
Analysis finished2023-12-12 16:28:24.682494
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업코드
Real number (ℝ)

Distinct39
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2190262.4
Minimum211029
Maximum9999022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:24.751312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum211029
5-th percentile221109
Q12016016
median2017004
Q32017012
95-th percentile2017030
Maximum9999022
Range9787993
Interquartile range (IQR)996

Descriptive statistics

Standard deviation1515439.5
Coefficient of variation (CV)0.69189857
Kurtosis20.306299
Mean2190262.4
Median Absolute Deviation (MAD)12
Skewness4.3623068
Sum6.2291064 × 109
Variance2.2965567 × 1012
MonotonicityIncreasing
2023-12-13T01:28:24.922211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
2017004 428
15.0%
2017012 295
 
10.4%
2015023 257
 
9.0%
2017011 230
 
8.1%
2017003 205
 
7.2%
2017030 131
 
4.6%
2017016 116
 
4.1%
2016016 115
 
4.0%
2016018 99
 
3.5%
2017013 97
 
3.4%
Other values (29) 871
30.6%
ValueCountFrequency (%)
211029 57
2.0%
211109 32
1.1%
221029 50
1.8%
221109 34
1.2%
2009100 5
 
0.2%
2009101 8
 
0.3%
2009102 5
 
0.2%
2009103 7
 
0.2%
2009104 19
 
0.7%
2009105 5
 
0.2%
ValueCountFrequency (%)
9999022 90
3.2%
9999001 3
 
0.1%
4017006 32
 
1.1%
2017030 131
4.6%
2017029 1
 
< 0.1%
2017020 88
3.1%
2017018 1
 
< 0.1%
2017016 116
4.1%
2017014 48
 
1.7%
2017013 97
3.4%

계획연도
Categorical

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2020
842 
2018
818 
2019
631 
2021
455 
2022
98 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2020 842
29.6%
2018 818
28.8%
2019 631
22.2%
2021 455
16.0%
2022 98
 
3.4%

Length

2023-12-13T01:28:25.060505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:25.168643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 842
29.6%
2018 818
28.8%
2019 631
22.2%
2021 455
16.0%
2022 98
 
3.4%
Distinct26
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
36
955 
21
327 
26
227 
18,26,,36,44
199 
18,26,36,44
138 
Other values (21)
998 

Length

Max length12
Median length2
Mean length4.0755977
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row72,84
2nd row72,84
3rd row72,84
4th row72,84
5th row72,84

Common Values

ValueCountFrequency (%)
36 955
33.6%
21 327
 
11.5%
26 227
 
8.0%
18,26,,36,44 199
 
7.0%
18,26,36,44 138
 
4.9%
26, 35 115
 
4.0%
34, 44 99
 
3.5%
29 93
 
3.3%
26,35 90
 
3.2%
72,84 89
 
3.1%
Other values (16) 512
18.0%

Length

2023-12-13T01:28:25.309667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
36 955
31.2%
26 342
 
11.2%
21 327
 
10.7%
18,26,,36,44 199
 
6.5%
34 149
 
4.9%
18,26,36,44 138
 
4.5%
44 124
 
4.1%
35 115
 
3.8%
29 93
 
3.0%
26,35 90
 
2.9%
Other values (15) 526
17.2%

공급호수
Real number (ℝ)

Distinct33
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean583.97328
Minimum5
Maximum2078
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:25.440839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile40
Q1130
median330
Q3865
95-th percentile2078
Maximum2078
Range2073
Interquartile range (IQR)735

Descriptive statistics

Standard deviation556.75112
Coefficient of variation (CV)0.95338459
Kurtosis1.2462554
Mean583.97328
Median Absolute Deviation (MAD)289
Skewness1.2994009
Sum1660820
Variance309971.81
MonotonicityNot monotonic
2023-12-13T01:28:25.591101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
865 403
14.2%
50 305
10.7%
330 295
10.4%
300 293
10.3%
970 257
9.0%
232 230
8.1%
2078 205
 
7.2%
800 131
 
4.6%
42 99
 
3.5%
651 89
 
3.1%
Other values (23) 537
18.9%
ValueCountFrequency (%)
5 5
 
0.2%
8 5
 
0.2%
10 5
 
0.2%
14 88
3.1%
20 3
 
0.1%
21 7
 
0.2%
40 50
1.8%
42 99
3.5%
48 8
 
0.3%
49 1
 
< 0.1%
ValueCountFrequency (%)
2078 205
7.2%
1650 47
 
1.7%
970 257
9.0%
961 84
 
3.0%
865 403
14.2%
800 131
 
4.6%
659 43
 
1.5%
651 89
 
3.1%
502 5
 
0.2%
500 1
 
< 0.1%

공급연월
Categorical

Distinct17
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2020-06
525 
2019-12
403 
2019-07
354 
2018-08
352 
2019-09
228 
Other values (12)
982 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-10
2nd row2021-10
3rd row2021-10
4th row2021-10
5th row2021-10

Common Values

ValueCountFrequency (%)
2020-06 525
18.5%
2019-12 403
14.2%
2019-07 354
12.4%
2018-08 352
12.4%
2019-09 228
8.0%
2021-09 221
7.8%
2020-12 207
 
7.3%
2021-10 197
 
6.9%
2022-02 98
 
3.4%
2018-12 93
 
3.3%
Other values (7) 166
 
5.8%

Length

2023-12-13T01:28:25.736595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-06 525
18.5%
2019-12 403
14.2%
2019-07 354
12.4%
2018-08 352
12.4%
2019-09 228
8.0%
2021-09 221
7.8%
2020-12 207
 
7.3%
2021-10 197
 
6.9%
2022-02 98
 
3.4%
2018-12 93
 
3.3%
Other values (7) 166
 
5.8%
Distinct24
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2021-05
525 
2020-12
428 
2019-11
257 
2022-04
205 
2020-02
190 
Other values (19)
1239 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row2022-09
2nd row2022-09
3rd row2022-09
4th row2022-09
5th row2022-09

Common Values

ValueCountFrequency (%)
2021-05 525
18.5%
2020-12 428
15.0%
2019-11 257
9.0%
2022-04 205
 
7.2%
2020-02 190
 
6.7%
2022-08 181
 
6.4%
2022-09 174
 
6.1%
2020-10 116
 
4.1%
2019-08 115
 
4.0%
2022-03 114
 
4.0%
Other values (14) 539
19.0%

Length

2023-12-13T01:28:25.870358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-05 525
18.5%
2020-12 428
15.0%
2019-11 257
9.0%
2022-04 205
 
7.2%
2020-02 190
 
6.7%
2022-08 181
 
6.4%
2022-09 174
 
6.1%
2020-10 116
 
4.1%
2019-08 115
 
4.0%
2022-03 114
 
4.0%
Other values (14) 539
19.0%

비고내용
Categorical

Distinct21
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
신규
499 
신규(전용 18~36㎡형)
403 
신규(전용 18~44㎡형)
336 
신규(전용 16~36㎡형)
295 
추가모집(전용 24~36㎡형)
257 
Other values (16)
1054 

Length

Max length16
Median length14
Mean length11.314698
Min length2

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row신규(전용 72~84㎡형)
2nd row신규(전용 72~84㎡형)
3rd row신규(전용 72~84㎡형)
4th row신규(전용 72~84㎡형)
5th row신규(전용 72~84㎡형)

Common Values

ValueCountFrequency (%)
신규 499
17.5%
신규(전용 18~36㎡형) 403
14.2%
신규(전용 18~44㎡형) 336
11.8%
신규(전용 16~36㎡형) 295
10.4%
추가모집(전용 24~36㎡형) 257
9.0%
신규(전용 16~21㎡형) 230
8.1%
신규(전용 16~26㎡형) 203
7.1%
해당 없음 122
 
4.3%
추가모집 97
 
3.4%
신규(전용 26~35㎡형) 90
 
3.2%
Other values (11) 312
11.0%

Length

2023-12-13T01:28:25.994865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
신규(전용 1849
36.3%
신규 499
 
9.8%
18~36㎡형 403
 
7.9%
18~44㎡형 336
 
6.6%
16~36㎡형 295
 
5.8%
추가모집(전용 257
 
5.0%
24~36㎡형 257
 
5.0%
16~21㎡형 230
 
4.5%
16~26㎡형 203
 
4.0%
해당 122
 
2.4%
Other values (15) 646
 
12.7%

모집횟수
Real number (ℝ)

MISSING 

Distinct19
Distinct (%)0.7%
Missing37
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean5.9821874
Minimum0
Maximum19
Zeros3
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:26.113809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median5
Q39
95-th percentile14
Maximum19
Range19
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.6682955
Coefficient of variation (CV)0.78036597
Kurtosis-0.33460734
Mean5.9821874
Median Absolute Deviation (MAD)3
Skewness0.78373235
Sum16792
Variance21.792983
MonotonicityNot monotonic
2023-12-13T01:28:26.220656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 561
19.7%
2 362
12.7%
3 221
 
7.8%
4 208
 
7.3%
5 202
 
7.1%
7 154
 
5.4%
8 148
 
5.2%
6 138
 
4.9%
9 133
 
4.7%
10 131
 
4.6%
Other values (9) 549
19.3%
ValueCountFrequency (%)
0 3
 
0.1%
1 561
19.7%
2 362
12.7%
3 221
 
7.8%
4 208
 
7.3%
5 202
 
7.1%
6 138
 
4.9%
7 154
 
5.4%
8 148
 
5.2%
9 133
 
4.7%
ValueCountFrequency (%)
19 20
 
0.7%
18 39
 
1.4%
17 37
 
1.3%
15 44
 
1.5%
14 89
3.1%
13 98
3.4%
12 105
3.7%
11 114
4.0%
10 131
4.6%
9 133
4.7%

순위
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
1804 
1
585 
2
258 
3
197 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1804
63.4%
1 585
 
20.6%
2 258
 
9.1%
3 197
 
6.9%

Length

2023-12-13T01:28:26.342876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:26.425428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1804
63.4%
1 585
 
20.6%
2 258
 
9.1%
3 197
 
6.9%
Distinct185
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2023-12-13T01:28:26.649597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length472
Median length1
Mean length23.083333
Min length1

Characters and Unicode

Total characters65649
Distinct characters277
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)2.3%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 1804
 
12.6%
또는 605
 
4.2%
사람 590
 
4.1%
소득 408
 
2.9%
거주지이거나 390
 
2.7%
근거지인 298
 
2.1%
않는 258
 
1.8%
해당되지 244
 
1.7%
현재 243
 
1.7%
해당 231
 
1.6%
Other values (528) 9227
64.5%
2023-12-13T01:28:27.039029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11543
 
17.6%
2438
 
3.7%
2201
 
3.4%
, 2026
 
3.1%
1905
 
2.9%
0 1808
 
2.8%
1593
 
2.4%
1390
 
2.1%
1344
 
2.0%
1160
 
1.8%
Other values (267) 38241
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45795
69.8%
Space Separator 11543
 
17.6%
Decimal Number 3016
 
4.6%
Other Punctuation 2503
 
3.8%
Close Punctuation 1329
 
2.0%
Open Punctuation 1326
 
2.0%
Math Symbol 48
 
0.1%
Other Number 31
 
< 0.1%
Other Symbol 30
 
< 0.1%
Dash Punctuation 14
 
< 0.1%
Other values (3) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2438
 
5.3%
2201
 
4.8%
1905
 
4.2%
1593
 
3.5%
1390
 
3.0%
1344
 
2.9%
1160
 
2.5%
1075
 
2.3%
901
 
2.0%
887
 
1.9%
Other values (230) 30901
67.5%
Decimal Number
ValueCountFrequency (%)
0 1808
59.9%
1 555
 
18.4%
2 253
 
8.4%
5 138
 
4.6%
6 100
 
3.3%
7 80
 
2.7%
3 38
 
1.3%
4 27
 
0.9%
8 17
 
0.6%
Other Symbol
ValueCountFrequency (%)
10
33.3%
8
26.7%
4
 
13.3%
4
 
13.3%
2
 
6.7%
2
 
6.7%
Other Number
ValueCountFrequency (%)
14
45.2%
7
22.6%
7
22.6%
2
 
6.5%
1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 2026
80.9%
. 285
 
11.4%
? 186
 
7.4%
: 6
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1127
84.8%
] 190
 
14.3%
12
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 1124
84.8%
[ 190
 
14.3%
12
 
0.9%
Math Symbol
ValueCountFrequency (%)
~ 45
93.8%
3
 
6.2%
Space Separator
ValueCountFrequency (%)
11543
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 12
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45823
69.8%
Common 19814
30.2%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2438
 
5.3%
2201
 
4.8%
1905
 
4.2%
1593
 
3.5%
1390
 
3.0%
1344
 
2.9%
1160
 
2.5%
1075
 
2.3%
901
 
2.0%
887
 
1.9%
Other values (235) 30929
67.5%
Common
ValueCountFrequency (%)
11543
58.3%
, 2026
 
10.2%
0 1808
 
9.1%
) 1127
 
5.7%
( 1124
 
5.7%
1 555
 
2.8%
. 285
 
1.4%
2 253
 
1.3%
[ 190
 
1.0%
] 190
 
1.0%
Other values (21) 713
 
3.6%
Latin
ValueCountFrequency (%)
A 12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45795
69.8%
ASCII 19764
30.1%
None 55
 
0.1%
Enclosed Alphanum 31
 
< 0.1%
CJK Compat 2
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11543
58.4%
, 2026
 
10.3%
0 1808
 
9.1%
) 1127
 
5.7%
( 1124
 
5.7%
1 555
 
2.8%
. 285
 
1.4%
2 253
 
1.3%
[ 190
 
1.0%
] 190
 
1.0%
Other values (11) 663
 
3.4%
Hangul
ValueCountFrequency (%)
2438
 
5.3%
2201
 
4.8%
1905
 
4.2%
1593
 
3.5%
1390
 
3.0%
1344
 
2.9%
1160
 
2.5%
1075
 
2.3%
901
 
2.0%
887
 
1.9%
Other values (230) 30901
67.5%
Enclosed Alphanum
ValueCountFrequency (%)
14
45.2%
7
22.6%
7
22.6%
2
 
6.5%
1
 
3.2%
None
ValueCountFrequency (%)
12
21.8%
12
21.8%
10
18.2%
8
14.5%
4
 
7.3%
4
 
7.3%
3
 
5.5%
2
 
3.6%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

선정인원수
Real number (ℝ)

ZEROS 

Distinct14
Distinct (%)0.5%
Missing3
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.53924674
Minimum0
Maximum120
Zeros2785
Zeros (%)97.9%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:27.194021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum120
Range120
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5.850902
Coefficient of variation (CV)10.850139
Kurtosis200.03982
Mean0.53924674
Median Absolute Deviation (MAD)0
Skewness13.561332
Sum1532
Variance34.233054
MonotonicityNot monotonic
2023-12-13T01:28:27.320809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0 2785
97.9%
12 12
 
0.4%
65 10
 
0.4%
7 8
 
0.3%
2 8
 
0.3%
4 4
 
0.1%
5 2
 
0.1%
83 2
 
0.1%
47 2
 
0.1%
100 2
 
0.1%
Other values (4) 6
 
0.2%
(Missing) 3
 
0.1%
ValueCountFrequency (%)
0 2785
97.9%
2 8
 
0.3%
4 4
 
0.1%
5 2
 
0.1%
6 2
 
0.1%
7 8
 
0.3%
8 1
 
< 0.1%
12 12
 
0.4%
20 2
 
0.1%
47 2
 
0.1%
ValueCountFrequency (%)
120 1
 
< 0.1%
100 2
 
0.1%
83 2
 
0.1%
65 10
0.4%
47 2
 
0.1%
20 2
 
0.1%
12 12
0.4%
8 1
 
< 0.1%
7 8
0.3%
6 2
 
0.1%
Distinct166
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2023-12-13T01:28:27.540852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length472
Median length1
Mean length20.769339
Min length1

Characters and Unicode

Total characters59068
Distinct characters260
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)1.7%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 1826
 
14.1%
또는 603
 
4.7%
사람 591
 
4.6%
소득 408
 
3.1%
거주지이거나 377
 
2.9%
근거지인 287
 
2.2%
않는 258
 
2.0%
해당되지 244
 
1.9%
214
 
1.7%
해당 209
 
1.6%
Other values (443) 7949
61.3%
2023-12-13T01:28:27.964681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10205
 
17.3%
2378
 
4.0%
2177
 
3.7%
, 1980
 
3.4%
0 1830
 
3.1%
1561
 
2.6%
1500
 
2.5%
1336
 
2.3%
1291
 
2.2%
) 1109
 
1.9%
Other values (250) 33701
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40924
69.3%
Space Separator 10205
 
17.3%
Decimal Number 2861
 
4.8%
Other Punctuation 2361
 
4.0%
Close Punctuation 1309
 
2.2%
Open Punctuation 1306
 
2.2%
Other Number 31
 
0.1%
Other Symbol 30
 
0.1%
Math Symbol 15
 
< 0.1%
Dash Punctuation 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2378
 
5.8%
2177
 
5.3%
1561
 
3.8%
1500
 
3.7%
1336
 
3.3%
1291
 
3.2%
1051
 
2.6%
865
 
2.1%
808
 
2.0%
747
 
1.8%
Other values (215) 27210
66.5%
Decimal Number
ValueCountFrequency (%)
0 1830
64.0%
1 487
 
17.0%
2 236
 
8.2%
5 114
 
4.0%
6 98
 
3.4%
7 58
 
2.0%
3 27
 
0.9%
8 6
 
0.2%
4 5
 
0.2%
Other Symbol
ValueCountFrequency (%)
10
33.3%
8
26.7%
4
 
13.3%
4
 
13.3%
2
 
6.7%
2
 
6.7%
Other Number
ValueCountFrequency (%)
14
45.2%
7
22.6%
7
22.6%
2
 
6.5%
1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 1980
83.9%
. 189
 
8.0%
? 186
 
7.9%
: 6
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 1109
84.7%
] 190
 
14.5%
10
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 1106
84.7%
[ 190
 
14.5%
10
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 12
80.0%
3
 
20.0%
Space Separator
ValueCountFrequency (%)
10205
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40952
69.3%
Common 18104
30.6%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2378
 
5.8%
2177
 
5.3%
1561
 
3.8%
1500
 
3.7%
1336
 
3.3%
1291
 
3.2%
1051
 
2.6%
865
 
2.1%
808
 
2.0%
747
 
1.8%
Other values (220) 27238
66.5%
Common
ValueCountFrequency (%)
10205
56.4%
, 1980
 
10.9%
0 1830
 
10.1%
) 1109
 
6.1%
( 1106
 
6.1%
1 487
 
2.7%
2 236
 
1.3%
[ 190
 
1.0%
] 190
 
1.0%
. 189
 
1.0%
Other values (19) 582
 
3.2%
Latin
ValueCountFrequency (%)
A 12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40924
69.3%
ASCII 18060
30.6%
None 51
 
0.1%
Enclosed Alphanum 31
 
0.1%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10205
56.5%
, 1980
 
11.0%
0 1830
 
10.1%
) 1109
 
6.1%
( 1106
 
6.1%
1 487
 
2.7%
2 236
 
1.3%
[ 190
 
1.1%
] 190
 
1.1%
. 189
 
1.0%
Other values (11) 538
 
3.0%
Hangul
ValueCountFrequency (%)
2378
 
5.8%
2177
 
5.3%
1561
 
3.8%
1500
 
3.7%
1336
 
3.3%
1291
 
3.2%
1051
 
2.6%
865
 
2.1%
808
 
2.0%
747
 
1.8%
Other values (215) 27210
66.5%
Enclosed Alphanum
ValueCountFrequency (%)
14
45.2%
7
22.6%
7
22.6%
2
 
6.5%
1
 
3.2%
None
ValueCountFrequency (%)
10
19.6%
10
19.6%
10
19.6%
8
15.7%
4
 
7.8%
4
 
7.8%
3
 
5.9%
2
 
3.9%
CJK Compat
ValueCountFrequency (%)
2
100.0%

공급일련번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2844 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2844
100.0%

Length

2023-12-13T01:28:28.088444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:28.191825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2844
100.0%

전환공급호수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2844 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2844
100.0%

Length

2023-12-13T01:28:28.295265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:28.384826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2844
100.0%

일반공급호수
Real number (ℝ)

ZEROS 

Distinct117
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.699719
Minimum0
Maximum628
Zeros1468
Zeros (%)51.6%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:28.520859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q38
95-th percentile90
Maximum628
Range628
Interquartile range (IQR)8

Descriptive statistics

Standard deviation45.031374
Coefficient of variation (CV)3.0634174
Kurtosis63.478955
Mean14.699719
Median Absolute Deviation (MAD)0
Skewness6.7976655
Sum41806
Variance2027.8247
MonotonicityNot monotonic
2023-12-13T01:28:28.695417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1468
51.6%
1 143
 
5.0%
2 136
 
4.8%
3 113
 
4.0%
5 68
 
2.4%
4 68
 
2.4%
8 53
 
1.9%
10 51
 
1.8%
6 50
 
1.8%
7 48
 
1.7%
Other values (107) 646
22.7%
ValueCountFrequency (%)
0 1468
51.6%
1 143
 
5.0%
2 136
 
4.8%
3 113
 
4.0%
4 68
 
2.4%
5 68
 
2.4%
6 50
 
1.8%
7 48
 
1.7%
8 53
 
1.9%
9 44
 
1.5%
ValueCountFrequency (%)
628 2
0.1%
553 2
0.1%
505 2
0.1%
428 2
0.1%
354 2
0.1%
349 2
0.1%
304 2
0.1%
297 2
0.1%
293 1
 
< 0.1%
291 3
0.1%

우선공급호수
Real number (ℝ)

ZEROS 

Distinct71
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3924051
Minimum0
Maximum621
Zeros2395
Zeros (%)84.2%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:28.836684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile19.85
Maximum621
Range621
Interquartile range (IQR)0

Descriptive statistics

Standard deviation25.044723
Coefficient of variation (CV)5.7018246
Kurtosis203.78859
Mean4.3924051
Median Absolute Deviation (MAD)0
Skewness12.022528
Sum12492
Variance627.23816
MonotonicityNot monotonic
2023-12-13T01:28:29.000562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2395
84.2%
3 51
 
1.8%
2 44
 
1.5%
1 42
 
1.5%
4 41
 
1.4%
20 20
 
0.7%
5 20
 
0.7%
6 15
 
0.5%
10 15
 
0.5%
8 14
 
0.5%
Other values (61) 187
 
6.6%
ValueCountFrequency (%)
0 2395
84.2%
1 42
 
1.5%
2 44
 
1.5%
3 51
 
1.8%
4 41
 
1.4%
5 20
 
0.7%
6 15
 
0.5%
7 13
 
0.5%
8 14
 
0.5%
9 12
 
0.4%
ValueCountFrequency (%)
621 1
 
< 0.1%
416 1
 
< 0.1%
339 2
0.1%
330 1
 
< 0.1%
238 1
 
< 0.1%
233 2
0.1%
223 1
 
< 0.1%
197 2
0.1%
180 3
0.1%
156 1
 
< 0.1%

예비자선정비율
Real number (ℝ)

ZEROS 

Distinct95
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean236.55028
Minimum0
Maximum9999
Zeros2221
Zeros (%)78.1%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:29.159630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile300
Maximum9999
Range9999
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1349.3327
Coefficient of variation (CV)5.704211
Kurtosis41.942261
Mean236.55028
Median Absolute Deviation (MAD)0
Skewness6.5762425
Sum672749
Variance1820698.8
MonotonicityNot monotonic
2023-12-13T01:28:29.303094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2221
78.1%
100 337
 
11.8%
40 36
 
1.3%
9000 35
 
1.2%
9999 23
 
0.8%
300 17
 
0.6%
400 9
 
0.3%
200 8
 
0.3%
50 7
 
0.2%
150 6
 
0.2%
Other values (85) 145
 
5.1%
ValueCountFrequency (%)
0 2221
78.1%
10 2
 
0.1%
25 2
 
0.1%
30 4
 
0.1%
34 1
 
< 0.1%
35 4
 
0.1%
39 1
 
< 0.1%
40 36
 
1.3%
44 1
 
< 0.1%
45 1
 
< 0.1%
ValueCountFrequency (%)
9999 23
0.8%
9100 1
 
< 0.1%
9000 35
1.2%
5100 1
 
< 0.1%
5000 1
 
< 0.1%
2450 1
 
< 0.1%
2400 1
 
< 0.1%
2200 1
 
< 0.1%
2000 2
 
0.1%
1800 1
 
< 0.1%

예비자수
Real number (ℝ)

SKEWED  ZEROS 

Distinct139
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.028833
Minimum0
Maximum27360
Zeros2254
Zeros (%)79.3%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:29.430723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile71
Maximum27360
Range27360
Interquartile range (IQR)0

Descriptive statistics

Standard deviation656.09591
Coefficient of variation (CV)13.660459
Kurtosis1148.2117
Mean48.028833
Median Absolute Deviation (MAD)0
Skewness30.835414
Sum136594
Variance430461.85
MonotonicityNot monotonic
2023-12-13T01:28:29.579357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2254
79.3%
2 44
 
1.5%
4 36
 
1.3%
3 30
 
1.1%
5 24
 
0.8%
8 22
 
0.8%
1 22
 
0.8%
10 22
 
0.8%
20 20
 
0.7%
9 20
 
0.7%
Other values (129) 350
 
12.3%
ValueCountFrequency (%)
0 2254
79.3%
1 22
 
0.8%
2 44
 
1.5%
3 30
 
1.1%
4 36
 
1.3%
5 24
 
0.8%
6 17
 
0.6%
7 14
 
0.5%
8 22
 
0.8%
9 20
 
0.7%
ValueCountFrequency (%)
27360 1
< 0.1%
12870 1
< 0.1%
11700 1
< 0.1%
6400 1
< 0.1%
5580 1
< 0.1%
3780 1
< 0.1%
3600 1
< 0.1%
3240 1
< 0.1%
2700 1
< 0.1%
2520 2
0.1%

서류심사대상비율
Real number (ℝ)

ZEROS 

Distinct19
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1914.7845
Minimum0
Maximum99999
Zeros2672
Zeros (%)94.0%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:29.709186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile368
Maximum99999
Range99999
Interquartile range (IQR)0

Descriptive statistics

Standard deviation12838.756
Coefficient of variation (CV)6.7050661
Kurtosis53.504469
Mean1914.7845
Median Absolute Deviation (MAD)0
Skewness7.3975868
Sum5445647
Variance1.6483366 × 108
MonotonicityNot monotonic
2023-12-13T01:28:29.833093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 2672
94.0%
99999 47
 
1.7%
9999 36
 
1.3%
300 14
 
0.5%
9990 14
 
0.5%
1000 13
 
0.5%
200 13
 
0.5%
9000 12
 
0.4%
500 9
 
0.3%
8000 5
 
0.2%
Other values (9) 9
 
0.3%
ValueCountFrequency (%)
0 2672
94.0%
141 1
 
< 0.1%
150 1
 
< 0.1%
200 13
 
0.5%
300 14
 
0.5%
380 1
 
< 0.1%
500 9
 
0.3%
600 1
 
< 0.1%
999 1
 
< 0.1%
1000 13
 
0.5%
ValueCountFrequency (%)
99999 47
1.7%
50000 1
 
< 0.1%
15000 1
 
< 0.1%
9999 36
1.3%
9990 14
 
0.5%
9000 12
 
0.4%
8000 5
 
0.2%
5000 1
 
< 0.1%
1300 1
 
< 0.1%
1000 13
 
0.5%

서류심사대상번호
Real number (ℝ)

ZEROS 

Distinct179
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean343.47609
Minimum0
Maximum56520
Zeros2287
Zeros (%)80.4%
Negative0
Negative (%)0.0%
Memory size25.1 KiB
2023-12-13T01:28:29.961395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1000
Maximum56520
Range56520
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2416.5718
Coefficient of variation (CV)7.0356333
Kurtosis239.52204
Mean343.47609
Median Absolute Deviation (MAD)0
Skewness13.989186
Sum976846
Variance5839819.4
MonotonicityNot monotonic
2023-12-13T01:28:30.119357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2287
80.4%
180 21
 
0.7%
90 20
 
0.7%
24 17
 
0.6%
300 17
 
0.6%
100 14
 
0.5%
6 14
 
0.5%
30 13
 
0.5%
200 13
 
0.5%
1000 12
 
0.4%
Other values (169) 416
 
14.6%
ValueCountFrequency (%)
0 2287
80.4%
1 3
 
0.1%
3 4
 
0.1%
4 1
 
< 0.1%
5 1
 
< 0.1%
6 14
 
0.5%
9 1
 
< 0.1%
10 3
 
0.1%
12 10
 
0.4%
15 11
 
0.4%
ValueCountFrequency (%)
56520 1
< 0.1%
44000 1
< 0.1%
43000 1
< 0.1%
38000 2
0.1%
27360 1
< 0.1%
26190 1
< 0.1%
26100 1
< 0.1%
26000 1
< 0.1%
25000 1
< 0.1%
15000 1
< 0.1%

주거약자호수
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2842 
36
 
1
110
 
1

Length

Max length3
Median length1
Mean length1.0010549
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2842
99.9%
36 1
 
< 0.1%
110 1
 
< 0.1%

Length

2023-12-13T01:28:30.296967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:30.434776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2842
99.9%
36 1
 
< 0.1%
110 1
 
< 0.1%

특별공급호수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2844 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2844
100.0%

Length

2023-12-13T01:28:30.554106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:30.659340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2844
100.0%

사전청약공급호수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2844 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2844
100.0%

Length

2023-12-13T01:28:30.774824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:31.244963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2844
100.0%

우선추첨여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2791 
1
 
53

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2791
98.1%
1 53
 
1.9%

Length

2023-12-13T01:28:31.365122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:31.488292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2791
98.1%
1 53
 
1.9%

일반추첨여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2763 
1
 
81

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2763
97.2%
1 81
 
2.8%

Length

2023-12-13T01:28:31.593547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:31.685116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2763
97.2%
1 81
 
2.8%

자격요건내용
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2816 
무주택세대구성원으로서 월평균 소득이 전년도 도시근로자 가구당 월평균소득의 70%이하인 자
 
28

Length

Max length49
Median length1
Mean length1.4725738
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2816
99.0%
무주택세대구성원으로서 월평균 소득이 전년도 도시근로자 가구당 월평균소득의 70%이하인 자 28
 
1.0%

Length

2023-12-13T01:28:31.788719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:31.895368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2816
91.8%
무주택세대구성원으로서 28
 
0.9%
월평균 28
 
0.9%
소득이 28
 
0.9%
전년도 28
 
0.9%
도시근로자 28
 
0.9%
가구당 28
 
0.9%
월평균소득의 28
 
0.9%
70%이하인 28
 
0.9%
28
 
0.9%

저층우선신청여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2835 
1
 
9

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2835
99.7%
1 9
 
0.3%

Length

2023-12-13T01:28:32.011688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:32.112954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2835
99.7%
1 9
 
0.3%

사용여부
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
1956 
1
888 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1956
68.8%
1 888
31.2%

Length

2023-12-13T01:28:32.199100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:32.285445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1956
68.8%
1 888
31.2%

주거약자추첨여부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2844 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2844
100.0%

Length

2023-12-13T01:28:32.383604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:32.472820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2844
100.0%

공동주택여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2843 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2843
> 99.9%
1 1
 
< 0.1%

Length

2023-12-13T01:28:32.559229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:32.660561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2843
> 99.9%
1 1
 
< 0.1%

일반거주서열기준여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2841 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2841
99.9%
1 3
 
0.1%

Length

2023-12-13T01:28:32.759473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:32.853230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2841
99.9%
1 3
 
0.1%

우선거주서열기준여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2829 
1
 
15

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2829
99.5%
1 15
 
0.5%

Length

2023-12-13T01:28:32.958337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:33.047287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2829
99.5%
1 15
 
0.5%

자격요건내용.1
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
0
2805 
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일 것
 
13
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하이고 청년계층 본인의 월평균소득은 전년도 도시근로자 가구당 월평균소득의 80퍼센트 이하 일 것
 
13
신청자 부모와 본인의 월평균소득 합계가 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일 것
 
5
무주택세대구성원으로서 공고일 현재 주거급여수급자일 것
 
3
Other values (3)
 
5

Length

Max length96
Median length1
Mean length1.8808017
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2805
98.6%
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일 것 13
 
0.5%
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하이고 청년계층 본인의 월평균소득은 전년도 도시근로자 가구당 월평균소득의 80퍼센트 이하 일 것 13
 
0.5%
신청자 부모와 본인의 월평균소득 합계가 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일 것 5
 
0.2%
무주택세대구성원으로서 공고일 현재 주거급여수급자일 것 3
 
0.1%
해당 세대(예비신혼부부는 혼인으로 구성될 세대)의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일 것 2
 
0.1%
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하일것 2
 
0.1%
해당 세대의 월평균소득이 전년도 도시근로자 가구당 월평균소득의 100퍼센트 이하이고 청년계층 본인의 월평균소득은 전년도 도시근로자 가구당 월평균소득의 80퍼센트 이하 일것 1
 
< 0.1%

Length

2023-12-13T01:28:33.149054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:33.257784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2805
84.2%
전년도 50
 
1.5%
도시근로자 50
 
1.5%
가구당 50
 
1.5%
월평균소득의 50
 
1.5%
100퍼센트 36
 
1.1%
36
 
1.1%
월평균소득이 31
 
0.9%
해당 31
 
0.9%
세대의 29
 
0.9%
Other values (22) 165
 
5.0%

기타 유의사항
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
공란은 데이터 미존재
2844 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공란은 데이터 미존재
2nd row공란은 데이터 미존재
3rd row공란은 데이터 미존재
4th row공란은 데이터 미존재
5th row공란은 데이터 미존재

Common Values

ValueCountFrequency (%)
공란은 데이터 미존재 2844
100.0%

Length

2023-12-13T01:28:33.387198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:33.474106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공란은 2844
33.3%
데이터 2844
33.3%
미존재 2844
33.3%

Sample

사업코드계획연도전용면적내용공급호수공급연월입주예정년월비고내용모집횟수순위순위기준내용선정인원수표시내용공급일련번호전환공급호수일반공급호수우선공급호수예비자선정비율예비자수서류심사대상비율서류심사대상번호주거약자호수특별공급호수사전청약공급호수우선추첨여부일반추첨여부자격요건내용저층우선신청여부사용여부주거약자추첨여부공동주택여부일반거주서열기준여부우선거주서열기준여부자격요건내용.1기타 유의사항
0211029202172,846512021-102022-09신규(전용 72~84㎡형)10000008723300000000000000000공란은 데이터 미존재
1211029202172,846512021-102022-09신규(전용 72~84㎡형)1000000308000000000000000000공란은 데이터 미존재
2211029202172,846512021-102022-09신규(전용 72~84㎡형)100000072000000000000000000공란은 데이터 미존재
3211029202172,846512021-102022-09신규(전용 72~84㎡형)100000072200000000000000000공란은 데이터 미존재
4211029202172,846512021-102022-09신규(전용 72~84㎡형)1000000143900000000000000000공란은 데이터 미존재
5211029202172,846512021-102022-09신규(전용 72~84㎡형)1000000328000000000000000000공란은 데이터 미존재
6211029202172,846512021-102022-09신규(전용 72~84㎡형)10000000500000000000100000공란은 데이터 미존재
7211029202172,846512021-102022-09신규(전용 72~84㎡형)100000001600000001000100000공란은 데이터 미존재
8211029202172,846512021-102022-09신규(전용 72~84㎡형)100000032000000000100100000공란은 데이터 미존재
9211029202172,846512021-102022-09신규(전용 72~84㎡형)100000001000000000000100000공란은 데이터 미존재
사업코드계획연도전용면적내용공급호수공급연월입주예정년월비고내용모집횟수순위순위기준내용선정인원수표시내용공급일련번호전환공급호수일반공급호수우선공급호수예비자선정비율예비자수서류심사대상비율서류심사대상번호주거약자호수특별공급호수사전청약공급호수우선추첨여부일반추첨여부자격요건내용저층우선신청여부사용여부주거약자추첨여부공동주택여부일반거주서열기준여부우선거주서열기준여부자격요건내용.1기타 유의사항
28349999022202126,353002021-092022-03신규(전용 26~35㎡형)400000030360119999930000000000000000공란은 데이터 미존재
28359999022202126,353002021-092022-03신규(전용 26~35㎡형)4000000101000109999910000000000000000공란은 데이터 미존재
28369999022202126,353002021-092022-03신규(전용 26~35㎡형)400000012000000000000100000공란은 데이터 미존재
28379999022202126,353002021-092022-03신규(전용 26~35㎡형)400000037100000000000100000공란은 데이터 미존재
28389999022202126,353002021-092022-03신규(전용 26~35㎡형)40000005000000000000100000공란은 데이터 미존재
28399999022202126,353002021-092022-03신규(전용 26~35㎡형)40000003200000000000100000공란은 데이터 미존재
28409999022202126,353002021-092022-03신규(전용 26~35㎡형)40000003000000000000100000공란은 데이터 미존재
28419999022202126,353002021-092022-03신규(전용 26~35㎡형)40000000000000000000000000공란은 데이터 미존재
28429999022202126,353002021-092022-03신규(전용 26~35㎡형)40000001000000000000100000공란은 데이터 미존재
28439999022202126,353002021-092022-03신규(전용 26~35㎡형)<NA>0000000000000000000000000공란은 데이터 미존재