Overview

Dataset statistics

Number of variables21
Number of observations10000
Missing cells29751
Missing cells (%)14.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 MiB
Average record size in memory182.0 B

Variable types

Categorical4
Text9
Numeric6
DateTime2

Dataset

Description한국토지주택공사의 주택별현황정보(지역본부, 단지지방자치단체코드, 단지명, 시군구명, 주소, 우편번호, 면적, 호수 등)를 제공합니다.
Author한국토지주택공사
URLhttps://www.data.go.kr/data/15127116/fileData.do

Alerts

(법정동)지번-산 is highly imbalanced (59.0%)Imbalance
(법정동)법정동일련번호 has 1540 (15.4%) missing valuesMissing
(법정동)주소 has 1530 (15.3%) missing valuesMissing
(법정동)본번 has 2042 (20.4%) missing valuesMissing
(법정동)부번 has 7518 (75.2%) missing valuesMissing
(법정동)상세주소 has 9764 (97.6%) missing valuesMissing
(도로명)우편번호 has 627 (6.3%) missing valuesMissing
(도로명)주소 has 571 (5.7%) missing valuesMissing
(도로명)상세주소 has 4887 (48.9%) missing valuesMissing
입주지정시작일자 has 574 (5.7%) missing valuesMissing
입주지정종료일자 has 587 (5.9%) missing valuesMissing
전용면적 is highly skewed (γ1 = 22.9798619)Skewed
주거공용면적 is highly skewed (γ1 = 30.50980914)Skewed
공급면적 is highly skewed (γ1 = 28.03640123)Skewed
전용면적 has 114 (1.1%) zerosZeros
주거공용면적 has 1631 (16.3%) zerosZeros
공급면적 has 101 (1.0%) zerosZeros

Reproduction

Analysis started2024-04-22 00:13:18.740733
Analysis finished2024-04-22 00:13:20.212172
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역본부
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기남부지역본부
1859 
경기북부지역본부
1011 
대전충남지역본부
779 
대구경북지역본부
760 
광주전남지역본부
757 
Other values (22)
4834 

Length

Max length8
Median length8
Mean length6.93
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원지역본부
2nd row부산울산지역본부
3rd row경기북부지역본부
4th row광주전남지역본부
5th row경기북부지역본부

Common Values

ValueCountFrequency (%)
경기남부지역본부 1859
18.6%
경기북부지역본부 1011
 
10.1%
대전충남지역본부 779
 
7.8%
대구경북지역본부 760
 
7.6%
광주전남지역본부 757
 
7.6%
경남지역본부 547
 
5.5%
인천지역본부 420
 
4.2%
전북지역본부 413
 
4.1%
충북지역본부 411
 
4.1%
강원지역본부 383
 
3.8%
Other values (17) 2660
26.6%

Length

2024-04-22T09:13:20.283818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기남부지역본부 1859
18.6%
경기북부지역본부 1011
 
10.1%
대전충남지역본부 779
 
7.8%
대구경북지역본부 760
 
7.6%
광주전남지역본부 757
 
7.6%
경남지역본부 547
 
5.5%
인천지역본부 420
 
4.2%
전북지역본부 413
 
4.1%
충북지역본부 411
 
4.1%
강원지역본부 383
 
3.8%
Other values (17) 2660
26.6%
Distinct3121
Distinct (%)31.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T09:13:20.615601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters60000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique766 ?
Unique (%)7.7%

Sample

1st rowC01220
2nd rowC00856
3rd rowC01262
4th rowD00423
5th rowC00785
ValueCountFrequency (%)
c02773 49
 
0.5%
c00321 41
 
0.4%
c02421 37
 
0.4%
c00888 37
 
0.4%
c00855 31
 
0.3%
c00051 30
 
0.3%
c02262 27
 
0.3%
c00218 26
 
0.3%
c01991 22
 
0.2%
c00180 22
 
0.2%
Other values (3111) 9678
96.8%
2024-04-22T09:13:21.109037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17681
29.5%
C 8869
14.8%
1 6254
 
10.4%
2 5588
 
9.3%
7 3074
 
5.1%
5 3009
 
5.0%
3 2993
 
5.0%
6 2962
 
4.9%
4 2910
 
4.9%
8 2893
 
4.8%
Other values (2) 3767
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50000
83.3%
Uppercase Letter 10000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 17681
35.4%
1 6254
 
12.5%
2 5588
 
11.2%
7 3074
 
6.1%
5 3009
 
6.0%
3 2993
 
6.0%
6 2962
 
5.9%
4 2910
 
5.8%
8 2893
 
5.8%
9 2636
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
C 8869
88.7%
D 1131
 
11.3%

Most occurring scripts

ValueCountFrequency (%)
Common 50000
83.3%
Latin 10000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 17681
35.4%
1 6254
 
12.5%
2 5588
 
11.2%
7 3074
 
6.1%
5 3009
 
6.0%
3 2993
 
6.0%
6 2962
 
5.9%
4 2910
 
5.8%
8 2893
 
5.8%
9 2636
 
5.3%
Latin
ValueCountFrequency (%)
C 8869
88.7%
D 1131
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 60000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17681
29.5%
C 8869
14.8%
1 6254
 
10.4%
2 5588
 
9.3%
7 3074
 
5.1%
5 3009
 
5.0%
3 2993
 
5.0%
6 2962
 
4.9%
4 2910
 
4.9%
8 2893
 
4.8%
Other values (2) 3767
 
6.3%

관리기관
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공사
8141 
공단
1859 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 8141
81.4%
공단 1859
 
18.6%

Length

2024-04-22T09:13:21.258955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:13:21.349037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 8141
81.4%
공단 1859
 
18.6%
Distinct3072
Distinct (%)30.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T09:13:21.578452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length9.6489
Min length2

Characters and Unicode

Total characters96489
Distinct characters485
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique745 ?
Unique (%)7.4%

Sample

1st row원주개운2
2nd row부산망미
3rd row의정부녹양 A1블럭(의정부녹양1)
4th row광주광역시 북구 양산동 93-18(오션빌)
5th row동두천송내1
ValueCountFrequency (%)
행복주택 460
 
3.0%
1단지 171
 
1.1%
화성동탄2 166
 
1.1%
광주광역시 141
 
0.9%
2단지 138
 
0.9%
북구 122
 
0.8%
신혼희망타운 96
 
0.6%
국민임대 95
 
0.6%
아름들이(경북 83
 
0.5%
성남판교 76
 
0.5%
Other values (3413) 14016
90.1%
2024-04-22T09:13:21.987166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5564
 
5.8%
1 3360
 
3.5%
2 3062
 
3.2%
( 2972
 
3.1%
) 2971
 
3.1%
2657
 
2.8%
1992
 
2.1%
1916
 
2.0%
1702
 
1.8%
3 1656
 
1.7%
Other values (475) 68637
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66004
68.4%
Decimal Number 12909
 
13.4%
Space Separator 5564
 
5.8%
Uppercase Letter 4232
 
4.4%
Open Punctuation 2977
 
3.1%
Close Punctuation 2976
 
3.1%
Dash Punctuation 1613
 
1.7%
Other Punctuation 119
 
0.1%
Lowercase Letter 74
 
0.1%
Connector Punctuation 10
 
< 0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2657
 
4.0%
1992
 
3.0%
1916
 
2.9%
1702
 
2.6%
1524
 
2.3%
1524
 
2.3%
1444
 
2.2%
1114
 
1.7%
1084
 
1.6%
1079
 
1.6%
Other values (432) 49968
75.7%
Uppercase Letter
ValueCountFrequency (%)
A 1369
32.3%
B 1111
26.3%
L 950
22.4%
H 287
 
6.8%
S 131
 
3.1%
F 98
 
2.3%
N 98
 
2.3%
M 70
 
1.7%
C 41
 
1.0%
G 30
 
0.7%
Other values (5) 47
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 3360
26.0%
2 3062
23.7%
3 1656
12.8%
4 977
 
7.6%
5 912
 
7.1%
0 847
 
6.6%
6 728
 
5.6%
7 595
 
4.6%
9 390
 
3.0%
8 382
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
b 28
37.8%
a 20
27.0%
e 12
16.2%
c 8
 
10.8%
l 6
 
8.1%
Other Punctuation
ValueCountFrequency (%)
· 70
58.8%
/ 25
 
21.0%
. 24
 
20.2%
Open Punctuation
ValueCountFrequency (%)
( 2972
99.8%
[ 5
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 2971
99.8%
] 5
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 9
90.0%
~ 1
 
10.0%
Space Separator
ValueCountFrequency (%)
5564
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1613
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66004
68.4%
Common 26179
 
27.1%
Latin 4306
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2657
 
4.0%
1992
 
3.0%
1916
 
2.9%
1702
 
2.6%
1524
 
2.3%
1524
 
2.3%
1444
 
2.2%
1114
 
1.7%
1084
 
1.6%
1079
 
1.6%
Other values (432) 49968
75.7%
Common
ValueCountFrequency (%)
5564
21.3%
1 3360
12.8%
2 3062
11.7%
( 2972
11.4%
) 2971
11.3%
3 1656
 
6.3%
- 1613
 
6.2%
4 977
 
3.7%
5 912
 
3.5%
0 847
 
3.2%
Other values (13) 2245
8.6%
Latin
ValueCountFrequency (%)
A 1369
31.8%
B 1111
25.8%
L 950
22.1%
H 287
 
6.7%
S 131
 
3.0%
F 98
 
2.3%
N 98
 
2.3%
M 70
 
1.6%
C 41
 
1.0%
G 30
 
0.7%
Other values (10) 121
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66004
68.4%
ASCII 30414
31.5%
None 70
 
0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5564
18.3%
1 3360
11.0%
2 3062
10.1%
( 2972
9.8%
) 2971
9.8%
3 1656
 
5.4%
- 1613
 
5.3%
A 1369
 
4.5%
B 1111
 
3.7%
4 977
 
3.2%
Other values (31) 5759
18.9%
Hangul
ValueCountFrequency (%)
2657
 
4.0%
1992
 
3.0%
1916
 
2.9%
1702
 
2.6%
1524
 
2.3%
1524
 
2.3%
1444
 
2.2%
1114
 
1.7%
1084
 
1.6%
1079
 
1.6%
Other values (432) 49968
75.7%
None
ValueCountFrequency (%)
· 70
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct231
Distinct (%)2.3%
Missing99
Missing (%)1.0%
Memory size156.2 KiB
2024-04-22T09:13:22.288261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length2.9461671
Min length2

Characters and Unicode

Total characters29170
Distinct characters149
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row원주시
2nd row연제구
3rd row의정부시
4th row북구
5th row부천시
ValueCountFrequency (%)
북구 436
 
4.4%
화성시 382
 
3.9%
동구 363
 
3.7%
성남시 322
 
3.3%
서구 277
 
2.8%
수원시 233
 
2.4%
청주시 188
 
1.9%
파주시 185
 
1.9%
의정부시 183
 
1.8%
전주시 174
 
1.8%
Other values (221) 7158
72.3%
2024-04-22T09:13:23.073569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6287
21.6%
2972
 
10.2%
1406
 
4.8%
1088
 
3.7%
1080
 
3.7%
1050
 
3.6%
919
 
3.2%
795
 
2.7%
764
 
2.6%
695
 
2.4%
Other values (139) 12114
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29170
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6287
21.6%
2972
 
10.2%
1406
 
4.8%
1088
 
3.7%
1080
 
3.7%
1050
 
3.6%
919
 
3.2%
795
 
2.7%
764
 
2.6%
695
 
2.4%
Other values (139) 12114
41.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29170
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6287
21.6%
2972
 
10.2%
1406
 
4.8%
1088
 
3.7%
1080
 
3.7%
1050
 
3.6%
919
 
3.2%
795
 
2.7%
764
 
2.6%
695
 
2.4%
Other values (139) 12114
41.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29170
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6287
21.6%
2972
 
10.2%
1406
 
4.8%
1088
 
3.7%
1080
 
3.7%
1050
 
3.6%
919
 
3.2%
795
 
2.7%
764
 
2.6%
695
 
2.4%
Other values (139) 12114
41.5%

(법정동)법정동일련번호
Real number (ℝ)

MISSING 

Distinct1223
Distinct (%)14.5%
Missing1540
Missing (%)15.4%
Infinite0
Infinite (%)0.0%
Mean3.8894994 × 109
Minimum1.1140157 × 109
Maximum5.280025 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:23.244686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1140157 × 109
5-th percentile2.6230102 × 109
Q13.0110109 × 109
median4.1410104 × 109
Q34.6110164 × 109
95-th percentile5.1770259 × 109
Maximum5.280025 × 109
Range4.1660093 × 109
Interquartile range (IQR)1.6000055 × 109

Descriptive statistics

Standard deviation9.8042531 × 108
Coefficient of variation (CV)0.25206979
Kurtosis0.62053834
Mean3.8894994 × 109
Median Absolute Deviation (MAD)5.7201493 × 108
Skewness-0.98789904
Sum3.2905165 × 1013
Variance9.6123379 × 1017
MonotonicityNot monotonic
2024-04-22T09:13:23.384825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4159013100 116
 
1.2%
2820010300 71
 
0.7%
1130510200 63
 
0.6%
4136011100 57
 
0.6%
1168011200 54
 
0.5%
2917012800 53
 
0.5%
4145010900 53
 
0.5%
4163011400 51
 
0.5%
2623010900 49
 
0.5%
4148011700 47
 
0.5%
Other values (1213) 7846
78.5%
(Missing) 1540
 
15.4%
ValueCountFrequency (%)
1114015700 9
 
0.1%
1117012500 7
 
0.1%
1121510300 2
 
< 0.1%
1123010900 3
 
< 0.1%
1126010100 1
 
< 0.1%
1126010500 11
 
0.1%
1130510200 63
0.6%
1132010700 7
 
0.1%
1132010800 2
 
< 0.1%
1135010200 14
 
0.1%
ValueCountFrequency (%)
5280025024 3
< 0.1%
5280025022 1
 
< 0.1%
5279033021 3
< 0.1%
5279031022 2
< 0.1%
5279025036 1
 
< 0.1%
5279025023 1
 
< 0.1%
5279025021 4
< 0.1%
5277033028 1
 
< 0.1%
5277025026 1
 
< 0.1%
5274025021 3
< 0.1%

(법정동)주소
Text

MISSING 

Distinct1467
Distinct (%)17.3%
Missing1530
Missing (%)15.3%
Memory size156.2 KiB
2024-04-22T09:13:23.687034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length12.564109
Min length5

Characters and Unicode

Total characters106418
Distinct characters293
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)2.8%

Sample

1st row강원특별자치도 원주시 단구동
2nd row부산 연제구 연산동
3rd row경기 의정부시 녹양동
4th row광주광역시 북구 양산동
5th row경기 부천시 소사구 송내동
ValueCountFrequency (%)
경기도 1952
 
7.1%
경기 780
 
2.8%
북구 450
 
1.6%
강원특별자치도 445
 
1.6%
전북특별자치도 389
 
1.4%
화성시 371
 
1.4%
광주광역시 324
 
1.2%
동구 320
 
1.2%
충청남도 314
 
1.1%
경상남도 304
 
1.1%
Other values (1468) 21797
79.4%
2024-04-22T09:13:24.161189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18976
 
17.8%
7986
 
7.5%
7001
 
6.6%
4513
 
4.2%
4306
 
4.0%
3736
 
3.5%
2910
 
2.7%
2494
 
2.3%
2108
 
2.0%
2022
 
1.9%
Other values (283) 50366
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87240
82.0%
Space Separator 18976
 
17.8%
Decimal Number 199
 
0.2%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7986
 
9.2%
7001
 
8.0%
4513
 
5.2%
4306
 
4.9%
3736
 
4.3%
2910
 
3.3%
2494
 
2.9%
2108
 
2.4%
2022
 
2.3%
1914
 
2.2%
Other values (274) 48250
55.3%
Decimal Number
ValueCountFrequency (%)
1 84
42.2%
2 63
31.7%
3 29
 
14.6%
9 10
 
5.0%
6 7
 
3.5%
5 5
 
2.5%
7 1
 
0.5%
Space Separator
ValueCountFrequency (%)
18976
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87240
82.0%
Common 19178
 
18.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7986
 
9.2%
7001
 
8.0%
4513
 
5.2%
4306
 
4.9%
3736
 
4.3%
2910
 
3.3%
2494
 
2.9%
2108
 
2.4%
2022
 
2.3%
1914
 
2.2%
Other values (274) 48250
55.3%
Common
ValueCountFrequency (%)
18976
98.9%
1 84
 
0.4%
2 63
 
0.3%
3 29
 
0.2%
9 10
 
0.1%
6 7
 
< 0.1%
5 5
 
< 0.1%
- 3
 
< 0.1%
7 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87240
82.0%
ASCII 19178
 
18.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18976
98.9%
1 84
 
0.4%
2 63
 
0.3%
3 29
 
0.2%
9 10
 
0.1%
6 7
 
< 0.1%
5 5
 
< 0.1%
- 3
 
< 0.1%
7 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
7986
 
9.2%
7001
 
8.0%
4513
 
5.2%
4306
 
4.9%
3736
 
4.3%
2910
 
3.3%
2494
 
2.9%
2108
 
2.4%
2022
 
2.3%
1914
 
2.2%
Other values (274) 48250
55.3%

(법정동)지번-산
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반번지
6721 
<NA>
1569 
산번지
1382 
블럭
 
197
해당없음
 
70
Other values (6)
 
61

Length

Max length21
Median length4
Mean length3.8344
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산번지
2nd row산번지
3rd row일반번지
4th row일반번지
5th row산번지

Common Values

ValueCountFrequency (%)
일반번지 6721
67.2%
<NA> 1569
 
15.7%
산번지 1382
 
13.8%
블럭 197
 
2.0%
해당없음 70
 
0.7%
무번지 28
 
0.3%
단지 16
 
0.2%
구획정리1(롯트가 세분화되지않은 경우) 9
 
0.1%
3
 
< 0.1%
구획정리2(롯트가 세분화된 경우) 3
 
< 0.1%

Length

2024-04-22T09:13:24.321390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반번지 6721
67.0%
na 1569
 
15.7%
산번지 1382
 
13.8%
블럭 197
 
2.0%
해당없음 70
 
0.7%
무번지 28
 
0.3%
단지 16
 
0.2%
경우 12
 
0.1%
구획정리1(롯트가 9
 
0.1%
세분화되지않은 9
 
0.1%
Other values (4) 11
 
0.1%

(법정동)본번
Text

MISSING 

Distinct1553
Distinct (%)19.5%
Missing2042
Missing (%)20.4%
Memory size156.2 KiB
2024-04-22T09:13:24.716187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.6517969
Min length1

Characters and Unicode

Total characters29061
Distinct characters15
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique216 ?
Unique (%)2.7%

Sample

1st row1461
2nd row2220
3rd row0409
4th row0093
5th row0717
ValueCountFrequency (%)
197 52
 
0.7%
1005 47
 
0.6%
0200 40
 
0.5%
1 39
 
0.5%
0794 38
 
0.5%
864 37
 
0.5%
0797 34
 
0.4%
0237 32
 
0.4%
0707 31
 
0.4%
0321 30
 
0.4%
Other values (1543) 7578
95.2%
2024-04-22T09:13:25.290447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6449
22.2%
1 4463
15.4%
2 2774
9.5%
6 2415
 
8.3%
4 2321
 
8.0%
7 2288
 
7.9%
5 2275
 
7.8%
3 2177
 
7.5%
8 1972
 
6.8%
9 1901
 
6.5%
Other values (5) 26
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29035
99.9%
Uppercase Letter 13
 
< 0.1%
Other Letter 7
 
< 0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6449
22.2%
1 4463
15.4%
2 2774
9.6%
6 2415
 
8.3%
4 2321
 
8.0%
7 2288
 
7.9%
5 2275
 
7.8%
3 2177
 
7.5%
8 1972
 
6.8%
9 1901
 
6.5%
Uppercase Letter
ValueCountFrequency (%)
A 7
53.8%
B 6
46.2%
Other Letter
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29041
99.9%
Latin 13
 
< 0.1%
Hangul 7
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6449
22.2%
1 4463
15.4%
2 2774
9.6%
6 2415
 
8.3%
4 2321
 
8.0%
7 2288
 
7.9%
5 2275
 
7.8%
3 2177
 
7.5%
8 1972
 
6.8%
9 1901
 
6.5%
Latin
ValueCountFrequency (%)
A 7
53.8%
B 6
46.2%
Hangul
ValueCountFrequency (%)
6
85.7%
1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29054
> 99.9%
Hangul 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6449
22.2%
1 4463
15.4%
2 2774
9.5%
6 2415
 
8.3%
4 2321
 
8.0%
7 2288
 
7.9%
5 2275
 
7.8%
3 2177
 
7.5%
8 1972
 
6.8%
9 1901
 
6.5%
Other values (3) 19
 
0.1%
Hangul
ValueCountFrequency (%)
6
85.7%
1
 
14.3%

(법정동)부번
Text

MISSING 

Distinct127
Distinct (%)5.1%
Missing7518
Missing (%)75.2%
Memory size156.2 KiB
2024-04-22T09:13:25.554082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length2.7207897
Min length1

Characters and Unicode

Total characters6753
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.8%

Sample

1st row0018
2nd row0002
3rd row0106
4th row3
5th row3
ValueCountFrequency (%)
0001 513
20.7%
1 305
 
12.3%
2 215
 
8.7%
0002 137
 
5.5%
3 130
 
5.2%
0003 80
 
3.2%
0006 80
 
3.2%
4 56
 
2.3%
0004 53
 
2.1%
0005 52
 
2.1%
Other values (116) 862
34.7%
2024-04-22T09:13:25.931855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3760
55.7%
1 1174
 
17.4%
2 602
 
8.9%
3 305
 
4.5%
6 210
 
3.1%
4 187
 
2.8%
7 185
 
2.7%
5 154
 
2.3%
9 89
 
1.3%
8 84
 
1.2%
Other values (3) 3
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6750
> 99.9%
Space Separator 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3760
55.7%
1 1174
 
17.4%
2 602
 
8.9%
3 305
 
4.5%
6 210
 
3.1%
4 187
 
2.8%
7 185
 
2.7%
5 154
 
2.3%
9 89
 
1.3%
8 84
 
1.2%
Space Separator
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6752
> 99.9%
Latin 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3760
55.7%
1 1174
 
17.4%
2 602
 
8.9%
3 305
 
4.5%
6 210
 
3.1%
4 187
 
2.8%
7 185
 
2.7%
5 154
 
2.3%
9 89
 
1.3%
8 84
 
1.2%
Other values (2) 2
 
< 0.1%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6753
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3760
55.7%
1 1174
 
17.4%
2 602
 
8.9%
3 305
 
4.5%
6 210
 
3.1%
4 187
 
2.8%
7 185
 
2.7%
5 154
 
2.3%
9 89
 
1.3%
8 84
 
1.2%
Other values (3) 3
 
< 0.1%
Distinct68
Distinct (%)28.8%
Missing9764
Missing (%)97.6%
Memory size156.2 KiB
2024-04-22T09:13:26.173124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length10.423729
Min length1

Characters and Unicode

Total characters2460
Distinct characters104
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)3.4%

Sample

1st row동탄2택지개발지구 A-66블록
2nd row수원당수 A-1BL
3rd row화성향남2 A-22블록 행복주택
4th row감일동 감이동 일원 하남감일지구
5th row평택고덕 A-57-2
ValueCountFrequency (%)
일원 38
 
8.1%
a-1bl 15
 
3.2%
행복주택 13
 
2.8%
송리 12
 
2.5%
광주선운2 12
 
2.5%
681-60 12
 
2.5%
공공주택지구내 12
 
2.5%
화성향남2 10
 
2.1%
a3 10
 
2.1%
a4블록 10
 
2.1%
Other values (90) 328
69.5%
2024-04-22T09:13:26.558537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
249
 
10.1%
A 117
 
4.8%
- 116
 
4.7%
2 93
 
3.8%
1 84
 
3.4%
81
 
3.3%
80
 
3.3%
71
 
2.9%
69
 
2.8%
65
 
2.6%
Other values (94) 1435
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1316
53.5%
Decimal Number 491
 
20.0%
Uppercase Letter 250
 
10.2%
Space Separator 249
 
10.1%
Dash Punctuation 116
 
4.7%
Open Punctuation 17
 
0.7%
Close Punctuation 17
 
0.7%
Math Symbol 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
6.2%
80
 
6.1%
71
 
5.4%
69
 
5.2%
65
 
4.9%
61
 
4.6%
49
 
3.7%
44
 
3.3%
43
 
3.3%
42
 
3.2%
Other values (71) 711
54.0%
Decimal Number
ValueCountFrequency (%)
2 93
18.9%
1 84
17.1%
6 63
12.8%
5 52
10.6%
3 51
10.4%
8 43
8.8%
4 36
 
7.3%
9 35
 
7.1%
7 17
 
3.5%
0 17
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
A 117
46.8%
B 53
21.2%
L 45
 
18.0%
S 12
 
4.8%
H 8
 
3.2%
M 5
 
2.0%
U 5
 
2.0%
R 5
 
2.0%
Space Separator
ValueCountFrequency (%)
249
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1316
53.5%
Common 894
36.3%
Latin 250
 
10.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
6.2%
80
 
6.1%
71
 
5.4%
69
 
5.2%
65
 
4.9%
61
 
4.6%
49
 
3.7%
44
 
3.3%
43
 
3.3%
42
 
3.2%
Other values (71) 711
54.0%
Common
ValueCountFrequency (%)
249
27.9%
- 116
13.0%
2 93
 
10.4%
1 84
 
9.4%
6 63
 
7.0%
5 52
 
5.8%
3 51
 
5.7%
8 43
 
4.8%
4 36
 
4.0%
9 35
 
3.9%
Other values (5) 72
 
8.1%
Latin
ValueCountFrequency (%)
A 117
46.8%
B 53
21.2%
L 45
 
18.0%
S 12
 
4.8%
H 8
 
3.2%
M 5
 
2.0%
U 5
 
2.0%
R 5
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1316
53.5%
ASCII 1144
46.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
249
21.8%
A 117
10.2%
- 116
10.1%
2 93
 
8.1%
1 84
 
7.3%
6 63
 
5.5%
B 53
 
4.6%
5 52
 
4.5%
3 51
 
4.5%
L 45
 
3.9%
Other values (13) 221
19.3%
Hangul
ValueCountFrequency (%)
81
 
6.2%
80
 
6.1%
71
 
5.4%
69
 
5.2%
65
 
4.9%
61
 
4.6%
49
 
3.7%
44
 
3.3%
43
 
3.3%
42
 
3.2%
Other values (71) 711
54.0%

(도로명)우편번호
Real number (ℝ)

MISSING 

Distinct2198
Distinct (%)23.5%
Missing627
Missing (%)6.3%
Infinite0
Infinite (%)0.0%
Mean107477.4
Minimum1137
Maximum791759
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:26.711345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1137
5-th percentile10088
Q117782
median34333
Q358566
95-th percentile570090
Maximum791759
Range790622
Interquartile range (IQR)40784

Descriptive statistics

Standard deviation181199.87
Coefficient of variation (CV)1.6859346
Kurtosis3.2744005
Mean107477.4
Median Absolute Deviation (MAD)19337
Skewness2.1506917
Sum1.0073857 × 109
Variance3.2833392 × 1010
MonotonicityNot monotonic
2024-04-22T09:13:26.865338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18469 65
 
0.7%
18463 46
 
0.5%
614104 44
 
0.4%
21541 41
 
0.4%
15822 37
 
0.4%
31473 35
 
0.4%
14307 30
 
0.3%
13449 30
 
0.3%
28692 30
 
0.3%
57945 27
 
0.3%
Other values (2188) 8988
89.9%
(Missing) 627
 
6.3%
ValueCountFrequency (%)
1137 15
0.1%
1224 3
 
< 0.1%
1225 19
0.2%
1229 26
0.3%
1319 2
 
< 0.1%
1404 7
 
0.1%
1489 4
 
< 0.1%
1623 1
 
< 0.1%
1702 3
 
< 0.1%
1711 7
 
0.1%
ValueCountFrequency (%)
791759 2
 
< 0.1%
791280 11
0.1%
790250 3
 
< 0.1%
790110 1
 
< 0.1%
770756 5
0.1%
770110 8
0.1%
770100 2
 
< 0.1%
760751 1
 
< 0.1%
760010 2
 
< 0.1%
750902 3
 
< 0.1%

(도로명)주소
Text

MISSING 

Distinct2658
Distinct (%)28.2%
Missing571
Missing (%)5.7%
Memory size156.2 KiB
2024-04-22T09:13:27.213512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length41
Mean length22.892459
Min length3

Characters and Unicode

Total characters215853
Distinct characters505
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique596 ?
Unique (%)6.3%

Sample

1st row강원특별자치도 원주시 서원대로 427(단구동 개운2차한신휴플러스아파트 201)
2nd row부산광역시 연제구 토현로
3rd row경기도 의정부시 체육로 278(녹양동 휴먼시아아파트 101동)
4th row광주광역시 북구 양지마을길 8
5th row경기도 동두천시 지행로
ValueCountFrequency (%)
경기도 2647
 
6.5%
광주광역시 486
 
1.2%
북구 485
 
1.2%
강원특별자치도 481
 
1.2%
전북특별자치도 478
 
1.2%
충청북도 453
 
1.1%
충청남도 453
 
1.1%
경상남도 430
 
1.1%
서울특별시 427
 
1.0%
대전광역시 423
 
1.0%
Other values (5142) 34124
83.5%
2024-04-22T09:13:27.818564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31458
 
14.6%
9315
 
4.3%
7113
 
3.3%
6996
 
3.2%
6291
 
2.9%
1 5672
 
2.6%
5222
 
2.4%
4235
 
2.0%
) 4179
 
1.9%
( 4179
 
1.9%
Other values (495) 131193
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 150729
69.8%
Space Separator 31458
 
14.6%
Decimal Number 23841
 
11.0%
Close Punctuation 4179
 
1.9%
Open Punctuation 4179
 
1.9%
Dash Punctuation 938
 
0.4%
Uppercase Letter 499
 
0.2%
Other Punctuation 19
 
< 0.1%
Math Symbol 5
 
< 0.1%
Letter Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9315
 
6.2%
7113
 
4.7%
6996
 
4.6%
6291
 
4.2%
5222
 
3.5%
4235
 
2.8%
3279
 
2.2%
3196
 
2.1%
3156
 
2.1%
3096
 
2.1%
Other values (457) 98830
65.6%
Uppercase Letter
ValueCountFrequency (%)
L 191
38.3%
H 182
36.5%
B 36
 
7.2%
A 36
 
7.2%
I 10
 
2.0%
F 10
 
2.0%
N 10
 
2.0%
T 6
 
1.2%
P 6
 
1.2%
C 3
 
0.6%
Other values (7) 9
 
1.8%
Decimal Number
ValueCountFrequency (%)
1 5672
23.8%
2 3133
13.1%
3 2485
10.4%
0 2149
 
9.0%
5 2145
 
9.0%
4 2051
 
8.6%
6 1795
 
7.5%
7 1586
 
6.7%
8 1481
 
6.2%
9 1344
 
5.6%
Other Punctuation
ValueCountFrequency (%)
/ 14
73.7%
. 5
 
26.3%
Math Symbol
ValueCountFrequency (%)
~ 3
60.0%
2
40.0%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
31458
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4179
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4179
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 938
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 150729
69.8%
Common 64619
29.9%
Latin 505
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9315
 
6.2%
7113
 
4.7%
6996
 
4.6%
6291
 
4.2%
5222
 
3.5%
4235
 
2.8%
3279
 
2.2%
3196
 
2.1%
3156
 
2.1%
3096
 
2.1%
Other values (457) 98830
65.6%
Latin
ValueCountFrequency (%)
L 191
37.8%
H 182
36.0%
B 36
 
7.1%
A 36
 
7.1%
I 10
 
2.0%
F 10
 
2.0%
N 10
 
2.0%
T 6
 
1.2%
P 6
 
1.2%
C 3
 
0.6%
Other values (10) 15
 
3.0%
Common
ValueCountFrequency (%)
31458
48.7%
1 5672
 
8.8%
) 4179
 
6.5%
( 4179
 
6.5%
2 3133
 
4.8%
3 2485
 
3.8%
0 2149
 
3.3%
5 2145
 
3.3%
4 2051
 
3.2%
6 1795
 
2.8%
Other values (8) 5373
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 150729
69.8%
ASCII 65118
30.2%
Number Forms 4
 
< 0.1%
Math Operators 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31458
48.3%
1 5672
 
8.7%
) 4179
 
6.4%
( 4179
 
6.4%
2 3133
 
4.8%
3 2485
 
3.8%
0 2149
 
3.3%
5 2145
 
3.3%
4 2051
 
3.1%
6 1795
 
2.8%
Other values (25) 5872
 
9.0%
Hangul
ValueCountFrequency (%)
9315
 
6.2%
7113
 
4.7%
6996
 
4.6%
6291
 
4.2%
5222
 
3.5%
4235
 
2.8%
3279
 
2.2%
3196
 
2.1%
3156
 
2.1%
3096
 
2.1%
Other values (457) 98830
65.6%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Distinct1458
Distinct (%)28.5%
Missing4887
Missing (%)48.9%
Memory size156.2 KiB
2024-04-22T09:13:28.140218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length31
Mean length11.10796
Min length1

Characters and Unicode

Total characters56795
Distinct characters422
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique341 ?
Unique (%)6.7%

Sample

1st row10 (연산동)
2nd row278 휴먼시아(녹양1단지)
3rd row16 (송내 주공1단지 아파트)
4th row별가람1-4
5th row150 (민락동)
ValueCountFrequency (%)
주공아파트 235
 
2.3%
아파트 121
 
1.2%
11 99
 
0.9%
휴먼시아 93
 
0.9%
행복주택 81
 
0.8%
10 78
 
0.7%
15 68
 
0.7%
25 68
 
0.7%
12 64
 
0.6%
105 62
 
0.6%
Other values (1863) 9465
90.7%
2024-04-22T09:13:28.627282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5355
 
9.4%
( 3375
 
5.9%
) 3373
 
5.9%
1 2674
 
4.7%
2159
 
3.8%
2047
 
3.6%
2 2033
 
3.6%
1595
 
2.8%
1563
 
2.8%
1529
 
2.7%
Other values (412) 31092
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30762
54.2%
Decimal Number 12344
21.7%
Space Separator 5355
 
9.4%
Open Punctuation 3416
 
6.0%
Close Punctuation 3414
 
6.0%
Uppercase Letter 923
 
1.6%
Dash Punctuation 533
 
0.9%
Lowercase Letter 26
 
< 0.1%
Other Punctuation 16
 
< 0.1%
Other Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2159
 
7.0%
2047
 
6.7%
1595
 
5.2%
1563
 
5.1%
1529
 
5.0%
1326
 
4.3%
1273
 
4.1%
1050
 
3.4%
624
 
2.0%
599
 
1.9%
Other values (375) 16997
55.3%
Uppercase Letter
ValueCountFrequency (%)
L 339
36.7%
H 311
33.7%
A 126
 
13.7%
B 91
 
9.9%
C 23
 
2.5%
F 10
 
1.1%
N 10
 
1.1%
R 5
 
0.5%
M 3
 
0.3%
E 2
 
0.2%
Other values (2) 3
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 2674
21.7%
2 2033
16.5%
3 1348
10.9%
5 1108
9.0%
4 1074
8.7%
0 1017
 
8.2%
6 928
 
7.5%
7 853
 
6.9%
9 731
 
5.9%
8 578
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 12
46.2%
l 4
 
15.4%
o 4
 
15.4%
h 4
 
15.4%
m 2
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 12
75.0%
: 3
 
18.8%
/ 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 3375
98.8%
[ 41
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 3373
98.8%
] 41
 
1.2%
Space Separator
ValueCountFrequency (%)
5355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 533
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30762
54.2%
Common 25084
44.2%
Latin 949
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2159
 
7.0%
2047
 
6.7%
1595
 
5.2%
1563
 
5.1%
1529
 
5.0%
1326
 
4.3%
1273
 
4.1%
1050
 
3.4%
624
 
2.0%
599
 
1.9%
Other values (375) 16997
55.3%
Common
ValueCountFrequency (%)
5355
21.3%
( 3375
13.5%
) 3373
13.4%
1 2674
10.7%
2 2033
 
8.1%
3 1348
 
5.4%
5 1108
 
4.4%
4 1074
 
4.3%
0 1017
 
4.1%
6 928
 
3.7%
Other values (10) 2799
11.2%
Latin
ValueCountFrequency (%)
L 339
35.7%
H 311
32.8%
A 126
 
13.3%
B 91
 
9.6%
C 23
 
2.4%
e 12
 
1.3%
F 10
 
1.1%
N 10
 
1.1%
R 5
 
0.5%
l 4
 
0.4%
Other values (7) 18
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30762
54.2%
ASCII 26027
45.8%
Enclosed Alphanum 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5355
20.6%
( 3375
13.0%
) 3373
13.0%
1 2674
10.3%
2 2033
 
7.8%
3 1348
 
5.2%
5 1108
 
4.3%
4 1074
 
4.1%
0 1017
 
3.9%
6 928
 
3.6%
Other values (26) 3742
14.4%
Hangul
ValueCountFrequency (%)
2159
 
7.0%
2047
 
6.7%
1595
 
5.2%
1563
 
5.1%
1529
 
5.0%
1326
 
4.3%
1273
 
4.1%
1050
 
3.4%
624
 
2.0%
599
 
1.9%
Other values (375) 16997
55.3%
Enclosed Alphanum
ValueCountFrequency (%)
6
100.0%

공급유형명
Categorical

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공분양
3374 
국민임대
2213 
행복주택
927 
임대상가
829 
공공임대(10년)
584 
Other values (19)
2073 

Length

Max length10
Median length4
Mean length4.8825
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row공공분양
2nd row장기임대
3rd row국민임대
4th row신축다세대매입임대
5th row공공분양

Common Values

ValueCountFrequency (%)
공공분양 3374
33.7%
국민임대 2213
22.1%
행복주택 927
 
9.3%
임대상가 829
 
8.3%
공공임대(10년) 584
 
5.8%
신축다세대매입임대 540
 
5.4%
공공임대(5년) 494
 
4.9%
영구임대 381
 
3.8%
근로복지 166
 
1.7%
장기임대 128
 
1.3%
Other values (14) 364
 
3.6%

Length

2024-04-22T09:13:28.808952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공공분양 3374
33.7%
국민임대 2213
22.1%
행복주택 927
 
9.3%
임대상가 829
 
8.3%
공공임대(10년 584
 
5.8%
신축다세대매입임대 540
 
5.4%
공공임대(5년 494
 
4.9%
영구임대 381
 
3.8%
근로복지 166
 
1.7%
장기임대 128
 
1.3%
Other values (14) 364
 
3.6%

전용면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct2620
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.613778
Minimum0
Maximum2712.38
Zeros114
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:28.989434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21.94
Q137.49
median51.4091
Q359.95
95-th percentile84.9616
Maximum2712.38
Range2712.38
Interquartile range (IQR)22.46

Descriptive statistics

Standard deviation58.351233
Coefficient of variation (CV)1.0492226
Kurtosis805.67509
Mean55.613778
Median Absolute Deviation (MAD)11.6891
Skewness22.979862
Sum556137.78
Variance3404.8663
MonotonicityNot monotonic
2024-04-22T09:13:29.167453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
59.99 144
 
1.4%
46.9 117
 
1.2%
0.0 114
 
1.1%
51.93 110
 
1.1%
26.37 105
 
1.1%
59.98 76
 
0.8%
31.32 71
 
0.7%
59.89 69
 
0.7%
59.92 64
 
0.6%
84.98 63
 
0.6%
Other values (2610) 9067
90.7%
ValueCountFrequency (%)
0.0 114
1.1%
5.75 1
 
< 0.1%
6.38 1
 
< 0.1%
7.25 1
 
< 0.1%
7.41 1
 
< 0.1%
10.44 1
 
< 0.1%
11.83 1
 
< 0.1%
12.0 1
 
< 0.1%
12.02 1
 
< 0.1%
12.36 3
 
< 0.1%
ValueCountFrequency (%)
2712.38 1
< 0.1%
2298.89 1
< 0.1%
1820.22 1
< 0.1%
1270.79 1
< 0.1%
1233.04 1
< 0.1%
1188.88 1
< 0.1%
1033.95 1
< 0.1%
800.48 1
< 0.1%
609.23 1
< 0.1%
583.44 1
< 0.1%

주거공용면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct6934
Distinct (%)69.4%
Missing6
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean17.875147
Minimum0
Maximum1316.8868
Zeros1631
Zeros (%)16.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:29.334249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q112.89235
median18.911
Q323.4116
95-th percentile28.762185
Maximum1316.8868
Range1316.8868
Interquartile range (IQR)10.51925

Descriptive statistics

Standard deviation25.746745
Coefficient of variation (CV)1.4403655
Kurtosis1234.3869
Mean17.875147
Median Absolute Deviation (MAD)5.0806
Skewness30.509809
Sum178644.21
Variance662.89487
MonotonicityNot monotonic
2024-04-22T09:13:29.531907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 1631
 
16.3%
5.85 28
 
0.3%
13.48 18
 
0.2%
13.38 15
 
0.1%
17.1238 14
 
0.1%
13.56 12
 
0.1%
13.5 12
 
0.1%
13.85 12
 
0.1%
14.02 12
 
0.1%
12.6 11
 
0.1%
Other values (6924) 8229
82.3%
ValueCountFrequency (%)
0.0 1631
16.3%
0.02 1
 
< 0.1%
1.49 1
 
< 0.1%
1.59 1
 
< 0.1%
1.82 2
 
< 0.1%
2.0 1
 
< 0.1%
2.15 3
 
< 0.1%
2.21 1
 
< 0.1%
2.55 2
 
< 0.1%
2.71 1
 
< 0.1%
ValueCountFrequency (%)
1316.8868 1
< 0.1%
1116.1334 1
< 0.1%
883.7344 1
< 0.1%
618.5313 1
< 0.1%
616.9807 1
< 0.1%
598.6528 1
< 0.1%
577.2126 1
< 0.1%
501.9927 1
< 0.1%
429.3665 1
< 0.1%
290.84 1
< 0.1%

공급면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct7550
Distinct (%)75.5%
Missing6
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean73.419837
Minimum0
Maximum4029.2668
Zeros101
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:29.700152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile27.286775
Q152.5
median68.98065
Q382.57255
95-th percentile113.52406
Maximum4029.2668
Range4029.2668
Interquartile range (IQR)30.07255

Descriptive statistics

Standard deviation80.134542
Coefficient of variation (CV)1.0914563
Kurtosis1106.013
Mean73.419837
Median Absolute Deviation (MAD)15.56995
Skewness28.036401
Sum733757.85
Variance6421.5449
MonotonicityNot monotonic
2024-04-22T09:13:29.862927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 101
 
1.0%
42.98 50
 
0.5%
49.59 37
 
0.4%
56.2 30
 
0.3%
57.55 27
 
0.3%
49.49 25
 
0.2%
57.92 25
 
0.2%
45.55 24
 
0.2%
33.06 22
 
0.2%
52.89 22
 
0.2%
Other values (7540) 9631
96.3%
ValueCountFrequency (%)
0.0 101
1.0%
7.41 1
 
< 0.1%
11.83 1
 
< 0.1%
12.0 1
 
< 0.1%
12.02 1
 
< 0.1%
12.36 1
 
< 0.1%
12.62 1
 
< 0.1%
12.67 1
 
< 0.1%
12.72 3
 
< 0.1%
13.02 1
 
< 0.1%
ValueCountFrequency (%)
4029.2668 1
< 0.1%
3415.0234 1
< 0.1%
2703.9544 1
< 0.1%
1887.7707 1
< 0.1%
1831.6928 1
< 0.1%
1766.0926 1
< 0.1%
1535.9427 1
< 0.1%
1419.0113 1
< 0.1%
985.0365 1
< 0.1%
828.81 1
< 0.1%

호수
Real number (ℝ)

Distinct702
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115.5533
Minimum1
Maximum3320
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:13:30.044737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q18
median56.5
Q3154
95-th percentile439
Maximum3320
Range3319
Interquartile range (IQR)146

Descriptive statistics

Standard deviation169.71724
Coefficient of variation (CV)1.4687356
Kurtosis34.82306
Mean115.5533
Median Absolute Deviation (MAD)53.5
Skewness3.9818499
Sum1155533
Variance28803.942
MonotonicityNot monotonic
2024-04-22T09:13:30.202759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1071
 
10.7%
2 477
 
4.8%
3 317
 
3.2%
4 260
 
2.6%
8 173
 
1.7%
6 164
 
1.6%
60 127
 
1.3%
120 126
 
1.3%
30 123
 
1.2%
5 111
 
1.1%
Other values (692) 7051
70.5%
ValueCountFrequency (%)
1 1071
10.7%
2 477
4.8%
3 317
 
3.2%
4 260
 
2.6%
5 111
 
1.1%
6 164
 
1.6%
7 91
 
0.9%
8 173
 
1.7%
9 69
 
0.7%
10 96
 
1.0%
ValueCountFrequency (%)
3320 1
< 0.1%
3000 1
< 0.1%
2023 1
< 0.1%
1825 1
< 0.1%
1780 1
< 0.1%
1768 1
< 0.1%
1752 1
< 0.1%
1746 1
< 0.1%
1580 1
< 0.1%
1563 1
< 0.1%
Distinct1757
Distinct (%)18.6%
Missing574
Missing (%)5.7%
Memory size156.2 KiB
Minimum1976-11-01 00:00:00
Maximum2027-09-30 00:00:00
2024-04-22T09:13:30.342015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:13:30.541788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1902
Distinct (%)20.2%
Missing587
Missing (%)5.9%
Memory size156.2 KiB
Minimum1976-11-30 00:00:00
Maximum2027-11-30 00:00:00
2024-04-22T09:13:30.717311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:13:30.880920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Sample

지역본부단지코드관리기관단지명시군구명(법정동)법정동일련번호(법정동)주소(법정동)지번-산(법정동)본번(법정동)부번(법정동)상세주소(도로명)우편번호(도로명)주소(도로명)상세주소공급유형명전용면적주거공용면적공급면적호수입주지정시작일자입주지정종료일자
4683강원지역본부C01220공사원주개운2원주시5113010700강원특별자치도 원주시 단구동산번지1461<NA><NA>26449강원특별자치도 원주시 서원대로 427(단구동 개운2차한신휴플러스아파트 201)<NA>공공분양65.190.065.1921988-01-011988-01-31
620부산울산지역본부C00856공사부산망미연제구2647010200부산 연제구 연산동산번지2220<NA><NA>47573부산광역시 연제구 토현로10 (연산동)장기임대92.560.092.56461999-01-011999-01-01
11554경기북부지역본부C01262공사의정부녹양 A1블럭(의정부녹양1)의정부시4115011100경기 의정부시 녹양동일반번지0409<NA><NA>11609경기도 의정부시 체육로 278(녹양동 휴먼시아아파트 101동)278 휴먼시아(녹양1단지)국민임대36.6817.176153.85614372008-05-102008-06-18
8690광주전남지역본부D00423공사광주광역시 북구 양산동 93-18(오션빌)북구2917012800광주광역시 북구 양산동일반번지00930018<NA>61071광주광역시 북구 양지마을길 8<NA>신축다세대매입임대46.5315.261.7332013-05-202013-06-20
11466경기북부지역본부C00785공사동두천송내1부천시4119710500경기 부천시 소사구 송내동산번지07170002<NA>11349경기도 동두천시 지행로16 (송내 주공1단지 아파트)공공분양84.7623.0892107.84922162004-08-262004-09-24
11822경기북부지역본부C01725공사남양주별가람1-4단지남양주시4136011100경기도 남양주시 별내동일반번지0808<NA><NA>12097경기도 남양주시 덕송3로 7(별내동 별가람마을아파트)별가람1-4공공임대(10년)84.6426.108110.74822013-12-232014-01-21
14230대전충남지사C00357공단조치원신흥2<NA><NA><NA><NA><NA><NA><NA>339800393번지<NA>공공분양84.8417.7827102.62272401999-05-141999-06-30
11566경기북부지역본부C01264공사의정부민락의정부시<NA><NA><NA><NA><NA><NA>11806경기도 의정부시 오목로150 (민락동)근로복지59.3922.880482.2704901999-08-181999-10-15
10120경남지역본부C00660공사김해진영김해시4825025027경상남도 김해시 여래리산번지0981<NA><NA>50864경상남도 김해시 진영읍 김해대로 462(진영휴먼시아아파트)462 (진영휴먼시아)국민임대39.7218.75458.4743452010-04-102010-05-09
14632광주전남지사C00094공단나주용산2<NA><NA><NA><NA><NA><NA><NA>520754344번지<NA>공공분양39.7411.894851.6348891999-02-271999-03-18
지역본부단지코드관리기관단지명시군구명(법정동)법정동일련번호(법정동)주소(법정동)지번-산(법정동)본번(법정동)부번(법정동)상세주소(도로명)우편번호(도로명)주소(도로명)상세주소공급유형명전용면적주거공용면적공급면적호수입주지정시작일자입주지정종료일자
14911대구경북지사C00117공단대구월성2달서구2729012000대구광역시 달서구 월성동일반번지86<NA><NA>42732대구광역시 달서구 월성로 77(월성동 월성주공아파트2단지)<NA>임대상가25.20.025.231991-09-301991-10-29
10733경남지역본부C02821공사진주가좌(행복주택)진주시4817013100경상남도 진주시 가좌동일반번지1072<NA><NA><NA><NA><NA>행복주택14.829.901424.72141482024-06-302024-07-31
1194인천지역본부C01565공사화도진분전동구2814010200인천 동구 화수동산번지0321<NA><NA>401020인천 동구 화수동 321 화도진그린빌<NA>공공분양59.7925.692585.4825162001-12-012001-12-31
2955경기남부지역본부C01704공사오산세교 9단지오산시4137010700경기 오산시 금암동블럭0B-3<NA><NA>447280경기 오산시 금암동 B-3BL 죽미마을 휴먼시아 휴튼9단지<NA>공공분양84.5325.6999110.229962011-07-292011-09-27
4537강원지역본부C00462공사강릉노암2강릉시<NA><NA><NA><NA><NA><NA>25591강원특별자치도 강릉시 노가니남길25 주공아파트 (노암동)공공분양62.810.062.812<NA><NA>
3954경기남부지역본부C02712공사수원당수1(행복·영구)수원시4111314100경기도 수원시 권선구 당수동블럭<NA><NA>수원당수 A-1BL16373경기도 수원시 권선구 당수로130번길 13(당수동)<NA>행복주택26.8814.442941.32292302023-05-152023-07-13
5761대전충남지역본부C00019공사아산배방11블럭아산시<NA><NA>일반번지<NA><NA><NA>336857충남 아산시 배방면 장재리 159번지 아산배방11블럭<NA>공공분양74.3223.862798.18273302010-09-302010-10-29
5404충북지역본부C01596공사음성감곡음성군4377037021충북 음성군 오향리일반번지1181<NA><NA>27611충청북도 음성군 감곡면 오향로 31(음성감곡휴먼시아)<NA>국민임대39.5718.353557.92353442010-11-092010-12-08
4773강원지역본부C01486공사춘천근화춘천시5111011700강원특별자치도 춘천시 근화동산번지02790008<NA>24361강원특별자치도 춘천시 근화길15번길26 (근화동)공공분양56.20.056.221988-01-011988-01-31
13644경기남부지사C00399공단평택송화평택시4122025000경기도 평택시 팽성읍일반번지0163<NA><NA>17993경기도 평택시 팽성읍 팽성송화로163국민임대46.2220.843867.06382532004-06-182004-07-17