Overview

Dataset statistics

Number of variables27
Number of observations500
Missing cells1799
Missing cells (%)13.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory113.4 KiB
Average record size in memory232.3 B

Variable types

Text8
Categorical8
Numeric10
Unsupported1

Dataset

Description샘플 데이터
Author국토교통부(open.eais.go.kr)
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=28

Alerts

대장_구분_코드 has constant value ""Constant
대장_구분_코드_명 has constant value ""Constant
대장_종류_코드 has constant value ""Constant
대장_종류_코드_명 has constant value ""Constant
새주소_지상지하_코드 has constant value ""Constant
대지_구분_코드 is highly imbalanced (91.2%)Imbalance
층_구분_코드 is highly imbalanced (81.6%)Imbalance
층_구분_코드_명 is highly imbalanced (85.9%)Imbalance
도로명_대지_위치 has 22 (4.4%) missing valuesMissing
건물_명 has 60 (12.0%) missing valuesMissing
특수지_명 has 486 (97.2%) missing valuesMissing
블록 has 496 (99.2%) missing valuesMissing
로트 has 500 (100.0%) missing valuesMissing
새주소_도로_코드 has 35 (7.0%) missing valuesMissing
새주소_법정동_코드 has 36 (7.2%) missing valuesMissing
새주소_본_번 has 33 (6.6%) missing valuesMissing
새주소_부_번 has 25 (5.0%) missing valuesMissing
동_명칭 has 105 (21.0%) missing valuesMissing
관리_건축물대장_PK has unique valuesUnique
로트 is an unsupported type, check if it needs cleaning or further analysisUnsupported
has 229 (45.8%) zerosZeros
새주소_부_번 has 410 (82.0%) zerosZeros
층_번호 has 107 (21.4%) zerosZeros

Reproduction

Analysis started2023-12-10 15:02:44.847576
Analysis finished2023-12-10 15:02:46.284675
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-11T00:02:46.490805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length12.502
Min length11

Characters and Unicode

Total characters6251
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st row41150-104927
2nd row11290-111317
3rd row27290-100219599
4th row26350-100230945
5th row48123-158735
ValueCountFrequency (%)
41150-104927 1
 
0.2%
42150-86637 1
 
0.2%
41463-171738 1
 
0.2%
41500-65358 1
 
0.2%
36110-5000060971 1
 
0.2%
45130-100225048 1
 
0.2%
47190-80469 1
 
0.2%
11650-61841 1
 
0.2%
26290-100186491 1
 
0.2%
41480-72374 1
 
0.2%
Other values (490) 490
98.0%
2023-12-11T00:02:47.159839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1171
18.7%
0 993
15.9%
2 681
10.9%
4 625
10.0%
- 500
8.0%
3 444
 
7.1%
5 409
 
6.5%
7 396
 
6.3%
6 363
 
5.8%
8 356
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5751
92.0%
Dash Punctuation 500
 
8.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1171
20.4%
0 993
17.3%
2 681
11.8%
4 625
10.9%
3 444
 
7.7%
5 409
 
7.1%
7 396
 
6.9%
6 363
 
6.3%
8 356
 
6.2%
9 313
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 500
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6251
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1171
18.7%
0 993
15.9%
2 681
10.9%
4 625
10.0%
- 500
8.0%
3 444
 
7.1%
5 409
 
6.5%
7 396
 
6.3%
6 363
 
5.8%
8 356
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6251
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1171
18.7%
0 993
15.9%
2 681
10.9%
4 625
10.0%
- 500
8.0%
3 444
 
7.1%
5 409
 
6.5%
7 396
 
6.3%
6 363
 
5.8%
8 356
 
5.7%

대장_구분_코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 500
100.0%

Length

2023-12-11T00:02:47.543998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:02:48.043634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 500
100.0%

대장_구분_코드_명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
집합
500 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집합
2nd row집합
3rd row집합
4th row집합
5th row집합

Common Values

ValueCountFrequency (%)
집합 500
100.0%

Length

2023-12-11T00:02:48.497241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:02:49.227553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집합 500
100.0%

대장_종류_코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
4
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 500
100.0%

Length

2023-12-11T00:02:49.675275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:02:49.895924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 500
100.0%

대장_종류_코드_명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
전유부
500 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전유부
2nd row전유부
3rd row전유부
4th row전유부
5th row전유부

Common Values

ValueCountFrequency (%)
전유부 500
100.0%

Length

2023-12-11T00:02:50.084577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:02:50.285770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전유부 500
100.0%
Distinct495
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-11T00:02:50.860568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length20.686
Min length15

Characters and Unicode

Total characters10343
Distinct characters254
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique490 ?
Unique (%)98.0%

Sample

1st row서울특별시 송파구 장지동 847번지
2nd row경기도 성남시 분당구 야탑동 219-1번지
3rd row경상남도 창원시 마산합포구 진동면 요장리 393-2번지
4th row대전광역시 서구 관저동 984번지
5th row경기도 이천시 부발읍 응암리 109번지
ValueCountFrequency (%)
경기도 128
 
5.9%
서울특별시 110
 
5.1%
부산광역시 46
 
2.1%
인천광역시 35
 
1.6%
경상남도 28
 
1.3%
강원도 21
 
1.0%
대구광역시 21
 
1.0%
수원시 20
 
0.9%
경상북도 20
 
0.9%
남구 16
 
0.7%
Other values (1067) 1708
79.3%
2023-12-11T00:02:51.797759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1655
 
16.0%
521
 
5.0%
510
 
4.9%
499
 
4.8%
494
 
4.8%
1 414
 
4.0%
392
 
3.8%
272
 
2.6%
- 235
 
2.3%
3 209
 
2.0%
Other values (244) 5142
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6581
63.6%
Decimal Number 1872
 
18.1%
Space Separator 1655
 
16.0%
Dash Punctuation 235
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
521
 
7.9%
510
 
7.7%
499
 
7.6%
494
 
7.5%
392
 
6.0%
272
 
4.1%
180
 
2.7%
169
 
2.6%
166
 
2.5%
142
 
2.2%
Other values (232) 3236
49.2%
Decimal Number
ValueCountFrequency (%)
1 414
22.1%
3 209
11.2%
2 200
10.7%
4 166
8.9%
5 163
 
8.7%
8 156
 
8.3%
6 147
 
7.9%
0 145
 
7.7%
7 144
 
7.7%
9 128
 
6.8%
Space Separator
ValueCountFrequency (%)
1655
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 235
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6581
63.6%
Common 3762
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
521
 
7.9%
510
 
7.7%
499
 
7.6%
494
 
7.5%
392
 
6.0%
272
 
4.1%
180
 
2.7%
169
 
2.6%
166
 
2.5%
142
 
2.2%
Other values (232) 3236
49.2%
Common
ValueCountFrequency (%)
1655
44.0%
1 414
 
11.0%
- 235
 
6.2%
3 209
 
5.6%
2 200
 
5.3%
4 166
 
4.4%
5 163
 
4.3%
8 156
 
4.1%
6 147
 
3.9%
0 145
 
3.9%
Other values (2) 272
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6581
63.6%
ASCII 3762
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1655
44.0%
1 414
 
11.0%
- 235
 
6.2%
3 209
 
5.6%
2 200
 
5.3%
4 166
 
4.4%
5 163
 
4.3%
8 156
 
4.1%
6 147
 
3.9%
0 145
 
3.9%
Other values (2) 272
 
7.2%
Hangul
ValueCountFrequency (%)
521
 
7.9%
510
 
7.7%
499
 
7.6%
494
 
7.5%
392
 
6.0%
272
 
4.1%
180
 
2.7%
169
 
2.6%
166
 
2.5%
142
 
2.2%
Other values (232) 3236
49.2%
Distinct472
Distinct (%)98.7%
Missing22
Missing (%)4.4%
Memory size4.0 KiB
2023-12-11T00:02:52.477893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length18.518828
Min length11

Characters and Unicode

Total characters8852
Distinct characters286
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique466 ?
Unique (%)97.5%

Sample

1st row인천광역시 서구 비지니스로 41
2nd row충청남도 태안군 갈음이길 77
3rd row서울특별시 노원구 중계로 230
4th row서울특별시 마포구 포은로 147
5th row전라북도 익산시 배산로 34-1
ValueCountFrequency (%)
경기도 116
 
5.8%
서울특별시 98
 
4.9%
대구광역시 33
 
1.6%
부산광역시 31
 
1.5%
인천광역시 28
 
1.4%
경상남도 28
 
1.4%
충청북도 22
 
1.1%
강원도 22
 
1.1%
충청남도 20
 
1.0%
경상북도 18
 
0.9%
Other values (852) 1594
79.3%
2023-12-11T00:02:53.498338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1534
 
17.3%
472
 
5.3%
429
 
4.8%
372
 
4.2%
1 337
 
3.8%
262
 
3.0%
241
 
2.7%
2 216
 
2.4%
3 198
 
2.2%
175
 
2.0%
Other values (276) 4616
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5649
63.8%
Decimal Number 1610
 
18.2%
Space Separator 1534
 
17.3%
Dash Punctuation 59
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
472
 
8.4%
429
 
7.6%
372
 
6.6%
262
 
4.6%
241
 
4.3%
175
 
3.1%
158
 
2.8%
149
 
2.6%
131
 
2.3%
122
 
2.2%
Other values (264) 3138
55.5%
Decimal Number
ValueCountFrequency (%)
1 337
20.9%
2 216
13.4%
3 198
12.3%
5 155
9.6%
4 137
8.5%
7 129
 
8.0%
6 121
 
7.5%
0 111
 
6.9%
8 105
 
6.5%
9 101
 
6.3%
Space Separator
ValueCountFrequency (%)
1534
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5649
63.8%
Common 3203
36.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
472
 
8.4%
429
 
7.6%
372
 
6.6%
262
 
4.6%
241
 
4.3%
175
 
3.1%
158
 
2.8%
149
 
2.6%
131
 
2.3%
122
 
2.2%
Other values (264) 3138
55.5%
Common
ValueCountFrequency (%)
1534
47.9%
1 337
 
10.5%
2 216
 
6.7%
3 198
 
6.2%
5 155
 
4.8%
4 137
 
4.3%
7 129
 
4.0%
6 121
 
3.8%
0 111
 
3.5%
8 105
 
3.3%
Other values (2) 160
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5649
63.8%
ASCII 3203
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1534
47.9%
1 337
 
10.5%
2 216
 
6.7%
3 198
 
6.2%
5 155
 
4.8%
4 137
 
4.3%
7 129
 
4.0%
6 121
 
3.8%
0 111
 
3.5%
8 105
 
3.3%
Other values (2) 160
 
5.0%
Hangul
ValueCountFrequency (%)
472
 
8.4%
429
 
7.6%
372
 
6.6%
262
 
4.6%
241
 
4.3%
175
 
3.1%
158
 
2.8%
149
 
2.6%
131
 
2.3%
122
 
2.2%
Other values (264) 3138
55.5%

건물_명
Text

MISSING 

Distinct407
Distinct (%)92.5%
Missing60
Missing (%)12.0%
Memory size4.0 KiB
2023-12-11T00:02:53.943834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.5568182
Min length2

Characters and Unicode

Total characters3325
Distinct characters360
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique386 ?
Unique (%)87.7%

Sample

1st row연지자이2차
2nd row광교마을40단지
3rd row박촌 풍림아이원
4th row우미아파트
5th row광양써니밸리아파트
ValueCountFrequency (%)
주공아파트 11
 
1.9%
아파트 7
 
1.2%
현대아파트 6
 
1.1%
우성아파트 5
 
0.9%
삼성아파트 3
 
0.5%
세아청솔아파트 3
 
0.5%
302동 3
 
0.5%
진흥아파트 3
 
0.5%
푸르지오 3
 
0.5%
더샵 2
 
0.4%
Other values (487) 520
91.9%
2023-12-11T00:02:54.940804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
254
 
7.6%
240
 
7.2%
233
 
7.0%
126
 
3.8%
64
 
1.9%
56
 
1.7%
55
 
1.7%
50
 
1.5%
48
 
1.4%
47
 
1.4%
Other values (350) 2152
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2999
90.2%
Space Separator 126
 
3.8%
Decimal Number 118
 
3.5%
Uppercase Letter 51
 
1.5%
Dash Punctuation 11
 
0.3%
Lowercase Letter 6
 
0.2%
Open Punctuation 5
 
0.2%
Close Punctuation 5
 
0.2%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
254
 
8.5%
240
 
8.0%
233
 
7.8%
64
 
2.1%
56
 
1.9%
55
 
1.8%
50
 
1.7%
48
 
1.6%
47
 
1.6%
46
 
1.5%
Other values (311) 1906
63.6%
Uppercase Letter
ValueCountFrequency (%)
S 7
13.7%
K 6
11.8%
D 4
 
7.8%
L 4
 
7.8%
H 3
 
5.9%
C 3
 
5.9%
E 3
 
5.9%
T 3
 
5.9%
V 3
 
5.9%
W 3
 
5.9%
Other values (9) 12
23.5%
Decimal Number
ValueCountFrequency (%)
2 38
32.2%
1 34
28.8%
3 13
 
11.0%
0 10
 
8.5%
6 6
 
5.1%
4 6
 
5.1%
5 5
 
4.2%
8 3
 
2.5%
7 2
 
1.7%
9 1
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
e 4
66.7%
n 1
 
16.7%
a 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
· 2
50.0%
. 1
25.0%
& 1
25.0%
Space Separator
ValueCountFrequency (%)
126
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2999
90.2%
Common 269
 
8.1%
Latin 57
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
254
 
8.5%
240
 
8.0%
233
 
7.8%
64
 
2.1%
56
 
1.9%
55
 
1.8%
50
 
1.7%
48
 
1.6%
47
 
1.6%
46
 
1.5%
Other values (311) 1906
63.6%
Latin
ValueCountFrequency (%)
S 7
12.3%
K 6
 
10.5%
e 4
 
7.0%
D 4
 
7.0%
L 4
 
7.0%
H 3
 
5.3%
C 3
 
5.3%
E 3
 
5.3%
T 3
 
5.3%
V 3
 
5.3%
Other values (12) 17
29.8%
Common
ValueCountFrequency (%)
126
46.8%
2 38
 
14.1%
1 34
 
12.6%
3 13
 
4.8%
- 11
 
4.1%
0 10
 
3.7%
6 6
 
2.2%
4 6
 
2.2%
5 5
 
1.9%
( 5
 
1.9%
Other values (7) 15
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2999
90.2%
ASCII 324
 
9.7%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
254
 
8.5%
240
 
8.0%
233
 
7.8%
64
 
2.1%
56
 
1.9%
55
 
1.8%
50
 
1.7%
48
 
1.6%
47
 
1.6%
46
 
1.5%
Other values (311) 1906
63.6%
ASCII
ValueCountFrequency (%)
126
38.9%
2 38
 
11.7%
1 34
 
10.5%
3 13
 
4.0%
- 11
 
3.4%
0 10
 
3.1%
S 7
 
2.2%
K 6
 
1.9%
6 6
 
1.9%
4 6
 
1.9%
Other values (28) 67
20.7%
None
ValueCountFrequency (%)
· 2
100.0%

시군구_코드
Real number (ℝ)

Distinct165
Distinct (%)33.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33572.024
Minimum11110
Maximum50130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:02:55.595400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11110
5-th percentile11350
Q126665
median41195
Q343114
95-th percentile48250
Maximum50130
Range39020
Interquartile range (IQR)16449

Descriptive statistics

Standard deviation12722.403
Coefficient of variation (CV)0.37895849
Kurtosis-0.92014214
Mean33572.024
Median Absolute Deviation (MAD)7055
Skewness-0.66260747
Sum16786012
Variance1.6185955 × 108
MonotonicityNot monotonic
2023-12-11T00:02:56.394883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11680 10
 
2.0%
48250 9
 
1.8%
41195 9
 
1.8%
41281 8
 
1.6%
11530 8
 
1.6%
11500 8
 
1.6%
27290 7
 
1.4%
45140 7
 
1.4%
48310 7
 
1.4%
41360 7
 
1.4%
Other values (155) 420
84.0%
ValueCountFrequency (%)
11110 1
 
0.2%
11140 1
 
0.2%
11170 1
 
0.2%
11200 4
0.8%
11215 2
0.4%
11230 4
0.8%
11260 1
 
0.2%
11290 4
0.8%
11305 2
0.4%
11320 2
0.4%
ValueCountFrequency (%)
50130 3
 
0.6%
50110 5
1.0%
48870 1
 
0.2%
48820 1
 
0.2%
48740 1
 
0.2%
48330 3
 
0.6%
48310 7
1.4%
48250 9
1.8%
48220 1
 
0.2%
48170 6
1.2%

법정동_코드
Real number (ℝ)

Distinct92
Distinct (%)18.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13180.738
Minimum10100
Maximum42031
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:02:56.853333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10100
5-th percentile10100
Q110400
median10900
Q312300
95-th percentile25622
Maximum42031
Range31931
Interquartile range (IQR)1900

Descriptive statistics

Standard deviation5897.4877
Coefficient of variation (CV)0.44743228
Kurtosis6.5370096
Mean13180.738
Median Absolute Deviation (MAD)600
Skewness2.6767539
Sum6590369
Variance34780361
MonotonicityNot monotonic
2023-12-11T00:02:57.349084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10100 44
 
8.8%
10300 43
 
8.6%
10200 33
 
6.6%
10800 30
 
6.0%
10500 26
 
5.2%
10600 23
 
4.6%
10900 23
 
4.6%
10400 22
 
4.4%
10700 19
 
3.8%
11100 16
 
3.2%
Other values (82) 221
44.2%
ValueCountFrequency (%)
10100 44
8.8%
10200 33
6.6%
10300 43
8.6%
10400 22
4.4%
10500 26
5.2%
10600 23
4.6%
10700 19
3.8%
10800 30
6.0%
10900 23
4.6%
11000 9
 
1.8%
ValueCountFrequency (%)
42031 1
 
0.2%
39021 1
 
0.2%
38023 1
 
0.2%
37023 1
 
0.2%
37021 3
0.6%
36026 1
 
0.2%
36021 1
 
0.2%
35028 1
 
0.2%
35022 1
 
0.2%
34022 1
 
0.2%

대지_구분_코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
491 
2
 
8
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 491
98.2%
2 8
 
1.6%
1 1
 
0.2%

Length

2023-12-11T00:02:57.797439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:02:58.004941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 491
98.2%
2 8
 
1.6%
1 1
 
0.2%


Real number (ℝ)

Distinct419
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean634.832
Minimum0
Maximum3747
Zeros3
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:02:58.222317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile23
Q1224.25
median550.5
Q3881
95-th percentile1682.2
Maximum3747
Range3747
Interquartile range (IQR)656.75

Descriptive statistics

Standard deviation519.65795
Coefficient of variation (CV)0.81857555
Kurtosis3.1884361
Mean634.832
Median Absolute Deviation (MAD)329
Skewness1.3620665
Sum317416
Variance270044.39
MonotonicityNot monotonic
2023-12-11T00:02:58.564845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23 3
 
0.6%
58 3
 
0.6%
3 3
 
0.6%
0 3
 
0.6%
150 3
 
0.6%
715 3
 
0.6%
546 3
 
0.6%
694 3
 
0.6%
76 3
 
0.6%
580 3
 
0.6%
Other values (409) 470
94.0%
ValueCountFrequency (%)
0 3
0.6%
1 2
0.4%
2 1
 
0.2%
3 3
0.6%
4 1
 
0.2%
5 2
0.4%
8 1
 
0.2%
9 1
 
0.2%
10 1
 
0.2%
11 2
0.4%
ValueCountFrequency (%)
3747 1
0.2%
2708 1
0.2%
2545 1
0.2%
2393 1
0.2%
2380 1
0.2%
2174 1
0.2%
2100 1
0.2%
2014 1
0.2%
1989 1
0.2%
1978 1
0.2%


Real number (ℝ)

ZEROS 

Distinct57
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.53
Minimum0
Maximum652
Zeros229
Zeros (%)45.8%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:02:58.887202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34.25
95-th percentile39.05
Maximum652
Range652
Interquartile range (IQR)4.25

Descriptive statistics

Standard deviation49.415594
Coefficient of variation (CV)4.2858277
Kurtosis84.027123
Mean11.53
Median Absolute Deviation (MAD)1
Skewness8.4653465
Sum5765
Variance2441.9009
MonotonicityNot monotonic
2023-12-11T00:02:59.324218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 229
45.8%
1 70
 
14.0%
3 34
 
6.8%
2 27
 
5.4%
4 15
 
3.0%
5 12
 
2.4%
7 9
 
1.8%
6 8
 
1.6%
10 7
 
1.4%
12 7
 
1.4%
Other values (47) 82
 
16.4%
ValueCountFrequency (%)
0 229
45.8%
1 70
 
14.0%
2 27
 
5.4%
3 34
 
6.8%
4 15
 
3.0%
5 12
 
2.4%
6 8
 
1.6%
7 9
 
1.8%
8 5
 
1.0%
9 4
 
0.8%
ValueCountFrequency (%)
652 1
0.2%
421 1
0.2%
381 1
0.2%
377 2
0.4%
234 1
0.2%
218 1
0.2%
142 1
0.2%
134 1
0.2%
127 1
0.2%
115 1
0.2%

특수지_명
Text

MISSING 

Distinct14
Distinct (%)100.0%
Missing486
Missing (%)97.2%
Memory size4.0 KiB
2023-12-11T00:02:59.760557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length12
Mean length9.2142857
Min length4

Characters and Unicode

Total characters129
Distinct characters57
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)100.0%

Sample

1st row1B2-2L
2nd row선암1지구
3rd row배곧신도시
4th row구미국가산업단지제4단지
5th row택지개발지구 20블럭
ValueCountFrequency (%)
1b2-2l 1
 
4.3%
35b 1
 
4.3%
강동산하지구 1
 
4.3%
지방산업단지내 1
 
4.3%
센텀 1
 
4.3%
삼송지구 1
 
4.3%
12블럭 1
 
4.3%
죽전택지개발지구 1
 
4.3%
2-1l 1
 
4.3%
66b 1
 
4.3%
Other values (13) 13
56.5%
2023-12-11T00:03:00.450989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
11.6%
10
 
7.8%
9
 
7.0%
2 6
 
4.7%
1 4
 
3.1%
B 4
 
3.1%
L 4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
Other values (47) 67
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88
68.2%
Decimal Number 22
 
17.1%
Space Separator 9
 
7.0%
Uppercase Letter 8
 
6.2%
Dash Punctuation 2
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
17.0%
10
 
11.4%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (34) 39
44.3%
Decimal Number
ValueCountFrequency (%)
2 6
27.3%
1 4
18.2%
4 3
13.6%
9 2
 
9.1%
5 2
 
9.1%
6 2
 
9.1%
3 1
 
4.5%
8 1
 
4.5%
0 1
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 4
50.0%
L 4
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88
68.2%
Common 33
 
25.6%
Latin 8
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
17.0%
10
 
11.4%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (34) 39
44.3%
Common
ValueCountFrequency (%)
9
27.3%
2 6
18.2%
1 4
12.1%
4 3
 
9.1%
9 2
 
6.1%
5 2
 
6.1%
6 2
 
6.1%
- 2
 
6.1%
3 1
 
3.0%
8 1
 
3.0%
Latin
ValueCountFrequency (%)
B 4
50.0%
L 4
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88
68.2%
ASCII 41
31.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
17.0%
10
 
11.4%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (34) 39
44.3%
ASCII
ValueCountFrequency (%)
9
22.0%
2 6
14.6%
1 4
9.8%
B 4
9.8%
L 4
9.8%
4 3
 
7.3%
9 2
 
4.9%
5 2
 
4.9%
6 2
 
4.9%
- 2
 
4.9%
Other values (3) 3
 
7.3%

블록
Text

MISSING 

Distinct4
Distinct (%)100.0%
Missing496
Missing (%)99.2%
Memory size4.0 KiB
2023-12-11T00:03:00.791747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4.5
Mean length4.5
Min length3

Characters and Unicode

Total characters18
Distinct characters14
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)100.0%

Sample

1st row74B
2nd rowRM10블록
3rd row6-1블록
4th rowA2BL
ValueCountFrequency (%)
74b 1
25.0%
rm10블록 1
25.0%
6-1블록 1
25.0%
a2bl 1
25.0%
2023-12-11T00:03:01.454186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 2
11.1%
1 2
11.1%
2
11.1%
2
11.1%
7 1
 
5.6%
4 1
 
5.6%
R 1
 
5.6%
M 1
 
5.6%
0 1
 
5.6%
6 1
 
5.6%
Other values (4) 4
22.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7
38.9%
Uppercase Letter 6
33.3%
Other Letter 4
22.2%
Dash Punctuation 1
 
5.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2
28.6%
7 1
14.3%
4 1
14.3%
0 1
14.3%
6 1
14.3%
2 1
14.3%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
R 1
16.7%
M 1
16.7%
A 1
16.7%
L 1
16.7%
Other Letter
ValueCountFrequency (%)
2
50.0%
2
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8
44.4%
Latin 6
33.3%
Hangul 4
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2
25.0%
7 1
12.5%
4 1
12.5%
0 1
12.5%
6 1
12.5%
- 1
12.5%
2 1
12.5%
Latin
ValueCountFrequency (%)
B 2
33.3%
R 1
16.7%
M 1
16.7%
A 1
16.7%
L 1
16.7%
Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14
77.8%
Hangul 4
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 2
14.3%
1 2
14.3%
7 1
7.1%
4 1
7.1%
R 1
7.1%
M 1
7.1%
0 1
7.1%
6 1
7.1%
- 1
7.1%
A 1
7.1%
Other values (2) 2
14.3%
Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

로트
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing500
Missing (%)100.0%
Memory size4.5 KiB

새주소_도로_코드
Real number (ℝ)

MISSING 

Distinct439
Distinct (%)94.4%
Missing35
Missing (%)7.0%
Infinite0
Infinite (%)0.0%
Mean3.1521535 × 1011
Minimum1.111041 × 1011
Maximum5.0130335 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:01.743393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.111041 × 1011
5-th percentile1.1320311 × 1011
Q12.6260419 × 1011
median3.1140317 × 1011
Q34.1480442 × 1011
95-th percentile4.8122882 × 1011
Maximum5.0130335 × 1011
Range3.9019925 × 1011
Interquartile range (IQR)1.5220023 × 1011

Descriptive statistics

Standard deviation1.2845829 × 1011
Coefficient of variation (CV)0.40752549
Kurtosis-1.1796396
Mean3.1521535 × 1011
Median Absolute Deviation (MAD)1.0325003 × 1011
Skewness-0.46827382
Sum1.4657514 × 1014
Variance1.6501532 × 1022
MonotonicityNot monotonic
2023-12-11T00:03:02.083372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
282373154006 3
 
0.6%
114404139262 3
 
0.6%
272304235418 3
 
0.6%
431123237104 2
 
0.4%
112603005028 2
 
0.4%
431133237020 2
 
0.4%
114704142181 2
 
0.4%
431143240063 2
 
0.4%
414804418712 2
 
0.4%
265302006008 2
 
0.4%
Other values (429) 442
88.4%
(Missing) 35
 
7.0%
ValueCountFrequency (%)
111104100482 1
0.2%
111403005009 1
0.2%
111404103057 1
0.2%
111703102008 1
0.2%
111704106026 1
0.2%
111704106548 1
0.2%
112004109073 1
0.2%
112004109192 1
0.2%
112004109201 1
0.2%
112154112031 1
0.2%
ValueCountFrequency (%)
501303350237 1
0.2%
501104848960 1
0.2%
501103349227 1
0.2%
501103349109 1
0.2%
501103349051 1
0.2%
488804841195 1
0.2%
488504832133 1
0.2%
488403343013 1
0.2%
488204826149 1
0.2%
487303340025 1
0.2%

새주소_법정동_코드
Real number (ℝ)

MISSING 

Distinct115
Distinct (%)24.8%
Missing36
Missing (%)7.2%
Infinite0
Infinite (%)0.0%
Mean13373.966
Minimum10101
Maximum44001
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:02.405697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10101
5-th percentile10101
Q110301
median10801
Q312432
95-th percentile25557.5
Maximum44001
Range33900
Interquartile range (IQR)2131

Descriptive statistics

Standard deviation6221.308
Coefficient of variation (CV)0.4651805
Kurtosis5.789293
Mean13373.966
Median Absolute Deviation (MAD)600
Skewness2.5201083
Sum6205520
Variance38704673
MonotonicityNot monotonic
2023-12-11T00:03:02.744771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10101 49
 
9.8%
10201 41
 
8.2%
25001 28
 
5.6%
10301 26
 
5.2%
10701 20
 
4.0%
10801 18
 
3.6%
10501 17
 
3.4%
10601 15
 
3.0%
10401 12
 
2.4%
10901 11
 
2.2%
Other values (105) 227
45.4%
(Missing) 36
 
7.2%
ValueCountFrequency (%)
10101 49
9.8%
10201 41
8.2%
10202 10
 
2.0%
10301 26
5.2%
10302 5
 
1.0%
10303 1
 
0.2%
10401 12
 
2.4%
10402 6
 
1.2%
10403 3
 
0.6%
10501 17
 
3.4%
ValueCountFrequency (%)
44001 1
 
0.2%
40001 2
0.4%
39006 1
 
0.2%
38001 1
 
0.2%
37002 1
 
0.2%
36003 1
 
0.2%
36001 3
0.6%
34001 2
0.4%
32002 1
 
0.2%
32001 4
0.8%

새주소_지상지하_코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 500
100.0%

Length

2023-12-11T00:03:03.084950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:03:03.271540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 500
100.0%

새주소_본_번
Real number (ℝ)

MISSING 

Distinct214
Distinct (%)45.8%
Missing33
Missing (%)6.6%
Infinite0
Infinite (%)0.0%
Mean172.80514
Minimum0
Maximum8694
Zeros5
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:03.927307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7
Q121
median51
Q3137
95-th percentile754.6
Maximum8694
Range8694
Interquartile range (IQR)116

Descriptive statistics

Standard deviation500.43034
Coefficient of variation (CV)2.8959228
Kurtosis183.7858
Mean172.80514
Median Absolute Deviation (MAD)39
Skewness11.649685
Sum80700
Variance250430.53
MonotonicityNot monotonic
2023-12-11T00:03:04.378335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12 11
 
2.2%
10 10
 
2.0%
25 10
 
2.0%
33 10
 
2.0%
15 9
 
1.8%
11 9
 
1.8%
26 9
 
1.8%
9 9
 
1.8%
7 8
 
1.6%
13 8
 
1.6%
Other values (204) 374
74.8%
(Missing) 33
 
6.6%
ValueCountFrequency (%)
0 5
1.0%
2 2
 
0.4%
3 3
 
0.6%
4 3
 
0.6%
5 3
 
0.6%
6 5
1.0%
7 8
1.6%
8 5
1.0%
9 9
1.8%
10 10
2.0%
ValueCountFrequency (%)
8694 1
0.2%
2742 1
0.2%
2692 1
0.2%
1842 1
0.2%
1791 1
0.2%
1640 1
0.2%
1388 1
0.2%
1322 1
0.2%
1321 1
0.2%
1254 2
0.4%

새주소_부_번
Real number (ℝ)

MISSING  ZEROS 

Distinct26
Distinct (%)5.5%
Missing25
Missing (%)5.0%
Infinite0
Infinite (%)0.0%
Mean1.6484211
Minimum0
Maximum71
Zeros410
Zeros (%)82.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:04.659741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile12.3
Maximum71
Range71
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.0413984
Coefficient of variation (CV)3.6649607
Kurtosis48.823525
Mean1.6484211
Median Absolute Deviation (MAD)0
Skewness6.008242
Sum783
Variance36.498494
MonotonicityNot monotonic
2023-12-11T00:03:04.996854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0 410
82.0%
1 9
 
1.8%
7 5
 
1.0%
6 5
 
1.0%
10 5
 
1.0%
14 5
 
1.0%
16 4
 
0.8%
12 4
 
0.8%
5 3
 
0.6%
4 3
 
0.6%
Other values (16) 22
 
4.4%
(Missing) 25
 
5.0%
ValueCountFrequency (%)
0 410
82.0%
1 9
 
1.8%
2 1
 
0.2%
3 3
 
0.6%
4 3
 
0.6%
5 3
 
0.6%
6 5
 
1.0%
7 5
 
1.0%
8 2
 
0.4%
10 5
 
1.0%
ValueCountFrequency (%)
71 1
0.2%
44 1
0.2%
39 1
0.2%
36 1
0.2%
30 1
0.2%
29 2
0.4%
21 1
0.2%
20 1
0.2%
18 1
0.2%
17 1
0.2%

동_명칭
Text

MISSING 

Distinct164
Distinct (%)41.5%
Missing105
Missing (%)21.0%
Memory size4.0 KiB
2023-12-11T00:03:05.624753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length4
Mean length4.1088608
Min length1

Characters and Unicode

Total characters1623
Distinct characters131
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)29.4%

Sample

1st row윈저스빌
2nd row616동
3rd row에이동
4th row106동
5th row107동
ValueCountFrequency (%)
101동 44
 
10.8%
102동 26
 
6.4%
103동 23
 
5.6%
104동 18
 
4.4%
105동 16
 
3.9%
106동 16
 
3.9%
107동 14
 
3.4%
1동 9
 
2.2%
108동 7
 
1.7%
301동 6
 
1.5%
Other values (163) 229
56.1%
2023-12-11T00:03:06.510109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
357
22.0%
1 342
21.1%
0 270
16.6%
2 102
 
6.3%
3 77
 
4.7%
4 54
 
3.3%
6 47
 
2.9%
5 37
 
2.3%
7 31
 
1.9%
8 27
 
1.7%
Other values (121) 279
17.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 999
61.6%
Other Letter 597
36.8%
Space Separator 13
 
0.8%
Uppercase Letter 12
 
0.7%
Math Symbol 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
357
59.8%
9
 
1.5%
8
 
1.3%
8
 
1.3%
8
 
1.3%
8
 
1.3%
7
 
1.2%
7
 
1.2%
6
 
1.0%
5
 
0.8%
Other values (101) 174
29.1%
Decimal Number
ValueCountFrequency (%)
1 342
34.2%
0 270
27.0%
2 102
 
10.2%
3 77
 
7.7%
4 54
 
5.4%
6 47
 
4.7%
5 37
 
3.7%
7 31
 
3.1%
8 27
 
2.7%
9 12
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
B 5
41.7%
A 2
 
16.7%
N 1
 
8.3%
D 1
 
8.3%
S 1
 
8.3%
H 1
 
8.3%
C 1
 
8.3%
Space Separator
ValueCountFrequency (%)
13
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1014
62.5%
Hangul 597
36.8%
Latin 12
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
357
59.8%
9
 
1.5%
8
 
1.3%
8
 
1.3%
8
 
1.3%
8
 
1.3%
7
 
1.2%
7
 
1.2%
6
 
1.0%
5
 
0.8%
Other values (101) 174
29.1%
Common
ValueCountFrequency (%)
1 342
33.7%
0 270
26.6%
2 102
 
10.1%
3 77
 
7.6%
4 54
 
5.3%
6 47
 
4.6%
5 37
 
3.6%
7 31
 
3.1%
8 27
 
2.7%
13
 
1.3%
Other values (3) 14
 
1.4%
Latin
ValueCountFrequency (%)
B 5
41.7%
A 2
 
16.7%
N 1
 
8.3%
D 1
 
8.3%
S 1
 
8.3%
H 1
 
8.3%
C 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1025
63.2%
Hangul 597
36.8%
Math Operators 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
357
59.8%
9
 
1.5%
8
 
1.3%
8
 
1.3%
8
 
1.3%
8
 
1.3%
7
 
1.2%
7
 
1.2%
6
 
1.0%
5
 
0.8%
Other values (101) 174
29.1%
ASCII
ValueCountFrequency (%)
1 342
33.4%
0 270
26.3%
2 102
 
10.0%
3 77
 
7.5%
4 54
 
5.3%
6 47
 
4.6%
5 37
 
3.6%
7 31
 
3.0%
8 27
 
2.6%
13
 
1.3%
Other values (9) 25
 
2.4%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct283
Distinct (%)56.7%
Missing1
Missing (%)0.2%
Memory size4.0 KiB
2023-12-11T00:03:07.118363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length4.0641283
Min length2

Characters and Unicode

Total characters2028
Distinct characters39
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)35.9%

Sample

1st row703
2nd row13층 1304호
3rd row102호
4th row611호
5th row201호
ValueCountFrequency (%)
302 8
 
1.6%
101호 8
 
1.6%
302호 8
 
1.6%
202 8
 
1.6%
202호 7
 
1.4%
402 7
 
1.4%
301호 7
 
1.4%
501호 6
 
1.2%
102호 6
 
1.2%
201호 6
 
1.2%
Other values (273) 444
86.2%
2023-12-11T00:03:07.972275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 461
22.7%
1 363
17.9%
298
14.7%
2 210
10.4%
3 162
 
8.0%
4 127
 
6.3%
5 99
 
4.9%
6 79
 
3.9%
7 49
 
2.4%
9 46
 
2.3%
Other values (29) 134
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1637
80.7%
Other Letter 357
 
17.6%
Space Separator 16
 
0.8%
Dash Punctuation 9
 
0.4%
Uppercase Letter 7
 
0.3%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
298
83.5%
28
 
7.8%
4
 
1.1%
4
 
1.1%
3
 
0.8%
2
 
0.6%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Other values (13) 13
 
3.6%
Decimal Number
ValueCountFrequency (%)
0 461
28.2%
1 363
22.2%
2 210
12.8%
3 162
 
9.9%
4 127
 
7.8%
5 99
 
6.0%
6 79
 
4.8%
7 49
 
3.0%
9 46
 
2.8%
8 41
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
B 5
71.4%
A 2
 
28.6%
Space Separator
ValueCountFrequency (%)
16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1664
82.1%
Hangul 357
 
17.6%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
298
83.5%
28
 
7.8%
4
 
1.1%
4
 
1.1%
3
 
0.8%
2
 
0.6%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Other values (13) 13
 
3.6%
Common
ValueCountFrequency (%)
0 461
27.7%
1 363
21.8%
2 210
12.6%
3 162
 
9.7%
4 127
 
7.6%
5 99
 
5.9%
6 79
 
4.7%
7 49
 
2.9%
9 46
 
2.8%
8 41
 
2.5%
Other values (4) 27
 
1.6%
Latin
ValueCountFrequency (%)
B 5
71.4%
A 2
 
28.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1671
82.4%
Hangul 357
 
17.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 461
27.6%
1 363
21.7%
2 210
12.6%
3 162
 
9.7%
4 127
 
7.6%
5 99
 
5.9%
6 79
 
4.7%
7 49
 
2.9%
9 46
 
2.8%
8 41
 
2.5%
Other values (6) 34
 
2.0%
Hangul
ValueCountFrequency (%)
298
83.5%
28
 
7.8%
4
 
1.1%
4
 
1.1%
3
 
0.8%
2
 
0.6%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Other values (13) 13
 
3.6%

층_구분_코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
20
486 
10
 
14

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 486
97.2%
10 14
 
2.8%

Length

2023-12-11T00:03:08.253113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:03:08.440477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 486
97.2%
10 14
 
2.8%

층_구분_코드_명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
지상
490 
지하
 
10

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상
2nd row지상
3rd row지상
4th row지상
5th row지상

Common Values

ValueCountFrequency (%)
지상 490
98.0%
지하 10
 
2.0%

Length

2023-12-11T00:03:08.649632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:03:08.834326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지상 490
98.0%
지하 10
 
2.0%

층_번호
Real number (ℝ)

ZEROS 

Distinct28
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.802
Minimum0
Maximum41
Zeros107
Zeros (%)21.4%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:09.042853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median4
Q39
95-th percentile18
Maximum41
Range41
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.9387991
Coefficient of variation (CV)1.0235779
Kurtosis2.5452236
Mean5.802
Median Absolute Deviation (MAD)4
Skewness1.3697528
Sum2901
Variance35.269335
MonotonicityNot monotonic
2023-12-11T00:03:09.281730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
0 107
21.4%
3 44
 
8.8%
2 43
 
8.6%
4 40
 
8.0%
1 36
 
7.2%
5 29
 
5.8%
7 26
 
5.2%
6 25
 
5.0%
13 20
 
4.0%
9 18
 
3.6%
Other values (18) 112
22.4%
ValueCountFrequency (%)
0 107
21.4%
1 36
 
7.2%
2 43
8.6%
3 44
8.8%
4 40
 
8.0%
5 29
 
5.8%
6 25
 
5.0%
7 26
 
5.2%
8 15
 
3.0%
9 18
 
3.6%
ValueCountFrequency (%)
41 1
 
0.2%
27 1
 
0.2%
26 1
 
0.2%
24 2
 
0.4%
23 1
 
0.2%
22 1
 
0.2%
21 1
 
0.2%
20 3
 
0.6%
19 8
1.6%
18 9
1.8%

생성_일자
Real number (ℝ)

Distinct326
Distinct (%)65.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20127536
Minimum20090317
Maximum20160531
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:09.818984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20090317
5-th percentile20090320
Q120110420
median20130722
Q320150247
95-th percentile20160316
Maximum20160531
Range70214
Interquartile range (IQR)39827.25

Descriptive statistics

Standard deviation22686.12
Coefficient of variation (CV)0.0011271186
Kurtosis-1.166629
Mean20127536
Median Absolute Deviation (MAD)19996
Skewness-0.31400165
Sum1.0063768 × 1010
Variance5.1466003 × 108
MonotonicityNot monotonic
2023-12-11T00:03:10.152123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20090320 30
 
6.0%
20110420 14
 
2.8%
20090319 11
 
2.2%
20090323 6
 
1.2%
20150725 6
 
1.2%
20140902 6
 
1.2%
20110417 5
 
1.0%
20090321 5
 
1.0%
20150912 5
 
1.0%
20130208 4
 
0.8%
Other values (316) 408
81.6%
ValueCountFrequency (%)
20090317 4
 
0.8%
20090318 2
 
0.4%
20090319 11
 
2.2%
20090320 30
6.0%
20090321 5
 
1.0%
20090323 6
 
1.2%
20090325 4
 
0.8%
20090421 2
 
0.4%
20090430 1
 
0.2%
20090507 1
 
0.2%
ValueCountFrequency (%)
20160531 1
 
0.2%
20160526 2
0.4%
20160524 3
0.6%
20160518 1
 
0.2%
20160514 2
0.4%
20160512 1
 
0.2%
20160507 1
 
0.2%
20160506 2
0.4%
20160430 2
0.4%
20160429 1
 
0.2%

Sample

관리_건축물대장_PK대장_구분_코드대장_구분_코드_명대장_종류_코드대장_종류_코드_명대지_위치도로명_대지_위치건물_명시군구_코드법정동_코드대지_구분_코드특수지_명블록로트새주소_도로_코드새주소_법정동_코드새주소_지상지하_코드새주소_본_번새주소_부_번동_명칭호_명칭층_구분_코드층_구분_코드_명층_번호생성_일자
041150-1049272집합4전유부서울특별시 송파구 장지동 847번지인천광역시 서구 비지니스로 41연지자이2차114701090006310<NA><NA><NA>117102005011112010<NA>10윈저스빌70320지상320160127
111290-1113172집합4전유부경기도 성남시 분당구 야탑동 219-1번지충청남도 태안군 갈음이길 77광교마을40단지29140114000541<NA><NA><NA>114704142183103010130616동13층 1304호20지상520130320
227290-1002195992집합4전유부경상남도 창원시 마산합포구 진동면 요장리 393-2번지서울특별시 노원구 중계로 230박촌 풍림아이원1141010100112946<NA><NA><NA>411134331214101010150<NA>102호20지상1020141021
326350-1002309452집합4전유부대전광역시 서구 관저동 984번지<NA>우미아파트412712532504040<NA><NA><NA>311404310575102020160에이동611호20지상020150212
448123-1587352집합4전유부경기도 이천시 부발읍 응암리 109번지서울특별시 마포구 포은로 147광양써니밸리아파트4825010400032543<NA><NA><NA>117104169508<NA>0850106동201호20지상020090319
511680-1002536352집합4전유부부산광역시 부산진구 양정동 458-5번지전라북도 익산시 배산로 34-1파크빌2차아파트412811260006773<NA><NA><NA>2917042862981110103730107동1003호20지상420120808
647290-628032집합4전유부전라북도 익산시 금강동 364-1번지충청남도 공주시 번영1로 194-7다우훼밀리아파트412811220005110<NA><NA><NA>272304235418108010240<NA>50620지상120160109
741250-208422집합4전유부부산광역시 부산진구 부전동 450번지경기도 안산시 단원구 광덕대로 194연산 자이26290110000782<NA><NA><NA>11200410907313202011880103동3302호20지상420100630
843150-442642집합4전유부경기도 수원시 팔달구 인계동 939번지서울특별시 마포구 백범로 202신일해피트리아파트265301010008687<NA><NA><NA>411353180044130090130101동102호20지상420150124
911560-1002089002집합4전유부대구광역시 달서구 이곡동 1304-1번지강원도 원주시 시청로 40<NA>264701080005020<NA><NA><NA>482503335032101010930<NA>100620지상220131101
관리_건축물대장_PK대장_구분_코드대장_구분_코드_명대장_종류_코드대장_종류_코드_명대지_위치도로명_대지_위치건물_명시군구_코드법정동_코드대지_구분_코드특수지_명블록로트새주소_도로_코드새주소_법정동_코드새주소_지상지하_코드새주소_본_번새주소_부_번동_명칭호_명칭층_구분_코드층_구분_코드_명층_번호생성_일자
49048330-1002200992집합4전유부경기도 수원시 권선구 금곡동 520번지경기도 광명시 광명로888번길 18-5<NA>263201150004351<NA><NA><NA>411733012016116050300203동612호20지상1620120818
49129155-510572집합4전유부서울특별시 노원구 월계동 776번지서울특별시 양천구 신월로 99은빛마을 휴먼시아265001150007383<NA><NA><NA>116803122007137010120107동405호20지상720130129
49241150-1098812집합4전유부경상북도 안동시 용상동 600번지충청남도 천안시 동남구 탑골길 17중앙자유시장412811440003390<NA><NA><NA>28237315403510401012701동70620지상020140110
49328185-189052집합4전유부서울특별시 동대문구 휘경동 43-175번지부산광역시 사하구 윤공단로 123호계 e-편한세상아파트2811011400059510<NA><NA><NA>11260300502813201066<NA>4동605호20지상820100524
49411140-1002066932집합4전유부인천광역시 계양구 작전동 899-1번지서울특별시 강서구 허준로 209돈암2-1 삼부아파트50130115000191<NA><NA><NA>262304187799<NA>0750102동110620지상020141022
49528245-1001757192집합4전유부경상남도 창원시 진해구 석동 90-1번지인천광역시 계양구 봉오대로691번길 4<NA>412871140004000<NA><NA><NA>2814031490631030101520104동190320지상1420150207
49611440-889832집합4전유부충청남도 천안시 동남구 청당동 436번지경상남도 창원시 의창구 창원대로397번길 6행당한신아파트4121011400016940<NA><NA><NA>4418032520323700201110116동302호20지상020090323
49743750-1001866452집합4전유부경상남도 거제시 옥포동 556-11번지전라남도 광양시 광장로 112-20이동414651280005080<NA><NA><NA>116504163277106010150<NA>지하102호20지상420110420
49828245-219562집합4전유부대구광역시 수성구 지산동 1269-1번지경상남도 양산시 양주로 154현대파크맨션267101130003570<NA><NA><NA>2626041901591300105611321동616호20지상320100105
49941117-2750202집합4전유부서울특별시 서초구 반포동 885번지부산광역시 사하구 승학로281번길 6세아청솔아파트 302동2632010200010828<NA><NA><NA>282373154006106010330제주대유대림아파트 301동10520지상1020121231