Overview

Dataset statistics

Number of variables25
Number of observations10000
Missing cells48924
Missing cells (%)19.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 MiB
Average record size in memory217.0 B

Variable types

Categorical9
Numeric8
Text6
Unsupported1
DateTime1

Dataset

Description광주광역시 동구 도로명주소현황 데이터입니다.데이터는 시군구, 읍면동, 건물일련번호, 건축물대장건물명 등으로 구성되어 있습니다.
Author광주광역시 동구
URLhttps://www.data.go.kr/data/15124801/fileData.do

Alerts

시군구 has constant value ""Constant
고시여부 has constant value ""Constant
데이터기준일자 has constant value ""Constant
건물종속구분 is highly imbalanced (73.5%)Imbalance
산여부 is highly imbalanced (93.2%)Imbalance
지하여부 is highly imbalanced (98.7%)Imbalance
도로관리기관 is highly imbalanced (97.1%)Imbalance
건축물대장건물명 has 9320 (93.2%) missing valuesMissing
시군구관리건물명 has 9123 (91.2%) missing valuesMissing
상세건물명 has 9623 (96.2%) missing valuesMissing
리명 has 10000 (100.0%) missing valuesMissing
지번(부번) has 1946 (19.5%) missing valuesMissing
건물군명 has 8912 (89.1%) missing valuesMissing
건물일련번호 has unique valuesUnique
리명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
건물군일련번호 has 6054 (60.5%) zerosZeros
건물부번 has 4056 (40.6%) zerosZeros

Reproduction

Analysis started2023-12-11 23:36:08.535492
Analysis finished2023-12-11 23:36:09.618100
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동구
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd row동구
3rd row동구
4th row동구
5th row동구

Common Values

ValueCountFrequency (%)
동구 10000
100.0%

Length

2023-12-12T08:36:09.676454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:09.765674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동구 10000
100.0%

읍면동
Categorical

Distinct34
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
산수동
1640 
계림동
1407 
지산동
1140 
동명동
975 
소태동
925 
Other values (29)
3913 

Length

Max length5
Median length3
Mean length2.9445
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동명동
2nd row학동
3rd row동명동
4th row동명동
5th row서석동

Common Values

ValueCountFrequency (%)
산수동 1640
16.4%
계림동 1407
14.1%
지산동 1140
11.4%
동명동 975
9.8%
소태동 925
9.2%
학동 857
8.6%
서석동 349
 
3.5%
운림동 283
 
2.8%
대인동 254
 
2.5%
용산동 231
 
2.3%
Other values (24) 1939
19.4%

Length

2023-12-12T08:36:09.883904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
산수동 1640
16.4%
계림동 1407
14.1%
지산동 1140
11.4%
동명동 975
9.8%
소태동 925
9.2%
학동 857
8.6%
서석동 349
 
3.5%
운림동 283
 
2.8%
대인동 254
 
2.5%
용산동 231
 
2.3%
Other values (24) 1939
19.4%

건물일련번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15724.602
Minimum9
Maximum37334
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:10.007347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile1402.95
Q17633.5
median15192.5
Q322623.25
95-th percentile34049
Maximum37334
Range37325
Interquartile range (IQR)14989.75

Descriptive statistics

Standard deviation9752.7042
Coefficient of variation (CV)0.62021947
Kurtosis-0.74423523
Mean15724.602
Median Absolute Deviation (MAD)7489.5
Skewness0.31563382
Sum1.5724602 × 108
Variance95115239
MonotonicityNot monotonic
2023-12-12T08:36:10.167291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24287 1
 
< 0.1%
22523 1
 
< 0.1%
10868 1
 
< 0.1%
7582 1
 
< 0.1%
8863 1
 
< 0.1%
5829 1
 
< 0.1%
36743 1
 
< 0.1%
11653 1
 
< 0.1%
30825 1
 
< 0.1%
2944 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
9 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
16 1
< 0.1%
19 1
< 0.1%
21 1
< 0.1%
27 1
< 0.1%
28 1
< 0.1%
ValueCountFrequency (%)
37334 1
< 0.1%
37332 1
< 0.1%
37330 1
< 0.1%
37328 1
< 0.1%
37314 1
< 0.1%
37313 1
< 0.1%
37310 1
< 0.1%
37309 1
< 0.1%
37307 1
< 0.1%
37306 1
< 0.1%
Distinct350
Distinct (%)51.5%
Missing9320
Missing (%)93.2%
Memory size156.2 KiB
2023-12-12T08:36:10.433980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length21
Mean length6.8558824
Min length2

Characters and Unicode

Total characters4662
Distinct characters320
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)33.8%

Sample

1st row남룡주택
2nd row마을회관
3rd row산수2동주민센터
4th row조소과 소성실
5th row전남의대부속병원
ValueCountFrequency (%)
조선대학교 21
 
2.5%
무등산 21
 
2.5%
금호타운 13
 
1.5%
두산위브 12
 
1.4%
조선대그린빌리지 11
 
1.3%
용산지구 11
 
1.3%
두암타운 11
 
1.3%
증심사 10
 
1.2%
그린웰로제비앙 10
 
1.2%
월남호반베르디움1차 9
 
1.1%
Other values (391) 724
84.9%
2023-12-12T08:36:10.852486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
3.7%
156
 
3.3%
138
 
3.0%
118
 
2.5%
114
 
2.4%
109
 
2.3%
107
 
2.3%
104
 
2.2%
98
 
2.1%
98
 
2.1%
Other values (310) 3447
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4296
92.1%
Space Separator 173
 
3.7%
Decimal Number 113
 
2.4%
Uppercase Letter 36
 
0.8%
Open Punctuation 15
 
0.3%
Close Punctuation 15
 
0.3%
Dash Punctuation 10
 
0.2%
Other Punctuation 3
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
156
 
3.6%
138
 
3.2%
118
 
2.7%
114
 
2.7%
109
 
2.5%
107
 
2.5%
104
 
2.4%
98
 
2.3%
98
 
2.3%
96
 
2.2%
Other values (280) 3158
73.5%
Uppercase Letter
ValueCountFrequency (%)
L 10
27.8%
B 9
25.0%
A 4
 
11.1%
C 2
 
5.6%
I 2
 
5.6%
P 2
 
5.6%
X 1
 
2.8%
E 1
 
2.8%
T 1
 
2.8%
N 1
 
2.8%
Other values (3) 3
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 37
32.7%
2 36
31.9%
3 7
 
6.2%
4 7
 
6.2%
7 6
 
5.3%
6 6
 
5.3%
0 5
 
4.4%
5 4
 
3.5%
8 3
 
2.7%
9 2
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
173
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4296
92.1%
Common 329
 
7.1%
Latin 37
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
156
 
3.6%
138
 
3.2%
118
 
2.7%
114
 
2.7%
109
 
2.5%
107
 
2.5%
104
 
2.4%
98
 
2.3%
98
 
2.3%
96
 
2.2%
Other values (280) 3158
73.5%
Common
ValueCountFrequency (%)
173
52.6%
1 37
 
11.2%
2 36
 
10.9%
( 15
 
4.6%
) 15
 
4.6%
- 10
 
3.0%
3 7
 
2.1%
4 7
 
2.1%
7 6
 
1.8%
6 6
 
1.8%
Other values (6) 17
 
5.2%
Latin
ValueCountFrequency (%)
L 10
27.0%
B 9
24.3%
A 4
 
10.8%
C 2
 
5.4%
I 2
 
5.4%
P 2
 
5.4%
X 1
 
2.7%
E 1
 
2.7%
T 1
 
2.7%
N 1
 
2.7%
Other values (4) 4
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4296
92.1%
ASCII 365
 
7.8%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
173
47.4%
1 37
 
10.1%
2 36
 
9.9%
( 15
 
4.1%
) 15
 
4.1%
- 10
 
2.7%
L 10
 
2.7%
B 9
 
2.5%
3 7
 
1.9%
4 7
 
1.9%
Other values (19) 46
 
12.6%
Hangul
ValueCountFrequency (%)
156
 
3.6%
138
 
3.2%
118
 
2.7%
114
 
2.7%
109
 
2.5%
107
 
2.5%
104
 
2.4%
98
 
2.3%
98
 
2.3%
96
 
2.2%
Other values (280) 3158
73.5%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct507
Distinct (%)57.8%
Missing9123
Missing (%)91.2%
Memory size156.2 KiB
2023-12-12T08:36:11.136064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.4515393
Min length2

Characters and Unicode

Total characters5658
Distinct characters390
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique372 ?
Unique (%)42.4%

Sample

1st row남룡주택
2nd row시티즌
3rd row마을회관
4th row산수2동주민센터
5th row조선대학교 미술대학
ValueCountFrequency (%)
무등산 21
 
2.0%
금호타운 13
 
1.3%
두산위브 12
 
1.2%
조선대그린빌리지 11
 
1.1%
두암타운 11
 
1.1%
증심사 10
 
1.0%
그린웰로제비앙 10
 
1.0%
월남호반베르디움1차 9
 
0.9%
용연정수장 9
 
0.9%
무등산골드클래스2차 9
 
0.9%
Other values (545) 923
88.9%
2023-12-12T08:36:11.735149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
176
 
3.1%
162
 
2.9%
160
 
2.8%
135
 
2.4%
127
 
2.2%
126
 
2.2%
124
 
2.2%
118
 
2.1%
107
 
1.9%
104
 
1.8%
Other values (380) 4319
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5305
93.8%
Space Separator 162
 
2.9%
Decimal Number 113
 
2.0%
Uppercase Letter 48
 
0.8%
Lowercase Letter 9
 
0.2%
Dash Punctuation 5
 
0.1%
Other Punctuation 5
 
0.1%
Close Punctuation 5
 
0.1%
Open Punctuation 5
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
 
3.3%
160
 
3.0%
135
 
2.5%
127
 
2.4%
126
 
2.4%
124
 
2.3%
118
 
2.2%
107
 
2.0%
104
 
2.0%
102
 
1.9%
Other values (339) 4026
75.9%
Uppercase Letter
ValueCountFrequency (%)
S 9
18.8%
K 8
16.7%
A 5
10.4%
I 3
 
6.2%
B 3
 
6.2%
L 3
 
6.2%
C 2
 
4.2%
P 2
 
4.2%
O 2
 
4.2%
T 2
 
4.2%
Other values (9) 9
18.8%
Decimal Number
ValueCountFrequency (%)
2 48
42.5%
1 38
33.6%
4 11
 
9.7%
3 7
 
6.2%
8 3
 
2.7%
9 2
 
1.8%
7 2
 
1.8%
6 1
 
0.9%
5 1
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
l 3
33.3%
s 1
 
11.1%
p 1
 
11.1%
a 1
 
11.1%
i 1
 
11.1%
k 1
 
11.1%
e 1
 
11.1%
Space Separator
ValueCountFrequency (%)
162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5305
93.8%
Common 295
 
5.2%
Latin 58
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
 
3.3%
160
 
3.0%
135
 
2.5%
127
 
2.4%
126
 
2.4%
124
 
2.3%
118
 
2.2%
107
 
2.0%
104
 
2.0%
102
 
1.9%
Other values (339) 4026
75.9%
Latin
ValueCountFrequency (%)
S 9
15.5%
K 8
13.8%
A 5
 
8.6%
I 3
 
5.2%
B 3
 
5.2%
L 3
 
5.2%
l 3
 
5.2%
C 2
 
3.4%
P 2
 
3.4%
O 2
 
3.4%
Other values (17) 18
31.0%
Common
ValueCountFrequency (%)
162
54.9%
2 48
 
16.3%
1 38
 
12.9%
4 11
 
3.7%
3 7
 
2.4%
- 5
 
1.7%
, 5
 
1.7%
) 5
 
1.7%
( 5
 
1.7%
8 3
 
1.0%
Other values (4) 6
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5305
93.8%
ASCII 352
 
6.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
176
 
3.3%
160
 
3.0%
135
 
2.5%
127
 
2.4%
126
 
2.4%
124
 
2.3%
118
 
2.2%
107
 
2.0%
104
 
2.0%
102
 
1.9%
Other values (339) 4026
75.9%
ASCII
ValueCountFrequency (%)
162
46.0%
2 48
 
13.6%
1 38
 
10.8%
4 11
 
3.1%
S 9
 
2.6%
K 8
 
2.3%
3 7
 
2.0%
- 5
 
1.4%
, 5
 
1.4%
A 5
 
1.4%
Other values (30) 54
 
15.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

상세건물명
Text

MISSING 

Distinct150
Distinct (%)39.8%
Missing9623
Missing (%)96.2%
Memory size156.2 KiB
2023-12-12T08:36:12.124360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length4.2625995
Min length2

Characters and Unicode

Total characters1607
Distinct characters181
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)26.8%

Sample

1st row나동
2nd row조소과 소성실
3rd row117동
4th row101동
5th row102동
ValueCountFrequency (%)
101동 24
 
6.2%
102동 20
 
5.1%
103동 18
 
4.6%
주건축물제1동 17
 
4.4%
상가동 14
 
3.6%
104동 13
 
3.3%
107동 12
 
3.1%
105동 11
 
2.8%
a동 11
 
2.8%
가동 9
 
2.3%
Other values (151) 240
61.7%
2023-12-12T08:36:12.686995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
290
18.0%
1 251
15.6%
0 202
 
12.6%
2 87
 
5.4%
3 39
 
2.4%
33
 
2.1%
5 28
 
1.7%
25
 
1.6%
24
 
1.5%
4 23
 
1.4%
Other values (171) 605
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 847
52.7%
Decimal Number 676
42.1%
Uppercase Letter 39
 
2.4%
Dash Punctuation 13
 
0.8%
Space Separator 12
 
0.7%
Lowercase Letter 12
 
0.7%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%
Letter Number 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
34.2%
33
 
3.9%
25
 
3.0%
24
 
2.8%
21
 
2.5%
20
 
2.4%
20
 
2.4%
19
 
2.2%
18
 
2.1%
18
 
2.1%
Other values (135) 359
42.4%
Uppercase Letter
ValueCountFrequency (%)
A 14
35.9%
C 7
17.9%
B 7
17.9%
F 2
 
5.1%
E 2
 
5.1%
G 2
 
5.1%
L 1
 
2.6%
D 1
 
2.6%
H 1
 
2.6%
T 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 251
37.1%
0 202
29.9%
2 87
 
12.9%
3 39
 
5.8%
5 28
 
4.1%
4 23
 
3.4%
7 18
 
2.7%
6 11
 
1.6%
9 9
 
1.3%
8 8
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
o 3
25.0%
n 2
16.7%
t 1
 
8.3%
l 1
 
8.3%
c 1
 
8.3%
y 1
 
8.3%
r 1
 
8.3%
k 1
 
8.3%
e 1
 
8.3%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 847
52.7%
Common 708
44.1%
Latin 52
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
290
34.2%
33
 
3.9%
25
 
3.0%
24
 
2.8%
21
 
2.5%
20
 
2.4%
20
 
2.4%
19
 
2.2%
18
 
2.1%
18
 
2.1%
Other values (135) 359
42.4%
Latin
ValueCountFrequency (%)
A 14
26.9%
C 7
13.5%
B 7
13.5%
o 3
 
5.8%
n 2
 
3.8%
F 2
 
3.8%
E 2
 
3.8%
G 2
 
3.8%
t 1
 
1.9%
l 1
 
1.9%
Other values (11) 11
21.2%
Common
ValueCountFrequency (%)
1 251
35.5%
0 202
28.5%
2 87
 
12.3%
3 39
 
5.5%
5 28
 
4.0%
4 23
 
3.2%
7 18
 
2.5%
- 13
 
1.8%
12
 
1.7%
6 11
 
1.6%
Other values (5) 24
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 847
52.7%
ASCII 759
47.2%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
290
34.2%
33
 
3.9%
25
 
3.0%
24
 
2.8%
21
 
2.5%
20
 
2.4%
20
 
2.4%
19
 
2.2%
18
 
2.1%
18
 
2.1%
Other values (135) 359
42.4%
ASCII
ValueCountFrequency (%)
1 251
33.1%
0 202
26.6%
2 87
 
11.5%
3 39
 
5.1%
5 28
 
3.7%
4 23
 
3.0%
7 18
 
2.4%
A 14
 
1.8%
- 13
 
1.7%
12
 
1.6%
Other values (25) 72
 
9.5%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단독주택
5686 
제2종근린생활시설
1409 
제1종근린생활시설
1152 
공동주택
 
383
판매 및 영업시설
 
367
Other values (15)
1003 

Length

Max length11
Median length4
Mean length5.6693
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단독주택
2nd row단독주택
3rd row단독주택
4th row단독주택
5th row제1종근린생활시설

Common Values

ValueCountFrequency (%)
단독주택 5686
56.9%
제2종근린생활시설 1409
 
14.1%
제1종근린생활시설 1152
 
11.5%
공동주택 383
 
3.8%
판매 및 영업시설 367
 
3.7%
창고시설 273
 
2.7%
교육연구 및 복지시설 211
 
2.1%
숙박시설 140
 
1.4%
업무시설 71
 
0.7%
의료시설 64
 
0.6%
Other values (10) 244
 
2.4%

Length

2023-12-12T08:36:12.846483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
단독주택 5686
50.7%
제2종근린생활시설 1409
 
12.6%
제1종근린생활시설 1152
 
10.3%
610
 
5.4%
공동주택 383
 
3.4%
판매 367
 
3.3%
영업시설 367
 
3.3%
창고시설 273
 
2.4%
교육연구 211
 
1.9%
복지시설 211
 
1.9%
Other values (14) 551
 
4.9%
Distinct155
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:36:13.154029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length3.9689
Min length2

Characters and Unicode

Total characters39689
Distinct characters189
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st row단독주택
2nd row단독주택
3rd row단독주택
4th row단독주택
5th row소매점
ValueCountFrequency (%)
단독주택 5214
52.1%
소매점 877
 
8.8%
일반음식점 562
 
5.6%
다가구주택 445
 
4.4%
사무소 353
 
3.5%
상점 337
 
3.4%
아파트 244
 
2.4%
기타창고시설 149
 
1.5%
기타제2종근생 121
 
1.2%
창고 121
 
1.2%
Other values (147) 1581
 
15.8%
2023-12-12T08:36:13.723549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5888
14.8%
5807
14.6%
5240
13.2%
5216
13.1%
1902
 
4.8%
1396
 
3.5%
924
 
2.3%
636
 
1.6%
636
 
1.6%
629
 
1.6%
Other values (179) 11415
28.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39384
99.2%
Decimal Number 185
 
0.5%
Close Punctuation 58
 
0.1%
Open Punctuation 58
 
0.1%
Space Separator 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5888
15.0%
5807
14.7%
5240
13.3%
5216
13.2%
1902
 
4.8%
1396
 
3.5%
924
 
2.3%
636
 
1.6%
636
 
1.6%
629
 
1.6%
Other values (174) 11110
28.2%
Decimal Number
ValueCountFrequency (%)
2 121
65.4%
1 64
34.6%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39384
99.2%
Common 305
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5888
15.0%
5807
14.7%
5240
13.3%
5216
13.2%
1902
 
4.8%
1396
 
3.5%
924
 
2.3%
636
 
1.6%
636
 
1.6%
629
 
1.6%
Other values (174) 11110
28.2%
Common
ValueCountFrequency (%)
2 121
39.7%
1 64
21.0%
) 58
19.0%
( 58
19.0%
4
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39384
99.2%
ASCII 305
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5888
15.0%
5807
14.7%
5240
13.3%
5216
13.2%
1902
 
4.8%
1396
 
3.5%
924
 
2.3%
636
 
1.6%
636
 
1.6%
629
 
1.6%
Other values (174) 11110
28.2%
ASCII
ValueCountFrequency (%)
2 121
39.7%
1 64
21.0%
) 58
19.0%
( 58
19.0%
4
 
1.3%

건물종속구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
주건물
9550 
종속건물
 
450

Length

Max length4
Median length3
Mean length3.045
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주건물
2nd row주건물
3rd row주건물
4th row주건물
5th row주건물

Common Values

ValueCountFrequency (%)
주건물 9550
95.5%
종속건물 450
 
4.5%

Length

2023-12-12T08:36:13.882352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:14.005297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주건물 9550
95.5%
종속건물 450
 
4.5%

우편번호
Real number (ℝ)

Distinct115
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61453.048
Minimum61400
Maximum61514
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:14.138093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum61400
5-th percentile61405
Q161428
median61447
Q361484
95-th percentile61507.05
Maximum61514
Range114
Interquartile range (IQR)56

Descriptive statistics

Standard deviation33.036603
Coefficient of variation (CV)0.00053759095
Kurtosis-1.1900724
Mean61453.048
Median Absolute Deviation (MAD)27
Skewness0.23373585
Sum6.1453048 × 108
Variance1091.4171
MonotonicityNot monotonic
2023-12-12T08:36:14.285561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61430 297
 
3.0%
61417 277
 
2.8%
61496 259
 
2.6%
61411 228
 
2.3%
61428 219
 
2.2%
61437 208
 
2.1%
61405 207
 
2.1%
61488 204
 
2.0%
61434 195
 
1.9%
61512 192
 
1.9%
Other values (105) 7714
77.1%
ValueCountFrequency (%)
61400 119
1.2%
61401 130
1.3%
61402 36
 
0.4%
61403 14
 
0.1%
61404 21
 
0.2%
61405 207
2.1%
61406 182
1.8%
61407 128
1.3%
61408 31
 
0.3%
61409 11
 
0.1%
ValueCountFrequency (%)
61514 55
 
0.5%
61513 85
0.9%
61512 192
1.9%
61511 83
0.8%
61510 42
 
0.4%
61509 18
 
0.2%
61508 25
 
0.2%
61507 70
 
0.7%
61506 90
0.9%
61505 11
 
0.1%

리명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

산여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9919 
 
81

Length

Max length4
Median length4
Mean length3.9757
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9919
99.2%
81
 
0.8%

Length

2023-12-12T08:36:14.463814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:14.569614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9919
99.2%
81
 
0.8%

지번(본번)
Real number (ℝ)

Distinct1058
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean388.9167
Minimum0
Maximum1869
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:14.695714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20
Q1129.75
median390
Q3554
95-th percentile824.05
Maximum1869
Range1869
Interquartile range (IQR)424.25

Descriptive statistics

Standard deviation301.47697
Coefficient of variation (CV)0.77517106
Kurtosis3.730483
Mean388.9167
Median Absolute Deviation (MAD)212
Skewness1.2826185
Sum3889167
Variance90888.364
MonotonicityNot monotonic
2023-12-12T08:36:14.857898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200 152
 
1.5%
532 81
 
0.8%
154 79
 
0.8%
553 63
 
0.6%
209 59
 
0.6%
1 58
 
0.6%
540 57
 
0.6%
55 52
 
0.5%
207 51
 
0.5%
539 49
 
0.5%
Other values (1048) 9299
93.0%
ValueCountFrequency (%)
0 1
 
< 0.1%
1 58
0.6%
2 40
0.4%
3 21
 
0.2%
4 13
 
0.1%
5 34
0.3%
6 23
 
0.2%
7 31
0.3%
8 36
0.4%
9 26
0.3%
ValueCountFrequency (%)
1869 7
0.1%
1845 5
0.1%
1827 8
0.1%
1816 1
 
< 0.1%
1815 4
< 0.1%
1814 1
 
< 0.1%
1813 1
 
< 0.1%
1811 1
 
< 0.1%
1810 1
 
< 0.1%
1809 1
 
< 0.1%

지번(부번)
Real number (ℝ)

MISSING 

Distinct210
Distinct (%)2.6%
Missing1946
Missing (%)19.5%
Infinite0
Infinite (%)0.0%
Mean21.750807
Minimum1
Maximum360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:15.035884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median10
Q326
95-th percentile76
Maximum360
Range359
Interquartile range (IQR)22

Descriptive statistics

Standard deviation33.092022
Coefficient of variation (CV)1.5214158
Kurtosis20.024461
Mean21.750807
Median Absolute Deviation (MAD)8
Skewness3.8281016
Sum175181
Variance1095.0819
MonotonicityNot monotonic
2023-12-12T08:36:15.213369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 858
 
8.6%
2 586
 
5.9%
3 452
 
4.5%
4 404
 
4.0%
5 361
 
3.6%
6 319
 
3.2%
7 315
 
3.1%
8 294
 
2.9%
9 235
 
2.4%
10 229
 
2.3%
Other values (200) 4001
40.0%
(Missing) 1946
19.5%
ValueCountFrequency (%)
1 858
8.6%
2 586
5.9%
3 452
4.5%
4 404
4.0%
5 361
3.6%
6 319
 
3.2%
7 315
 
3.1%
8 294
 
2.9%
9 235
 
2.4%
10 229
 
2.3%
ValueCountFrequency (%)
360 1
 
< 0.1%
324 1
 
< 0.1%
318 1
 
< 0.1%
314 1
 
< 0.1%
305 1
 
< 0.1%
304 1
 
< 0.1%
303 6
0.1%
295 1
 
< 0.1%
292 1
 
< 0.1%
261 1
 
< 0.1%
Distinct576
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2011-07-29 00:00:00
Maximum2023-10-05 00:00:00
2023-12-12T08:36:15.396500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:36:15.573194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

건물군일련번호
Real number (ℝ)

ZEROS 

Distinct2490
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2065.3586
Minimum0
Maximum11446
Zeros6054
Zeros (%)60.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:15.770706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33535.25
95-th percentile10967.05
Maximum11446
Range11446
Interquartile range (IQR)3535.25

Descriptive statistics

Standard deviation3356.9298
Coefficient of variation (CV)1.6253496
Kurtosis1.1656865
Mean2065.3586
Median Absolute Deviation (MAD)0
Skewness1.5387615
Sum20653586
Variance11268978
MonotonicityNot monotonic
2023-12-12T08:36:16.316170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6054
60.5%
7653 13
 
0.1%
8129 11
 
0.1%
10644 10
 
0.1%
10177 10
 
0.1%
4675 10
 
0.1%
9210 9
 
0.1%
10657 9
 
0.1%
66 9
 
0.1%
5255 8
 
0.1%
Other values (2480) 3857
38.6%
ValueCountFrequency (%)
0 6054
60.5%
5 2
 
< 0.1%
6 2
 
< 0.1%
7 1
 
< 0.1%
8 1
 
< 0.1%
17 1
 
< 0.1%
18 1
 
< 0.1%
35 1
 
< 0.1%
39 4
 
< 0.1%
42 1
 
< 0.1%
ValueCountFrequency (%)
11446 1
< 0.1%
11445 1
< 0.1%
11443 1
< 0.1%
11442 2
< 0.1%
11441 1
< 0.1%
11440 1
< 0.1%
11439 1
< 0.1%
11438 2
< 0.1%
11425 2
< 0.1%
11424 2
< 0.1%

건물군명
Text

MISSING 

Distinct538
Distinct (%)49.4%
Missing8912
Missing (%)89.1%
Memory size156.2 KiB
2023-12-12T08:36:16.668172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length5.9448529
Min length2

Characters and Unicode

Total characters6468
Distinct characters405
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)23.4%

Sample

1st row합동에너지
2nd row중심각모텔
3rd row남룡주택
4th row산수2동주민센터
5th row조선대학교 미술대학
ValueCountFrequency (%)
금호타운 13
 
1.1%
두산위브 12
 
1.0%
두암타운 11
 
0.9%
증심사 10
 
0.8%
조선대그린빌리지 10
 
0.8%
무등산그린웰로제비앙 10
 
0.8%
용연정수장 9
 
0.8%
무등산골드클래스2차 9
 
0.8%
광주남초등학교 8
 
0.7%
그랜드센트럴 8
 
0.7%
Other values (557) 1090
91.6%
2023-12-12T08:36:17.247662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
 
2.2%
138
 
2.1%
138
 
2.1%
136
 
2.1%
136
 
2.1%
134
 
2.1%
131
 
2.0%
116
 
1.8%
109
 
1.7%
106
 
1.6%
Other values (395) 5179
80.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6172
95.4%
Space Separator 104
 
1.6%
Decimal Number 72
 
1.1%
Uppercase Letter 55
 
0.9%
Open Punctuation 17
 
0.3%
Close Punctuation 17
 
0.3%
Other Punctuation 15
 
0.2%
Lowercase Letter 13
 
0.2%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
145
 
2.3%
138
 
2.2%
138
 
2.2%
136
 
2.2%
136
 
2.2%
134
 
2.2%
131
 
2.1%
116
 
1.9%
109
 
1.8%
106
 
1.7%
Other values (359) 4883
79.1%
Uppercase Letter
ValueCountFrequency (%)
K 10
18.2%
S 10
18.2%
L 8
14.5%
O 6
10.9%
A 4
 
7.3%
P 3
 
5.5%
G 3
 
5.5%
T 2
 
3.6%
C 2
 
3.6%
H 1
 
1.8%
Other values (6) 6
10.9%
Decimal Number
ValueCountFrequency (%)
2 41
56.9%
1 20
27.8%
0 4
 
5.6%
3 3
 
4.2%
6 2
 
2.8%
5 2
 
2.8%
Lowercase Letter
ValueCountFrequency (%)
a 4
30.8%
i 3
23.1%
p 2
15.4%
m 2
15.4%
s 1
 
7.7%
e 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 7
46.7%
· 4
26.7%
, 3
20.0%
& 1
 
6.7%
Space Separator
ValueCountFrequency (%)
104
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6172
95.4%
Common 228
 
3.5%
Latin 68
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
145
 
2.3%
138
 
2.2%
138
 
2.2%
136
 
2.2%
136
 
2.2%
134
 
2.2%
131
 
2.1%
116
 
1.9%
109
 
1.8%
106
 
1.7%
Other values (359) 4883
79.1%
Latin
ValueCountFrequency (%)
K 10
14.7%
S 10
14.7%
L 8
11.8%
O 6
 
8.8%
A 4
 
5.9%
a 4
 
5.9%
P 3
 
4.4%
G 3
 
4.4%
i 3
 
4.4%
T 2
 
2.9%
Other values (12) 15
22.1%
Common
ValueCountFrequency (%)
104
45.6%
2 41
 
18.0%
1 20
 
8.8%
( 17
 
7.5%
) 17
 
7.5%
. 7
 
3.1%
0 4
 
1.8%
· 4
 
1.8%
3 3
 
1.3%
, 3
 
1.3%
Other values (4) 8
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6172
95.4%
ASCII 292
 
4.5%
None 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
145
 
2.3%
138
 
2.2%
138
 
2.2%
136
 
2.2%
136
 
2.2%
134
 
2.2%
131
 
2.1%
116
 
1.9%
109
 
1.8%
106
 
1.7%
Other values (359) 4883
79.1%
ASCII
ValueCountFrequency (%)
104
35.6%
2 41
 
14.0%
1 20
 
6.8%
( 17
 
5.8%
) 17
 
5.8%
K 10
 
3.4%
S 10
 
3.4%
L 8
 
2.7%
. 7
 
2.4%
O 6
 
2.1%
Other values (25) 52
17.8%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct320
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:36:17.593200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.917
Min length3

Characters and Unicode

Total characters59170
Distinct characters88
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row동명로25번길
2nd row학소로
3rd row동계로9번길
4th row동계로1번길
5th row서남로
ValueCountFrequency (%)
남문로 227
 
2.3%
중앙로 224
 
2.2%
필문대로 213
 
2.1%
무등로 208
 
2.1%
동명로 189
 
1.9%
동계천로 185
 
1.8%
충장로 167
 
1.7%
제봉로 154
 
1.5%
경양로 152
 
1.5%
지호로 143
 
1.4%
Other values (310) 8138
81.4%
2023-12-12T08:36:18.080409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8882
 
15.0%
7034
 
11.9%
5893
 
10.0%
2 2453
 
4.1%
1 2393
 
4.0%
3 2147
 
3.6%
1873
 
3.2%
5 1463
 
2.5%
1334
 
2.3%
1284
 
2.2%
Other values (78) 24414
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44371
75.0%
Decimal Number 14799
 
25.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8882
20.0%
7034
15.9%
5893
13.3%
1873
 
4.2%
1334
 
3.0%
1284
 
2.9%
1225
 
2.8%
731
 
1.6%
699
 
1.6%
698
 
1.6%
Other values (68) 14718
33.2%
Decimal Number
ValueCountFrequency (%)
2 2453
16.6%
1 2393
16.2%
3 2147
14.5%
5 1463
9.9%
6 1277
8.6%
7 1232
8.3%
4 1121
7.6%
0 935
 
6.3%
9 928
 
6.3%
8 850
 
5.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44371
75.0%
Common 14799
 
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8882
20.0%
7034
15.9%
5893
13.3%
1873
 
4.2%
1334
 
3.0%
1284
 
2.9%
1225
 
2.8%
731
 
1.6%
699
 
1.6%
698
 
1.6%
Other values (68) 14718
33.2%
Common
ValueCountFrequency (%)
2 2453
16.6%
1 2393
16.2%
3 2147
14.5%
5 1463
9.9%
6 1277
8.6%
7 1232
8.3%
4 1121
7.6%
0 935
 
6.3%
9 928
 
6.3%
8 850
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44371
75.0%
ASCII 14799
 
25.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8882
20.0%
7034
15.9%
5893
13.3%
1873
 
4.2%
1334
 
3.0%
1284
 
2.9%
1225
 
2.8%
731
 
1.6%
699
 
1.6%
698
 
1.6%
Other values (68) 14718
33.2%
ASCII
ValueCountFrequency (%)
2 2453
16.6%
1 2393
16.2%
3 2147
14.5%
5 1463
9.9%
6 1277
8.6%
7 1232
8.3%
4 1121
7.6%
0 935
 
6.3%
9 928
 
6.3%
8 850
 
5.7%

지하여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9988 
지하
 
12

Length

Max length4
Median length4
Mean length3.9976
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9988
99.9%
지하 12
 
0.1%

Length

2023-12-12T08:36:18.216154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:18.308865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9988
99.9%
지하 12
 
0.1%

건물본번
Real number (ℝ)

Distinct594
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.2279
Minimum1
Maximum784
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:18.403941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q110
median21
Q376
95-th percentile381
Maximum784
Range783
Interquartile range (IQR)66

Descriptive statistics

Standard deviation137.78566
Coefficient of variation (CV)1.696285
Kurtosis7.1242721
Mean81.2279
Median Absolute Deviation (MAD)15
Skewness2.6230704
Sum812279
Variance18984.889
MonotonicityNot monotonic
2023-12-12T08:36:18.556599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6 371
 
3.7%
5 361
 
3.6%
8 310
 
3.1%
10 294
 
2.9%
11 281
 
2.8%
7 280
 
2.8%
12 274
 
2.7%
14 269
 
2.7%
15 247
 
2.5%
9 247
 
2.5%
Other values (584) 7066
70.7%
ValueCountFrequency (%)
1 194
1.9%
2 175
1.8%
3 214
2.1%
4 225
2.2%
5 361
3.6%
6 371
3.7%
7 280
2.8%
8 310
3.1%
9 247
2.5%
10 294
2.9%
ValueCountFrequency (%)
784 2
 
< 0.1%
782 1
 
< 0.1%
778 2
 
< 0.1%
776 1
 
< 0.1%
774 1
 
< 0.1%
772 6
0.1%
770 1
 
< 0.1%
768 3
< 0.1%
766 1
 
< 0.1%
764 1
 
< 0.1%

건물부번
Real number (ℝ)

ZEROS 

Distinct62
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.041
Minimum0
Maximum320
Zeros4056
Zeros (%)40.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:18.698612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q37
95-th percentile22
Maximum320
Range320
Interquartile range (IQR)7

Descriptive statistics

Standard deviation8.5876594
Coefficient of variation (CV)1.7035627
Kurtosis188.01107
Mean5.041
Median Absolute Deviation (MAD)1
Skewness7.061367
Sum50410
Variance73.747894
MonotonicityNot monotonic
2023-12-12T08:36:18.814968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4056
40.6%
1 1228
 
12.3%
3 487
 
4.9%
2 452
 
4.5%
4 380
 
3.8%
5 374
 
3.7%
7 345
 
3.5%
6 323
 
3.2%
8 268
 
2.7%
9 242
 
2.4%
Other values (52) 1845
18.4%
ValueCountFrequency (%)
0 4056
40.6%
1 1228
 
12.3%
2 452
 
4.5%
3 487
 
4.9%
4 380
 
3.8%
5 374
 
3.7%
6 323
 
3.2%
7 345
 
3.5%
8 268
 
2.7%
9 242
 
2.4%
ValueCountFrequency (%)
320 1
 
< 0.1%
106 1
 
< 0.1%
80 1
 
< 0.1%
61 1
 
< 0.1%
59 2
< 0.1%
57 4
< 0.1%
56 2
< 0.1%
55 2
< 0.1%
54 1
 
< 0.1%
53 1
 
< 0.1%

기초구역번호
Real number (ℝ)

Distinct115
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61453.048
Minimum61400
Maximum61514
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:36:18.945620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum61400
5-th percentile61405
Q161428
median61447
Q361484
95-th percentile61507.05
Maximum61514
Range114
Interquartile range (IQR)56

Descriptive statistics

Standard deviation33.036603
Coefficient of variation (CV)0.00053759095
Kurtosis-1.1900724
Mean61453.048
Median Absolute Deviation (MAD)27
Skewness0.23373585
Sum6.1453048 × 108
Variance1091.4171
MonotonicityNot monotonic
2023-12-12T08:36:19.071630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61430 297
 
3.0%
61417 277
 
2.8%
61496 259
 
2.6%
61411 228
 
2.3%
61428 219
 
2.2%
61437 208
 
2.1%
61405 207
 
2.1%
61488 204
 
2.0%
61434 195
 
1.9%
61512 192
 
1.9%
Other values (105) 7714
77.1%
ValueCountFrequency (%)
61400 119
1.2%
61401 130
1.3%
61402 36
 
0.4%
61403 14
 
0.1%
61404 21
 
0.2%
61405 207
2.1%
61406 182
1.8%
61407 128
1.3%
61408 31
 
0.3%
61409 11
 
0.1%
ValueCountFrequency (%)
61514 55
 
0.5%
61513 85
0.9%
61512 192
1.9%
61511 83
0.8%
61510 42
 
0.4%
61509 18
 
0.2%
61508 25
 
0.2%
61507 70
 
0.7%
61506 90
0.9%
61505 11
 
0.1%

도로관리기관
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동구
9952 
북구
 
45
남구
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd row동구
3rd row동구
4th row동구
5th row동구

Common Values

ValueCountFrequency (%)
동구 9952
99.5%
북구 45
 
0.4%
남구 3
 
< 0.1%

Length

2023-12-12T08:36:19.205132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:19.303042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동구 9952
99.5%
북구 45
 
0.4%
남구 3
 
< 0.1%

고시여부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고시
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고시
2nd row고시
3rd row고시
4th row고시
5th row고시

Common Values

ValueCountFrequency (%)
고시 10000
100.0%

Length

2023-12-12T08:36:19.393006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:19.470826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고시 10000
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-11-03
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-11-03
2nd row2023-11-03
3rd row2023-11-03
4th row2023-11-03
5th row2023-11-03

Common Values

ValueCountFrequency (%)
2023-11-03 10000
100.0%

Length

2023-12-12T08:36:19.570954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:19.648849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-11-03 10000
100.0%

Sample

시군구읍면동건물일련번호건축물대장건물명시군구관리건물명상세건물명건물 용도(분류)건물 용도(상세)건물종속구분우편번호리명산여부지번(본번)지번(부번)고시일자건물군일련번호건물군명도로명지하여부건물본번건물부번기초구역번호도로관리기관고시여부데이터기준일자
3413동구동명동24287<NA><NA><NA>단독주택단독주택주건물61430<NA><NA>2001222011-07-2911253<NA>동명로25번길<NA>16161430동구고시2023-11-03
18020동구학동11964<NA><NA><NA>단독주택단독주택주건물61462<NA><NA>781132011-07-290<NA>학소로<NA>132061462동구고시2023-11-03
2962동구동명동4319<NA><NA><NA>단독주택단독주택주건물61430<NA><NA>2071092011-07-2911316<NA>동계로9번길<NA>4161430동구고시2023-11-03
2932동구동명동24354<NA><NA><NA>단독주택단독주택주건물61430<NA><NA>200452011-07-2911217<NA>동계로1번길<NA>141261430동구고시2023-11-03
14108동구서석동5172<NA><NA><NA>제1종근린생활시설소매점주건물61468<NA><NA>3392011-07-290<NA>서남로<NA>4061468동구고시2023-11-03
1907동구동명동13840<NA><NA><NA>단독주택단독주택주건물61448<NA><NA>10212011-07-2911021<NA>필문대로<NA>249161448동구고시2023-11-03
163동구대인동23432<NA><NA><NA>제1종근린생활시설이(미)용원주건물61470<NA><NA>2362011-07-297282<NA>제봉로<NA>2212561470동구고시2023-11-03
10900동구지산동1978<NA><NA><NA>판매 및 영업시설상점주건물61441<NA><NA>706542011-07-290<NA>밤실로<NA>35061441동구고시2023-11-03
3854동구동명동28341<NA><NA><NA>단독주택다중주택주건물61448<NA><NA>113982013-02-250<NA>필문대로253번길<NA>10561448동구고시2023-11-03
11158동구지산동7844<NA><NA><NA>제1종근린생활시설소매점주건물61441<NA><NA>708302011-07-290<NA>밤실로23번길<NA>9061441동구고시2023-11-03
시군구읍면동건물일련번호건축물대장건물명시군구관리건물명상세건물명건물 용도(분류)건물 용도(상세)건물종속구분우편번호리명산여부지번(본번)지번(부번)고시일자건물군일련번호건물군명도로명지하여부건물본번건물부번기초구역번호도로관리기관고시여부데이터기준일자
10034동구산수동16326<NA><NA><NA>제1종근린생활시설소매점주건물61433<NA><NA>55762011-07-290<NA>필문대로192번길<NA>8061433동구고시2023-11-03
17250동구운림동11781비지터 센터비지터 센터<NA>제1종근린생활시설기타공공시설주건물61493<NA><NA>10412011-07-290<NA>증심사길<NA>71061493동구고시2023-11-03
703동구금남로5가15127<NA><NA><NA>단독주택단독주택주건물61471<NA><NA>532011-07-296057<NA>독립로<NA>260561471동구고시2023-11-03
4791동구계림동2209<NA><NA><NA>단독주택단독주택종속건물61413<NA><NA>25092011-07-29793<NA>중앙로<NA>278161413동구고시2023-11-03
20465동구충장로3가20417<NA><NA><NA>제2종근린생활시설일반음식점주건물61483<NA><NA>36<NA>2011-07-290<NA>충장로안길<NA>5361483동구고시2023-11-03
9650동구산수동16267<NA><NA><NA>단독주택단독주택주건물61411<NA><NA>509512011-07-290<NA>필문대로171번길<NA>24361411동구고시2023-11-03
14339동구서석동19077<NA><NA><NA>자동차관련시설주차장주건물61468<NA><NA>9282011-07-290<NA>백서로175번길<NA>15161468동구고시2023-11-03
533동구대인동18381<NA><NA><NA>숙박시설여관주건물61426<NA><NA>322102011-07-290<NA>제봉로222번길<NA>101461426동구고시2023-11-03
19106동구학동10326조선대그린빌리지조선대그린빌리지F동단독주택다가구주택주건물61457<NA><NA>62122022-02-2810644조선대그린빌리지조선대2길<NA>60061457동구고시2023-11-03
9460동구산수동21545<NA><NA><NA>제1종근린생활시설소매점주건물61411<NA><NA>530422011-07-290<NA>필문대로165번길<NA>1261411동구고시2023-11-03