Overview

Dataset statistics

Number of variables6
Number of observations187
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.9 KiB
Average record size in memory48.7 B

Variable types

Text6

Dataset

Description각 기초자치단체에서 승인된 아파트 등 주택 건설 사업장에 대한 사업명, 대지위치, 세대수, 사업주체, 시공자, 준공(예정)일 등을 제공함
URLhttps://www.data.go.kr/data/15116841/fileData.do

Alerts

사업명 has unique valuesUnique
대지위치(외 필지수) has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:10:36.484266
Analysis finished2023-12-12 07:10:37.422173
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업명
Text

UNIQUE 

Distinct187
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:37.594732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length13.31016
Min length5

Characters and Unicode

Total characters2489
Distinct characters233
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)100.0%

Sample

1st row대봉1-2주택재건축사업
2nd row대봉1-3주택재건축사업
3rd row태평아파트소규모재건축사업
4th row남산4-5주택재건축사업
5th row달성지구주택재개발사업
ValueCountFrequency (%)
주거복합 43
 
10.3%
공동주택 25
 
6.0%
주상복합 14
 
3.3%
신축공사 12
 
2.9%
힐스테이트 8
 
1.9%
대명동 7
 
1.7%
본리동 6
 
1.4%
감삼동 5
 
1.2%
일원 5
 
1.2%
주택재건축정비사업 5
 
1.2%
Other values (246) 289
69.0%
2023-12-12T16:10:37.979434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
241
 
9.7%
143
 
5.7%
132
 
5.3%
80
 
3.2%
77
 
3.1%
72
 
2.9%
69
 
2.8%
65
 
2.6%
62
 
2.5%
55
 
2.2%
Other values (223) 1493
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1966
79.0%
Space Separator 241
 
9.7%
Decimal Number 201
 
8.1%
Uppercase Letter 38
 
1.5%
Dash Punctuation 35
 
1.4%
Close Punctuation 4
 
0.2%
Open Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
7.3%
132
 
6.7%
80
 
4.1%
77
 
3.9%
72
 
3.7%
69
 
3.5%
65
 
3.3%
62
 
3.2%
55
 
2.8%
55
 
2.8%
Other values (195) 1156
58.8%
Uppercase Letter
ValueCountFrequency (%)
B 8
21.1%
A 5
13.2%
L 5
13.2%
W 4
10.5%
S 3
 
7.9%
I 3
 
7.9%
C 2
 
5.3%
H 2
 
5.3%
R 1
 
2.6%
K 1
 
2.6%
Other values (4) 4
10.5%
Decimal Number
ValueCountFrequency (%)
1 43
21.4%
2 38
18.9%
3 29
14.4%
4 18
9.0%
5 17
 
8.5%
9 13
 
6.5%
8 12
 
6.0%
7 11
 
5.5%
6 11
 
5.5%
0 9
 
4.5%
Space Separator
ValueCountFrequency (%)
241
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1966
79.0%
Common 485
 
19.5%
Latin 38
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
7.3%
132
 
6.7%
80
 
4.1%
77
 
3.9%
72
 
3.7%
69
 
3.5%
65
 
3.3%
62
 
3.2%
55
 
2.8%
55
 
2.8%
Other values (195) 1156
58.8%
Common
ValueCountFrequency (%)
241
49.7%
1 43
 
8.9%
2 38
 
7.8%
- 35
 
7.2%
3 29
 
6.0%
4 18
 
3.7%
5 17
 
3.5%
9 13
 
2.7%
8 12
 
2.5%
7 11
 
2.3%
Other values (4) 28
 
5.8%
Latin
ValueCountFrequency (%)
B 8
21.1%
A 5
13.2%
L 5
13.2%
W 4
10.5%
S 3
 
7.9%
I 3
 
7.9%
C 2
 
5.3%
H 2
 
5.3%
R 1
 
2.6%
K 1
 
2.6%
Other values (4) 4
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1966
79.0%
ASCII 523
 
21.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
241
46.1%
1 43
 
8.2%
2 38
 
7.3%
- 35
 
6.7%
3 29
 
5.5%
4 18
 
3.4%
5 17
 
3.3%
9 13
 
2.5%
8 12
 
2.3%
7 11
 
2.1%
Other values (18) 66
 
12.6%
Hangul
ValueCountFrequency (%)
143
 
7.3%
132
 
6.7%
80
 
4.1%
77
 
3.9%
72
 
3.7%
69
 
3.5%
65
 
3.3%
62
 
3.2%
55
 
2.8%
55
 
2.8%
Other values (195) 1156
58.8%
Distinct187
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:38.371060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length30
Mean length22.534759
Min length16

Characters and Unicode

Total characters4214
Distinct characters129
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)100.0%

Sample

1st row대구광역시 중구 대봉동 55-3(31)
2nd row대구광역시 중구 대봉동 55-68(17)
3rd row대구광역시 중구 태평로1가 23-1
4th row대구광역시 중구 남산동 2478(357)
5th row대구광역시 중구 달성동 12-11(636)
ValueCountFrequency (%)
대구광역시 187
21.2%
일원 104
 
11.8%
중구 32
 
3.6%
수성구 29
 
3.3%
달서구 29
 
3.3%
동구 26
 
2.9%
북구 25
 
2.8%
서구 19
 
2.1%
남구 17
 
1.9%
달성군 10
 
1.1%
Other values (293) 406
45.9%
2023-12-12T16:10:38.878865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
705
16.7%
373
 
8.9%
204
 
4.8%
202
 
4.8%
1 194
 
4.6%
191
 
4.5%
190
 
4.5%
187
 
4.4%
- 133
 
3.2%
126
 
3.0%
Other values (119) 1709
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2497
59.3%
Decimal Number 818
 
19.4%
Space Separator 705
 
16.7%
Dash Punctuation 133
 
3.2%
Close Punctuation 24
 
0.6%
Open Punctuation 24
 
0.6%
Uppercase Letter 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
373
14.9%
204
 
8.2%
202
 
8.1%
191
 
7.6%
190
 
7.6%
187
 
7.5%
126
 
5.0%
118
 
4.7%
100
 
4.0%
84
 
3.4%
Other values (101) 722
28.9%
Decimal Number
ValueCountFrequency (%)
1 194
23.7%
2 112
13.7%
3 96
11.7%
5 72
 
8.8%
4 70
 
8.6%
6 64
 
7.8%
8 60
 
7.3%
0 59
 
7.2%
7 46
 
5.6%
9 45
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 6
46.2%
L 3
23.1%
A 3
23.1%
D 1
 
7.7%
Space Separator
ValueCountFrequency (%)
705
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2497
59.3%
Common 1704
40.4%
Latin 13
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
373
14.9%
204
 
8.2%
202
 
8.1%
191
 
7.6%
190
 
7.6%
187
 
7.5%
126
 
5.0%
118
 
4.7%
100
 
4.0%
84
 
3.4%
Other values (101) 722
28.9%
Common
ValueCountFrequency (%)
705
41.4%
1 194
 
11.4%
- 133
 
7.8%
2 112
 
6.6%
3 96
 
5.6%
5 72
 
4.2%
4 70
 
4.1%
6 64
 
3.8%
8 60
 
3.5%
0 59
 
3.5%
Other values (4) 139
 
8.2%
Latin
ValueCountFrequency (%)
B 6
46.2%
L 3
23.1%
A 3
23.1%
D 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2497
59.3%
ASCII 1717
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
705
41.1%
1 194
 
11.3%
- 133
 
7.7%
2 112
 
6.5%
3 96
 
5.6%
5 72
 
4.2%
4 70
 
4.1%
6 64
 
3.7%
8 60
 
3.5%
0 59
 
3.4%
Other values (8) 152
 
8.9%
Hangul
ValueCountFrequency (%)
373
14.9%
204
 
8.2%
202
 
8.1%
191
 
7.6%
190
 
7.6%
187
 
7.5%
126
 
5.0%
118
 
4.7%
100
 
4.0%
84
 
3.4%
Other values (101) 722
28.9%
Distinct166
Distinct (%)88.8%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:39.342032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length3.2673797
Min length1

Characters and Unicode

Total characters611
Distinct characters21
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)78.1%

Sample

1st row487
2nd row469
3rd row419
4th row947
5th row1501
ValueCountFrequency (%)
627 3
 
1.6%
446 2
 
1.0%
아파트 2
 
1.0%
12 2
 
1.0%
320 2
 
1.0%
418 2
 
1.0%
532 2
 
1.0%
300 2
 
1.0%
499 2
 
1.0%
433 2
 
1.0%
Other values (161) 172
89.1%
2023-12-12T16:10:39.872357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 85
13.9%
4 74
12.1%
3 64
10.5%
9 61
10.0%
0 60
9.8%
2 56
9.2%
6 52
8.5%
5 50
8.2%
8 48
7.9%
7 37
6.1%
Other values (11) 24
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 587
96.1%
Other Letter 12
 
2.0%
Space Separator 6
 
1.0%
Other Punctuation 4
 
0.7%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 85
14.5%
4 74
12.6%
3 64
10.9%
9 61
10.4%
0 60
10.2%
2 56
9.5%
6 52
8.9%
5 50
8.5%
8 48
8.2%
7 37
6.3%
Other Letter
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
1
8.3%
1
8.3%
Space Separator
ValueCountFrequency (%)
6
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 599
98.0%
Hangul 12
 
2.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 85
14.2%
4 74
12.4%
3 64
10.7%
9 61
10.2%
0 60
10.0%
2 56
9.3%
6 52
8.7%
5 50
8.3%
8 48
8.0%
7 37
6.2%
Other values (4) 12
 
2.0%
Hangul
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
1
8.3%
1
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 599
98.0%
Hangul 12
 
2.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 85
14.2%
4 74
12.4%
3 64
10.7%
9 61
10.2%
0 60
10.0%
2 56
9.3%
6 52
8.7%
5 50
8.3%
8 48
8.0%
7 37
6.2%
Other values (4) 12
 
2.0%
Hangul
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
1
8.3%
1
8.3%
Distinct151
Distinct (%)80.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:40.153431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length11.438503
Min length2

Characters and Unicode

Total characters2139
Distinct characters215
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)74.3%

Sample

1st row대봉1-2지구주택재건축정비사업조합
2nd row대봉1-3지구주택재건축정비사업조합
3rd row77태평아파트주택재건축정비사업조합
4th row남산4-5지구주택재건축정비사업조합
5th row달성지구주택재개발정비사업조합
ValueCountFrequency (%)
주택재개발정비사업조합 17
 
6.0%
주택재건축정비사업조합 16
 
5.6%
아시아신탁㈜ 11
 
3.9%
촉진구역 9
 
3.2%
㈜하나자산신탁 6
 
2.1%
lh 6
 
2.1%
대구도시공사 5
 
1.8%
㈜무궁화신탁 4
 
1.4%
가로주택정비사업조합 4
 
1.4%
한국자산신탁㈜ 3
 
1.1%
Other values (185) 203
71.5%
2023-12-12T16:10:40.611280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
97
 
4.5%
97
 
4.5%
87
 
4.1%
73
 
3.4%
73
 
3.4%
71
 
3.3%
69
 
3.2%
68
 
3.2%
66
 
3.1%
66
 
3.1%
Other values (205) 1372
64.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1830
85.6%
Other Symbol 97
 
4.5%
Space Separator 97
 
4.5%
Decimal Number 53
 
2.5%
Open Punctuation 17
 
0.8%
Close Punctuation 17
 
0.8%
Uppercase Letter 16
 
0.7%
Other Punctuation 8
 
0.4%
Dash Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
4.8%
73
 
4.0%
73
 
4.0%
71
 
3.9%
69
 
3.8%
68
 
3.7%
66
 
3.6%
66
 
3.6%
65
 
3.6%
56
 
3.1%
Other values (186) 1136
62.1%
Decimal Number
ValueCountFrequency (%)
2 15
28.3%
3 13
24.5%
1 11
20.8%
7 3
 
5.7%
5 3
 
5.7%
0 3
 
5.7%
6 2
 
3.8%
4 2
 
3.8%
8 1
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
L 7
43.8%
H 7
43.8%
S 1
 
6.2%
G 1
 
6.2%
Other Symbol
ValueCountFrequency (%)
97
100.0%
Space Separator
ValueCountFrequency (%)
97
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1927
90.1%
Common 196
 
9.2%
Latin 16
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
 
5.0%
87
 
4.5%
73
 
3.8%
73
 
3.8%
71
 
3.7%
69
 
3.6%
68
 
3.5%
66
 
3.4%
66
 
3.4%
65
 
3.4%
Other values (187) 1192
61.9%
Common
ValueCountFrequency (%)
97
49.5%
( 17
 
8.7%
) 17
 
8.7%
2 15
 
7.7%
3 13
 
6.6%
1 11
 
5.6%
, 8
 
4.1%
- 4
 
2.0%
7 3
 
1.5%
5 3
 
1.5%
Other values (4) 8
 
4.1%
Latin
ValueCountFrequency (%)
L 7
43.8%
H 7
43.8%
S 1
 
6.2%
G 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1830
85.6%
ASCII 212
 
9.9%
None 97
 
4.5%

Most frequent character per block

None
ValueCountFrequency (%)
97
100.0%
ASCII
ValueCountFrequency (%)
97
45.8%
( 17
 
8.0%
) 17
 
8.0%
2 15
 
7.1%
3 13
 
6.1%
1 11
 
5.2%
, 8
 
3.8%
L 7
 
3.3%
H 7
 
3.3%
- 4
 
1.9%
Other values (8) 16
 
7.5%
Hangul
ValueCountFrequency (%)
87
 
4.8%
73
 
4.0%
73
 
4.0%
71
 
3.9%
69
 
3.8%
68
 
3.7%
66
 
3.6%
66
 
3.6%
65
 
3.6%
56
 
3.1%
Other values (186) 1136
62.1%
Distinct109
Distinct (%)58.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:40.886723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length15
Mean length5.5026738
Min length2

Characters and Unicode

Total characters1029
Distinct characters120
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)47.1%

Sample

1st row현대엔지니어링
2nd row㈜서한
3rd row㈜삼호
4th rowGS건설
5th row㈜대우건설/현대엔지니어링㈜
ValueCountFrequency (%)
미정 36
 
17.7%
㈜서한 9
 
4.4%
㈜대우건설 9
 
4.4%
현대엔지니어링㈜ 5
 
2.5%
㈜포스코건설 5
 
2.5%
현대건설(주 4
 
2.0%
gs건설 4
 
2.0%
현대건설㈜ 4
 
2.0%
화성산업㈜ 4
 
2.0%
동부건설㈜ 3
 
1.5%
Other values (104) 120
59.1%
2023-12-12T16:10:41.314681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
10.9%
91
 
8.8%
88
 
8.6%
38
 
3.7%
37
 
3.6%
37
 
3.6%
28
 
2.7%
22
 
2.1%
21
 
2.0%
( 21
 
2.0%
Other values (110) 534
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 809
78.6%
Other Symbol 112
 
10.9%
Uppercase Letter 32
 
3.1%
Open Punctuation 21
 
2.0%
Close Punctuation 21
 
2.0%
Space Separator 16
 
1.6%
Other Punctuation 15
 
1.5%
Lowercase Letter 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
11.2%
88
 
10.9%
38
 
4.7%
37
 
4.6%
37
 
4.6%
28
 
3.5%
22
 
2.7%
21
 
2.6%
19
 
2.3%
18
 
2.2%
Other values (92) 410
50.7%
Uppercase Letter
ValueCountFrequency (%)
S 8
25.0%
C 7
21.9%
G 6
18.8%
D 4
12.5%
K 3
 
9.4%
H 2
 
6.2%
E 1
 
3.1%
L 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 11
73.3%
& 2
 
13.3%
/ 2
 
13.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
g 1
50.0%
Other Symbol
ValueCountFrequency (%)
112
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 921
89.5%
Common 74
 
7.2%
Latin 34
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
12.2%
91
 
9.9%
88
 
9.6%
38
 
4.1%
37
 
4.0%
37
 
4.0%
28
 
3.0%
22
 
2.4%
21
 
2.3%
19
 
2.1%
Other values (93) 428
46.5%
Latin
ValueCountFrequency (%)
S 8
23.5%
C 7
20.6%
G 6
17.6%
D 4
11.8%
K 3
 
8.8%
H 2
 
5.9%
s 1
 
2.9%
g 1
 
2.9%
E 1
 
2.9%
L 1
 
2.9%
Common
ValueCountFrequency (%)
( 21
28.4%
) 21
28.4%
16
21.6%
, 11
14.9%
& 2
 
2.7%
/ 2
 
2.7%
1 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 809
78.6%
None 112
 
10.9%
ASCII 108
 
10.5%

Most frequent character per block

None
ValueCountFrequency (%)
112
100.0%
Hangul
ValueCountFrequency (%)
91
 
11.2%
88
 
10.9%
38
 
4.7%
37
 
4.6%
37
 
4.6%
28
 
3.5%
22
 
2.7%
21
 
2.6%
19
 
2.3%
18
 
2.2%
Other values (92) 410
50.7%
ASCII
ValueCountFrequency (%)
( 21
19.4%
) 21
19.4%
16
14.8%
, 11
10.2%
S 8
 
7.4%
C 7
 
6.5%
G 6
 
5.6%
D 4
 
3.7%
K 3
 
2.8%
& 2
 
1.9%
Other values (7) 9
8.3%
Distinct114
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T16:10:41.645868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length7.6684492
Min length2

Characters and Unicode

Total characters1434
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)54.0%

Sample

1st row미착공
2nd row2019-12-03
3rd row미착공
4th row2020-02-14
5th row2020-07-02
ValueCountFrequency (%)
미착공 60
32.1%
2021-08-13 3
 
1.6%
2020-05-14 3
 
1.6%
2020-11-06 2
 
1.1%
착공 2
 
1.1%
2021-10-22 2
 
1.1%
2021-07-01 2
 
1.1%
2020-07-02 2
 
1.1%
2019-12-31 2
 
1.1%
2021-03-19 2
 
1.1%
Other values (104) 107
57.2%
2023-12-12T16:10:42.113944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 336
23.4%
0 312
21.8%
- 250
17.4%
1 169
11.8%
62
 
4.3%
62
 
4.3%
60
 
4.2%
9 35
 
2.4%
3 30
 
2.1%
7 30
 
2.1%
Other values (4) 88
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
69.7%
Dash Punctuation 250
 
17.4%
Other Letter 184
 
12.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 336
33.6%
0 312
31.2%
1 169
16.9%
9 35
 
3.5%
3 30
 
3.0%
7 30
 
3.0%
8 23
 
2.3%
4 22
 
2.2%
6 22
 
2.2%
5 21
 
2.1%
Other Letter
ValueCountFrequency (%)
62
33.7%
62
33.7%
60
32.6%
Dash Punctuation
ValueCountFrequency (%)
- 250
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1250
87.2%
Hangul 184
 
12.8%

Most frequent character per script

Common
ValueCountFrequency (%)
2 336
26.9%
0 312
25.0%
- 250
20.0%
1 169
13.5%
9 35
 
2.8%
3 30
 
2.4%
7 30
 
2.4%
8 23
 
1.8%
4 22
 
1.8%
6 22
 
1.8%
Hangul
ValueCountFrequency (%)
62
33.7%
62
33.7%
60
32.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1250
87.2%
Hangul 184
 
12.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 336
26.9%
0 312
25.0%
- 250
20.0%
1 169
13.5%
9 35
 
2.8%
3 30
 
2.4%
7 30
 
2.4%
8 23
 
1.8%
4 22
 
1.8%
6 22
 
1.8%
Hangul
ValueCountFrequency (%)
62
33.7%
62
33.7%
60
32.6%

Missing values

2023-12-12T16:10:37.270230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:10:37.379583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업명대지위치(외 필지수)세대수사업주체시공자착공일
0대봉1-2주택재건축사업대구광역시 중구 대봉동 55-3(31)487대봉1-2지구주택재건축정비사업조합현대엔지니어링미착공
1대봉1-3주택재건축사업대구광역시 중구 대봉동 55-68(17)469대봉1-3지구주택재건축정비사업조합㈜서한2019-12-03
2태평아파트소규모재건축사업대구광역시 중구 태평로1가 23-141977태평아파트주택재건축정비사업조합㈜삼호미착공
3남산4-5주택재건축사업대구광역시 중구 남산동 2478(357)947남산4-5지구주택재건축정비사업조합GS건설2020-02-14
4달성지구주택재개발사업대구광역시 중구 달성동 12-11(636)1501달성지구주택재개발정비사업조합㈜대우건설/현대엔지니어링㈜2020-07-02
5동인3-1주택재개발사업대구광역시 중구 동인동3가 88(169)630동인3-1지구주택재개발정비사업조합대우산업개발㈜2020-07-02
6힐스테이트 도원센트럴대구광역시 중구 도원동 3-11(147)894㈜하나자산신탁현대건설㈜2020-03-18
7동산동 청라힐 주상복합대구광역시 중구 동산동 531(108)302㈜하나자산신탁㈜서한2019-08-22
8힐스테이트 대구역 주상복합대구광역시 중구 태평로2가 7-1(155)803국제자산신탁㈜현대건설㈜2019-11-20
9대원 칸타빌대구광역시 중구 동인동1가 77(64)410㈜대원㈜대원2020-08-07
사업명대지위치(외 필지수)세대수사업주체시공자착공일
177대구국가산단 A7-1BL대구광역시 달성군 구지면 국가산단 내(2단계) A7-1BL500LH미정미착공
178금포지구 우신미가뷰 1단지 공동주택대구광역시 달성군 논공읍 금포리 금포지구 토지구획정리사업 64BL 1L695우신종합건설㈜우신종합건설㈜2019-08-23
179화원 파크뷰 우방아이유쉘대구광역시 달성군 화원읍 천내리 690-1번지538아시아신탁㈜우방산업㈜2019-09-01
180다사역 공동주택 신축공사대구광역시 달성군 다사읍 매곡리 521-2 외 53필지869다사도시개발주식회사금호산업㈜2020-05-14
181설화리 공동주택 신축공사대구광역시 달성군 화원읍 설화리 778-1 외 8필지320아시아신탁㈜에스엠상선㈜2020-03-26
182대구테크노폴리스 RC블럭 주상복합 신축공사대구광역시 달성군 유가읍 봉리 660번지894㈜금성백조주택 정성욱 외 2인(주)금성백조주택2020-10-21
183화원 동화아이위시 주거복합 신축공사대구광역시 달성군 화원읍 명곡리 230-8 외 29필지568㈜동화건설㈜동화건설2021-02-03
184가창 테라스하우스 공동주택 신축공사대구광역시 달성군 가창면 용계리 산10-4 외 13필지118㈜다성디앤씨미정미착공
185테크노폴리스 타운하우스 신축공사대구광역시 달성군 유가읍 초곡리 426외 20필지69(주)하우스탑디앤씨㈜태왕이앤씨 외 12022-02-01
186다사읍 매곡리 748일원주상복합 신축공사대구광역시 달성군 다사읍 매곡리 748외 63필지471㈜에스앤씨컨설팅미정미착공