Overview

Dataset statistics

Number of variables5
Number of observations277
Missing cells89
Missing cells (%)6.4%
Duplicate rows2
Duplicate rows (%)0.7%
Total size in memory10.9 KiB
Average record size in memory40.5 B

Variable types

Text5

Dataset

Description대구광역시 중구 관내 부동산 중개업소 위치 등 현황입니다. - 등록번호,사무소명,중개업자명,사무소전화번호,사무소주소 등 https://www.jung.daegu.kr/new/pages/information/page.html?mc=1884 페이지에서 해당 정보 확인 가능합니다.
Author대구광역시 중구
URLhttps://www.data.go.kr/data/3072640/fileData.do

Alerts

Dataset has 2 (0.7%) duplicate rowsDuplicates
사무소전화번호 has 89 (32.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:52:27.445237
Analysis finished2023-12-12 09:52:28.236069
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct273
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T18:52:28.407238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length13.66787
Min length8

Characters and Unicode

Total characters3786
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)97.5%

Sample

1st row가-11-0613
2nd row가-11-0636
3rd row가-11-0677
4th row가-11-0710
5th row가-11-755
ValueCountFrequency (%)
27110-2019-00013 3
 
1.1%
27110-2019-00025 2
 
0.7%
47840-2018-00010 2
 
0.7%
27110-2020-00075 1
 
0.4%
27110-2019-00007 1
 
0.4%
27110-2019-00024 1
 
0.4%
27110-2019-00009 1
 
0.4%
27110-2019-00020 1
 
0.4%
27110-2019-00018 1
 
0.4%
27110-2019-00014 1
 
0.4%
Other values (263) 263
94.9%
2023-12-12T18:52:28.847491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1093
28.9%
1 808
21.3%
- 554
14.6%
2 513
13.5%
7 250
 
6.6%
4 95
 
2.5%
3 92
 
2.4%
9 78
 
2.1%
5 78
 
2.1%
74
 
2.0%
Other values (3) 151
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3140
82.9%
Dash Punctuation 554
 
14.6%
Other Letter 92
 
2.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1093
34.8%
1 808
25.7%
2 513
16.3%
7 250
 
8.0%
4 95
 
3.0%
3 92
 
2.9%
9 78
 
2.5%
5 78
 
2.5%
6 70
 
2.2%
8 63
 
2.0%
Other Letter
ValueCountFrequency (%)
74
80.4%
18
 
19.6%
Dash Punctuation
ValueCountFrequency (%)
- 554
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3694
97.6%
Hangul 92
 
2.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1093
29.6%
1 808
21.9%
- 554
15.0%
2 513
13.9%
7 250
 
6.8%
4 95
 
2.6%
3 92
 
2.5%
9 78
 
2.1%
5 78
 
2.1%
6 70
 
1.9%
Hangul
ValueCountFrequency (%)
74
80.4%
18
 
19.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3694
97.6%
Hangul 92
 
2.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1093
29.6%
1 808
21.9%
- 554
15.0%
2 513
13.9%
7 250
 
6.8%
4 95
 
2.6%
3 92
 
2.5%
9 78
 
2.1%
5 78
 
2.1%
6 70
 
1.9%
Hangul
ValueCountFrequency (%)
74
80.4%
18
 
19.6%
Distinct273
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T18:52:29.120532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length11.054152
Min length5

Characters and Unicode

Total characters3062
Distinct characters231
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)97.5%

Sample

1st row새로운공인중개사사무소
2nd row정우공인중개사사무소
3rd row신삼성공인중개사사무소
4th row태백공인중개사사무소
5th row보성공인중개사사무소
ValueCountFrequency (%)
에이블부동산중개법인주식회사 3
 
1.1%
큰손공인중개사사무소 2
 
0.7%
비안공인중개사사무소 2
 
0.7%
신대박공인중개사사무소 1
 
0.4%
공인중개사사무소백호부동산 1
 
0.4%
반월당제네스공인중개사사무소 1
 
0.4%
행복1번지공인중개사사무소 1
 
0.4%
동인탑공인중개사사무소 1
 
0.4%
라이프공인중개사사무소 1
 
0.4%
제네스타워부동산중개주식회사 1
 
0.4%
Other values (263) 263
94.9%
2023-12-12T18:52:29.513196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
467
15.3%
279
 
9.1%
276
 
9.0%
247
 
8.1%
246
 
8.0%
228
 
7.4%
215
 
7.0%
106
 
3.5%
105
 
3.4%
92
 
3.0%
Other values (221) 801
26.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3028
98.9%
Uppercase Letter 21
 
0.7%
Decimal Number 12
 
0.4%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
467
15.4%
279
 
9.2%
276
 
9.1%
247
 
8.2%
246
 
8.1%
228
 
7.5%
215
 
7.1%
106
 
3.5%
105
 
3.5%
92
 
3.0%
Other values (204) 767
25.3%
Uppercase Letter
ValueCountFrequency (%)
K 4
19.0%
I 3
14.3%
A 2
9.5%
B 2
9.5%
E 2
9.5%
S 1
 
4.8%
N 1
 
4.8%
C 1
 
4.8%
O 1
 
4.8%
U 1
 
4.8%
Other values (3) 3
14.3%
Decimal Number
ValueCountFrequency (%)
1 9
75.0%
4 2
 
16.7%
3 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3028
98.9%
Latin 21
 
0.7%
Common 13
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
467
15.4%
279
 
9.2%
276
 
9.1%
247
 
8.2%
246
 
8.1%
228
 
7.5%
215
 
7.1%
106
 
3.5%
105
 
3.5%
92
 
3.0%
Other values (204) 767
25.3%
Latin
ValueCountFrequency (%)
K 4
19.0%
I 3
14.3%
A 2
9.5%
B 2
9.5%
E 2
9.5%
S 1
 
4.8%
N 1
 
4.8%
C 1
 
4.8%
O 1
 
4.8%
U 1
 
4.8%
Other values (3) 3
14.3%
Common
ValueCountFrequency (%)
1 9
69.2%
4 2
 
15.4%
. 1
 
7.7%
3 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3028
98.9%
ASCII 34
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
467
15.4%
279
 
9.2%
276
 
9.1%
247
 
8.2%
246
 
8.1%
228
 
7.5%
215
 
7.1%
106
 
3.5%
105
 
3.5%
92
 
3.0%
Other values (204) 767
25.3%
ASCII
ValueCountFrequency (%)
1 9
26.5%
K 4
11.8%
I 3
 
8.8%
A 2
 
5.9%
B 2
 
5.9%
E 2
 
5.9%
4 2
 
5.9%
S 1
 
2.9%
N 1
 
2.9%
C 1
 
2.9%
Other values (7) 7
20.6%
Distinct269
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T18:52:29.879129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9963899
Min length2

Characters and Unicode

Total characters830
Distinct characters141
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique262 ?
Unique (%)94.6%

Sample

1st row김동훈
2nd row방석곤
3rd row박재목
4th row김규만
5th row노용현
ValueCountFrequency (%)
김명희 3
 
1.1%
우대용 2
 
0.7%
이영숙 2
 
0.7%
김창래 2
 
0.7%
이동욱 2
 
0.7%
이재홍 2
 
0.7%
김순옥 2
 
0.7%
최재혁 1
 
0.4%
진성희 1
 
0.4%
박명희 1
 
0.4%
Other values (259) 259
93.5%
2023-12-12T18:52:30.467236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
7.0%
48
 
5.8%
34
 
4.1%
30
 
3.6%
19
 
2.3%
17
 
2.0%
17
 
2.0%
16
 
1.9%
16
 
1.9%
16
 
1.9%
Other values (131) 559
67.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 830
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
7.0%
48
 
5.8%
34
 
4.1%
30
 
3.6%
19
 
2.3%
17
 
2.0%
17
 
2.0%
16
 
1.9%
16
 
1.9%
16
 
1.9%
Other values (131) 559
67.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 830
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
7.0%
48
 
5.8%
34
 
4.1%
30
 
3.6%
19
 
2.3%
17
 
2.0%
17
 
2.0%
16
 
1.9%
16
 
1.9%
16
 
1.9%
Other values (131) 559
67.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 830
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
58
 
7.0%
48
 
5.8%
34
 
4.1%
30
 
3.6%
19
 
2.3%
17
 
2.0%
17
 
2.0%
16
 
1.9%
16
 
1.9%
16
 
1.9%
Other values (131) 559
67.3%

사무소전화번호
Text

MISSING 

Distinct185
Distinct (%)98.4%
Missing89
Missing (%)32.1%
Memory size2.3 KiB
2023-12-12T18:52:30.726703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.015957
Min length12

Characters and Unicode

Total characters2259
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)97.3%

Sample

1st row053-425-8658
2nd row053-423-0079
3rd row053-421-4581
4th row053-261-3030
5th row053-428-4949
ValueCountFrequency (%)
053-213-0700 3
 
1.6%
053-255-9993 2
 
1.1%
053-568-2018 1
 
0.5%
053-255-3740 1
 
0.5%
053-421-4980 1
 
0.5%
053-428-4545 1
 
0.5%
053-425-8658 1
 
0.5%
053-427-3577 1
 
0.5%
053-424-0125 1
 
0.5%
053-421-8775 1
 
0.5%
Other values (175) 175
93.1%
2023-12-12T18:52:31.168302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 376
16.6%
0 362
16.0%
5 360
15.9%
3 284
12.6%
2 242
10.7%
4 190
8.4%
8 118
 
5.2%
7 101
 
4.5%
9 78
 
3.5%
1 77
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1883
83.4%
Dash Punctuation 376
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 362
19.2%
5 360
19.1%
3 284
15.1%
2 242
12.9%
4 190
10.1%
8 118
 
6.3%
7 101
 
5.4%
9 78
 
4.1%
1 77
 
4.1%
6 71
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 376
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2259
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 376
16.6%
0 362
16.0%
5 360
15.9%
3 284
12.6%
2 242
10.7%
4 190
8.4%
8 118
 
5.2%
7 101
 
4.5%
9 78
 
3.5%
1 77
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2259
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 376
16.6%
0 362
16.0%
5 360
15.9%
3 284
12.6%
2 242
10.7%
4 190
8.4%
8 118
 
5.2%
7 101
 
4.5%
9 78
 
3.5%
1 77
 
3.4%
Distinct262
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T18:52:31.477536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length26.523466
Min length5

Characters and Unicode

Total characters7347
Distinct characters165
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique250 ?
Unique (%)90.3%

Sample

1st row대구광역시 중구 중앙대로 336 2층(남산동)
2nd row대구광역시 중구 동성로2길 1(봉산동)
3rd row대구광역시 중구 중앙대로 407-3(동일동)
4th row대구광역시 중구 동성로6길 25(공평동)
5th row대구광역시 중구 남산로13길 17 상가116동 102호(남산동 보성황실타운)
ValueCountFrequency (%)
대구광역시 272
 
20.1%
중구 272
 
20.1%
달구벌대로 32
 
2.4%
1층 18
 
1.3%
국채보상로 16
 
1.2%
서성로 14
 
1.0%
중앙대로 14
 
1.0%
대봉로 12
 
0.9%
99 11
 
0.8%
2층 11
 
0.8%
Other values (416) 681
50.3%
2023-12-12T18:52:31.986854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1242
16.9%
600
 
8.2%
393
 
5.3%
1 387
 
5.3%
296
 
4.0%
287
 
3.9%
286
 
3.9%
272
 
3.7%
271
 
3.7%
244
 
3.3%
Other values (155) 3069
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4328
58.9%
Decimal Number 1393
 
19.0%
Space Separator 1242
 
16.9%
Open Punctuation 153
 
2.1%
Close Punctuation 153
 
2.1%
Dash Punctuation 68
 
0.9%
Uppercase Letter 6
 
0.1%
Lowercase Letter 3
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
600
13.9%
393
 
9.1%
296
 
6.8%
287
 
6.6%
286
 
6.6%
272
 
6.3%
271
 
6.3%
244
 
5.6%
123
 
2.8%
111
 
2.6%
Other values (136) 1445
33.4%
Decimal Number
ValueCountFrequency (%)
1 387
27.8%
2 178
12.8%
0 153
 
11.0%
3 151
 
10.8%
4 128
 
9.2%
7 92
 
6.6%
9 92
 
6.6%
5 88
 
6.3%
6 88
 
6.3%
8 36
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
E 4
66.7%
W 1
 
16.7%
B 1
 
16.7%
Space Separator
ValueCountFrequency (%)
1242
100.0%
Open Punctuation
ValueCountFrequency (%)
( 153
100.0%
Close Punctuation
ValueCountFrequency (%)
) 153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 3
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4328
58.9%
Common 3010
41.0%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
600
13.9%
393
 
9.1%
296
 
6.8%
287
 
6.6%
286
 
6.6%
272
 
6.3%
271
 
6.3%
244
 
5.6%
123
 
2.8%
111
 
2.6%
Other values (136) 1445
33.4%
Common
ValueCountFrequency (%)
1242
41.3%
1 387
 
12.9%
2 178
 
5.9%
( 153
 
5.1%
) 153
 
5.1%
0 153
 
5.1%
3 151
 
5.0%
4 128
 
4.3%
7 92
 
3.1%
9 92
 
3.1%
Other values (5) 281
 
9.3%
Latin
ValueCountFrequency (%)
E 4
44.4%
e 3
33.3%
W 1
 
11.1%
B 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4328
58.9%
ASCII 3018
41.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1242
41.2%
1 387
 
12.8%
2 178
 
5.9%
( 153
 
5.1%
) 153
 
5.1%
0 153
 
5.1%
3 151
 
5.0%
4 128
 
4.2%
7 92
 
3.0%
9 92
 
3.0%
Other values (8) 289
 
9.6%
Hangul
ValueCountFrequency (%)
600
13.9%
393
 
9.1%
296
 
6.8%
287
 
6.6%
286
 
6.6%
272
 
6.3%
271
 
6.3%
244
 
5.6%
123
 
2.8%
111
 
2.6%
Other values (136) 1445
33.4%
None
ValueCountFrequency (%)
· 1
100.0%

Missing values

2023-12-12T18:52:28.100329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:52:28.196051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호사무소명중개업자명사무소전화번호사무소주소
0가-11-0613새로운공인중개사사무소김동훈053-425-8658대구광역시 중구 중앙대로 336 2층(남산동)
1가-11-0636정우공인중개사사무소방석곤053-423-0079대구광역시 중구 동성로2길 1(봉산동)
2가-11-0677신삼성공인중개사사무소박재목<NA>대구광역시 중구 중앙대로 407-3(동일동)
3가-11-0710태백공인중개사사무소김규만053-421-4581대구광역시 중구 동성로6길 25(공평동)
4가-11-755보성공인중개사사무소노용현<NA>대구광역시 중구 남산로13길 17 상가116동 102호(남산동 보성황실타운)
5가-11-0848태왕공인중개사사무소백정애053-261-3030대구광역시 중구 동덕로 55 상가111동105호(대봉동 대봉태왕아너스)
6가-11-0925한미공인중개사사무소손종오053-428-4949대구광역시 중구 중앙대로 459 1층(북성로1가)
7가-11-0940청운공인중개사사무소박성현053-427-4343대구광역시 중구 동덕로 33 상가2호(대봉동 청운아파트)
8가-11-0954성심공인중개사사무소김동욱053-422-9833대구광역시 중구 명덕로 71-7(남산동)
9가-11-0989리치공인중개사사무소양성만053-421-4150대구광역시 중구 동성로2길 71 지하1층
등록번호사무소명중개업자명사무소전화번호사무소주소
26727110-2020-00067다온부동산공인중개사사무소김기욱070-8098-1772대구광역시 중구 종로 46-2 4층 (종로1가)
26827110-2020-00068대신솔로몬공인중개사사무소신현보053-253-2989대구광역시 중구 달구벌대로 1943 102동102호 (대신동 대산e편한세상)
26927110-2020-00069로얄공인중개사사무소홍승진<NA>대구광역시 중구 달성로21길 69 1층 (달성동)
27027110-2020-00070밸류플러스부동산중개사무소권영진053-568-2018대구광역시 중구 국채보상로150길 7 2층
27127110-2020-00071하이공인중개사사무소김승미<NA>대구광역시 중구 달성로21길 69(달성동)
27227110-2020-00072자이봄봄공인중개사사무소임순매<NA>대구광역시 중구 서성로 99 상가103동119호 (대구역센트럴자이)
27327110-2020-00073풍경채봄봄공인중개사사무소소영주053-427-5955대구광역시 중구 서성로 99 상가103동119호 (대구역센트럴자이)
27427110-2020-00074남산롯데스카이부동산중개사무소문정순<NA>대구광역시 중구 달구벌대로 2020 401동102호(남산동 남산이편한세상)
27527110-2020-00075신대박공인중개사사무소박병숙053-255-1888대구광역시 중구 남산로6안길 30
27627110-2020-00076동네한바퀴공인중개사사무소박근영<NA>대구광역시 중구 남성로 60-1 2층 (동성로3가)

Duplicate rows

Most frequently occurring

등록번호사무소명중개업자명사무소전화번호사무소주소# duplicates
027110-2019-00025큰손공인중개사사무소우대용<NA>대구광역시 중구 재마루길 1042
147840-2018-00010비안공인중개사사무소김창래<NA>대구광역시 중구 국채보상로150길 72