Overview

Dataset statistics

Number of variables3
Number of observations927
Missing cells171
Missing cells (%)6.1%
Duplicate rows4
Duplicate rows (%)0.4%
Total size in memory21.9 KiB
Average record size in memory24.1 B

Variable types

Text3

Dataset

Description서울특별시 용산구 부동산중개업소 현황(부동산 중개업 상호명, 부동산 중개업소 주소, 전화번호)에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15090472/fileData.do

Alerts

Dataset has 4 (0.4%) duplicate rowsDuplicates
사무소전화번호 has 171 (18.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:29:52.687414
Analysis finished2023-12-12 22:29:53.275444
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct770
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-13T07:29:53.471481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length11.77562
Min length6

Characters and Unicode

Total characters10916
Distinct characters390
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique674 ?
Unique (%)72.7%

Sample

1st row한남정다운공인중개사사무소
2nd row성심공인중개사사무소
3rd row리치부동산공인중개사사무소
4th rowOK써밋부동산중개사무소
5th row온누리공인중개사사무소
ValueCountFrequency (%)
주식회사 13
 
1.4%
우리공인중개사사무소 11
 
1.1%
공인중개사사무소 7
 
0.7%
중개법인스타빌 7
 
0.7%
미래공인중개사사무소 5
 
0.5%
베스트공인중개사사무소 5
 
0.5%
행운공인중개사사무소 5
 
0.5%
삼성공인중개사사무소 5
 
0.5%
한강공인중개사사무소 5
 
0.5%
조은공인중개사사무소 4
 
0.4%
Other values (771) 895
93.0%
2023-12-13T07:29:53.910752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1719
15.7%
933
 
8.5%
933
 
8.5%
893
 
8.2%
872
 
8.0%
867
 
7.9%
817
 
7.5%
338
 
3.1%
276
 
2.5%
261
 
2.4%
Other values (380) 3007
27.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10615
97.2%
Decimal Number 84
 
0.8%
Uppercase Letter 73
 
0.7%
Lowercase Letter 38
 
0.3%
Space Separator 35
 
0.3%
Close Punctuation 28
 
0.3%
Open Punctuation 28
 
0.3%
Dash Punctuation 8
 
0.1%
Other Punctuation 4
 
< 0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1719
16.2%
933
 
8.8%
933
 
8.8%
893
 
8.4%
872
 
8.2%
867
 
8.2%
817
 
7.7%
338
 
3.2%
276
 
2.6%
261
 
2.5%
Other values (328) 2706
25.5%
Uppercase Letter
ValueCountFrequency (%)
K 13
17.8%
A 10
13.7%
C 8
11.0%
J 7
9.6%
O 6
8.2%
S 5
 
6.8%
B 4
 
5.5%
E 3
 
4.1%
R 3
 
4.1%
L 3
 
4.1%
Other values (8) 11
15.1%
Lowercase Letter
ValueCountFrequency (%)
e 8
21.1%
i 5
13.2%
a 5
13.2%
o 3
 
7.9%
c 3
 
7.9%
l 3
 
7.9%
b 2
 
5.3%
h 1
 
2.6%
s 1
 
2.6%
g 1
 
2.6%
Other values (6) 6
15.8%
Decimal Number
ValueCountFrequency (%)
1 31
36.9%
4 13
15.5%
7 9
 
10.7%
0 9
 
10.7%
2 8
 
9.5%
6 4
 
4.8%
5 3
 
3.6%
3 3
 
3.6%
8 2
 
2.4%
9 2
 
2.4%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
. 1
 
25.0%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10612
97.2%
Common 187
 
1.7%
Latin 114
 
1.0%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1719
16.2%
933
 
8.8%
933
 
8.8%
893
 
8.4%
872
 
8.2%
867
 
8.2%
817
 
7.7%
338
 
3.2%
276
 
2.6%
261
 
2.5%
Other values (325) 2703
25.5%
Latin
ValueCountFrequency (%)
K 13
 
11.4%
A 10
 
8.8%
C 8
 
7.0%
e 8
 
7.0%
J 7
 
6.1%
O 6
 
5.3%
S 5
 
4.4%
i 5
 
4.4%
a 5
 
4.4%
B 4
 
3.5%
Other values (26) 43
37.7%
Common
ValueCountFrequency (%)
35
18.7%
1 31
16.6%
) 28
15.0%
( 28
15.0%
4 13
 
7.0%
7 9
 
4.8%
0 9
 
4.8%
- 8
 
4.3%
2 8
 
4.3%
6 4
 
2.1%
Other values (6) 14
 
7.5%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10612
97.2%
ASCII 298
 
2.7%
Number Forms 3
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1719
16.2%
933
 
8.8%
933
 
8.8%
893
 
8.4%
872
 
8.2%
867
 
8.2%
817
 
7.7%
338
 
3.2%
276
 
2.6%
261
 
2.5%
Other values (325) 2703
25.5%
ASCII
ValueCountFrequency (%)
35
 
11.7%
1 31
 
10.4%
) 28
 
9.4%
( 28
 
9.4%
K 13
 
4.4%
4 13
 
4.4%
A 10
 
3.4%
7 9
 
3.0%
0 9
 
3.0%
C 8
 
2.7%
Other values (40) 114
38.3%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

주소
Text

Distinct882
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-13T07:29:54.300151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length28.774542
Min length17

Characters and Unicode

Total characters26674
Distinct characters234
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique843 ?
Unique (%)90.9%

Sample

1st row서울특별시 용산구 장문로 87-1 102호(보광동)
2nd row서울특별시 용산구 원효로19길 61 101호(금강프라임빌, 원효로3가)
3rd row서울특별시 용산구 후암로28길 38 1층 101호
4th row서울특별시 용산구 한강대로 69 상가동 102호(한강로2가, 용산푸르지오써밋)
5th row서울특별시 용산구 후암로28길 12 ,1층1호 (후암동)
ValueCountFrequency (%)
서울특별시 927
 
17.9%
용산구 923
 
17.9%
1층 296
 
5.7%
한강대로 96
 
1.9%
서빙고로 67
 
1.3%
이촌로 57
 
1.1%
보광로 40
 
0.8%
한남동 39
 
0.8%
17 37
 
0.7%
효창원로 36
 
0.7%
Other values (997) 2647
51.2%
2023-12-13T07:29:54.901657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4335
 
16.3%
1 1546
 
5.8%
1075
 
4.0%
1053
 
3.9%
1053
 
3.9%
1043
 
3.9%
978
 
3.7%
932
 
3.5%
930
 
3.5%
927
 
3.5%
Other values (224) 12802
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16052
60.2%
Decimal Number 4710
 
17.7%
Space Separator 4335
 
16.3%
Open Punctuation 572
 
2.1%
Close Punctuation 570
 
2.1%
Other Punctuation 223
 
0.8%
Dash Punctuation 155
 
0.6%
Uppercase Letter 53
 
0.2%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1075
 
6.7%
1053
 
6.6%
1053
 
6.6%
1043
 
6.5%
978
 
6.1%
932
 
5.8%
930
 
5.8%
927
 
5.8%
927
 
5.8%
516
 
3.2%
Other values (196) 6618
41.2%
Decimal Number
ValueCountFrequency (%)
1 1546
32.8%
2 662
14.1%
0 480
 
10.2%
3 397
 
8.4%
4 352
 
7.5%
5 288
 
6.1%
6 260
 
5.5%
7 259
 
5.5%
9 246
 
5.2%
8 220
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 24
45.3%
A 9
 
17.0%
C 7
 
13.2%
D 3
 
5.7%
K 3
 
5.7%
L 2
 
3.8%
G 2
 
3.8%
S 1
 
1.9%
E 1
 
1.9%
J 1
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
b 1
25.0%
k 1
25.0%
Space Separator
ValueCountFrequency (%)
4335
100.0%
Open Punctuation
ValueCountFrequency (%)
( 572
100.0%
Close Punctuation
ValueCountFrequency (%)
) 570
100.0%
Other Punctuation
ValueCountFrequency (%)
, 223
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 155
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16052
60.2%
Common 10565
39.6%
Latin 57
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1075
 
6.7%
1053
 
6.6%
1053
 
6.6%
1043
 
6.5%
978
 
6.1%
932
 
5.8%
930
 
5.8%
927
 
5.8%
927
 
5.8%
516
 
3.2%
Other values (196) 6618
41.2%
Common
ValueCountFrequency (%)
4335
41.0%
1 1546
 
14.6%
2 662
 
6.3%
( 572
 
5.4%
) 570
 
5.4%
0 480
 
4.5%
3 397
 
3.8%
4 352
 
3.3%
5 288
 
2.7%
6 260
 
2.5%
Other values (5) 1103
 
10.4%
Latin
ValueCountFrequency (%)
B 24
42.1%
A 9
 
15.8%
C 7
 
12.3%
D 3
 
5.3%
K 3
 
5.3%
c 2
 
3.5%
L 2
 
3.5%
G 2
 
3.5%
b 1
 
1.8%
S 1
 
1.8%
Other values (3) 3
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16052
60.2%
ASCII 10622
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4335
40.8%
1 1546
 
14.6%
2 662
 
6.2%
( 572
 
5.4%
) 570
 
5.4%
0 480
 
4.5%
3 397
 
3.7%
4 352
 
3.3%
5 288
 
2.7%
6 260
 
2.4%
Other values (18) 1160
 
10.9%
Hangul
ValueCountFrequency (%)
1075
 
6.7%
1053
 
6.6%
1053
 
6.6%
1043
 
6.5%
978
 
6.1%
932
 
5.8%
930
 
5.8%
927
 
5.8%
927
 
5.8%
516
 
3.2%
Other values (196) 6618
41.2%

사무소전화번호
Text

MISSING 

Distinct732
Distinct (%)96.8%
Missing171
Missing (%)18.4%
Memory size7.4 KiB
2023-12-13T07:29:55.218288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length180
Median length11
Mean length12.878307
Min length9

Characters and Unicode

Total characters9736
Distinct characters15
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique714 ?
Unique (%)94.4%

Sample

1st row02-794-2226
2nd row02-718-9787
3rd row02-757-7701
4th row02-771-4969
5th row02-6232-7220, 02-3447-1012, 02-3447-1013, 02-3447-1016
ValueCountFrequency (%)
02-798-8990 7
 
0.8%
02-6235-1133~1136 3
 
0.4%
02-711-4969 3
 
0.4%
02-6952-8885 2
 
0.2%
02-3447-1016 2
 
0.2%
02-3447-1013 2
 
0.2%
02-3447-1012 2
 
0.2%
02-6232-7220 2
 
0.2%
02-749-7377 2
 
0.2%
02-795-7768 2
 
0.2%
Other values (805) 825
96.8%
2023-12-13T07:29:55.719614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1712
17.6%
- 1711
17.6%
7 1292
13.3%
2 1252
12.9%
9 883
9.1%
1 533
 
5.5%
8 491
 
5.0%
4 488
 
5.0%
5 442
 
4.5%
3 423
 
4.3%
Other values (5) 509
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7824
80.4%
Dash Punctuation 1711
 
17.6%
Other Punctuation 101
 
1.0%
Space Separator 97
 
1.0%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1712
21.9%
7 1292
16.5%
2 1252
16.0%
9 883
11.3%
1 533
 
6.8%
8 491
 
6.3%
4 488
 
6.2%
5 442
 
5.6%
3 423
 
5.4%
6 308
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 100
99.0%
/ 1
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 1711
100.0%
Space Separator
ValueCountFrequency (%)
97
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9736
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1712
17.6%
- 1711
17.6%
7 1292
13.3%
2 1252
12.9%
9 883
9.1%
1 533
 
5.5%
8 491
 
5.0%
4 488
 
5.0%
5 442
 
4.5%
3 423
 
4.3%
Other values (5) 509
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9736
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1712
17.6%
- 1711
17.6%
7 1292
13.3%
2 1252
12.9%
9 883
9.1%
1 533
 
5.5%
8 491
 
5.0%
4 488
 
5.0%
5 442
 
4.5%
3 423
 
4.3%
Other values (5) 509
 
5.2%

Missing values

2023-12-13T07:29:53.160538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:29:53.235737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명주소사무소전화번호
0한남정다운공인중개사사무소서울특별시 용산구 장문로 87-1 102호(보광동)02-794-2226
1성심공인중개사사무소서울특별시 용산구 원효로19길 61 101호(금강프라임빌, 원효로3가)02-718-9787
2리치부동산공인중개사사무소서울특별시 용산구 후암로28길 38 1층 101호02-757-7701
3OK써밋부동산중개사무소서울특별시 용산구 한강대로 69 상가동 102호(한강로2가, 용산푸르지오써밋)<NA>
4온누리공인중개사사무소서울특별시 용산구 후암로28길 12 ,1층1호 (후암동)02-771-4969
5레몬트리부동산중개(주) 마포점서울특별시 마포구 동교로 209-4 5층(동교동, 프로빌딩)02-6232-7220, 02-3447-1012, 02-3447-1013, 02-3447-1016
6경남공인중개사사무소서울특별시 용산구 이촌로 290 1층 점포5호(점보아파트상가)02-790-3423
7힐튼공인중개사사무소서울특별시 용산구 후암로 6402-2332-0123
8한남114공인중개사사무소서울특별시 용산구 대사관로30가길 9 , 1층(한남동)02-790-282802-790-8400
9수공인중개사사무소서울특별시 용산구 서빙고로 67 상가동 지하1층 비8호02-795-8700
상호명주소사무소전화번호
917에버그린부동산공인중개사사무소서울특별시 용산구 후암로34가길 4 1층(후암동)02-2088-8253
918삼오부동산중개인사무소서울특별시 용산구 이촌로 264 114호 (이촌1동, 삼익상가)02-793-0013
919나이스공인중개사사무소서울특별시 용산구 한강대로 69 상가동 110-2호(한강로2가, 용산푸르지오써밋)02-749-2200, 02-792-3303
920리치공인중개사사무소서울특별시 용산구 한강대로 14502-798-8999
921더 파크사이드 서울 공인중개사사무소서울특별시 용산구 한강대로104길 50 1층 (동자동)02-754-1050
922비타민공인중개사사무소서울특별시 용산구 효창원로 101-3 1층02-714-8882
923ACE공인중개사사무소서울특별시 용산구 신흥로 128 1층02-790-7700
924베니스공인중개사사무소서울특별시 용산구 원효로89길 13 1층02-313-6101, 02-6358-8585, 02-3272-8324
925부동산키움공인중개사사무소서울특별시 용산구 대사관로 69 1층 일부 (한남동)02-795-6777
926원부동산컨설팅공인중개사사무소서울특별시 용산구 녹사평대로26가길 5 1층(이태원동)<NA>

Duplicate rows

Most frequently occurring

상호명주소사무소전화번호# duplicates
3중개법인스타빌 주식회사서울특별시 용산구 대사관로 48 비동 지층 (한남동)02-798-89907
0대승부동산공인중개사사무소서울특별시 용산구 이태원로 211 917호(한남동)02-6235-1133~1136, 6235-11392
1부동산중개법인마네주식회사서울특별시 용산구 이태원로27길 101 지하1층 일부 (한남동)02-6953-6100, 02-6952-6100, 02-6953-6400, 02-501-8770, 02-6952-8885, 02-6953-61032
2주식회사맨해튼부동산중개법인서울특별시 용산구 장문로 24 지1층 특실(동빙고동,라이온스아파트)02-794-27772