Overview

Dataset statistics

Number of variables3
Number of observations95
Missing cells33
Missing cells (%)11.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory25.4 B

Variable types

Text3

Dataset

Description서울특별시 서대문구 관내 위치한 건축사사무소에 대한 현황( 사무소명, 주소, 전화번호)를 데이터로 제공합니다.
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15039582/fileData.do

Alerts

전화번호 has 33 (34.7%) missing valuesMissing
사무소명 has unique valuesUnique

Reproduction

Analysis started2024-04-17 20:29:57.606990
Analysis finished2024-04-17 20:29:58.167374
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사무소명
Text

UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size892.0 B
2024-04-18T05:29:58.316600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length11.421053
Min length7

Characters and Unicode

Total characters1085
Distinct characters163
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row유전건축사사무소
2nd row(주)건축사사무소서은
3rd row박양춘건축사사무소
4th row신세계건축사사무소
5th row인우 건축사사무소
ValueCountFrequency (%)
건축사사무소 25
 
17.4%
주식회사 13
 
9.0%
주)종합건축사사무소 2
 
1.4%
종합건축사사무소 2
 
1.4%
건축사 1
 
0.7%
라온건축사사무소 1
 
0.7%
연온재건축사사무소 1
 
0.7%
건담건축사사무소 1
 
0.7%
다온건축사사무소 1
 
0.7%
서현종합건축사사무소 1
 
0.7%
Other values (96) 96
66.7%
2024-04-18T05:29:58.634496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
205
18.9%
103
 
9.5%
102
 
9.4%
97
 
8.9%
96
 
8.8%
49
 
4.5%
31
 
2.9%
22
 
2.0%
( 20
 
1.8%
) 20
 
1.8%
Other values (153) 340
31.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 973
89.7%
Space Separator 49
 
4.5%
Open Punctuation 20
 
1.8%
Close Punctuation 20
 
1.8%
Uppercase Letter 12
 
1.1%
Lowercase Letter 6
 
0.6%
Decimal Number 3
 
0.3%
Dash Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
205
21.1%
103
 
10.6%
102
 
10.5%
97
 
10.0%
96
 
9.9%
31
 
3.2%
22
 
2.3%
15
 
1.5%
13
 
1.3%
13
 
1.3%
Other values (130) 276
28.4%
Uppercase Letter
ValueCountFrequency (%)
A 2
16.7%
D 2
16.7%
F 1
8.3%
S 1
8.3%
N 1
8.3%
P 1
8.3%
C 1
8.3%
M 1
8.3%
I 1
8.3%
L 1
8.3%
Lowercase Letter
ValueCountFrequency (%)
l 2
33.3%
a 1
16.7%
w 1
16.7%
s 1
16.7%
e 1
16.7%
Decimal Number
ValueCountFrequency (%)
4 1
33.3%
1 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 973
89.7%
Common 94
 
8.7%
Latin 18
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
205
21.1%
103
 
10.6%
102
 
10.5%
97
 
10.0%
96
 
9.9%
31
 
3.2%
22
 
2.3%
15
 
1.5%
13
 
1.3%
13
 
1.3%
Other values (130) 276
28.4%
Latin
ValueCountFrequency (%)
l 2
 
11.1%
A 2
 
11.1%
D 2
 
11.1%
F 1
 
5.6%
S 1
 
5.6%
a 1
 
5.6%
w 1
 
5.6%
N 1
 
5.6%
s 1
 
5.6%
e 1
 
5.6%
Other values (5) 5
27.8%
Common
ValueCountFrequency (%)
49
52.1%
( 20
21.3%
) 20
21.3%
4 1
 
1.1%
- 1
 
1.1%
. 1
 
1.1%
1 1
 
1.1%
2 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 973
89.7%
ASCII 112
 
10.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
205
21.1%
103
 
10.6%
102
 
10.5%
97
 
10.0%
96
 
9.9%
31
 
3.2%
22
 
2.3%
15
 
1.5%
13
 
1.3%
13
 
1.3%
Other values (130) 276
28.4%
ASCII
ValueCountFrequency (%)
49
43.8%
( 20
17.9%
) 20
17.9%
l 2
 
1.8%
A 2
 
1.8%
D 2
 
1.8%
F 1
 
0.9%
S 1
 
0.9%
a 1
 
0.9%
w 1
 
0.9%
Other values (13) 13
 
11.6%

주소
Text

Distinct92
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size892.0 B
2024-04-18T05:29:58.878273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length35
Mean length26.715789
Min length17

Characters and Unicode

Total characters2538
Distinct characters129
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)93.7%

Sample

1st row서울특별시 서대문구 연희로 245
2nd row서울특별시 서대문구 연희로 263
3rd row서울특별시 서대문구 연희로36길 10
4th row서울특별시 서대문구 연희로11가길 2
5th row서울특별시 서대문구 응암로 115, 4층
ValueCountFrequency (%)
서울특별시 95
 
18.6%
서대문구 95
 
18.6%
연희로 20
 
3.9%
3층 13
 
2.5%
대지 7
 
1.4%
2층 7
 
1.4%
4층 6
 
1.2%
충정로 6
 
1.2%
1층 6
 
1.2%
연희동 5
 
1.0%
Other values (185) 251
49.1%
2024-04-18T05:29:59.214648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
16.4%
192
 
7.6%
1 117
 
4.6%
110
 
4.3%
99
 
3.9%
97
 
3.8%
97
 
3.8%
96
 
3.8%
95
 
3.7%
95
 
3.7%
Other values (119) 1123
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1496
58.9%
Decimal Number 483
 
19.0%
Space Separator 417
 
16.4%
Other Punctuation 79
 
3.1%
Dash Punctuation 26
 
1.0%
Open Punctuation 14
 
0.6%
Close Punctuation 14
 
0.6%
Uppercase Letter 8
 
0.3%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
12.8%
110
 
7.4%
99
 
6.6%
97
 
6.5%
97
 
6.5%
96
 
6.4%
95
 
6.4%
95
 
6.4%
89
 
5.9%
49
 
3.3%
Other values (100) 477
31.9%
Decimal Number
ValueCountFrequency (%)
1 117
24.2%
2 90
18.6%
3 64
13.3%
0 47
9.7%
5 36
 
7.5%
4 31
 
6.4%
9 29
 
6.0%
7 26
 
5.4%
6 22
 
4.6%
8 21
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
B 4
50.0%
A 3
37.5%
F 1
 
12.5%
Space Separator
ValueCountFrequency (%)
417
100.0%
Other Punctuation
ValueCountFrequency (%)
, 79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1496
58.9%
Common 1033
40.7%
Latin 9
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
12.8%
110
 
7.4%
99
 
6.6%
97
 
6.5%
97
 
6.5%
96
 
6.4%
95
 
6.4%
95
 
6.4%
89
 
5.9%
49
 
3.3%
Other values (100) 477
31.9%
Common
ValueCountFrequency (%)
417
40.4%
1 117
 
11.3%
2 90
 
8.7%
, 79
 
7.6%
3 64
 
6.2%
0 47
 
4.5%
5 36
 
3.5%
4 31
 
3.0%
9 29
 
2.8%
7 26
 
2.5%
Other values (5) 97
 
9.4%
Latin
ValueCountFrequency (%)
B 4
44.4%
A 3
33.3%
F 1
 
11.1%
b 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1496
58.9%
ASCII 1042
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
417
40.0%
1 117
 
11.2%
2 90
 
8.6%
, 79
 
7.6%
3 64
 
6.1%
0 47
 
4.5%
5 36
 
3.5%
4 31
 
3.0%
9 29
 
2.8%
7 26
 
2.5%
Other values (9) 106
 
10.2%
Hangul
ValueCountFrequency (%)
192
12.8%
110
 
7.4%
99
 
6.6%
97
 
6.5%
97
 
6.5%
96
 
6.4%
95
 
6.4%
95
 
6.4%
89
 
5.9%
49
 
3.3%
Other values (100) 477
31.9%

전화번호
Text

MISSING 

Distinct61
Distinct (%)98.4%
Missing33
Missing (%)34.7%
Memory size892.0 B
2024-04-18T05:29:59.403655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.370968
Min length11

Characters and Unicode

Total characters705
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)96.8%

Sample

1st row02-324-3810
2nd row02-375-3301
3rd row02-338-2651
4th row02-3143-0583
5th row02-322-4852
ValueCountFrequency (%)
02 4
 
6.1%
02-393-1136 2
 
3.0%
02-714-3433 1
 
1.5%
02-324-3810 1
 
1.5%
02-391-7100 1
 
1.5%
02-3445-3866 1
 
1.5%
070-8095-5195 1
 
1.5%
02-322-5460 1
 
1.5%
02-336-6869 1
 
1.5%
02-598-8114 1
 
1.5%
Other values (52) 52
78.8%
2024-04-18T05:29:59.698671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 124
17.6%
2 116
16.5%
0 94
13.3%
3 93
13.2%
5 49
 
7.0%
4 47
 
6.7%
1 46
 
6.5%
8 37
 
5.2%
6 33
 
4.7%
9 32
 
4.5%
Other values (2) 34
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 577
81.8%
Dash Punctuation 124
 
17.6%
Space Separator 4
 
0.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 116
20.1%
0 94
16.3%
3 93
16.1%
5 49
8.5%
4 47
8.1%
1 46
 
8.0%
8 37
 
6.4%
6 33
 
5.7%
9 32
 
5.5%
7 30
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 705
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 124
17.6%
2 116
16.5%
0 94
13.3%
3 93
13.2%
5 49
 
7.0%
4 47
 
6.7%
1 46
 
6.5%
8 37
 
5.2%
6 33
 
4.7%
9 32
 
4.5%
Other values (2) 34
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 705
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 124
17.6%
2 116
16.5%
0 94
13.3%
3 93
13.2%
5 49
 
7.0%
4 47
 
6.7%
1 46
 
6.5%
8 37
 
5.2%
6 33
 
4.7%
9 32
 
4.5%
Other values (2) 34
 
4.8%

Correlations

2024-04-18T05:29:59.786034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사무소명주소전화번호
사무소명1.0001.0001.000
주소1.0001.0000.998
전화번호1.0000.9981.000

Missing values

2024-04-18T05:29:58.143508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사무소명주소전화번호
0유전건축사사무소서울특별시 서대문구 연희로 24502-324-3810
1(주)건축사사무소서은서울특별시 서대문구 연희로 26302-375-3301
2박양춘건축사사무소서울특별시 서대문구 연희로36길 1002-338-2651
3신세계건축사사무소서울특별시 서대문구 연희로11가길 202-3143-0583
4인우 건축사사무소서울특별시 서대문구 응암로 115, 4층<NA>
5국제건축사사무소서울특별시 서대문구 연희로 245<NA>
6대상건축사사무소서울특별시 서대문구 연희동 대지 169-1402-322-4852
7(주)아키프라자건축사사무소서울특별시 서대문구 연희로 261-28, 4층 402호02-3143-2994
8건축사사무소 늘푸른서울특별시 서대문구 연희로36길 10, 201호02-334-1252
9(주)씨앤에이건축사사무소서울특별시 서대문구 증가로 9, 범우빌딩 3층02-324-2203
사무소명주소전화번호
85마인드맵건축사사무소서울특별시 서대문구 성산로 527, 3층 1호 (하늬솔 A동)<NA>
86어느새건축사사무소서울특별시 서대문구 연희맛로 17-13, 2층 202호02-332-6412
87에이에이아키그룹건축사사무소(주)서울특별시 서대문구 충정로 23, 풍산빌딩 6층02-2187-0500
88건축사사무소 대안공간서울특별시 서대문구 신촌로 63, 901호02-6481-1934
89피엔피디 건축사사무소서울특별시 서대문구 증가로10길 96-9, 101호<NA>
90건축사사무소 라운디드서울특별시 서대문구 연대동문길 71, B1<NA>
91건축사사무소 만화기획서울특별시 서대문구 가좌로 36, 202호<NA>
92건축사사무소 오구사서울특별시 서대문구 홍은중앙로9길 51, 건축사사무소 오구사<NA>
93건축사사무소 태태서울특별시 서대문구 연희로 63, 연희체스트빌 216호<NA>
94이준구건축사사무소서울특별시 서대문구 연희로 263, 3층<NA>