Overview

Dataset statistics

Number of variables4
Number of observations194
Missing cells68
Missing cells (%)8.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory33.7 B

Variable types

Numeric1
Text3

Dataset

Description서울특별시 강서구 건축사사무소 현황정보입니다.연번, 건축사사무소명, 도로명주소, 전화번호 정보를 제공합니다.
Author서울특별시 강서구
URLhttps://www.data.go.kr/data/15126209/fileData.do

Alerts

전화번호 has 68 (35.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 14:46:07.185981
Analysis finished2024-03-14 14:46:08.112894
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct194
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.5
Minimum1
Maximum194
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-03-14T23:46:08.248508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.65
Q149.25
median97.5
Q3145.75
95-th percentile184.35
Maximum194
Range193
Interquartile range (IQR)96.5

Descriptive statistics

Standard deviation56.147128
Coefficient of variation (CV)0.57586798
Kurtosis-1.2
Mean97.5
Median Absolute Deviation (MAD)48.5
Skewness0
Sum18915
Variance3152.5
MonotonicityStrictly increasing
2024-03-14T23:46:08.512125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
147 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
Other values (184) 184
94.8%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%
190 1
0.5%
189 1
0.5%
188 1
0.5%
187 1
0.5%
186 1
0.5%
185 1
0.5%
Distinct186
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-14T23:46:09.374065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length11.226804
Min length7

Characters and Unicode

Total characters2178
Distinct characters212
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)91.8%

Sample

1st row주)세현건축사사무소
2nd row김명선건축사사무소
3rd row홍건축사사무소
4th row천지건축사사무소
5th row(주)유용준건축사사무소
ValueCountFrequency (%)
건축사사무소 60
 
20.8%
주식회사 19
 
6.6%
주)건축사사무소 3
 
1.0%
종합건축사사무소 3
 
1.0%
주)종합건축사사무소 2
 
0.7%
주)건축사사무소도시에스제이 2
 
0.7%
고유 2
 
0.7%
우리건축 2
 
0.7%
주)우인엔지니어링건축사사무소 2
 
0.7%
주)솔빛엔지니어링건축사사무소 2
 
0.7%
Other values (188) 191
66.3%
2024-03-14T23:46:10.686236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
19.1%
206
 
9.5%
201
 
9.2%
196
 
9.0%
195
 
9.0%
97
 
4.5%
72
 
3.3%
) 47
 
2.2%
( 46
 
2.1%
43
 
2.0%
Other values (202) 658
30.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1941
89.1%
Space Separator 97
 
4.5%
Close Punctuation 47
 
2.2%
Open Punctuation 46
 
2.1%
Uppercase Letter 23
 
1.1%
Decimal Number 11
 
0.5%
Lowercase Letter 9
 
0.4%
Other Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
417
21.5%
206
 
10.6%
201
 
10.4%
196
 
10.1%
195
 
10.0%
72
 
3.7%
43
 
2.2%
30
 
1.5%
28
 
1.4%
27
 
1.4%
Other values (170) 526
27.1%
Uppercase Letter
ValueCountFrequency (%)
A 4
17.4%
S 3
13.0%
H 2
8.7%
I 2
8.7%
P 2
8.7%
Y 2
8.7%
M 2
8.7%
Z 1
 
4.3%
O 1
 
4.3%
G 1
 
4.3%
Other values (3) 3
13.0%
Lowercase Letter
ValueCountFrequency (%)
u 2
22.2%
x 1
11.1%
o 1
11.1%
i 1
11.1%
d 1
11.1%
t 1
11.1%
s 1
11.1%
l 1
11.1%
Decimal Number
ValueCountFrequency (%)
2 3
27.3%
0 3
27.3%
1 2
18.2%
6 1
 
9.1%
5 1
 
9.1%
9 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Space Separator
ValueCountFrequency (%)
97
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1941
89.1%
Common 205
 
9.4%
Latin 32
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
417
21.5%
206
 
10.6%
201
 
10.4%
196
 
10.1%
195
 
10.0%
72
 
3.7%
43
 
2.2%
30
 
1.5%
28
 
1.4%
27
 
1.4%
Other values (170) 526
27.1%
Latin
ValueCountFrequency (%)
A 4
 
12.5%
S 3
 
9.4%
H 2
 
6.2%
I 2
 
6.2%
u 2
 
6.2%
P 2
 
6.2%
Y 2
 
6.2%
M 2
 
6.2%
x 1
 
3.1%
Z 1
 
3.1%
Other values (11) 11
34.4%
Common
ValueCountFrequency (%)
97
47.3%
) 47
22.9%
( 46
22.4%
. 3
 
1.5%
2 3
 
1.5%
0 3
 
1.5%
1 2
 
1.0%
6 1
 
0.5%
5 1
 
0.5%
9 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1941
89.1%
ASCII 237
 
10.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
417
21.5%
206
 
10.6%
201
 
10.4%
196
 
10.1%
195
 
10.0%
72
 
3.7%
43
 
2.2%
30
 
1.5%
28
 
1.4%
27
 
1.4%
Other values (170) 526
27.1%
ASCII
ValueCountFrequency (%)
97
40.9%
) 47
19.8%
( 46
19.4%
A 4
 
1.7%
. 3
 
1.3%
2 3
 
1.3%
0 3
 
1.3%
S 3
 
1.3%
H 2
 
0.8%
I 2
 
0.8%
Other values (22) 27
 
11.4%
Distinct184
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-14T23:46:11.951808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length40
Mean length32.793814
Min length17

Characters and Unicode

Total characters6362
Distinct characters188
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)90.2%

Sample

1st row서울특별시 강서구 화곡로 297, 파크뷰에버 오피스텔705호
2nd row서울특별시 강서구 화곡로 313, (화곡동)401호
3rd row서울특별시 강서구 화곡로 313
4th row서울특별시 강서구 화곡로 296, 강서I'PARK 311호
5th row서울특별시 강서구 공항대로 46
ValueCountFrequency (%)
서울특별시 194
 
16.5%
강서구 194
 
16.5%
공항대로 47
 
4.0%
화곡로 21
 
1.8%
마곡중앙로 17
 
1.4%
마곡중앙6로 15
 
1.3%
b동 11
 
0.9%
3층 10
 
0.8%
5층 10
 
0.8%
161-8 10
 
0.8%
Other values (382) 648
55.1%
2024-03-14T23:46:13.397929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
987
 
15.5%
421
 
6.6%
1 322
 
5.1%
, 233
 
3.7%
219
 
3.4%
2 208
 
3.3%
197
 
3.1%
195
 
3.1%
194
 
3.0%
194
 
3.0%
Other values (178) 3192
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3633
57.1%
Decimal Number 1324
 
20.8%
Space Separator 987
 
15.5%
Other Punctuation 234
 
3.7%
Close Punctuation 55
 
0.9%
Open Punctuation 55
 
0.9%
Uppercase Letter 45
 
0.7%
Dash Punctuation 24
 
0.4%
Lowercase Letter 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
421
 
11.6%
219
 
6.0%
197
 
5.4%
195
 
5.4%
194
 
5.3%
194
 
5.3%
194
 
5.3%
194
 
5.3%
170
 
4.7%
143
 
3.9%
Other values (147) 1512
41.6%
Decimal Number
ValueCountFrequency (%)
1 322
24.3%
2 208
15.7%
0 158
11.9%
6 130
9.8%
4 108
 
8.2%
5 98
 
7.4%
3 94
 
7.1%
8 91
 
6.9%
7 58
 
4.4%
9 57
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
B 16
35.6%
A 12
26.7%
C 5
 
11.1%
I 3
 
6.7%
P 3
 
6.7%
V 2
 
4.4%
M 1
 
2.2%
N 1
 
2.2%
K 1
 
2.2%
R 1
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
p 1
33.3%
i 1
33.3%
v 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 233
99.6%
' 1
 
0.4%
Space Separator
ValueCountFrequency (%)
987
100.0%
Close Punctuation
ValueCountFrequency (%)
) 55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3633
57.1%
Common 2681
42.1%
Latin 48
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
421
 
11.6%
219
 
6.0%
197
 
5.4%
195
 
5.4%
194
 
5.3%
194
 
5.3%
194
 
5.3%
194
 
5.3%
170
 
4.7%
143
 
3.9%
Other values (147) 1512
41.6%
Common
ValueCountFrequency (%)
987
36.8%
1 322
 
12.0%
, 233
 
8.7%
2 208
 
7.8%
0 158
 
5.9%
6 130
 
4.8%
4 108
 
4.0%
5 98
 
3.7%
3 94
 
3.5%
8 91
 
3.4%
Other values (8) 252
 
9.4%
Latin
ValueCountFrequency (%)
B 16
33.3%
A 12
25.0%
C 5
 
10.4%
I 3
 
6.2%
P 3
 
6.2%
V 2
 
4.2%
M 1
 
2.1%
p 1
 
2.1%
i 1
 
2.1%
v 1
 
2.1%
Other values (3) 3
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3633
57.1%
ASCII 2729
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
987
36.2%
1 322
 
11.8%
, 233
 
8.5%
2 208
 
7.6%
0 158
 
5.8%
6 130
 
4.8%
4 108
 
4.0%
5 98
 
3.6%
3 94
 
3.4%
8 91
 
3.3%
Other values (21) 300
 
11.0%
Hangul
ValueCountFrequency (%)
421
 
11.6%
219
 
6.0%
197
 
5.4%
195
 
5.4%
194
 
5.3%
194
 
5.3%
194
 
5.3%
194
 
5.3%
170
 
4.7%
143
 
3.9%
Other values (147) 1512
41.6%

전화번호
Text

MISSING 

Distinct113
Distinct (%)89.7%
Missing68
Missing (%)35.1%
Memory size1.6 KiB
2024-03-14T23:46:14.338140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.849206
Min length9

Characters and Unicode

Total characters1493
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)80.2%

Sample

1st row02-2696-6260
2nd row02-2602-0588
3rd row02-2602-2660
4th row02-2601-5406
5th row02-2661-6998
ValueCountFrequency (%)
02-3663-3571 3
 
2.4%
02-769-1353 2
 
1.6%
02-2658-5226 2
 
1.6%
02-2602-4944 2
 
1.6%
02-3664-6655 2
 
1.6%
02-2658-0725 2
 
1.6%
02-6358-4964 2
 
1.6%
02-2662-0816 2
 
1.6%
02-6053-8524 2
 
1.6%
02-2661-6998 2
 
1.6%
Other values (103) 105
83.3%
2024-03-14T23:46:15.538658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 262
17.5%
- 251
16.8%
0 213
14.3%
6 193
12.9%
3 104
 
7.0%
5 102
 
6.8%
9 86
 
5.8%
1 82
 
5.5%
4 79
 
5.3%
7 65
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1242
83.2%
Dash Punctuation 251
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 262
21.1%
0 213
17.1%
6 193
15.5%
3 104
 
8.4%
5 102
 
8.2%
9 86
 
6.9%
1 82
 
6.6%
4 79
 
6.4%
7 65
 
5.2%
8 56
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 251
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1493
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 262
17.5%
- 251
16.8%
0 213
14.3%
6 193
12.9%
3 104
 
7.0%
5 102
 
6.8%
9 86
 
5.8%
1 82
 
5.5%
4 79
 
5.3%
7 65
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1493
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 262
17.5%
- 251
16.8%
0 213
14.3%
6 193
12.9%
3 104
 
7.0%
5 102
 
6.8%
9 86
 
5.8%
1 82
 
5.5%
4 79
 
5.3%
7 65
 
4.4%

Interactions

2024-03-14T23:46:07.588482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-14T23:46:07.905394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:46:08.053213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사무소명도로명주소전화번호
01주)세현건축사사무소서울특별시 강서구 화곡로 297, 파크뷰에버 오피스텔705호02-2696-6260
12김명선건축사사무소서울특별시 강서구 화곡로 313, (화곡동)401호02-2602-0588
23홍건축사사무소서울특별시 강서구 화곡로 31302-2602-2660
34천지건축사사무소서울특별시 강서구 화곡로 296, 강서I'PARK 311호02-2601-5406
45(주)유용준건축사사무소서울특별시 강서구 공항대로 4602-2661-6998
56(주)유용준건축사사무소서울특별시 강서구 공항대로 4602-2661-6998
67(주)비사벌건축사사무소서울특별시 강서구 강서로 466, 우리벤처타운 402호02-2692-1919
78건축사사무소미래건축서울특별시 강서구 화곡로53길 14, 301호02-2696-5404
89(주)건축사사무소한맥서울특별시 강서구 화곡로68길 82, 제13층 1303호 (등촌동, 강서아이티밸리)02-3663-1672
910(주)건축사사무소 신화두보서울특별시 강서구 공항대로 212, A동 1029호 (마곡동, 문영퀸즈파크11차)02-2696-5600
연번사무소명도로명주소전화번호
184185종합건축사사무소 우리건축서울특별시 강서구 강서로 447, 2층 204호02-3663-3571
185186이승연 건축사사무소서울특별시 강서구 공항대로 525, 비원오피스텔 1501호<NA>
186187건축사사무소단단서울특별시 강서구 마곡중앙6로 10, 203호 (마곡역센트럴푸르지오)<NA>
187188건축사사무소 히어서울특별시 강서구 마곡중앙1로 10, 801~810호<NA>
188189아키봉 건축사사무소서울특별시 강서구 개화동로27가길 4, 404호<NA>
189190(주)고건건축사사무소서울특별시 강서구 공항대로59길 8, 404호 (등촌동, 우현빌딩)<NA>
190191건축사사무소 신화서울특별시 강서구 화곡로 296, 212호<NA>
191192지산종합건축사사무소서울특별시 강서구 양천로 738, 10층 1008호<NA>
192193더이레건축사사무소서울특별시 강서구 마곡중앙6로 63, A651호(마곡동, 마곡테크노타워)<NA>
193194케이앤건축사사무소서울특별시 강서구 마곡중앙로 161-8, 13층 A동 1316호<NA>