Overview

Dataset statistics

Number of variables5
Number of observations119
Missing cells47
Missing cells (%)7.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory42.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description서울특별시 용산구 건축사사무소개설신고 현황(건축사사무소개설 신고구분, 사무소명, 도로명주소, 전화번호)에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15090463/fileData.do

Alerts

전화번호 has 47 (39.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:14:29.567579
Analysis finished2023-12-12 04:14:30.006232
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct119
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60
Minimum1
Maximum119
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T13:14:30.091190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.9
Q130.5
median60
Q389.5
95-th percentile113.1
Maximum119
Range118
Interquartile range (IQR)59

Descriptive statistics

Standard deviation34.496377
Coefficient of variation (CV)0.57493961
Kurtosis-1.2
Mean60
Median Absolute Deviation (MAD)30
Skewness0
Sum7140
Variance1190
MonotonicityStrictly increasing
2023-12-12T13:14:30.307852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
2 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
Other values (109) 109
91.6%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%
110 1
0.8%

신고구분
Categorical

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
개인
64 
법인
55 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row법인
3rd row개인
4th row개인
5th row법인

Common Values

ValueCountFrequency (%)
개인 64
53.8%
법인 55
46.2%

Length

2023-12-12T13:14:30.545136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:14:30.656394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 64
53.8%
법인 55
46.2%
Distinct114
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T13:14:30.896328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length17
Mean length12.210084
Min length7

Characters and Unicode

Total characters1453
Distinct characters186
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)92.4%

Sample

1st row루멘건축사사무소
2nd row(주)지우탑건축사사무소
3rd row동원건축사사무소
4th row허건축사사무소
5th row종합건축사사무소
ValueCountFrequency (%)
건축사사무소 41
 
20.7%
주식회사 22
 
11.1%
종합건축사사무소 6
 
3.0%
다른 3
 
1.5%
주)건축사사무소 3
 
1.5%
주)탄허건축사사무소 2
 
1.0%
커튼홀 2
 
1.0%
co 1
 
0.5%
리파트너스종합건축사사무소 1
 
0.5%
lee 1
 
0.5%
Other values (116) 116
58.6%
2023-12-12T13:14:31.383397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
264
18.2%
125
 
8.6%
124
 
8.5%
124
 
8.5%
120
 
8.3%
80
 
5.5%
51
 
3.5%
31
 
2.1%
( 29
 
2.0%
) 29
 
2.0%
Other values (176) 476
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1262
86.9%
Space Separator 80
 
5.5%
Lowercase Letter 38
 
2.6%
Open Punctuation 29
 
2.0%
Close Punctuation 29
 
2.0%
Uppercase Letter 11
 
0.8%
Other Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
264
20.9%
125
 
9.9%
124
 
9.8%
124
 
9.8%
120
 
9.5%
51
 
4.0%
31
 
2.5%
24
 
1.9%
23
 
1.8%
14
 
1.1%
Other values (149) 362
28.7%
Lowercase Letter
ValueCountFrequency (%)
t 6
15.8%
r 5
13.2%
c 4
10.5%
e 4
10.5%
n 3
7.9%
u 3
7.9%
d 2
 
5.3%
o 2
 
5.3%
h 2
 
5.3%
i 2
 
5.3%
Other values (4) 5
13.2%
Uppercase Letter
ValueCountFrequency (%)
E 2
18.2%
L 2
18.2%
A 2
18.2%
C 1
9.1%
S 1
9.1%
I 1
9.1%
H 1
9.1%
N 1
9.1%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
, 1
 
25.0%
Space Separator
ValueCountFrequency (%)
80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1262
86.9%
Common 142
 
9.8%
Latin 49
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
264
20.9%
125
 
9.9%
124
 
9.8%
124
 
9.8%
120
 
9.5%
51
 
4.0%
31
 
2.5%
24
 
1.9%
23
 
1.8%
14
 
1.1%
Other values (149) 362
28.7%
Latin
ValueCountFrequency (%)
t 6
 
12.2%
r 5
 
10.2%
c 4
 
8.2%
e 4
 
8.2%
n 3
 
6.1%
u 3
 
6.1%
E 2
 
4.1%
L 2
 
4.1%
d 2
 
4.1%
o 2
 
4.1%
Other values (12) 16
32.7%
Common
ValueCountFrequency (%)
80
56.3%
( 29
 
20.4%
) 29
 
20.4%
. 3
 
2.1%
, 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1262
86.9%
ASCII 191
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
264
20.9%
125
 
9.9%
124
 
9.8%
124
 
9.8%
120
 
9.5%
51
 
4.0%
31
 
2.5%
24
 
1.9%
23
 
1.8%
14
 
1.1%
Other values (149) 362
28.7%
ASCII
ValueCountFrequency (%)
80
41.9%
( 29
 
15.2%
) 29
 
15.2%
t 6
 
3.1%
r 5
 
2.6%
c 4
 
2.1%
e 4
 
2.1%
. 3
 
1.6%
n 3
 
1.6%
u 3
 
1.6%
Other values (17) 25
 
13.1%
Distinct107
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T13:14:31.785743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length26.428571
Min length1

Characters and Unicode

Total characters3145
Distinct characters142
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)82.4%

Sample

1st row서울특별시 용산구 후암로28길 27 2층 (후암동)
2nd row서울특별시 용산구 원효로 214-2 청운빌딩8층 (원효로2가)
3rd row서울특별시 용산구 한강대로 125-2, 2층
4th row서울특별시 용산구 백범로 340-1
5th row서울특별시 용산구 청파로73길 3
ValueCountFrequency (%)
서울특별시 115
 
18.4%
용산구 115
 
18.4%
2층 22
 
3.5%
20 14
 
2.2%
3층 11
 
1.8%
1층 11
 
1.8%
25 7
 
1.1%
7 7
 
1.1%
한강대로 7
 
1.1%
효창원로69길 7
 
1.1%
Other values (225) 310
49.5%
2023-12-12T13:14:32.401125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
520
 
16.5%
2 136
 
4.3%
122
 
3.9%
120
 
3.8%
119
 
3.8%
119
 
3.8%
118
 
3.8%
116
 
3.7%
115
 
3.7%
115
 
3.7%
Other values (132) 1545
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1794
57.0%
Decimal Number 650
 
20.7%
Space Separator 520
 
16.5%
Other Punctuation 108
 
3.4%
Dash Punctuation 28
 
0.9%
Close Punctuation 15
 
0.5%
Open Punctuation 15
 
0.5%
Uppercase Letter 15
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
6.8%
120
 
6.7%
119
 
6.6%
119
 
6.6%
118
 
6.6%
116
 
6.5%
115
 
6.4%
115
 
6.4%
115
 
6.4%
79
 
4.4%
Other values (112) 656
36.6%
Decimal Number
ValueCountFrequency (%)
2 136
20.9%
1 113
17.4%
4 82
12.6%
3 71
10.9%
0 70
10.8%
6 45
 
6.9%
5 40
 
6.2%
7 33
 
5.1%
9 31
 
4.8%
8 29
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 6
40.0%
D 5
33.3%
F 2
 
13.3%
A 1
 
6.7%
N 1
 
6.7%
Space Separator
ValueCountFrequency (%)
520
100.0%
Other Punctuation
ValueCountFrequency (%)
, 108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1794
57.0%
Common 1336
42.5%
Latin 15
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
6.8%
120
 
6.7%
119
 
6.6%
119
 
6.6%
118
 
6.6%
116
 
6.5%
115
 
6.4%
115
 
6.4%
115
 
6.4%
79
 
4.4%
Other values (112) 656
36.6%
Common
ValueCountFrequency (%)
520
38.9%
2 136
 
10.2%
1 113
 
8.5%
, 108
 
8.1%
4 82
 
6.1%
3 71
 
5.3%
0 70
 
5.2%
6 45
 
3.4%
5 40
 
3.0%
7 33
 
2.5%
Other values (5) 118
 
8.8%
Latin
ValueCountFrequency (%)
B 6
40.0%
D 5
33.3%
F 2
 
13.3%
A 1
 
6.7%
N 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1794
57.0%
ASCII 1351
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
520
38.5%
2 136
 
10.1%
1 113
 
8.4%
, 108
 
8.0%
4 82
 
6.1%
3 71
 
5.3%
0 70
 
5.2%
6 45
 
3.3%
5 40
 
3.0%
7 33
 
2.4%
Other values (10) 133
 
9.8%
Hangul
ValueCountFrequency (%)
122
 
6.8%
120
 
6.7%
119
 
6.6%
119
 
6.6%
118
 
6.6%
116
 
6.5%
115
 
6.4%
115
 
6.4%
115
 
6.4%
79
 
4.4%
Other values (112) 656
36.6%

전화번호
Text

MISSING 

Distinct68
Distinct (%)94.4%
Missing47
Missing (%)39.5%
Memory size1.1 KiB
2023-12-12T13:14:32.756665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.402778
Min length10

Characters and Unicode

Total characters821
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)90.3%

Sample

1st row02-716-1797
2nd row02-373-8049
3rd row02-719-5766
4th row02-712-1804
5th row02-3272-1255
ValueCountFrequency (%)
02-394-4990 3
 
4.1%
02-790-1708 2
 
2.7%
02-796-0401 2
 
2.7%
02-778-6989 1
 
1.4%
02-797-7016 1
 
1.4%
02-6339-3003 1
 
1.4%
02-313-0887 1
 
1.4%
02-591-3913 1
 
1.4%
02-2635-2633 1
 
1.4%
02-872-0237 1
 
1.4%
Other values (59) 59
80.8%
2023-12-12T13:14:33.276736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 144
17.5%
2 134
16.3%
0 126
15.3%
7 75
9.1%
9 59
7.2%
3 55
 
6.7%
1 51
 
6.2%
4 47
 
5.7%
8 47
 
5.7%
6 42
 
5.1%
Other values (2) 41
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 676
82.3%
Dash Punctuation 144
 
17.5%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 134
19.8%
0 126
18.6%
7 75
11.1%
9 59
8.7%
3 55
8.1%
1 51
 
7.5%
4 47
 
7.0%
8 47
 
7.0%
6 42
 
6.2%
5 40
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 821
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 144
17.5%
2 134
16.3%
0 126
15.3%
7 75
9.1%
9 59
7.2%
3 55
 
6.7%
1 51
 
6.2%
4 47
 
5.7%
8 47
 
5.7%
6 42
 
5.1%
Other values (2) 41
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 821
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 144
17.5%
2 134
16.3%
0 126
15.3%
7 75
9.1%
9 59
7.2%
3 55
 
6.7%
1 51
 
6.2%
4 47
 
5.7%
8 47
 
5.7%
6 42
 
5.1%
Other values (2) 41
 
5.0%

Interactions

2023-12-12T13:14:29.786807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:14:33.398727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분전화번호
연번1.0000.3470.869
신고구분0.3471.0000.898
전화번호0.8690.8981.000
2023-12-12T13:14:33.518043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분
연번1.0000.263
신고구분0.2631.000

Missing values

2023-12-12T13:14:29.885908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:14:29.969559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번신고구분사무소명도로명주소전화번호
01개인루멘건축사사무소서울특별시 용산구 후암로28길 27 2층 (후암동)<NA>
12법인(주)지우탑건축사사무소서울특별시 용산구 원효로 214-2 청운빌딩8층 (원효로2가)02-716-1797
23개인동원건축사사무소서울특별시 용산구 한강대로 125-2, 2층02-373-8049
34개인허건축사사무소서울특별시 용산구 백범로 340-102-719-5766
45법인종합건축사사무소서울특별시 용산구 청파로73길 3<NA>
56법인종합건축사사무소서울특별시 용산구 청파로73길 3<NA>
67개인종합건축사사무소 신라서울특별시 용산구 백범로 328-102-712-1804
78개인지영종합건축사사무소서울특별시 용산구 백범로 276, 강산빌딩 1층 101호02-3272-1255
89법인주식회사 두일건축사사무소서울특별시 용산구 원효로89길 3-4, 302호(원효로1가,석선빌딩)<NA>
910법인(주)도시환경종합건축사사무소서울특별시 용산구 유엔빌리지3길 54-1602- 794-7852
연번신고구분사무소명도로명주소전화번호
109110법인주식회사 제이에이치씨건축사사무소서울특별시 용산구 신흥로22길 4, 2층<NA>
110111개인건축사사무소 무이원서울특별시 용산구 한강대로102길 11-3, 4층02-2272-0709
111112개인건축사사무소아키텍토닉스서울특별시 용산구 한강대로 366, 트윈시티 남산 6층 648호<NA>
112113개인지여건축사사무소서울특별시 용산구 원효로 186-1, 2층<NA>
113114개인필드워크 건축사사무소서울특별시 용산구 청파로49길 37-3, 디테일씨빌딩 1층 88호<NA>
114115법인(주)엘앤엘건축사사무소서울특별시 용산구 원효로90길 11, 16층 1621호<NA>
115116개인미가 건축사사무소서울특별시 용산구 청파로49길 37-3, 디테일씨빌딩 1층, 136호<NA>
116117개인건축사사무소 리가서울특별시 용산구 이촌로 5, 703호<NA>
117118법인주식회사 소와요건축사사무소서울특별시 용산구 원효로 210-5, 2층<NA>
118119개인삼삼 건축사사무소서울특별시 용산구 후암로 43, 2층<NA>