Overview

Dataset statistics

Number of variables7
Number of observations189
Missing cells75
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory57.7 B

Variable types

Numeric1
DateTime1
Categorical1
Text4

Dataset

Description건축사사무소 신고일, 건축사사무소 신고구분(법인, 개인), 건축사사무소 사무소명, 건축사사무소 주소, 건축사사무소 신고 건축사
Author서울특별시 광진구
URLhttps://www.data.go.kr/data/15034781/fileData.do

Alerts

전화번호 has 75 (39.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:17:25.378729
Analysis finished2024-03-14 15:17:26.821969
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct189
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean95
Minimum1
Maximum189
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-03-15T00:17:27.024565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.4
Q148
median95
Q3142
95-th percentile179.6
Maximum189
Range188
Interquartile range (IQR)94

Descriptive statistics

Standard deviation54.703748
Coefficient of variation (CV)0.57582892
Kurtosis-1.2
Mean95
Median Absolute Deviation (MAD)47
Skewness0
Sum17955
Variance2992.5
MonotonicityStrictly increasing
2024-03-15T00:17:27.466522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
131 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
Other values (179) 179
94.7%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
189 1
0.5%
188 1
0.5%
187 1
0.5%
186 1
0.5%
185 1
0.5%
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
Distinct177
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum1985-06-29 00:00:00
Maximum2023-12-06 00:00:00
2024-03-15T00:17:27.834779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:17:28.308230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신고구분
Categorical

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
개인
119 
법인
70 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row법인
3rd row개인
4th row법인
5th row법인

Common Values

ValueCountFrequency (%)
개인 119
63.0%
법인 70
37.0%

Length

2024-03-15T00:17:28.543456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:17:28.801245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 119
63.0%
법인 70
37.0%
Distinct180
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-15T00:17:29.608647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length11.253968
Min length7

Characters and Unicode

Total characters2127
Distinct characters208
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique174 ?
Unique (%)92.1%

Sample

1st row(주)대양환경건축사사무소
2nd row(주)태건축설계건축사사무소
3rd row정화건축사사무소
4th row(주)예향도인종합건축사사무소
5th row(주)종합건축사사무소 무예
ValueCountFrequency (%)
건축사사무소 68
 
23.4%
주식회사 25
 
8.6%
주)유타건축사사무소 4
 
1.4%
주)종합건축사사무소담 3
 
1.0%
플레이스 2
 
0.7%
2
 
0.7%
주)종합건축사사무소 2
 
0.7%
종합건축사사무소 2
 
0.7%
아음건축사사무소 2
 
0.7%
그리드 2
 
0.7%
Other values (177) 178
61.4%
2024-03-15T00:17:30.795124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
406
19.1%
199
 
9.4%
197
 
9.3%
193
 
9.1%
192
 
9.0%
102
 
4.8%
72
 
3.4%
43
 
2.0%
( 43
 
2.0%
) 43
 
2.0%
Other values (198) 637
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1908
89.7%
Space Separator 102
 
4.8%
Open Punctuation 43
 
2.0%
Close Punctuation 43
 
2.0%
Uppercase Letter 15
 
0.7%
Lowercase Letter 14
 
0.7%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
406
21.3%
199
 
10.4%
197
 
10.3%
193
 
10.1%
192
 
10.1%
72
 
3.8%
43
 
2.3%
28
 
1.5%
28
 
1.5%
24
 
1.3%
Other values (170) 526
27.6%
Lowercase Letter
ValueCountFrequency (%)
l 2
14.3%
g 1
 
7.1%
o 1
 
7.1%
m 1
 
7.1%
u 1
 
7.1%
e 1
 
7.1%
t 1
 
7.1%
r 1
 
7.1%
i 1
 
7.1%
a 1
 
7.1%
Other values (3) 3
21.4%
Uppercase Letter
ValueCountFrequency (%)
D 2
13.3%
I 2
13.3%
E 2
13.3%
A 2
13.3%
C 2
13.3%
V 1
6.7%
T 1
6.7%
R 1
6.7%
J 1
6.7%
Y 1
6.7%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Space Separator
ValueCountFrequency (%)
102
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1908
89.7%
Common 190
 
8.9%
Latin 29
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
406
21.3%
199
 
10.4%
197
 
10.3%
193
 
10.1%
192
 
10.1%
72
 
3.8%
43
 
2.3%
28
 
1.5%
28
 
1.5%
24
 
1.3%
Other values (170) 526
27.6%
Latin
ValueCountFrequency (%)
D 2
 
6.9%
I 2
 
6.9%
E 2
 
6.9%
A 2
 
6.9%
C 2
 
6.9%
l 2
 
6.9%
g 1
 
3.4%
o 1
 
3.4%
m 1
 
3.4%
u 1
 
3.4%
Other values (13) 13
44.8%
Common
ValueCountFrequency (%)
102
53.7%
( 43
22.6%
) 43
22.6%
& 1
 
0.5%
, 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1908
89.7%
ASCII 219
 
10.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
406
21.3%
199
 
10.4%
197
 
10.3%
193
 
10.1%
192
 
10.1%
72
 
3.8%
43
 
2.3%
28
 
1.5%
28
 
1.5%
24
 
1.3%
Other values (170) 526
27.6%
ASCII
ValueCountFrequency (%)
102
46.6%
( 43
19.6%
) 43
19.6%
D 2
 
0.9%
I 2
 
0.9%
E 2
 
0.9%
A 2
 
0.9%
C 2
 
0.9%
l 2
 
0.9%
g 1
 
0.5%
Other values (18) 18
 
8.2%
Distinct171
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-15T00:17:31.898872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length37
Mean length27.222222
Min length1

Characters and Unicode

Total characters5145
Distinct characters148
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)83.1%

Sample

1st row서울특별시 광진구 자양로 149
2nd row서울특별시 광진구 광나루로52길 92
3rd row서울특별시 광진구 아차산로 375, 크레신타워 404호
4th row서울특별시 광진구 광나루로 586, 7층
5th row서울특별시 광진구 강변역로4길 68, 5층,501호(구의동,리젠트 오피스텔)
ValueCountFrequency (%)
서울특별시 187
 
18.5%
광진구 187
 
18.5%
3층 21
 
2.1%
자양로 20
 
2.0%
능동로 15
 
1.5%
아차산로 15
 
1.5%
아차산로78길 15
 
1.5%
2층 14
 
1.4%
1층 11
 
1.1%
광나루로 11
 
1.1%
Other values (314) 517
51.0%
2024-03-15T00:17:33.372510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
830
 
16.1%
215
 
4.2%
1 203
 
3.9%
202
 
3.9%
, 194
 
3.8%
189
 
3.7%
188
 
3.7%
188
 
3.7%
187
 
3.6%
187
 
3.6%
Other values (138) 2562
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2917
56.7%
Decimal Number 1089
 
21.2%
Space Separator 830
 
16.1%
Other Punctuation 196
 
3.8%
Close Punctuation 37
 
0.7%
Open Punctuation 37
 
0.7%
Dash Punctuation 24
 
0.5%
Uppercase Letter 15
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
215
 
7.4%
202
 
6.9%
189
 
6.5%
188
 
6.4%
188
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
118
 
4.0%
Other values (113) 1069
36.6%
Decimal Number
ValueCountFrequency (%)
1 203
18.6%
2 144
13.2%
3 138
12.7%
0 135
12.4%
4 103
9.5%
5 99
9.1%
6 89
8.2%
8 70
 
6.4%
7 70
 
6.4%
9 38
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
B 7
46.7%
F 1
 
6.7%
E 1
 
6.7%
M 1
 
6.7%
I 1
 
6.7%
D 1
 
6.7%
S 1
 
6.7%
N 1
 
6.7%
U 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
, 194
99.0%
. 2
 
1.0%
Space Separator
ValueCountFrequency (%)
830
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2917
56.7%
Common 2213
43.0%
Latin 15
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
215
 
7.4%
202
 
6.9%
189
 
6.5%
188
 
6.4%
188
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
118
 
4.0%
Other values (113) 1069
36.6%
Common
ValueCountFrequency (%)
830
37.5%
1 203
 
9.2%
, 194
 
8.8%
2 144
 
6.5%
3 138
 
6.2%
0 135
 
6.1%
4 103
 
4.7%
5 99
 
4.5%
6 89
 
4.0%
8 70
 
3.2%
Other values (6) 208
 
9.4%
Latin
ValueCountFrequency (%)
B 7
46.7%
F 1
 
6.7%
E 1
 
6.7%
M 1
 
6.7%
I 1
 
6.7%
D 1
 
6.7%
S 1
 
6.7%
N 1
 
6.7%
U 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2917
56.7%
ASCII 2228
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
830
37.3%
1 203
 
9.1%
, 194
 
8.7%
2 144
 
6.5%
3 138
 
6.2%
0 135
 
6.1%
4 103
 
4.6%
5 99
 
4.4%
6 89
 
4.0%
8 70
 
3.1%
Other values (15) 223
 
10.0%
Hangul
ValueCountFrequency (%)
215
 
7.4%
202
 
6.9%
189
 
6.5%
188
 
6.4%
188
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
187
 
6.4%
118
 
4.0%
Other values (113) 1069
36.6%
Distinct186
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-15T00:17:34.642663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.989418
Min length2

Characters and Unicode

Total characters565
Distinct characters129
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)96.8%

Sample

1st row이상훈
2nd row석정훈
3rd row유원동
4th row최효숙
5th row이종태
ValueCountFrequency (%)
김주연 2
 
1.1%
김영식 2
 
1.1%
박현우 2
 
1.1%
박효경 1
 
0.5%
김행중 1
 
0.5%
이호경 1
 
0.5%
권이철 1
 
0.5%
이상훈 1
 
0.5%
이재현 1
 
0.5%
이민재 1
 
0.5%
Other values (176) 176
93.1%
2024-03-15T00:17:36.327424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
7.3%
27
 
4.8%
17
 
3.0%
16
 
2.8%
15
 
2.7%
15
 
2.7%
12
 
2.1%
11
 
1.9%
10
 
1.8%
10
 
1.8%
Other values (119) 391
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 565
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
7.3%
27
 
4.8%
17
 
3.0%
16
 
2.8%
15
 
2.7%
15
 
2.7%
12
 
2.1%
11
 
1.9%
10
 
1.8%
10
 
1.8%
Other values (119) 391
69.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 565
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
7.3%
27
 
4.8%
17
 
3.0%
16
 
2.8%
15
 
2.7%
15
 
2.7%
12
 
2.1%
11
 
1.9%
10
 
1.8%
10
 
1.8%
Other values (119) 391
69.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 565
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
7.3%
27
 
4.8%
17
 
3.0%
16
 
2.8%
15
 
2.7%
15
 
2.7%
12
 
2.1%
11
 
1.9%
10
 
1.8%
10
 
1.8%
Other values (119) 391
69.2%

전화번호
Text

MISSING 

Distinct104
Distinct (%)91.2%
Missing75
Missing (%)39.7%
Memory size1.6 KiB
2024-03-15T00:17:37.325716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.385965
Min length11

Characters and Unicode

Total characters1298
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)86.0%

Sample

1st row02-457-0331
2nd row02-456-6416
3rd row02-446-5635
4th row02-458-4181
5th row02-3437-6606
ValueCountFrequency (%)
02-556-6903 4
 
3.5%
02-454-9050 3
 
2.6%
02-462-8770 3
 
2.6%
02-3447-7888 2
 
1.7%
02-540-6100 2
 
1.7%
02-516-9583 2
 
1.7%
02-3448-9911 1
 
0.9%
02-578-2025 1
 
0.9%
02-406-2555 1
 
0.9%
02-3012-2122 1
 
0.9%
Other values (95) 95
82.6%
2024-03-15T00:17:38.770535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 228
17.6%
0 205
15.8%
2 188
14.5%
4 135
10.4%
5 108
8.3%
6 94
7.2%
7 81
 
6.2%
3 74
 
5.7%
1 74
 
5.7%
8 56
 
4.3%
Other values (3) 55
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1068
82.3%
Dash Punctuation 228
 
17.6%
Math Symbol 1
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 205
19.2%
2 188
17.6%
4 135
12.6%
5 108
10.1%
6 94
8.8%
7 81
 
7.6%
3 74
 
6.9%
1 74
 
6.9%
8 56
 
5.2%
9 53
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 228
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1298
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 228
17.6%
0 205
15.8%
2 188
14.5%
4 135
10.4%
5 108
8.3%
6 94
7.2%
7 81
 
6.2%
3 74
 
5.7%
1 74
 
5.7%
8 56
 
4.3%
Other values (3) 55
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1298
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 228
17.6%
0 205
15.8%
2 188
14.5%
4 135
10.4%
5 108
8.3%
6 94
7.2%
7 81
 
6.2%
3 74
 
5.7%
1 74
 
5.7%
8 56
 
4.3%
Other values (3) 55
 
4.2%

Interactions

2024-03-15T00:17:25.942497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:17:39.039124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분
연번1.0000.248
신고구분0.2481.000
2024-03-15T00:17:39.259397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분
연번1.0000.195
신고구분0.1951.000

Missing values

2024-03-15T00:17:26.294941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:17:26.676003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번신고일신고구분사무소명도로명주소신고건축사전화번호
011985-06-29법인(주)대양환경건축사사무소서울특별시 광진구 자양로 149이상훈02-457-0331
121989-01-23법인(주)태건축설계건축사사무소서울특별시 광진구 광나루로52길 92석정훈02-456-6416
231990-03-05개인정화건축사사무소서울특별시 광진구 아차산로 375, 크레신타워 404호유원동02-446-5635
341992-02-10법인(주)예향도인종합건축사사무소서울특별시 광진구 광나루로 586, 7층최효숙02-458-4181
451994-01-27법인(주)종합건축사사무소 무예서울특별시 광진구 강변역로4길 68, 5층,501호(구의동,리젠트 오피스텔)이종태02-3437-6606
561994-06-22개인건축사사무소 비상서울특별시 광진구 뚝섬로 742, 광성빌딩505호조진호02-456-2276
671994-07-15개인선인건축사사무소서울특별시 광진구 군자로 41-4, 601호김동환02-3409-2979
781994-10-20개인건축사사무소 기틀서울특별시 광진구 능동로48길 23, 1층나병기02-454-6478
891995-03-17개인건축사사무소 태성서울특별시 광진구 천호대로 596, 삼진빌딩임용상02-454-8255
9101995-04-04개인무암건축사사무소서울특별시 광진구 자양로 126김승범02-446-0276
연번신고일신고구분사무소명도로명주소신고건축사전화번호
1791802020-07-17법인주식회사 호원건축사사무소서울특별시 광진구 자양로13길 8, 9층, 901-5호박경원<NA>
1801812020-10-21법인주식회사 주희성건축사사무소서울특별시 광진구 군자로 156, 비1층 E-17호주희성<NA>
1811822022-12-30개인에스티건축사사무소서울특별시 광진구 능동로 266, 광정빌딩 2층 216-58박경선<NA>
1821832023-09-18법인(주)티비디건축사사무소서울특별시 광진구 군자로17길 8, 지하 1층민병기02-453-2311
1831842023-10-31개인Allometric 건축사사무소서울특별시 광진구 아차산로78길 75, 706호김은영<NA>
1841852023-11-03개인이음디자인건축사사무소서울특별시 광진구 광나루로 436, 5층기대영<NA>
1851862023-11-15개인앤앤에이 건축사사무소서울특별시 광진구 능동로 247, 6층, 602호정희태02-464-1300
1861872023-12-06개인온리트 건축사사무소서울특별시 광진구 광나루로 382, 3층 303-13호박효경<NA>
1871882021-03-16법인주식회사 삼민종합건축사사무소서울특별시 광진구 광나루로 436, 에듀킨빌딩 5층이지원<NA>
1881892006-05-04개인건축사사무소가온건축서울특별시 광진구 천호대로138길 28, 1층임형남02-512-6313