Overview

Dataset statistics

Number of variables6
Number of observations230
Missing cells72
Missing cells (%)5.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory49.6 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description서울특별시 종로구 건축사사무소개설신고 현황(건축사사무소개설 신고구분, 사무소명, 도로명주소, 전화번호)에 대한 데이터를 제공합니다.
Author서울특별시 종로구
URLhttps://www.data.go.kr/data/15126256/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 72 (31.3%) missing valuesMissing
연번 has unique valuesUnique
사무소명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:16:01.793879
Analysis finished2024-03-14 17:16:03.521094
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct230
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115.5
Minimum1
Maximum230
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-03-15T02:16:03.784938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.45
Q158.25
median115.5
Q3172.75
95-th percentile218.55
Maximum230
Range229
Interquartile range (IQR)114.5

Descriptive statistics

Standard deviation66.539462
Coefficient of variation (CV)0.57609924
Kurtosis-1.2
Mean115.5
Median Absolute Deviation (MAD)57.5
Skewness0
Sum26565
Variance4427.5
MonotonicityStrictly increasing
2024-03-15T02:16:04.222351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
146 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
Other values (220) 220
95.7%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
230 1
0.4%
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%

신고구분
Categorical

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
개인
141 
법인
89 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row법인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 141
61.3%
법인 89
38.7%

Length

2024-03-15T02:16:04.451155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:16:04.663730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 141
61.3%
법인 89
38.7%

사무소명
Text

UNIQUE 

Distinct230
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-03-15T02:16:05.708241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length11.834783
Min length7

Characters and Unicode

Total characters2722
Distinct characters262
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)100.0%

Sample

1st row(주)종합건축사사무소 그룹예성
2nd row주식회사 디자인그룹금성건축사사무소
3rd row(주)신진엔지니어링건축사사무소
4th row(주)종합건축사사무소 이로재
5th row아보 건축사사무소
ValueCountFrequency (%)
건축사사무소 97
25.3%
주식회사 27
 
7.0%
주)건축사사무소 5
 
1.3%
주)종합건축사사무소 4
 
1.0%
공감 2
 
0.5%
아키텍츠 2
 
0.5%
스튜디오 2
 
0.5%
건축사 2
 
0.5%
종합건축사사무소 2
 
0.5%
사무소 2
 
0.5%
Other values (237) 238
62.1%
2024-03-15T02:16:07.249157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
497
18.3%
242
 
8.9%
241
 
8.9%
235
 
8.6%
233
 
8.6%
156
 
5.7%
86
 
3.2%
63
 
2.3%
( 53
 
1.9%
) 53
 
1.9%
Other values (252) 863
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2398
88.1%
Space Separator 156
 
5.7%
Close Punctuation 54
 
2.0%
Open Punctuation 53
 
1.9%
Uppercase Letter 27
 
1.0%
Lowercase Letter 26
 
1.0%
Decimal Number 8
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
497
20.7%
242
 
10.1%
241
 
10.1%
235
 
9.8%
233
 
9.7%
86
 
3.6%
63
 
2.6%
35
 
1.5%
35
 
1.5%
33
 
1.4%
Other values (213) 698
29.1%
Lowercase Letter
ValueCountFrequency (%)
o 5
19.2%
t 3
11.5%
i 2
 
7.7%
c 2
 
7.7%
a 2
 
7.7%
z 2
 
7.7%
l 1
 
3.8%
b 1
 
3.8%
n 1
 
3.8%
m 1
 
3.8%
Other values (6) 6
23.1%
Uppercase Letter
ValueCountFrequency (%)
E 4
14.8%
I 3
11.1%
R 3
11.1%
A 3
11.1%
P 3
11.1%
N 2
7.4%
S 2
7.4%
M 2
7.4%
Z 2
7.4%
O 1
 
3.7%
Other values (2) 2
7.4%
Decimal Number
ValueCountFrequency (%)
8 2
25.0%
9 1
12.5%
3 1
12.5%
0 1
12.5%
6 1
12.5%
2 1
12.5%
1 1
12.5%
Close Punctuation
ValueCountFrequency (%)
) 53
98.1%
] 1
 
1.9%
Space Separator
ValueCountFrequency (%)
156
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2398
88.1%
Common 271
 
10.0%
Latin 53
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
497
20.7%
242
 
10.1%
241
 
10.1%
235
 
9.8%
233
 
9.7%
86
 
3.6%
63
 
2.6%
35
 
1.5%
35
 
1.5%
33
 
1.4%
Other values (213) 698
29.1%
Latin
ValueCountFrequency (%)
o 5
 
9.4%
E 4
 
7.5%
t 3
 
5.7%
I 3
 
5.7%
R 3
 
5.7%
A 3
 
5.7%
P 3
 
5.7%
N 2
 
3.8%
S 2
 
3.8%
i 2
 
3.8%
Other values (18) 23
43.4%
Common
ValueCountFrequency (%)
156
57.6%
( 53
 
19.6%
) 53
 
19.6%
8 2
 
0.7%
9 1
 
0.4%
3 1
 
0.4%
0 1
 
0.4%
6 1
 
0.4%
2 1
 
0.4%
1 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2398
88.1%
ASCII 324
 
11.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
497
20.7%
242
 
10.1%
241
 
10.1%
235
 
9.8%
233
 
9.7%
86
 
3.6%
63
 
2.6%
35
 
1.5%
35
 
1.5%
33
 
1.4%
Other values (213) 698
29.1%
ASCII
ValueCountFrequency (%)
156
48.1%
( 53
 
16.4%
) 53
 
16.4%
o 5
 
1.5%
E 4
 
1.2%
t 3
 
0.9%
I 3
 
0.9%
R 3
 
0.9%
A 3
 
0.9%
P 3
 
0.9%
Other values (29) 38
 
11.7%
Distinct225
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-03-15T02:16:08.525758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length37
Mean length26.8
Min length16

Characters and Unicode

Total characters6164
Distinct characters210
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)95.7%

Sample

1st row서울특별시 종로구 새문안로 42, 901호
2nd row서울특별시 종로구 새문안로 42, 피어선빌딩 5층
3rd row서울특별시 종로구 관훈동 198-16 남도빌딩 201호
4th row서울특별시 종로구 동숭4가길 20
5th row서울특별시 종로구 종로 335-5
ValueCountFrequency (%)
서울특별시 230
 
18.0%
종로구 230
 
18.0%
1층 24
 
1.9%
3층 23
 
1.8%
자하문로 18
 
1.4%
2층 18
 
1.4%
4층 15
 
1.2%
율곡로 11
 
0.9%
201호 9
 
0.7%
새문안로 9
 
0.7%
Other values (424) 693
54.1%
2024-03-15T02:16:10.211166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1056
 
17.1%
424
 
6.9%
1 281
 
4.6%
260
 
4.2%
234
 
3.8%
231
 
3.7%
231
 
3.7%
230
 
3.7%
230
 
3.7%
230
 
3.7%
Other values (200) 2757
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3537
57.4%
Decimal Number 1220
 
19.8%
Space Separator 1056
 
17.1%
Other Punctuation 216
 
3.5%
Dash Punctuation 66
 
1.1%
Open Punctuation 21
 
0.3%
Close Punctuation 21
 
0.3%
Lowercase Letter 15
 
0.2%
Uppercase Letter 12
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
424
 
12.0%
260
 
7.4%
234
 
6.6%
231
 
6.5%
231
 
6.5%
230
 
6.5%
230
 
6.5%
230
 
6.5%
123
 
3.5%
112
 
3.2%
Other values (169) 1232
34.8%
Decimal Number
ValueCountFrequency (%)
1 281
23.0%
3 166
13.6%
2 162
13.3%
0 149
12.2%
4 99
 
8.1%
5 84
 
6.9%
8 78
 
6.4%
6 71
 
5.8%
7 65
 
5.3%
9 65
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
e 2
13.3%
r 2
13.3%
c 2
13.3%
t 2
13.3%
b 2
13.3%
a 1
6.7%
h 1
6.7%
i 1
6.7%
s 1
6.7%
f 1
6.7%
Uppercase Letter
ValueCountFrequency (%)
B 4
33.3%
A 3
25.0%
F 2
16.7%
S 2
16.7%
K 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 215
99.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1056
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3537
57.4%
Common 2600
42.2%
Latin 27
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
424
 
12.0%
260
 
7.4%
234
 
6.6%
231
 
6.5%
231
 
6.5%
230
 
6.5%
230
 
6.5%
230
 
6.5%
123
 
3.5%
112
 
3.2%
Other values (169) 1232
34.8%
Common
ValueCountFrequency (%)
1056
40.6%
1 281
 
10.8%
, 215
 
8.3%
3 166
 
6.4%
2 162
 
6.2%
0 149
 
5.7%
4 99
 
3.8%
5 84
 
3.2%
8 78
 
3.0%
6 71
 
2.7%
Other values (6) 239
 
9.2%
Latin
ValueCountFrequency (%)
B 4
14.8%
A 3
11.1%
F 2
 
7.4%
e 2
 
7.4%
r 2
 
7.4%
c 2
 
7.4%
t 2
 
7.4%
b 2
 
7.4%
S 2
 
7.4%
a 1
 
3.7%
Other values (5) 5
18.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3537
57.4%
ASCII 2627
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1056
40.2%
1 281
 
10.7%
, 215
 
8.2%
3 166
 
6.3%
2 162
 
6.2%
0 149
 
5.7%
4 99
 
3.8%
5 84
 
3.2%
8 78
 
3.0%
6 71
 
2.7%
Other values (21) 266
 
10.1%
Hangul
ValueCountFrequency (%)
424
 
12.0%
260
 
7.4%
234
 
6.6%
231
 
6.5%
231
 
6.5%
230
 
6.5%
230
 
6.5%
230
 
6.5%
123
 
3.5%
112
 
3.2%
Other values (169) 1232
34.8%

전화번호
Text

MISSING 

Distinct155
Distinct (%)98.1%
Missing72
Missing (%)31.3%
Memory size1.9 KiB
2024-03-15T02:16:11.063274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.797468
Min length3

Characters and Unicode

Total characters1864
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)96.2%

Sample

1st row02-737-1130
2nd row02-720-2646
3rd row02-742-3351-2
4th row02-763-2010
5th row02-744-2170
ValueCountFrequency (%)
02-720-8686 2
 
1.3%
02-739-9958 2
 
1.3%
02-070-8837-8385 2
 
1.3%
02-928-2240 1
 
0.6%
02-6925-6667 1
 
0.6%
02-3453-3163 1
 
0.6%
02-356-4005 1
 
0.6%
02-323-9360 1
 
0.6%
02-542-8981 1
 
0.6%
02-2200-0500 1
 
0.6%
Other values (145) 145
91.8%
2024-03-15T02:16:12.486381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 331
17.8%
0 321
17.2%
2 280
15.0%
7 174
9.3%
5 126
 
6.8%
3 124
 
6.7%
1 111
 
6.0%
6 109
 
5.8%
8 104
 
5.6%
4 97
 
5.2%
Other values (2) 87
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1529
82.0%
Dash Punctuation 331
 
17.8%
Math Symbol 4
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 321
21.0%
2 280
18.3%
7 174
11.4%
5 126
 
8.2%
3 124
 
8.1%
1 111
 
7.3%
6 109
 
7.1%
8 104
 
6.8%
4 97
 
6.3%
9 83
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 331
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1864
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 331
17.8%
0 321
17.2%
2 280
15.0%
7 174
9.3%
5 126
 
6.8%
3 124
 
6.7%
1 111
 
6.0%
6 109
 
5.8%
8 104
 
5.6%
4 97
 
5.2%
Other values (2) 87
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1864
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 331
17.8%
0 321
17.2%
2 280
15.0%
7 174
9.3%
5 126
 
6.8%
3 124
 
6.7%
1 111
 
6.0%
6 109
 
5.8%
8 104
 
5.6%
4 97
 
5.2%
Other values (2) 87
 
4.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum2024-01-03 00:00:00
Maximum2024-01-03 00:00:00
2024-03-15T02:16:12.883572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:16:13.198623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T02:16:02.512692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:16:13.490914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분
연번1.0000.205
신고구분0.2051.000
2024-03-15T02:16:13.634366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고구분
연번1.0000.154
신고구분0.1541.000

Missing values

2024-03-15T02:16:02.999830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:16:03.370021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번신고구분사무소명도로명주소전화번호데이터기준일자
01법인(주)종합건축사사무소 그룹예성서울특별시 종로구 새문안로 42, 901호02-737-11302024-01-03
12법인주식회사 디자인그룹금성건축사사무소서울특별시 종로구 새문안로 42, 피어선빌딩 5층02-720-26462024-01-03
23법인(주)신진엔지니어링건축사사무소서울특별시 종로구 관훈동 198-16 남도빌딩 201호02-742-3351-22024-01-03
34법인(주)종합건축사사무소 이로재서울특별시 종로구 동숭4가길 2002-763-20102024-01-03
45개인아보 건축사사무소서울특별시 종로구 종로 335-502-744-21702024-01-03
56개인세승건축사사무소서울특별시 종로구 삼일대로 457, 12층 1213호실02-3391-35952024-01-03
67법인(주)기용건축 건축사사무소서울특별시 종로구 평창문화로 139, 서호빌딩 3층02-3675-08662024-01-03
78개인상헌건축사사무소서울특별시 종로구 자하문로 13102-738-78372024-01-03
89법인(주)반도종합 건축사사무소서울특별시 종로구 평창동 66-402-395-11012024-01-03
910개인류성건축사사무소서울특별시 종로구 율곡로 237, 동신빌딩 5층02-765-7835~62024-01-03
연번신고구분사무소명도로명주소전화번호데이터기준일자
220221개인이도lab 건축사사무소서울특별시 종로구 김상옥로 17, 신관 202-1호02-6229-19052024-01-03
221222개인프로인건축사사무소서울특별시 종로구 지봉로 62-1 (1층)<NA>2024-01-03
222223개인시제건축사사무소서울특별시 종로구 평창문화로 139, 서호빌딩 3층<NA>2024-01-03
223224개인건축사사무소 해연서울특별시 종로구 세종대로23길 47, 미도파광화문빌딩 6층 601-85호<NA>2024-01-03
224225법인(주)종합건축사사무소온서울특별시 종로구 율곡로14길 49, 동우빌딩 301호<NA>2024-01-03
225226법인(주)다주종합건축사사무소서울특별시 종로구 사직로 지하 96, 파크뷰타워 지하1층 비01호02-928-22402024-01-03
226227개인태율 건축사사무소서울특별시 종로구 새문안로 91, 고려빌딩 511호02-722-22902024-01-03
227228개인오챠드 건축사사무소서울특별시 종로구 효자로 15, 2층, 9<NA>2024-01-03
228229개인건축사사무소 다라산서울특별시 종로구 계동길 128, 402호02-743-21412024-01-03
229230개인마일드플랜아키텍츠 건축사사무소서울특별시 종로구 혜화로9길 13, 1층<NA>2024-01-03