Overview

Dataset statistics

Number of variables4
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory33.4 B

Variable types

DateTime1
Text3

Dataset

Description성북구에서 건축사로 사업(영업)중이며, 성북구청 건축과에서 관리중인 건축사사무소 현황데이터로, 건축사사무소 이름과 도로명주소, 그리고 대표자 이름을 제공합니다.
Author서울특별시 성북구
URLhttps://www.data.go.kr/data/15126283/fileData.do

Alerts

데이터기준일 has constant value ""Constant
대표자 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:38:00.338823
Analysis finished2024-03-14 17:38:01.335455
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size856.0 B
Minimum2024-01-08 00:00:00
Maximum2024-01-08 00:00:00
2024-03-15T02:38:01.483052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:38:01.787514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct89
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size856.0 B
2024-03-15T02:38:02.626961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length10.923077
Min length4

Characters and Unicode

Total characters994
Distinct characters136
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)96.7%

Sample

1st row칠성 건축사사무소
2nd row한성건축사사무소
3rd row피아건축종합건축사사무소
4th row아리건축사사무소
5th row송백건축사사무소
ValueCountFrequency (%)
건축사사무소 29
 
21.5%
주식회사 8
 
5.9%
주)건축사사무소메타 3
 
2.2%
종합건축사사무소 2
 
1.5%
건축사사무소건축공작소반 1
 
0.7%
에이브릭건축사사무소 1
 
0.7%
이지피종합건축사사무소 1
 
0.7%
자소원 1
 
0.7%
시가건축 1
 
0.7%
담우건축사사무소 1
 
0.7%
Other values (87) 87
64.4%
2024-03-15T02:38:04.111479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
188
18.9%
97
 
9.8%
97
 
9.8%
97
 
9.8%
90
 
9.1%
45
 
4.5%
27
 
2.7%
21
 
2.1%
( 19
 
1.9%
) 19
 
1.9%
Other values (126) 294
29.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 908
91.3%
Space Separator 45
 
4.5%
Open Punctuation 19
 
1.9%
Close Punctuation 19
 
1.9%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
188
20.7%
97
 
10.7%
97
 
10.7%
97
 
10.7%
90
 
9.9%
27
 
3.0%
21
 
2.3%
10
 
1.1%
10
 
1.1%
10
 
1.1%
Other values (120) 261
28.7%
Uppercase Letter
ValueCountFrequency (%)
J 1
33.3%
H 1
33.3%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 908
91.3%
Common 83
 
8.4%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
188
20.7%
97
 
10.7%
97
 
10.7%
97
 
10.7%
90
 
9.9%
27
 
3.0%
21
 
2.3%
10
 
1.1%
10
 
1.1%
10
 
1.1%
Other values (120) 261
28.7%
Common
ValueCountFrequency (%)
45
54.2%
( 19
22.9%
) 19
22.9%
Latin
ValueCountFrequency (%)
J 1
33.3%
H 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 908
91.3%
ASCII 86
 
8.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
188
20.7%
97
 
10.7%
97
 
10.7%
97
 
10.7%
90
 
9.9%
27
 
3.0%
21
 
2.3%
10
 
1.1%
10
 
1.1%
10
 
1.1%
Other values (120) 261
28.7%
ASCII
ValueCountFrequency (%)
45
52.3%
( 19
22.1%
) 19
22.1%
J 1
 
1.2%
H 1
 
1.2%
B 1
 
1.2%

주소
Text

Distinct86
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size856.0 B
2024-03-15T02:38:05.228029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length34
Mean length25.681319
Min length17

Characters and Unicode

Total characters2337
Distinct characters117
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)90.1%

Sample

1st row서울특별시 성북구 보문로 175
2nd row서울특별시 성북구 보문로 164, 3층
3rd row서울특별시 성북구 동소문로28길 44
4th row서울특별시 성북구 동소문로15길 8, 2층
5th row서울특별시 성북구 보문로 163, 12층(삼선동5가)
ValueCountFrequency (%)
서울특별시 91
18.9%
성북구 91
18.9%
2층 18
 
3.7%
3층 17
 
3.5%
보문로 11
 
2.3%
1층 8
 
1.7%
성북로 7
 
1.5%
선잠로 4
 
0.8%
6층 4
 
0.8%
21 4
 
0.8%
Other values (164) 226
47.0%
2024-03-15T02:38:06.869416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
392
 
16.8%
111
 
4.7%
110
 
4.7%
1 100
 
4.3%
93
 
4.0%
92
 
3.9%
91
 
3.9%
91
 
3.9%
91
 
3.9%
91
 
3.9%
Other values (107) 1075
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1341
57.4%
Decimal Number 474
 
20.3%
Space Separator 392
 
16.8%
Other Punctuation 89
 
3.8%
Dash Punctuation 21
 
0.9%
Close Punctuation 8
 
0.3%
Open Punctuation 8
 
0.3%
Uppercase Letter 3
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
8.3%
110
 
8.2%
93
 
6.9%
92
 
6.9%
91
 
6.8%
91
 
6.8%
91
 
6.8%
91
 
6.8%
89
 
6.6%
60
 
4.5%
Other values (89) 422
31.5%
Decimal Number
ValueCountFrequency (%)
1 100
21.1%
2 85
17.9%
3 65
13.7%
0 44
9.3%
4 44
9.3%
5 35
 
7.4%
8 33
 
7.0%
6 31
 
6.5%
7 19
 
4.0%
9 18
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
B 2
66.7%
F 1
33.3%
Space Separator
ValueCountFrequency (%)
392
100.0%
Other Punctuation
ValueCountFrequency (%)
, 89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1341
57.4%
Common 992
42.4%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
8.3%
110
 
8.2%
93
 
6.9%
92
 
6.9%
91
 
6.8%
91
 
6.8%
91
 
6.8%
91
 
6.8%
89
 
6.6%
60
 
4.5%
Other values (89) 422
31.5%
Common
ValueCountFrequency (%)
392
39.5%
1 100
 
10.1%
, 89
 
9.0%
2 85
 
8.6%
3 65
 
6.6%
0 44
 
4.4%
4 44
 
4.4%
5 35
 
3.5%
8 33
 
3.3%
6 31
 
3.1%
Other values (5) 74
 
7.5%
Latin
ValueCountFrequency (%)
B 2
50.0%
F 1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1341
57.4%
ASCII 995
42.6%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
392
39.4%
1 100
 
10.1%
, 89
 
8.9%
2 85
 
8.5%
3 65
 
6.5%
0 44
 
4.4%
4 44
 
4.4%
5 35
 
3.5%
8 33
 
3.3%
6 31
 
3.1%
Other values (7) 77
 
7.7%
Hangul
ValueCountFrequency (%)
111
 
8.3%
110
 
8.2%
93
 
6.9%
92
 
6.9%
91
 
6.8%
91
 
6.8%
91
 
6.8%
91
 
6.8%
89
 
6.6%
60
 
4.5%
Other values (89) 422
31.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

대표자
Text

UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size856.0 B
2024-03-15T02:38:08.172201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters273
Distinct characters105
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)100.0%

Sample

1st row문창수
2nd row강문식
3rd row김후석
4th row윤종수
5th row김제경
ValueCountFrequency (%)
문창수 1
 
1.1%
김진섭 1
 
1.1%
김민호 1
 
1.1%
박승완 1
 
1.1%
이충원 1
 
1.1%
최재혁 1
 
1.1%
이언정 1
 
1.1%
이성섭 1
 
1.1%
정정원 1
 
1.1%
안수범 1
 
1.1%
Other values (81) 81
89.0%
2024-03-15T02:38:09.683490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.2%
16
 
5.9%
14
 
5.1%
9
 
3.3%
8
 
2.9%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (95) 180
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 273
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
6.2%
16
 
5.9%
14
 
5.1%
9
 
3.3%
8
 
2.9%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (95) 180
65.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 273
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.9%
14
 
5.1%
9
 
3.3%
8
 
2.9%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (95) 180
65.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 273
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.9%
14
 
5.1%
9
 
3.3%
8
 
2.9%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (95) 180
65.9%

Correlations

2024-03-15T02:38:09.838765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축사사무소명주소대표자
건축사사무소명1.0001.0001.000
주소1.0001.0001.000
대표자1.0001.0001.000

Missing values

2024-03-15T02:38:00.945347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:38:01.225034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

데이터기준일건축사사무소명주소대표자
02024-01-08칠성 건축사사무소서울특별시 성북구 보문로 175문창수
12024-01-08한성건축사사무소서울특별시 성북구 보문로 164, 3층강문식
22024-01-08피아건축종합건축사사무소서울특별시 성북구 동소문로28길 44김후석
32024-01-08아리건축사사무소서울특별시 성북구 동소문로15길 8, 2층윤종수
42024-01-08송백건축사사무소서울특별시 성북구 보문로 163, 12층(삼선동5가)김제경
52024-01-08생활미건축사사무소서울특별시 성북구 삼선교로14길 74, 2층신광식
62024-01-08새빛건축사사무소서울특별시 성북구 보문로 148공준진
72024-01-08예준건축사사무소서울특별시 성북구 화랑로 299, 3층 338호(장위동 외7필지 석계역 한일노벨리아시티)강철준
82024-01-08소요종합건축사사무소서울특별시 성북구 삼선동4가 대지 346- 3층 306호신명길
92024-01-08(주)엑토종합건축사사무소서울특별시 성북구 보문로 185, 305호(고산빌딩)홍성천
데이터기준일건축사사무소명주소대표자
812024-01-08주식회사 건축사사무소이노랩서울특별시 성북구 동소문로13길 39-2, 백호빌딩 301호유승환
822024-01-08유리건축사사무소서울특별시 성북구 보국문로 95, 3층김정율
832024-01-08윤현필건축사사무소서울특별시 성북구 고려대로2길 49, 1층윤현필
842024-01-08건축사사무소 양양서울특별시 성북구 지봉로20길 65, 1층정지영
852024-01-08소단건축사사무소서울특별시 성북구 삼선교로 8, 5층, 501어혜령
862024-01-08반아크건축사사무소서울특별시 성북구 장위로13길 63, 2층심수웅
872024-01-08리하 건축사사무소서울특별시 성북구 안암로 19, 303호안효선
882024-01-08와이아키텍트 건축사사무소서울특별시 성북구 한천로80길 64, 2층윤태호
892024-01-08지요건축사사무소서울특별시 성북구 성북로 76, 3층김세진
902024-01-08헤젤리흐건축사사무소서울특별시 성북구 아리랑로 87, 헤젤리흐건축사사무소이성엽