Overview

Dataset statistics

Number of variables4
Number of observations794
Missing cells109
Missing cells (%)3.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.9 KiB
Average record size in memory32.2 B

Variable types

Text3
DateTime1

Dataset

Description파일 다운로드
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-14997/S/1/datasetView.do

Alerts

데이터기준일 has constant value ""Constant
전화번호 has 109 (13.7%) missing valuesMissing

Reproduction

Analysis started2023-12-11 05:37:15.766450
Analysis finished2023-12-11 05:37:16.521825
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct793
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-11T14:37:16.786110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length22
Mean length12.392947
Min length7

Characters and Unicode

Total characters9840
Distinct characters359
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique792 ?
Unique (%)99.7%

Sample

1st row(주)에이엔디파트너스건축사사무소
2nd row(주)CMR건축사사무소
3rd row(주)가나안건축사사무소
4th row(주)가당건축사사무소
5th row(주)가람종합건축사사무소
ValueCountFrequency (%)
건축사사무소 73
 
7.7%
주식회사 48
 
5.0%
종합건축사사무소 10
 
1.1%
주)건축사사무소 5
 
0.5%
주)종합건축사사무소 4
 
0.4%
partners 2
 
0.2%
구도건축사사무소 2
 
0.2%
2
 
0.2%
시안건축사사무소 2
 
0.2%
주)홍익엔지니어링건축사사무소 1
 
0.1%
Other values (803) 803
84.3%
2023-12-11T14:37:17.267345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1650
16.8%
882
 
9.0%
864
 
8.8%
804
 
8.2%
801
 
8.1%
490
 
5.0%
) 436
 
4.4%
( 436
 
4.4%
229
 
2.3%
226
 
2.3%
Other values (349) 3022
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8622
87.6%
Close Punctuation 439
 
4.5%
Open Punctuation 439
 
4.5%
Space Separator 162
 
1.6%
Uppercase Letter 89
 
0.9%
Lowercase Letter 50
 
0.5%
Decimal Number 20
 
0.2%
Other Punctuation 18
 
0.2%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1650
19.1%
882
 
10.2%
864
 
10.0%
804
 
9.3%
801
 
9.3%
490
 
5.7%
229
 
2.7%
226
 
2.6%
215
 
2.5%
138
 
1.6%
Other values (292) 2323
26.9%
Uppercase Letter
ValueCountFrequency (%)
A 14
15.7%
S 14
15.7%
E 6
 
6.7%
I 6
 
6.7%
D 6
 
6.7%
C 6
 
6.7%
H 5
 
5.6%
L 5
 
5.6%
N 4
 
4.5%
O 3
 
3.4%
Other values (13) 20
22.5%
Lowercase Letter
ValueCountFrequency (%)
r 8
16.0%
t 7
14.0%
e 6
12.0%
c 5
10.0%
n 4
8.0%
a 3
 
6.0%
i 3
 
6.0%
s 3
 
6.0%
o 2
 
4.0%
h 2
 
4.0%
Other values (7) 7
14.0%
Decimal Number
ValueCountFrequency (%)
1 5
25.0%
2 5
25.0%
3 3
15.0%
5 3
15.0%
4 2
 
10.0%
0 1
 
5.0%
7 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 13
72.2%
· 2
 
11.1%
, 2
 
11.1%
1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 436
99.3%
] 3
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 436
99.3%
[ 3
 
0.7%
Space Separator
ValueCountFrequency (%)
162
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8622
87.6%
Common 1078
 
11.0%
Latin 139
 
1.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1650
19.1%
882
 
10.2%
864
 
10.0%
804
 
9.3%
801
 
9.3%
490
 
5.7%
229
 
2.7%
226
 
2.6%
215
 
2.5%
138
 
1.6%
Other values (292) 2323
26.9%
Latin
ValueCountFrequency (%)
A 14
 
10.1%
S 14
 
10.1%
r 8
 
5.8%
t 7
 
5.0%
e 6
 
4.3%
E 6
 
4.3%
I 6
 
4.3%
D 6
 
4.3%
C 6
 
4.3%
c 5
 
3.6%
Other values (30) 61
43.9%
Common
ValueCountFrequency (%)
) 436
40.4%
( 436
40.4%
162
 
15.0%
. 13
 
1.2%
1 5
 
0.5%
2 5
 
0.5%
[ 3
 
0.3%
3 3
 
0.3%
5 3
 
0.3%
] 3
 
0.3%
Other values (6) 9
 
0.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8621
87.6%
ASCII 1214
 
12.3%
None 4
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1650
19.1%
882
 
10.2%
864
 
10.0%
804
 
9.3%
801
 
9.3%
490
 
5.7%
229
 
2.7%
226
 
2.6%
215
 
2.5%
138
 
1.6%
Other values (291) 2322
26.9%
ASCII
ValueCountFrequency (%)
) 436
35.9%
( 436
35.9%
162
 
13.3%
A 14
 
1.2%
S 14
 
1.2%
. 13
 
1.1%
r 8
 
0.7%
t 7
 
0.6%
e 6
 
0.5%
E 6
 
0.5%
Other values (44) 112
 
9.2%
None
ValueCountFrequency (%)
· 2
50.0%
1
25.0%
1
25.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct755
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-11T14:37:17.554088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length31.586902
Min length1

Characters and Unicode

Total characters25080
Distinct characters284
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique734 ?
Unique (%)92.4%

Sample

1st row서울특별시 강남구 역삼로 166-0, 세현빌딩
2nd row서울특별시 강남구 영동대로129길 10-0
3rd row서울특별시 강남구 역삼로37길 11-0
4th row서울특별시 강남구 삼성로 635-0, 이모빌딩 2층
5th row서울특별시 강남구 도산대로 165-0, (신사동)
ValueCountFrequency (%)
서울특별시 776
 
18.0%
강남구 776
 
18.0%
3층 53
 
1.2%
4층 41
 
0.9%
2층 35
 
0.8%
논현로 32
 
0.7%
선릉로 28
 
0.6%
봉은사로 27
 
0.6%
5층 26
 
0.6%
6-0 24
 
0.6%
Other values (1286) 2501
57.9%
2023-12-11T14:37:18.049911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3573
 
14.2%
0 1164
 
4.6%
1 954
 
3.8%
, 953
 
3.8%
856
 
3.4%
832
 
3.3%
796
 
3.2%
794
 
3.2%
787
 
3.1%
781
 
3.1%
Other values (274) 13590
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13838
55.2%
Decimal Number 5099
 
20.3%
Space Separator 3573
 
14.2%
Other Punctuation 956
 
3.8%
Dash Punctuation 766
 
3.1%
Close Punctuation 414
 
1.7%
Open Punctuation 413
 
1.6%
Uppercase Letter 21
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
856
 
6.2%
832
 
6.0%
796
 
5.8%
794
 
5.7%
787
 
5.7%
781
 
5.6%
780
 
5.6%
779
 
5.6%
776
 
5.6%
516
 
3.7%
Other values (249) 6141
44.4%
Decimal Number
ValueCountFrequency (%)
0 1164
22.8%
1 954
18.7%
2 682
13.4%
3 541
10.6%
4 434
 
8.5%
5 376
 
7.4%
6 320
 
6.3%
7 243
 
4.8%
8 227
 
4.5%
9 158
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
B 6
28.6%
A 4
19.0%
S 4
19.0%
J 2
 
9.5%
M 1
 
4.8%
Y 1
 
4.8%
K 1
 
4.8%
C 1
 
4.8%
D 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 953
99.7%
. 3
 
0.3%
Space Separator
ValueCountFrequency (%)
3573
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 766
100.0%
Close Punctuation
ValueCountFrequency (%)
) 414
100.0%
Open Punctuation
ValueCountFrequency (%)
( 413
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13838
55.2%
Common 11221
44.7%
Latin 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
856
 
6.2%
832
 
6.0%
796
 
5.8%
794
 
5.7%
787
 
5.7%
781
 
5.6%
780
 
5.6%
779
 
5.6%
776
 
5.6%
516
 
3.7%
Other values (249) 6141
44.4%
Common
ValueCountFrequency (%)
3573
31.8%
0 1164
 
10.4%
1 954
 
8.5%
, 953
 
8.5%
- 766
 
6.8%
2 682
 
6.1%
3 541
 
4.8%
4 434
 
3.9%
) 414
 
3.7%
( 413
 
3.7%
Other values (6) 1327
 
11.8%
Latin
ValueCountFrequency (%)
B 6
28.6%
A 4
19.0%
S 4
19.0%
J 2
 
9.5%
M 1
 
4.8%
Y 1
 
4.8%
K 1
 
4.8%
C 1
 
4.8%
D 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13838
55.2%
ASCII 11242
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3573
31.8%
0 1164
 
10.4%
1 954
 
8.5%
, 953
 
8.5%
- 766
 
6.8%
2 682
 
6.1%
3 541
 
4.8%
4 434
 
3.9%
) 414
 
3.7%
( 413
 
3.7%
Other values (15) 1348
 
12.0%
Hangul
ValueCountFrequency (%)
856
 
6.2%
832
 
6.0%
796
 
5.8%
794
 
5.7%
787
 
5.7%
781
 
5.6%
780
 
5.6%
779
 
5.6%
776
 
5.6%
516
 
3.7%
Other values (249) 6141
44.4%

전화번호
Text

MISSING 

Distinct652
Distinct (%)95.2%
Missing109
Missing (%)13.7%
Memory size6.3 KiB
2023-12-11T14:37:18.353116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.407299
Min length9

Characters and Unicode

Total characters7814
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique621 ?
Unique (%)90.7%

Sample

1st row070-8708-5110
2nd row02-544-2275
3rd row02-569-5595
4th row02-516-6828
5th row02-569-0901~5
ValueCountFrequency (%)
02-565-2278 3
 
0.4%
02-542-8937 3
 
0.4%
02-569-8833 2
 
0.3%
02-529-7207 2
 
0.3%
02-575-4137 2
 
0.3%
02-529-7558 2
 
0.3%
02-544-5608 2
 
0.3%
02-544-7495 2
 
0.3%
02-518-7542 2
 
0.3%
02-547-4596 2
 
0.3%
Other values (646) 667
96.8%
2023-12-11T14:37:18.933259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1369
17.5%
0 1169
15.0%
2 1096
14.0%
5 965
12.3%
4 624
8.0%
1 528
 
6.8%
3 474
 
6.1%
7 441
 
5.6%
6 432
 
5.5%
8 362
 
4.6%
Other values (4) 354
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6414
82.1%
Dash Punctuation 1369
 
17.5%
Math Symbol 26
 
0.3%
Space Separator 4
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1169
18.2%
2 1096
17.1%
5 965
15.0%
4 624
9.7%
1 528
8.2%
3 474
7.4%
7 441
 
6.9%
6 432
 
6.7%
8 362
 
5.6%
9 323
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 1369
100.0%
Math Symbol
ValueCountFrequency (%)
~ 26
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7814
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1369
17.5%
0 1169
15.0%
2 1096
14.0%
5 965
12.3%
4 624
8.0%
1 528
 
6.8%
3 474
 
6.1%
7 441
 
5.6%
6 432
 
5.5%
8 362
 
4.6%
Other values (4) 354
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7814
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1369
17.5%
0 1169
15.0%
2 1096
14.0%
5 965
12.3%
4 624
8.0%
1 528
 
6.8%
3 474
 
6.1%
7 441
 
5.6%
6 432
 
5.5%
8 362
 
4.6%
Other values (4) 354
 
4.5%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
Minimum2019-04-12 00:00:00
Maximum2019-04-12 00:00:00
2023-12-11T14:37:19.189785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T14:37:19.367414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-11T14:37:16.352455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:37:16.475516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사무소명도로명주소전화번호데이터기준일
0(주)에이엔디파트너스건축사사무소서울특별시 강남구 역삼로 166-0, 세현빌딩070-8708-51102019-04-12
1(주)CMR건축사사무소서울특별시 강남구 영동대로129길 10-002-544-22752019-04-12
2(주)가나안건축사사무소서울특별시 강남구 역삼로37길 11-002-569-55952019-04-12
3(주)가당건축사사무소서울특별시 강남구 삼성로 635-0, 이모빌딩 2층02-516-68282019-04-12
4(주)가람종합건축사사무소서울특별시 강남구 도산대로 165-0, (신사동)02-569-0901~52019-04-12
5(주)가온건축사사무소서울특별시 강남구 역삼로 455-0, 2층(대치동, 하나빌딩)<NA>2019-04-12
6(주)가운건축사사무소서울특별시 강남구 언주로141길 6-0, 501호(논현동 ,백향빌딩)02-3443-77992019-04-12
7(주)가풍종합건축사사무소02-529-7272~42019-04-12
8(주)강남종합건축사사무소서울특별시 강남구 테헤란로8길 40-0, 3층(역삼동, 알티넷빌딩)02-511-74242019-04-12
9(주)거목건축사사무소서울특별시 강남구 학동로20길 21-002-3442-35412019-04-12
사무소명도로명주소전화번호데이터기준일
784한인종합건축사사무소서울특별시 강남구 학동로 342-0, SK허브블루1120호02-3443-34612019-04-12
785해람종합건축사사무소서울특별시 강남구 영동대로 324-0, 타워크리스탈빌딩 805호(대치동)<NA>2019-04-12
786현우재건축사사무소서울특별시 강남구 언주로 118-0, 우성캐릭터199오피스텔2001호<NA>2019-04-12
787협연건축사사무소서울특별시 강남구 강남대로110길 36-002-557-35552019-04-12
788형공건축사사무소서울특별시 강남구 학동로101길 26-0, 4층 401-1호(청담동, 청담삼익상가)02-543-88752019-04-12
789홍건축사사무소 주식회사서울특별시 강남구 남부순환로 2645-0, 에이 451(도곡동, 한독빌딩)<NA>2019-04-12
790홍인건축사사무소서울특별시 강남구 삼성로 706-0, 추탄회관 4층 501호02-6409-69772019-04-12
791환경동인건축사사무소서울특별시 강남구 논현로 641-0, 대우아이빌323호02-548-11782019-04-12
792환경포럼건축사사무소서울특별시 강남구 도곡로3길 25-0, 삼성애니텔 714호02-2051-50202019-04-12
793희산건축사사무소서울특별시 강남구 학동로 423-0, 401호(청담동,청우빌딩)02-518-27272019-04-12