Overview

Dataset statistics

Number of variables4
Number of observations773
Missing cells112
Missing cells (%)3.6%
Duplicate rows8
Duplicate rows (%)1.0%
Total size in memory24.3 KiB
Average record size in memory32.2 B

Variable types

Text3
DateTime1

Dataset

Description파일 다운로드
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-14997/S/1/datasetView.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 8 (1.0%) duplicate rowsDuplicates
전화번호 has 112 (14.5%) missing valuesMissing

Reproduction

Analysis started2023-12-11 05:37:11.161625
Analysis finished2023-12-11 05:37:11.688181
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct762
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-11T14:37:11.888917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length31
Mean length12.558862
Min length7

Characters and Unicode

Total characters9708
Distinct characters351
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique752 ?
Unique (%)97.3%

Sample

1st row(주) 에이엔디파트너스건축사사무소
2nd row(주)CMR건축사사무소
3rd row(주)가나안건축사사무소
4th row(주)가당건축사사무소
5th row(주)가람종합건축사사무소
ValueCountFrequency (%)
건축사사무소 85
 
8.8%
주식회사 64
 
6.7%
종합건축사사무소 10
 
1.0%
주)종합건축사사무소 4
 
0.4%
주)건축사사무소 4
 
0.4%
주)종합건축사사무소가람건축 3
 
0.3%
architects 3
 
0.3%
아이씨디건축사사무소 2
 
0.2%
2
 
0.2%
와이건축사사무소 2
 
0.2%
Other values (775) 783
81.4%
2023-12-11T14:37:12.338301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1629
16.8%
852
 
8.8%
837
 
8.6%
782
 
8.1%
778
 
8.0%
482
 
5.0%
( 408
 
4.2%
) 408
 
4.2%
228
 
2.3%
215
 
2.2%
Other values (341) 3089
31.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8507
87.6%
Open Punctuation 410
 
4.2%
Close Punctuation 410
 
4.2%
Space Separator 194
 
2.0%
Uppercase Letter 117
 
1.2%
Lowercase Letter 37
 
0.4%
Other Punctuation 16
 
0.2%
Decimal Number 16
 
0.2%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1629
19.1%
852
 
10.0%
837
 
9.8%
782
 
9.2%
778
 
9.1%
482
 
5.7%
228
 
2.7%
215
 
2.5%
212
 
2.5%
139
 
1.6%
Other values (285) 2353
27.7%
Uppercase Letter
ValueCountFrequency (%)
S 19
16.2%
A 19
16.2%
I 9
 
7.7%
C 9
 
7.7%
D 6
 
5.1%
E 6
 
5.1%
H 6
 
5.1%
N 5
 
4.3%
T 5
 
4.3%
U 4
 
3.4%
Other values (14) 29
24.8%
Lowercase Letter
ValueCountFrequency (%)
t 5
13.5%
c 5
13.5%
r 5
13.5%
e 4
10.8%
n 3
8.1%
s 3
8.1%
i 3
8.1%
h 2
 
5.4%
a 2
 
5.4%
m 1
 
2.7%
Other values (4) 4
10.8%
Decimal Number
ValueCountFrequency (%)
1 4
25.0%
5 3
18.8%
2 3
18.8%
4 2
12.5%
3 2
12.5%
7 1
 
6.2%
0 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 12
75.0%
& 1
 
6.2%
* 1
 
6.2%
1
 
6.2%
· 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 408
99.5%
[ 2
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 408
99.5%
] 2
 
0.5%
Space Separator
ValueCountFrequency (%)
194
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8508
87.6%
Common 1046
 
10.8%
Latin 154
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1629
19.1%
852
 
10.0%
837
 
9.8%
782
 
9.2%
778
 
9.1%
482
 
5.7%
228
 
2.7%
215
 
2.5%
212
 
2.5%
139
 
1.6%
Other values (286) 2354
27.7%
Latin
ValueCountFrequency (%)
S 19
 
12.3%
A 19
 
12.3%
I 9
 
5.8%
C 9
 
5.8%
D 6
 
3.9%
E 6
 
3.9%
H 6
 
3.9%
t 5
 
3.2%
N 5
 
3.2%
T 5
 
3.2%
Other values (28) 65
42.2%
Common
ValueCountFrequency (%)
( 408
39.0%
) 408
39.0%
194
18.5%
. 12
 
1.1%
1 4
 
0.4%
5 3
 
0.3%
2 3
 
0.3%
] 2
 
0.2%
[ 2
 
0.2%
4 2
 
0.2%
Other values (7) 8
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8507
87.6%
ASCII 1198
 
12.3%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1629
19.1%
852
 
10.0%
837
 
9.8%
782
 
9.2%
778
 
9.1%
482
 
5.7%
228
 
2.7%
215
 
2.5%
212
 
2.5%
139
 
1.6%
Other values (285) 2353
27.7%
ASCII
ValueCountFrequency (%)
( 408
34.1%
) 408
34.1%
194
16.2%
S 19
 
1.6%
A 19
 
1.6%
. 12
 
1.0%
I 9
 
0.8%
C 9
 
0.8%
D 6
 
0.5%
E 6
 
0.5%
Other values (43) 108
 
9.0%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
· 1
33.3%
Distinct745
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-11T14:37:12.673104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length42
Mean length32.702458
Min length19

Characters and Unicode

Total characters25279
Distinct characters287
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique721 ?
Unique (%)93.3%

Sample

1st row서울특별시 강남구 역삼로 166-0, 세현빌딩
2nd row서울특별시 강남구 영동대로129길 10-0
3rd row서울특별시 강남구 역삼로37길 11-0
4th row서울특별시 강남구 삼성로 635-0, 이모빌딩 2층
5th row서울특별시 강남구 도산대로 165-0, (신사동)
ValueCountFrequency (%)
서울특별시 773
 
17.8%
강남구 773
 
17.8%
3층 50
 
1.2%
4층 44
 
1.0%
2층 35
 
0.8%
논현로 32
 
0.7%
5층 32
 
0.7%
봉은사로 30
 
0.7%
선릉로 27
 
0.6%
테헤란로 26
 
0.6%
Other values (1289) 2510
57.9%
2023-12-11T14:37:13.231016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3576
 
14.1%
0 1160
 
4.6%
, 980
 
3.9%
1 964
 
3.8%
854
 
3.4%
828
 
3.3%
793
 
3.1%
790
 
3.1%
784
 
3.1%
779
 
3.1%
Other values (277) 13771
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13873
54.9%
Decimal Number 5182
 
20.5%
Space Separator 3576
 
14.1%
Other Punctuation 986
 
3.9%
Dash Punctuation 779
 
3.1%
Open Punctuation 430
 
1.7%
Close Punctuation 430
 
1.7%
Uppercase Letter 23
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
854
 
6.2%
828
 
6.0%
793
 
5.7%
790
 
5.7%
784
 
5.7%
779
 
5.6%
777
 
5.6%
776
 
5.6%
773
 
5.6%
522
 
3.8%
Other values (249) 6197
44.7%
Uppercase Letter
ValueCountFrequency (%)
B 6
26.1%
S 4
17.4%
M 2
 
8.7%
K 2
 
8.7%
J 2
 
8.7%
A 2
 
8.7%
Y 1
 
4.3%
D 1
 
4.3%
C 1
 
4.3%
G 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
0 1160
22.4%
1 964
18.6%
2 698
13.5%
3 550
10.6%
4 448
 
8.6%
5 379
 
7.3%
6 331
 
6.4%
7 258
 
5.0%
8 217
 
4.2%
9 177
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 980
99.4%
: 5
 
0.5%
. 1
 
0.1%
Space Separator
ValueCountFrequency (%)
3576
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 779
100.0%
Open Punctuation
ValueCountFrequency (%)
( 430
100.0%
Close Punctuation
ValueCountFrequency (%)
) 430
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13873
54.9%
Common 11383
45.0%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
854
 
6.2%
828
 
6.0%
793
 
5.7%
790
 
5.7%
784
 
5.7%
779
 
5.6%
777
 
5.6%
776
 
5.6%
773
 
5.6%
522
 
3.8%
Other values (249) 6197
44.7%
Common
ValueCountFrequency (%)
3576
31.4%
0 1160
 
10.2%
, 980
 
8.6%
1 964
 
8.5%
- 779
 
6.8%
2 698
 
6.1%
3 550
 
4.8%
4 448
 
3.9%
( 430
 
3.8%
) 430
 
3.8%
Other values (7) 1368
 
12.0%
Latin
ValueCountFrequency (%)
B 6
26.1%
S 4
17.4%
M 2
 
8.7%
K 2
 
8.7%
J 2
 
8.7%
A 2
 
8.7%
Y 1
 
4.3%
D 1
 
4.3%
C 1
 
4.3%
G 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13873
54.9%
ASCII 11406
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3576
31.4%
0 1160
 
10.2%
, 980
 
8.6%
1 964
 
8.5%
- 779
 
6.8%
2 698
 
6.1%
3 550
 
4.8%
4 448
 
3.9%
( 430
 
3.8%
) 430
 
3.8%
Other values (18) 1391
 
12.2%
Hangul
ValueCountFrequency (%)
854
 
6.2%
828
 
6.0%
793
 
5.7%
790
 
5.7%
784
 
5.7%
779
 
5.6%
777
 
5.6%
776
 
5.6%
773
 
5.6%
522
 
3.8%
Other values (249) 6197
44.7%

전화번호
Text

MISSING 

Distinct626
Distinct (%)94.7%
Missing112
Missing (%)14.5%
Memory size6.2 KiB
2023-12-11T14:37:13.644016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length8.526475
Min length7

Characters and Unicode

Total characters5636
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique595 ?
Unique (%)90.0%

Sample

1st row070-8708-5110
2nd row544-2275
3rd row569-5595
4th row516-6828
5th row569-0901~5
ValueCountFrequency (%)
565-2278 3
 
0.4%
542-8937 3
 
0.4%
511-0361~5 3
 
0.4%
529-7207 3
 
0.4%
2051-9330 2
 
0.3%
575-4137 2
 
0.3%
448-3975 2
 
0.3%
567-2012 2
 
0.3%
544-5608 2
 
0.3%
578-2921 2
 
0.3%
Other values (623) 644
96.4%
2023-12-11T14:37:14.202241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 931
16.5%
- 681
12.1%
4 603
10.7%
1 518
9.2%
0 507
9.0%
3 455
8.1%
6 427
7.6%
7 425
7.5%
2 422
7.5%
8 338
 
6.0%
Other values (4) 329
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4923
87.3%
Dash Punctuation 681
 
12.1%
Math Symbol 24
 
0.4%
Space Separator 7
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 931
18.9%
4 603
12.2%
1 518
10.5%
0 507
10.3%
3 455
9.2%
6 427
8.7%
7 425
8.6%
2 422
8.6%
8 338
 
6.9%
9 297
 
6.0%
Dash Punctuation
ValueCountFrequency (%)
- 681
100.0%
Math Symbol
ValueCountFrequency (%)
~ 24
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5636
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 931
16.5%
- 681
12.1%
4 603
10.7%
1 518
9.2%
0 507
9.0%
3 455
8.1%
6 427
7.6%
7 425
7.5%
2 422
7.5%
8 338
 
6.0%
Other values (4) 329
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5636
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 931
16.5%
- 681
12.1%
4 603
10.7%
1 518
9.2%
0 507
9.0%
3 455
8.1%
6 427
7.6%
7 425
7.5%
2 422
7.5%
8 338
 
6.0%
Other values (4) 329
 
5.8%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
Minimum2020-05-12 00:00:00
Maximum2020-05-12 00:00:00
2023-12-11T14:37:14.325762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T14:37:14.416105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-11T14:37:11.542774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:37:11.648522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사무소명도로명주소전화번호데이터기준일
0(주) 에이엔디파트너스건축사사무소서울특별시 강남구 역삼로 166-0, 세현빌딩070-8708-51102020-05-12
1(주)CMR건축사사무소서울특별시 강남구 영동대로129길 10-0544-22752020-05-12
2(주)가나안건축사사무소서울특별시 강남구 역삼로37길 11-0569-55952020-05-12
3(주)가당건축사사무소서울특별시 강남구 삼성로 635-0, 이모빌딩 2층516-68282020-05-12
4(주)가람종합건축사사무소서울특별시 강남구 도산대로 165-0, (신사동)569-0901~52020-05-12
5(주)가운건축사사무소서울특별시 강남구 언주로141길 6-0, 501호(논현동 ,백향빌딩)3443-77992020-05-12
6(주)강남종합건축사사무소서울특별시 강남구 테헤란로8길 40-0, 3층(역삼동, 알티넷빌딩)511-74242020-05-12
7(주)거목건축사사무소서울특별시 강남구 학동로20길 21-03442-35412020-05-12
8(주)건우사종합건축사사무소서울특별시 강남구 역삼로 444-0562-06772020-05-12
9(주)건정종합건축사사무소서울특별시 강남구 선릉로112길 37-0554-00252020-05-12
사무소명도로명주소전화번호데이터기준일
763허서구건축사사무소서울특별시 강남구 영동대로112길 32-0, 302호(삼성동,한국문학번역원)<NA>2020-05-12
764현우재건축사사무소서울특별시 강남구 언주로 118-0, 우성캐릭터199오피스텔2001호<NA>2020-05-12
765협연건축사사무소서울특별시 강남구 강남대로110길 36-0557-35552020-05-12
766형공건축사사무소서울특별시 강남구 학동로101길 26-0, 4층 415-1호(청담동, 청담삼익상가)543-40122020-05-12
767홍건축사사무소 주식회사서울특별시 강남구 남부순환로 2645-0, 에이 451(도곡동, 한독빌딩)<NA>2020-05-12
768홍인건축사사무소서울특별시 강남구 삼성로 706-0, 추탄회관 4층 501호6409-69772020-05-12
769환경동인건축사사무소서울특별시 강남구 논현로 641-0, 대우아이빌323호548-11782020-05-12
770환경포럼건축사사무소서울특별시 강남구 도곡로3길 25-0, 삼성애니텔 714호2051-50202020-05-12
771황 어쏘시에이츠 건축사사무소서울특별시 강남구 역삼로 209-0, (역삼동)지하1층567-20122020-05-12
772희산건축사사무소서울특별시 강남구 학동로 423-0, 401호(청담동,청우빌딩)518-27272020-05-12

Duplicate rows

Most frequently occurring

사무소명도로명주소전화번호데이터기준일# duplicates
2(주)종합건축사사무소가람건축서울특별시 강남구 도산대로 165-0, (신사동)511-0361~52020-05-123
0(주)솔지원건축사사무소서울특별시 강남구 밤고개로24길 80-5514-56972020-05-122
1(주)에이스엔지니어링종합건축사사무소서울특별시 강남구 테헤란로25길 46-0, 서울빌딩 2층3142-56292020-05-122
3(주)지오.맥 건축사사무소서울특별시 강남구 언주로 546-0, 가영빌딩 401호556-80142020-05-122
4(주)천일건축엔지니어링종합건축사사무소서울특별시 강남구 개포로32길 7-0, (개포동)578-29212020-05-122
5(주)초이건축사사무소서울특별시 강남구 선릉로152길 11-0, 세진빌딩5층511-83422020-05-122
6주식회사 아이씨디건축사사무소서울특별시 강남구 개포로25길 3-4, 3층(개포동)529-72072020-05-122
7주식회사 청건축사사무소서울특별시 강남구 개포로22길 6-0, 4층(개포동,광혜빌딩)518-73012020-05-122