Overview

Dataset statistics

Number of variables4
Number of observations1576
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory49.4 KiB
Average record size in memory32.1 B

Variable types

Text4

Dataset

Description인천광역시 서구 부동산중개업소에 관한 데이터입니다. 등록번호, 업소명, 대표명, 사무소소재지 항목을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15121439/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:08:47.398516
Analysis finished2023-12-12 22:08:48.185570
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1574
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
2023-12-13T07:08:48.374182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length14.302665
Min length8

Characters and Unicode

Total characters22541
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1572 ?
Unique (%)99.7%

Sample

1st row3380-2349
2nd row3380-2340
3rd row3380-2329
4th row3380-2326
5th row3380-2308
ValueCountFrequency (%)
28260-2023-00139 2
 
0.1%
28260-2020-00017 2
 
0.1%
28260-2020-00048 1
 
0.1%
28260-2021-00259 1
 
0.1%
28260-2021-00274 1
 
0.1%
28260-2021-00289 1
 
0.1%
28260-2021-00288 1
 
0.1%
28260-2021-00287 1
 
0.1%
28260-2021-00284 1
 
0.1%
28260-2021-00282 1
 
0.1%
Other values (1564) 1564
99.2%
2023-12-13T07:08:48.741338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6092
27.0%
2 5354
23.8%
- 2770
12.3%
8 2007
 
8.9%
6 1574
 
7.0%
3 1517
 
6.7%
1 1378
 
6.1%
4 521
 
2.3%
5 453
 
2.0%
7 448
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 19771
87.7%
Dash Punctuation 2770
 
12.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6092
30.8%
2 5354
27.1%
8 2007
 
10.2%
6 1574
 
8.0%
3 1517
 
7.7%
1 1378
 
7.0%
4 521
 
2.6%
5 453
 
2.3%
7 448
 
2.3%
9 427
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 2770
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22541
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6092
27.0%
2 5354
23.8%
- 2770
12.3%
8 2007
 
8.9%
6 1574
 
7.0%
3 1517
 
6.7%
1 1378
 
6.1%
4 521
 
2.3%
5 453
 
2.0%
7 448
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22541
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6092
27.0%
2 5354
23.8%
- 2770
12.3%
8 2007
 
8.9%
6 1574
 
7.0%
3 1517
 
6.7%
1 1378
 
6.1%
4 521
 
2.3%
5 453
 
2.0%
7 448
 
2.0%
Distinct1369
Distinct (%)86.9%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
2023-12-13T07:08:48.999877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length12.279188
Min length6

Characters and Unicode

Total characters19352
Distinct characters438
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1252 ?
Unique (%)79.4%

Sample

1st rowLG부동산공인중개사사무소
2nd row조은 공인중개사사무소
3rd row뉴신명럭키공인중개사사무소
4th row라인부동산공인중개사사무소
5th row행복공인중개사사무소
ValueCountFrequency (%)
공인중개사사무소 35
 
2.1%
삼성공인중개사사무소 9
 
0.6%
미래공인중개사사무소 7
 
0.4%
굿모닝공인중개사사무소 7
 
0.4%
에이스공인중개사사무소 6
 
0.4%
행운공인중개사사무소 5
 
0.3%
하나공인중개사사무소 5
 
0.3%
우리공인중개사사무소 5
 
0.3%
현대공인중개사사무소 5
 
0.3%
한솔공인중개사사무소 5
 
0.3%
Other values (1367) 1539
94.5%
2023-12-13T07:08:49.367906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3106
16.1%
1595
 
8.2%
1583
 
8.2%
1582
 
8.2%
1562
 
8.1%
1560
 
8.1%
1556
 
8.0%
523
 
2.7%
506
 
2.6%
501
 
2.6%
Other values (428) 5278
27.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19012
98.2%
Uppercase Letter 119
 
0.6%
Decimal Number 100
 
0.5%
Space Separator 52
 
0.3%
Open Punctuation 21
 
0.1%
Close Punctuation 21
 
0.1%
Lowercase Letter 20
 
0.1%
Dash Punctuation 3
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3106
16.3%
1595
 
8.4%
1583
 
8.3%
1582
 
8.3%
1562
 
8.2%
1560
 
8.2%
1556
 
8.2%
523
 
2.8%
506
 
2.7%
501
 
2.6%
Other values (385) 4938
26.0%
Uppercase Letter
ValueCountFrequency (%)
K 43
36.1%
S 34
28.6%
O 7
 
5.9%
H 5
 
4.2%
L 5
 
4.2%
G 4
 
3.4%
C 3
 
2.5%
E 3
 
2.5%
T 3
 
2.5%
A 2
 
1.7%
Other values (9) 10
 
8.4%
Lowercase Letter
ValueCountFrequency (%)
e 8
40.0%
c 2
 
10.0%
g 2
 
10.0%
n 2
 
10.0%
h 2
 
10.0%
u 1
 
5.0%
k 1
 
5.0%
o 1
 
5.0%
d 1
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 53
53.0%
2 15
 
15.0%
4 14
 
14.0%
0 6
 
6.0%
3 4
 
4.0%
7 4
 
4.0%
6 2
 
2.0%
5 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19010
98.2%
Common 200
 
1.0%
Latin 140
 
0.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3106
16.3%
1595
 
8.4%
1583
 
8.3%
1582
 
8.3%
1562
 
8.2%
1560
 
8.2%
1556
 
8.2%
523
 
2.8%
506
 
2.7%
501
 
2.6%
Other values (383) 4936
26.0%
Latin
ValueCountFrequency (%)
K 43
30.7%
S 34
24.3%
e 8
 
5.7%
O 7
 
5.0%
H 5
 
3.6%
L 5
 
3.6%
G 4
 
2.9%
C 3
 
2.1%
E 3
 
2.1%
T 3
 
2.1%
Other values (19) 25
17.9%
Common
ValueCountFrequency (%)
1 53
26.5%
52
26.0%
( 21
 
10.5%
) 21
 
10.5%
2 15
 
7.5%
4 14
 
7.0%
0 6
 
3.0%
3 4
 
2.0%
7 4
 
2.0%
- 3
 
1.5%
Other values (4) 7
 
3.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19010
98.2%
ASCII 339
 
1.8%
CJK 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3106
16.3%
1595
 
8.4%
1583
 
8.3%
1582
 
8.3%
1562
 
8.2%
1560
 
8.2%
1556
 
8.2%
523
 
2.8%
506
 
2.7%
501
 
2.6%
Other values (383) 4936
26.0%
ASCII
ValueCountFrequency (%)
1 53
15.6%
52
15.3%
K 43
12.7%
S 34
10.0%
( 21
 
6.2%
) 21
 
6.2%
2 15
 
4.4%
4 14
 
4.1%
e 8
 
2.4%
O 7
 
2.1%
Other values (32) 71
20.9%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct1494
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
2023-12-13T07:08:49.694301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9898477
Min length2

Characters and Unicode

Total characters4712
Distinct characters226
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1428 ?
Unique (%)90.6%

Sample

1st row이우현
2nd row차선희
3rd row전숙경
4th row김진아
5th row이윤자
ValueCountFrequency (%)
이미숙 7
 
0.4%
김영숙 5
 
0.3%
정명숙 4
 
0.3%
이현정 3
 
0.2%
정미경 3
 
0.2%
김성은 3
 
0.2%
김종숙 3
 
0.2%
김미숙 3
 
0.2%
김기연 3
 
0.2%
이영심 2
 
0.1%
Other values (1484) 1540
97.7%
2023-12-13T07:08:50.128190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
319
 
6.8%
250
 
5.3%
215
 
4.6%
167
 
3.5%
123
 
2.6%
120
 
2.5%
119
 
2.5%
111
 
2.4%
107
 
2.3%
105
 
2.2%
Other values (216) 3076
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4712
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
319
 
6.8%
250
 
5.3%
215
 
4.6%
167
 
3.5%
123
 
2.6%
120
 
2.5%
119
 
2.5%
111
 
2.4%
107
 
2.3%
105
 
2.2%
Other values (216) 3076
65.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4712
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
319
 
6.8%
250
 
5.3%
215
 
4.6%
167
 
3.5%
123
 
2.6%
120
 
2.5%
119
 
2.5%
111
 
2.4%
107
 
2.3%
105
 
2.2%
Other values (216) 3076
65.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4712
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
319
 
6.8%
250
 
5.3%
215
 
4.6%
167
 
3.5%
123
 
2.6%
120
 
2.5%
119
 
2.5%
111
 
2.4%
107
 
2.3%
105
 
2.2%
Other values (216) 3076
65.3%
Distinct1506
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
2023-12-13T07:08:50.409223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length49
Mean length35.125
Min length16

Characters and Unicode

Total characters55357
Distinct characters385
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1440 ?
Unique (%)91.4%

Sample

1st row인천광역시 서구 가정로394번길 14 (가정동)
2nd row인천광역시 서구 고래울로 6-1 (가좌동)
3rd row인천광역시 서구 승학로 447 상가101호(검암동, 신명아파트)
4th row인천광역시 서구 가정로 387 나동 102호(신현동)
5th row인천광역시 서구 검단로 489 (마전동)
ValueCountFrequency (%)
인천광역시 1577
 
16.5%
서구 1577
 
16.5%
상가동 109
 
1.1%
이음5로 97
 
1.0%
석남동 70
 
0.7%
상가 69
 
0.7%
단지내상가 68
 
0.7%
가정로 67
 
0.7%
1층 64
 
0.7%
청라커낼로 57
 
0.6%
Other values (2029) 5821
60.8%
2023-12-13T07:08:50.802342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8021
 
14.5%
1 2880
 
5.2%
2082
 
3.8%
1732
 
3.1%
1723
 
3.1%
1656
 
3.0%
1598
 
2.9%
1592
 
2.9%
1591
 
2.9%
1585
 
2.9%
Other values (375) 30897
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32698
59.1%
Decimal Number 10005
 
18.1%
Space Separator 8021
 
14.5%
Close Punctuation 1528
 
2.8%
Open Punctuation 1526
 
2.8%
Other Punctuation 1013
 
1.8%
Uppercase Letter 323
 
0.6%
Dash Punctuation 184
 
0.3%
Lowercase Letter 59
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2082
 
6.4%
1732
 
5.3%
1723
 
5.3%
1656
 
5.1%
1598
 
4.9%
1592
 
4.9%
1591
 
4.9%
1585
 
4.8%
1577
 
4.8%
1282
 
3.9%
Other values (321) 16280
49.8%
Uppercase Letter
ValueCountFrequency (%)
B 128
39.6%
S 40
 
12.4%
A 35
 
10.8%
K 32
 
9.9%
C 12
 
3.7%
J 9
 
2.8%
E 9
 
2.8%
D 9
 
2.8%
I 8
 
2.5%
V 8
 
2.5%
Other values (12) 33
 
10.2%
Lowercase Letter
ValueCountFrequency (%)
e 20
33.9%
s 10
16.9%
a 8
 
13.6%
d 7
 
11.9%
r 6
 
10.2%
p 2
 
3.4%
o 2
 
3.4%
h 1
 
1.7%
w 1
 
1.7%
i 1
 
1.7%
Decimal Number
ValueCountFrequency (%)
1 2880
28.8%
0 1492
14.9%
2 1281
12.8%
3 969
 
9.7%
4 721
 
7.2%
5 693
 
6.9%
6 580
 
5.8%
7 528
 
5.3%
8 466
 
4.7%
9 395
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 987
97.4%
@ 16
 
1.6%
. 4
 
0.4%
; 2
 
0.2%
& 2
 
0.2%
' 1
 
0.1%
/ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
8021
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1528
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1526
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 184
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32698
59.1%
Common 22277
40.2%
Latin 382
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2082
 
6.4%
1732
 
5.3%
1723
 
5.3%
1656
 
5.1%
1598
 
4.9%
1592
 
4.9%
1591
 
4.9%
1585
 
4.8%
1577
 
4.8%
1282
 
3.9%
Other values (321) 16280
49.8%
Latin
ValueCountFrequency (%)
B 128
33.5%
S 40
 
10.5%
A 35
 
9.2%
K 32
 
8.4%
e 20
 
5.2%
C 12
 
3.1%
s 10
 
2.6%
J 9
 
2.4%
E 9
 
2.4%
D 9
 
2.4%
Other values (23) 78
20.4%
Common
ValueCountFrequency (%)
8021
36.0%
1 2880
 
12.9%
) 1528
 
6.9%
( 1526
 
6.9%
0 1492
 
6.7%
2 1281
 
5.8%
, 987
 
4.4%
3 969
 
4.3%
4 721
 
3.2%
5 693
 
3.1%
Other values (11) 2179
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32698
59.1%
ASCII 22659
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8021
35.4%
1 2880
 
12.7%
) 1528
 
6.7%
( 1526
 
6.7%
0 1492
 
6.6%
2 1281
 
5.7%
, 987
 
4.4%
3 969
 
4.3%
4 721
 
3.2%
5 693
 
3.1%
Other values (44) 2561
 
11.3%
Hangul
ValueCountFrequency (%)
2082
 
6.4%
1732
 
5.3%
1723
 
5.3%
1656
 
5.1%
1598
 
4.9%
1592
 
4.9%
1591
 
4.9%
1585
 
4.8%
1577
 
4.8%
1282
 
3.9%
Other values (321) 16280
49.8%

Missing values

2023-12-13T07:08:48.049767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:08:48.142918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호업소명대표명사무소소재지
03380-2349LG부동산공인중개사사무소이우현인천광역시 서구 가정로394번길 14 (가정동)
13380-2340조은 공인중개사사무소차선희인천광역시 서구 고래울로 6-1 (가좌동)
23380-2329뉴신명럭키공인중개사사무소전숙경인천광역시 서구 승학로 447 상가101호(검암동, 신명아파트)
33380-2326라인부동산공인중개사사무소김진아인천광역시 서구 가정로 387 나동 102호(신현동)
43380-2308행복공인중개사사무소이윤자인천광역시 서구 검단로 489 (마전동)
53380-2262하나공인중개사사무소신향자인천광역시 서구 가경주로40번길 9 (가정동)
63380-2255행복가득공인중개사사무소조관식인천광역시 서구 승학로 278 오륜프라자 107호(심곡동)
73380-2259신동아공인중개사사무소전순옥인천광역시 서구 서달로123번길 12-4 ,상가101호(석남동,신동아아파트)
83380-2258청라레이크파크공인중개사사무소이유희인천광역시 서구 크리스탈로74번길 26 상가동104호(청라동,청라더샵레이크파크)
93380-2224삼성공인중개사사무소박인순인천광역시 서구 거북로109번길 7 (석남동)
등록번호업소명대표명사무소소재지
156628260-2023-00157반올림공인중개사사무소권윤정인천광역시 서구 가정로336번길 1-4 1층(가정동)
156728260-2023-00158나루부동산공인중개사사무소허동하인천광역시 서구 서곶로 50 판매8동 F114호(가정동, 루원시티대성베르힐더센트로)
156828260-2023-00159현공인중개사사무소이현규인천광역시 서구 열우물로240번길 27 1층(가좌동)
156928260-2023-00160가좌럭키부동산공인중개사사무소이수관인천광역시 서구 가석로156번길 50 (가좌동)
157028260-2023-00161베스트공인중개사사무소박원철인천광역시 서구 가남로 295 , 2층
157128260-2023-0016221세기부동산공인중개사사무소남성이인천광역시 서구 이음2로 30 단지내상가 1-B107호(624동 B107호)
157228260-2023-00163검단SK부동산공인중개사사무소이진선인천광역시 서구 독정로 17 제115동 제110호
157328260-2023-00164샘공인중개사사무소차정한인천광역시 서구 가정로351번길 6
157428260-2023-00165청라창조공인중개사사무소심하연인천광역시 서구 청라커낼로 300 , 판매2동 112호(청라동)
157528260-2023-00166경남공인중개사사무소김태성인천광역시 서구 경명대로694번길 10 상가동, 101호(공촌동, 경남아파트)

Duplicate rows

Most frequently occurring

등록번호업소명대표명사무소소재지# duplicates
028260-2020-00017청라한양에이스부동산공인중개사사무소장미현인천광역시 서구 청라한내로 132 상가1동112호(청라동, 청라한양수자인레이크블루)2
128260-2023-00139(주)미래부동산중개법인정명교인천광역시 서구 봉수대로 295 , 1층 106호(석남동)2