Overview

Dataset statistics

Number of variables5
Number of observations547
Missing cells41
Missing cells (%)1.5%
Duplicate rows14
Duplicate rows (%)2.6%
Total size in memory21.5 KiB
Average record size in memory40.2 B

Variable types

DateTime1
Text3
Categorical1

Dataset

Description부동산중개업 등록 및 폐업 현황관내에 개설등록 된 개업공인중개사의 상호, 주소 등 정보를 제공하며 폐업한 공인중개사사무소이 폐업일을 등록하여관내에 부동산중개업소를 방문하는 고객들이 참고하시도록 정보를 제공하고자 합니다.
Author경상북도 영천시
URLhttps://www.data.go.kr/data/15085149/fileData.do

Alerts

Dataset has 14 (2.6%) duplicate rowsDuplicates
사무소주소 has 41 (7.5%) missing valuesMissing

Reproduction

Analysis started2024-03-14 15:57:51.535821
Analysis finished2024-03-14 15:57:52.350735
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct488
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum1986-04-17 00:00:00
Maximum2023-12-20 00:00:00
2024-03-15T00:57:52.483260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:57:52.929949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct349
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T00:57:53.782899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length10
Mean length10.700183
Min length9

Characters and Unicode

Total characters5853
Distinct characters251
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)43.1%

Sample

1st row대한공인중개사사무소
2nd row행복을주는공인중개사사무소
3rd row조양부동산중개사무소
4th row보금자리공인중개사사무소
5th row이편한부동산중개사무소
ValueCountFrequency (%)
사무소 9
 
1.6%
행운공인중개사사무소 6
 
1.1%
신화공인중개사사무소 6
 
1.1%
25시공인중개사사무소 6
 
1.1%
한솔공인중개사사무소 6
 
1.1%
도림공인중개사사무소 5
 
0.9%
공인중개사사무소 5
 
0.9%
대한공인중개사사무소 5
 
0.9%
드림공인중개사사무소 5
 
0.9%
태영공인중개사사무소 5
 
0.9%
Other values (342) 504
89.7%
2024-03-15T00:57:54.829648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1050
17.9%
557
9.5%
547
9.3%
547
9.3%
545
9.3%
516
 
8.8%
503
 
8.6%
101
 
1.7%
97
 
1.7%
95
 
1.6%
Other values (241) 1295
22.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5742
98.1%
Decimal Number 72
 
1.2%
Uppercase Letter 20
 
0.3%
Space Separator 15
 
0.3%
Other Punctuation 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1050
18.3%
557
9.7%
547
9.5%
547
9.5%
545
9.5%
516
9.0%
503
8.8%
101
 
1.8%
97
 
1.7%
95
 
1.7%
Other values (218) 1184
20.6%
Uppercase Letter
ValueCountFrequency (%)
C 4
20.0%
I 3
15.0%
O 3
15.0%
K 3
15.0%
E 2
10.0%
S 1
 
5.0%
W 1
 
5.0%
N 1
 
5.0%
T 1
 
5.0%
V 1
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 33
45.8%
5 9
 
12.5%
4 9
 
12.5%
2 9
 
12.5%
6 3
 
4.2%
3 3
 
4.2%
0 3
 
4.2%
8 2
 
2.8%
9 1
 
1.4%
Space Separator
ValueCountFrequency (%)
15
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5742
98.1%
Common 91
 
1.6%
Latin 20
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1050
18.3%
557
9.7%
547
9.5%
547
9.5%
545
9.5%
516
9.0%
503
8.8%
101
 
1.8%
97
 
1.7%
95
 
1.7%
Other values (218) 1184
20.6%
Common
ValueCountFrequency (%)
1 33
36.3%
15
16.5%
5 9
 
9.9%
4 9
 
9.9%
2 9
 
9.9%
6 3
 
3.3%
3 3
 
3.3%
0 3
 
3.3%
8 2
 
2.2%
/ 2
 
2.2%
Other values (3) 3
 
3.3%
Latin
ValueCountFrequency (%)
C 4
20.0%
I 3
15.0%
O 3
15.0%
K 3
15.0%
E 2
10.0%
S 1
 
5.0%
W 1
 
5.0%
N 1
 
5.0%
T 1
 
5.0%
V 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5742
98.1%
ASCII 111
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1050
18.3%
557
9.7%
547
9.5%
547
9.5%
545
9.5%
516
9.0%
503
8.8%
101
 
1.8%
97
 
1.7%
95
 
1.7%
Other values (218) 1184
20.6%
ASCII
ValueCountFrequency (%)
1 33
29.7%
15
13.5%
5 9
 
8.1%
4 9
 
8.1%
2 9
 
8.1%
C 4
 
3.6%
6 3
 
2.7%
3 3
 
2.7%
0 3
 
2.7%
I 3
 
2.7%
Other values (13) 20
18.0%
Distinct531
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T00:57:56.206179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length9
Mean length11.846435
Min length7

Characters and Unicode

Total characters6480
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique515 ?
Unique (%)94.1%

Sample

1st row47230-2023-00009
2nd row47230-2023-00007
3rd row47230-2023-00006
4th row47230-2023-00005
5th row47230-2023-00003
ValueCountFrequency (%)
가4214-158 2
 
0.4%
가4214-69 2
 
0.4%
가4214-67 2
 
0.4%
47230-2016-00023 2
 
0.4%
가4214-161 2
 
0.4%
가4214-94 2
 
0.4%
가4214-203 2
 
0.4%
가4214-213 2
 
0.4%
가4214-133 2
 
0.4%
가4214-101 2
 
0.4%
Other values (521) 527
96.3%
2024-03-15T00:57:57.826910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1314
20.3%
2 1047
16.2%
4 1039
16.0%
- 776
12.0%
1 676
10.4%
3 445
 
6.9%
7 332
 
5.1%
314
 
4.8%
5 182
 
2.8%
6 130
 
2.0%
Other values (3) 225
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5384
83.1%
Dash Punctuation 776
 
12.0%
Other Letter 320
 
4.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1314
24.4%
2 1047
19.4%
4 1039
19.3%
1 676
12.6%
3 445
 
8.3%
7 332
 
6.2%
5 182
 
3.4%
6 130
 
2.4%
8 111
 
2.1%
9 108
 
2.0%
Other Letter
ValueCountFrequency (%)
314
98.1%
6
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 776
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6160
95.1%
Hangul 320
 
4.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1314
21.3%
2 1047
17.0%
4 1039
16.9%
- 776
12.6%
1 676
11.0%
3 445
 
7.2%
7 332
 
5.4%
5 182
 
3.0%
6 130
 
2.1%
8 111
 
1.8%
Hangul
ValueCountFrequency (%)
314
98.1%
6
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6160
95.1%
Hangul 320
 
4.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1314
21.3%
2 1047
17.0%
4 1039
16.9%
- 776
12.6%
1 676
11.0%
3 445
 
7.2%
7 332
 
5.4%
5 182
 
3.0%
6 130
 
2.1%
8 111
 
1.8%
Hangul
ValueCountFrequency (%)
314
98.1%
6
 
1.9%

업소상태
Categorical

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
폐업
382 
영업중
163 
휴업
 
2

Length

Max length3
Median length2
Mean length2.297989
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
폐업 382
69.8%
영업중 163
29.8%
휴업 2
 
0.4%

Length

2024-03-15T00:57:58.261826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:57:58.593603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 382
69.8%
영업중 163
29.8%
휴업 2
 
0.4%

사무소주소
Text

MISSING 

Distinct336
Distinct (%)66.4%
Missing41
Missing (%)7.5%
Memory size4.4 KiB
2024-03-15T00:57:59.863390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length47
Mean length23.23913
Min length16

Characters and Unicode

Total characters11759
Distinct characters152
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique239 ?
Unique (%)47.2%

Sample

1st row경상북도 영천시 호국로 124 (망정동)
2nd row경상북도 영천시 영천아이시로 79 (봉동)
3rd row경상북도 영천시 호국로 93-1 (야사동)
4th row경상북도 영천시 창신1길 5 , 상가동 110호 (망정동, 창신타운)
5th row경상북도 영천시 시장로 120 107호 (완산동)
ValueCountFrequency (%)
경상북도 506
18.9%
영천시 506
18.9%
금호읍 91
 
3.4%
완산동 66
 
2.5%
천문로 56
 
2.1%
한방로 52
 
1.9%
호국로 50
 
1.9%
금노동 46
 
1.7%
금호로 41
 
1.5%
도동 40
 
1.5%
Other values (376) 1218
45.6%
2024-03-15T00:58:02.000460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2275
19.3%
594
 
5.1%
555
 
4.7%
553
 
4.7%
542
 
4.6%
531
 
4.5%
519
 
4.4%
518
 
4.4%
419
 
3.6%
1 414
 
3.5%
Other values (142) 4839
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6979
59.4%
Space Separator 2275
 
19.3%
Decimal Number 1580
 
13.4%
Close Punctuation 389
 
3.3%
Open Punctuation 389
 
3.3%
Dash Punctuation 95
 
0.8%
Other Punctuation 47
 
0.4%
Uppercase Letter 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
594
 
8.5%
555
 
8.0%
553
 
7.9%
542
 
7.8%
531
 
7.6%
519
 
7.4%
518
 
7.4%
419
 
6.0%
378
 
5.4%
223
 
3.2%
Other values (123) 2147
30.8%
Decimal Number
ValueCountFrequency (%)
1 414
26.2%
2 205
13.0%
3 152
 
9.6%
6 139
 
8.8%
0 135
 
8.5%
4 127
 
8.0%
5 118
 
7.5%
9 110
 
7.0%
7 94
 
5.9%
8 86
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
H 1
33.3%
T 1
33.3%
Space Separator
ValueCountFrequency (%)
2275
100.0%
Close Punctuation
ValueCountFrequency (%)
) 389
100.0%
Open Punctuation
ValueCountFrequency (%)
( 389
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 95
100.0%
Other Punctuation
ValueCountFrequency (%)
, 47
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6979
59.4%
Common 4775
40.6%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
594
 
8.5%
555
 
8.0%
553
 
7.9%
542
 
7.8%
531
 
7.6%
519
 
7.4%
518
 
7.4%
419
 
6.0%
378
 
5.4%
223
 
3.2%
Other values (123) 2147
30.8%
Common
ValueCountFrequency (%)
2275
47.6%
1 414
 
8.7%
) 389
 
8.1%
( 389
 
8.1%
2 205
 
4.3%
3 152
 
3.2%
6 139
 
2.9%
0 135
 
2.8%
4 127
 
2.7%
5 118
 
2.5%
Other values (5) 432
 
9.0%
Latin
ValueCountFrequency (%)
e 2
40.0%
E 1
20.0%
H 1
20.0%
T 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6979
59.4%
ASCII 4780
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2275
47.6%
1 414
 
8.7%
) 389
 
8.1%
( 389
 
8.1%
2 205
 
4.3%
3 152
 
3.2%
6 139
 
2.9%
0 135
 
2.8%
4 127
 
2.7%
5 118
 
2.5%
Other values (9) 437
 
9.1%
Hangul
ValueCountFrequency (%)
594
 
8.5%
555
 
8.0%
553
 
7.9%
542
 
7.8%
531
 
7.6%
519
 
7.4%
518
 
7.4%
419
 
6.0%
378
 
5.4%
223
 
3.2%
Other values (123) 2147
30.8%

Missing values

2024-03-15T00:57:52.063843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:57:52.283566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일중개업소명등록번호업소상태사무소주소
02023-12-20대한공인중개사사무소47230-2023-00009영업중경상북도 영천시 호국로 124 (망정동)
12023-04-21행복을주는공인중개사사무소47230-2023-00007영업중경상북도 영천시 영천아이시로 79 (봉동)
22023-03-22조양부동산중개사무소47230-2023-00006영업중경상북도 영천시 호국로 93-1 (야사동)
32023-03-16보금자리공인중개사사무소47230-2023-00005영업중경상북도 영천시 창신1길 5 , 상가동 110호 (망정동, 창신타운)
42023-02-22이편한부동산중개사무소47230-2023-00003영업중경상북도 영천시 시장로 120 107호 (완산동)
52023-01-31천지공인중개사사무소47230-2023-00002영업중경상북도 영천시 완산로 14 (완산동)
62023-01-12미소지움2차부동산중개사무소47230-2023-00001영업중경상북도 영천시 완산5길 111 상가동 109호 (완산동, 영천완산미소지움2차)
72022-12-28영천남부공인중개사사무소47230-2022-00022영업중경상북도 영천시 한방로 61 (도동)
82022-12-27대길공인중개사사무소47230-2022-00020영업중경상북도 영천시 금호읍 금호로 51
92022-12-27태건공인중개사사무소47230-2022-00021영업중경상북도 영천시 약전길 33 (완산동)
등록일중개업소명등록번호업소상태사무소주소
5371989-07-13경북공인중개사사무소가4214-76폐업경상북도 영천시 동문길 139 (문외동)
5381989-04-15새주남부동산중개인사무소나4214-17폐업경상북도 영천시 강변로 62 (금노동)
5391989-04-11대성부동산중개인사무소나4214-209폐업경상북도 영천시 금호읍 금호로 149-1 (금호읍)
5401989-04-11대성부동산중개인사무소나4214-39폐업경상북도 영천시 호국로 228 (조교동)
5411989-04-11조양부동산중개인사무소나4214-34폐업경상북도 영천시 호국로 95 (야사동)
5421989-03-30신녕부동산중개사무소가4238-13폐업경상북도 영천시 신녕면 장수로 1681
5431989-03-23조은부동산중개 사무소나4214-10폐업경상북도 영천시 완산로 23 (완산동)
5441989-02-21대창부동산중개인사무소가4238-12폐업<NA>
5451989-02-04조양부동산중개인사무소가4238-11폐업경상북도 영천시 호국로 93-1 (야사동)
5461988-12-09화산부동산중개인사무소가4238-10폐업경상북도 영천시 화산면 장수로 922 (화산면)

Duplicate rows

Most frequently occurring

등록일중개업소명등록번호업소상태사무소주소# duplicates
01986-04-17영천공인중개사사무소가4214-53영업중경상북도 영천시 망정로 27-4 (망정동)2
11986-11-03대창1번지부동산공인중개사사무소47230-2022-00016영업중경상북도 영천시 대창면 금창로 6872
21988-02-04금호공인중개사사무소가4238-9영업중경상북도 영천시 금호읍 금호로 142-22
31988-04-04대신부동산중개사무소가4214-67영업중경상북도 영천시 동문길 131 (문외동)2
41988-04-04한일공인중개사사무소가4214-69영업중경상북도 영천시 강변로 49 (금노동)2
51994-06-04동은공인중개사사무소가4214-94영업중경상북도 영천시 금호읍 금호로 1062
61994-06-04창성공인중개사사무소가4214-93영업중경상북도 영천시 호국로 174 (조교동)2
71995-04-20태창공인중개사사무소가4214-101영업중경상북도 영천시 언하공단로 95 (망정동)2
82000-05-25영화공인중개사사무소가4214-133영업중경상북도 영천시 호국로 89 (야사동)2
92003-07-03청통공인중개사사무소가4214-161영업중경상북도 영천시 청통면 금송로 9462