Overview

Dataset statistics

Number of variables5
Number of observations915
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory35.9 KiB
Average record size in memory40.1 B

Variable types

Text3
Categorical2

Dataset

Description광주광역시 서구 관내 부동산중개업소 현황입니다. 등록번호 행정처분상태 사무소명 중개업자구분 도로명주소의 정보를 제공합니다.
Author광주광역시 서구
URLhttps://www.data.go.kr/data/15011810/fileData.do

Alerts

행정처분상태 is highly imbalanced (98.8%)Imbalance
중개업자구분 is highly imbalanced (85.2%)Imbalance
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:25:02.031348
Analysis finished2023-12-12 08:25:03.107900
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct915
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
2023-12-12T17:25:03.306207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length14.314754
Min length9

Characters and Unicode

Total characters13098
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique915 ?
Unique (%)100.0%

Sample

1st row가4104-389
2nd row29140-2016-00130
3rd row가4104-430
4th row가4104-450
5th row가4104-134
ValueCountFrequency (%)
가4104-389 1
 
0.1%
29140-2018-00134 1
 
0.1%
29140-2018-00182 1
 
0.1%
29140-2018-00142 1
 
0.1%
29140-2018-00152 1
 
0.1%
29140-2018-00153 1
 
0.1%
29140-2018-00116 1
 
0.1%
29140-2018-00158 1
 
0.1%
29140-2018-00159 1
 
0.1%
29140-2018-00118 1
 
0.1%
Other values (905) 905
98.9%
2023-12-12T17:25:03.796381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3633
27.7%
1 1988
15.2%
2 1960
15.0%
- 1579
12.1%
4 1375
 
10.5%
9 977
 
7.5%
8 296
 
2.3%
5 292
 
2.2%
7 266
 
2.0%
6 261
 
2.0%
Other values (3) 471
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11269
86.0%
Dash Punctuation 1579
 
12.1%
Other Letter 250
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3633
32.2%
1 1988
17.6%
2 1960
17.4%
4 1375
 
12.2%
9 977
 
8.7%
8 296
 
2.6%
5 292
 
2.6%
7 266
 
2.4%
6 261
 
2.3%
3 221
 
2.0%
Other Letter
ValueCountFrequency (%)
247
98.8%
3
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 1579
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12848
98.1%
Hangul 250
 
1.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3633
28.3%
1 1988
15.5%
2 1960
15.3%
- 1579
12.3%
4 1375
 
10.7%
9 977
 
7.6%
8 296
 
2.3%
5 292
 
2.3%
7 266
 
2.1%
6 261
 
2.0%
Hangul
ValueCountFrequency (%)
247
98.8%
3
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12848
98.1%
Hangul 250
 
1.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3633
28.3%
1 1988
15.5%
2 1960
15.3%
- 1579
12.3%
4 1375
 
10.7%
9 977
 
7.6%
8 296
 
2.3%
5 292
 
2.3%
7 266
 
2.1%
6 261
 
2.0%
Hangul
ValueCountFrequency (%)
247
98.8%
3
 
1.2%

행정처분상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
영업중
914 
휴업
 
1

Length

Max length3
Median length3
Mean length2.9989071
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 914
99.9%
휴업 1
 
0.1%

Length

2023-12-12T17:25:03.968518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:25:04.085695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 914
99.9%
휴업 1
 
0.1%
Distinct763
Distinct (%)83.4%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
2023-12-12T17:25:04.352141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length10.987978
Min length7

Characters and Unicode

Total characters10054
Distinct characters402
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique674 ?
Unique (%)73.7%

Sample

1st row금화부동산중개
2nd row둥지부동산중개사무소
3rd row천지부동산중개사무소
4th row한국부동산중개인사무소
5th row광주부동산중개
ValueCountFrequency (%)
공인중개사사무소 44
 
4.5%
한국공인중개사사무소 6
 
0.6%
대박공인중개사사무소 5
 
0.5%
예스공인중개사사무소 5
 
0.5%
우리공인중개사사무소 5
 
0.5%
삼성공인중개사사무소 5
 
0.5%
으뜸공인중개사사무소 4
 
0.4%
사무소 4
 
0.4%
행복공인중개사사무소 4
 
0.4%
명가공인중개사사무소 4
 
0.4%
Other values (764) 888
91.2%
2023-12-12T17:25:04.930565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1740
17.3%
926
 
9.2%
925
 
9.2%
921
 
9.2%
891
 
8.9%
872
 
8.7%
844
 
8.4%
132
 
1.3%
119
 
1.2%
111
 
1.1%
Other values (392) 2573
25.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9705
96.5%
Decimal Number 108
 
1.1%
Uppercase Letter 76
 
0.8%
Space Separator 59
 
0.6%
Lowercase Letter 39
 
0.4%
Open Punctuation 19
 
0.2%
Close Punctuation 19
 
0.2%
Letter Number 12
 
0.1%
Math Symbol 8
 
0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1740
17.9%
926
9.5%
925
9.5%
921
9.5%
891
9.2%
872
 
9.0%
844
 
8.7%
132
 
1.4%
119
 
1.2%
111
 
1.1%
Other values (342) 2224
22.9%
Uppercase Letter
ValueCountFrequency (%)
K 24
31.6%
S 13
17.1%
O 10
13.2%
T 6
 
7.9%
M 5
 
6.6%
J 4
 
5.3%
D 3
 
3.9%
N 3
 
3.9%
Y 2
 
2.6%
H 1
 
1.3%
Other values (5) 5
 
6.6%
Lowercase Letter
ValueCountFrequency (%)
e 16
41.0%
h 5
 
12.8%
a 3
 
7.7%
t 3
 
7.7%
l 2
 
5.1%
b 2
 
5.1%
u 1
 
2.6%
w 1
 
2.6%
k 1
 
2.6%
n 1
 
2.6%
Other values (4) 4
 
10.3%
Decimal Number
ValueCountFrequency (%)
1 72
66.7%
3 9
 
8.3%
5 8
 
7.4%
4 6
 
5.6%
2 5
 
4.6%
6 4
 
3.7%
0 2
 
1.9%
9 2
 
1.9%
Letter Number
ValueCountFrequency (%)
9
75.0%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
· 2
40.0%
1
20.0%
# 1
20.0%
& 1
20.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Math Symbol
ValueCountFrequency (%)
+ 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9700
96.5%
Common 222
 
2.2%
Latin 127
 
1.3%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1740
17.9%
926
9.5%
925
9.5%
921
9.5%
891
9.2%
872
 
9.0%
844
 
8.7%
132
 
1.4%
119
 
1.2%
111
 
1.1%
Other values (337) 2219
22.9%
Latin
ValueCountFrequency (%)
K 24
18.9%
e 16
12.6%
S 13
10.2%
O 10
 
7.9%
9
 
7.1%
T 6
 
4.7%
h 5
 
3.9%
M 5
 
3.9%
J 4
 
3.1%
a 3
 
2.4%
Other values (23) 32
25.2%
Common
ValueCountFrequency (%)
1 72
32.4%
59
26.6%
( 19
 
8.6%
) 19
 
8.6%
3 9
 
4.1%
5 8
 
3.6%
+ 8
 
3.6%
4 6
 
2.7%
2 5
 
2.3%
- 4
 
1.8%
Other values (7) 13
 
5.9%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9700
96.5%
ASCII 334
 
3.3%
Number Forms 12
 
0.1%
CJK 5
 
< 0.1%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1740
17.9%
926
9.5%
925
9.5%
921
9.5%
891
9.2%
872
 
9.0%
844
 
8.7%
132
 
1.4%
119
 
1.2%
111
 
1.1%
Other values (337) 2219
22.9%
ASCII
ValueCountFrequency (%)
1 72
21.6%
59
17.7%
K 24
 
7.2%
( 19
 
5.7%
) 19
 
5.7%
e 16
 
4.8%
S 13
 
3.9%
O 10
 
3.0%
3 9
 
2.7%
5 8
 
2.4%
Other values (34) 85
25.4%
Number Forms
ValueCountFrequency (%)
9
75.0%
1
 
8.3%
1
 
8.3%
1
 
8.3%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

중개업자구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
공인중개사
886 
중개인
 
15
법인
 
14

Length

Max length5
Median length5
Mean length4.9213115
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중개인
2nd row중개인
3rd row중개인
4th row중개인
5th row중개인

Common Values

ValueCountFrequency (%)
공인중개사 886
96.8%
중개인 15
 
1.6%
법인 14
 
1.5%

Length

2023-12-12T17:25:05.135266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:25:05.269686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공인중개사 886
96.8%
중개인 15
 
1.6%
법인 14
 
1.5%
Distinct809
Distinct (%)88.5%
Missing1
Missing (%)0.1%
Memory size7.3 KiB
2023-12-12T17:25:05.534316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length41
Mean length31.044858
Min length10

Characters and Unicode

Total characters28375
Distinct characters260
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique728 ?
Unique (%)79.6%

Sample

1st row광주광역시 서구 화정로253번길 30(농성동)
2nd row광주광역시 서구 화운로 35, 대주아파트상가(화정동)
3rd row광주광역시 서구 치평로 20, 가동 4층 404호(치평동)
4th row광주광역시 서구 풍서좌로 153(매월동)
5th row광주광역시 서구 군분로 182-1(농성동)
ValueCountFrequency (%)
광주광역시 910
 
17.9%
서구 910
 
17.9%
상가동 127
 
2.5%
1층(쌍촌동 48
 
0.9%
시청로 45
 
0.9%
1층(화정동 39
 
0.8%
치평로 37
 
0.7%
1층 35
 
0.7%
상무대로 34
 
0.7%
1층(광천동 33
 
0.6%
Other values (1069) 2863
56.3%
2023-12-12T17:25:06.043662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4167
 
14.7%
1915
 
6.7%
1 1636
 
5.8%
1189
 
4.2%
1005
 
3.5%
962
 
3.4%
925
 
3.3%
914
 
3.2%
910
 
3.2%
896
 
3.2%
Other values (250) 13856
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16629
58.6%
Decimal Number 4873
 
17.2%
Space Separator 4167
 
14.7%
Close Punctuation 877
 
3.1%
Open Punctuation 877
 
3.1%
Other Punctuation 749
 
2.6%
Dash Punctuation 130
 
0.5%
Uppercase Letter 68
 
0.2%
Lowercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1915
 
11.5%
1189
 
7.2%
1005
 
6.0%
962
 
5.8%
925
 
5.6%
914
 
5.5%
910
 
5.5%
896
 
5.4%
592
 
3.6%
478
 
2.9%
Other values (221) 6843
41.2%
Decimal Number
ValueCountFrequency (%)
1 1636
33.6%
0 577
 
11.8%
2 551
 
11.3%
3 437
 
9.0%
4 394
 
8.1%
7 309
 
6.3%
5 286
 
5.9%
9 237
 
4.9%
6 230
 
4.7%
8 216
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
A 22
32.4%
S 16
23.5%
K 11
16.2%
B 9
13.2%
C 5
 
7.4%
W 1
 
1.5%
P 1
 
1.5%
I 1
 
1.5%
V 1
 
1.5%
E 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 743
99.2%
& 3
 
0.4%
@ 3
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
80.0%
s 1
 
20.0%
Space Separator
ValueCountFrequency (%)
4167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 877
100.0%
Open Punctuation
ValueCountFrequency (%)
( 877
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16629
58.6%
Common 11673
41.1%
Latin 73
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1915
 
11.5%
1189
 
7.2%
1005
 
6.0%
962
 
5.8%
925
 
5.6%
914
 
5.5%
910
 
5.5%
896
 
5.4%
592
 
3.6%
478
 
2.9%
Other values (221) 6843
41.2%
Common
ValueCountFrequency (%)
4167
35.7%
1 1636
 
14.0%
) 877
 
7.5%
( 877
 
7.5%
, 743
 
6.4%
0 577
 
4.9%
2 551
 
4.7%
3 437
 
3.7%
4 394
 
3.4%
7 309
 
2.6%
Other values (7) 1105
 
9.5%
Latin
ValueCountFrequency (%)
A 22
30.1%
S 16
21.9%
K 11
15.1%
B 9
12.3%
C 5
 
6.8%
e 4
 
5.5%
s 1
 
1.4%
W 1
 
1.4%
P 1
 
1.4%
I 1
 
1.4%
Other values (2) 2
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16629
58.6%
ASCII 11746
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4167
35.5%
1 1636
 
13.9%
) 877
 
7.5%
( 877
 
7.5%
, 743
 
6.3%
0 577
 
4.9%
2 551
 
4.7%
3 437
 
3.7%
4 394
 
3.4%
7 309
 
2.6%
Other values (19) 1178
 
10.0%
Hangul
ValueCountFrequency (%)
1915
 
11.5%
1189
 
7.2%
1005
 
6.0%
962
 
5.8%
925
 
5.6%
914
 
5.5%
910
 
5.5%
896
 
5.4%
592
 
3.6%
478
 
2.9%
Other values (221) 6843
41.2%

Correlations

2023-12-12T17:25:06.155032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정처분상태중개업자구분
행정처분상태1.0000.000
중개업자구분0.0001.000
2023-12-12T17:25:06.249221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중개업자구분행정처분상태
중개업자구분1.0000.000
행정처분상태0.0001.000
2023-12-12T17:25:06.368907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정처분상태중개업자구분
행정처분상태1.0000.000
중개업자구분0.0001.000

Missing values

2023-12-12T17:25:02.883107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:25:03.048249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호행정처분상태사무소명중개업자구분도로명주소
0가4104-389영업중금화부동산중개중개인광주광역시 서구 화정로253번길 30(농성동)
129140-2016-00130영업중둥지부동산중개사무소중개인광주광역시 서구 화운로 35, 대주아파트상가(화정동)
2가4104-430영업중천지부동산중개사무소중개인광주광역시 서구 치평로 20, 가동 4층 404호(치평동)
3가4104-450영업중한국부동산중개인사무소중개인광주광역시 서구 풍서좌로 153(매월동)
4가4104-134영업중광주부동산중개중개인광주광역시 서구 군분로 182-1(농성동)
5가4104-192영업중무등부동산중개사무소중개인광주광역시 서구 월드컵4강로182번길 3(내방동)
629140-2015-00122영업중요한부동산중개사무소중개인광주광역시 서구 군분로169번길 17-1(화정동)
729140-2015-00242영업중가온부동산중개사무소중개인광주광역시 서구 상무중앙로 84, 201호(치평동)
8가4104-1937영업중태흥부동산중개사무소중개인광주광역시 서구 상무대로1206번길 6-2, 1층(농성동)
9나4104-103영업중금호부동산중개인사무소중개인광주광역시 서구 경열로 95-1(농성동)
등록번호행정처분상태사무소명중개업자구분도로명주소
90529140-2018-00044영업중(주)스마일부동산중개법인법인광주광역시 서구 금호운천길 84, 1층(쌍촌동)
90629140-2017-00204영업중(주)화정역부동산중개법인법인광주광역시 서구 상무대로 1046, 1층(화정동)
90729140-2017-00180영업중(주)정직한부동산중개법인법인광주광역시 서구 매월2로15번길 16, 매월종합상가 205-220(매월동)
90829140-2018-00203영업중유한회사모아부동산중개법인법인광주광역시 서구 화정로 129, 모아아파트 상가 105동 113호(화정동)
90929140-2019-00040영업중주식회사조은날부동산중개법인광주광역시 서구 내방로 433, 농성지오스테이션 101호(농성동)
91029140-2018-00195영업중(주)스카이부동산중개법인법인광주광역시 서구 죽봉대로119번길 11, 1층(광천동)
91129140-2020-00125영업중(주)에스앤에스부동산중개법인법인광주광역시 서구 죽봉대로 105, 6층(광천동)
91229140-2021-00027영업중드림부동산공인중개사사무소법인광주광역시 서구 무진대로 503, 2층(덕흥동)
913가4104-2222영업중중개법인 리치부동산관리(주)법인광주광역시 서구 화운로 187, 3층(화정동)
91429140-2019-00091휴업친절한공인중개사사무소공인중개사광주광역시 서구 월드컵4강로 32, 1층(화정동)