Overview

Dataset statistics

Number of variables5
Number of observations933
Missing cells245
Missing cells (%)5.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.6 KiB
Average record size in memory40.1 B

Variable types

Categorical1
Text4

Dataset

Description부산진구 관내 소재 부동산 중개업소 현황에 관한 데이터이며 등록번호, 사무소 명칭, 대표자,사무소 연락처 및 주소 정보를 포함하고 있습니다.
Author부산광역시 부산진구
URLhttps://www.data.go.kr/data/15007198/fileData.do

Alerts

시군구 has constant value ""Constant
사무소전화번호 has 245 (26.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:14:13.955975
Analysis finished2023-12-12 09:14:14.645448
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
부산광역시 부산진구
933 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 부산진구
2nd row부산광역시 부산진구
3rd row부산광역시 부산진구
4th row부산광역시 부산진구
5th row부산광역시 부산진구

Common Values

ValueCountFrequency (%)
부산광역시 부산진구 933
100.0%

Length

2023-12-12T18:14:14.708600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:14:14.801152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 933
50.0%
부산진구 933
50.0%
Distinct932
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-12T18:14:15.072926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length14.090032
Min length7

Characters and Unicode

Total characters13146
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique931 ?
Unique (%)99.8%

Sample

1st row가-05-230
2nd row가-05-2376
3rd row나-05-86
4th row나-05-91
5th row26230-2019-00168
ValueCountFrequency (%)
26230-2017-00146 2
 
0.2%
26230-2021-00029 1
 
0.1%
26230-2021-00048 1
 
0.1%
26230-2021-00031 1
 
0.1%
26230-2021-00009 1
 
0.1%
26230-2021-00010 1
 
0.1%
26230-2021-00011 1
 
0.1%
26230-2021-00014 1
 
0.1%
26230-2021-00015 1
 
0.1%
26230-2021-00016 1
 
0.1%
Other values (922) 922
98.8%
2023-12-12T18:14:15.619666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3667
27.9%
2 2931
22.3%
- 1869
14.2%
3 992
 
7.5%
6 922
 
7.0%
1 914
 
7.0%
5 500
 
3.8%
4 288
 
2.2%
9 277
 
2.1%
8 276
 
2.1%
Other values (3) 510
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11022
83.8%
Dash Punctuation 1869
 
14.2%
Other Letter 255
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3667
33.3%
2 2931
26.6%
3 992
 
9.0%
6 922
 
8.4%
1 914
 
8.3%
5 500
 
4.5%
4 288
 
2.6%
9 277
 
2.5%
8 276
 
2.5%
7 255
 
2.3%
Other Letter
ValueCountFrequency (%)
251
98.4%
4
 
1.6%
Dash Punctuation
ValueCountFrequency (%)
- 1869
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12891
98.1%
Hangul 255
 
1.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3667
28.4%
2 2931
22.7%
- 1869
14.5%
3 992
 
7.7%
6 922
 
7.2%
1 914
 
7.1%
5 500
 
3.9%
4 288
 
2.2%
9 277
 
2.1%
8 276
 
2.1%
Hangul
ValueCountFrequency (%)
251
98.4%
4
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12891
98.1%
Hangul 255
 
1.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3667
28.4%
2 2931
22.7%
- 1869
14.5%
3 992
 
7.7%
6 922
 
7.2%
1 914
 
7.1%
5 500
 
3.9%
4 288
 
2.2%
9 277
 
2.1%
8 276
 
2.1%
Hangul
ValueCountFrequency (%)
251
98.4%
4
 
1.6%
Distinct824
Distinct (%)88.3%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-12T18:14:15.934882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length11.108253
Min length6

Characters and Unicode

Total characters10364
Distinct characters375
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique740 ?
Unique (%)79.3%

Sample

1st row동원부동산중개사무소
2nd row영광부동산중개
3rd row남산부동산중개사무소
4th row유신부동산중개사무소
5th rowB.S.서면부동산중개인사무소
ValueCountFrequency (%)
주식회사 10
 
1.0%
현대공인중개사사무소 7
 
0.7%
태양공인중개사사무소 6
 
0.6%
삼성공인중개사사무소 4
 
0.4%
미래공인중개사사무소 4
 
0.4%
공인중개사사무소 4
 
0.4%
굿모닝공인중개사사무소 3
 
0.3%
하늘공인중개사사무소 3
 
0.3%
프로합동공인중개사사무소 3
 
0.3%
대성공인중개사사무소 3
 
0.3%
Other values (822) 909
95.1%
2023-12-12T18:14:16.406455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1573
15.2%
944
 
9.1%
938
 
9.1%
831
 
8.0%
829
 
8.0%
771
 
7.4%
739
 
7.1%
374
 
3.6%
354
 
3.4%
338
 
3.3%
Other values (365) 2673
25.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10207
98.5%
Uppercase Letter 45
 
0.4%
Decimal Number 40
 
0.4%
Lowercase Letter 32
 
0.3%
Space Separator 23
 
0.2%
Open Punctuation 6
 
0.1%
Close Punctuation 6
 
0.1%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1573
15.4%
944
 
9.2%
938
 
9.2%
831
 
8.1%
829
 
8.1%
771
 
7.6%
739
 
7.2%
374
 
3.7%
354
 
3.5%
338
 
3.3%
Other values (328) 2516
24.6%
Uppercase Letter
ValueCountFrequency (%)
T 9
20.0%
K 7
15.6%
O 5
11.1%
H 4
8.9%
S 4
8.9%
N 3
 
6.7%
E 3
 
6.7%
D 3
 
6.7%
B 3
 
6.7%
C 2
 
4.4%
Other values (2) 2
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
e 16
50.0%
h 6
 
18.8%
w 3
 
9.4%
s 1
 
3.1%
u 1
 
3.1%
p 1
 
3.1%
l 1
 
3.1%
n 1
 
3.1%
r 1
 
3.1%
a 1
 
3.1%
Decimal Number
ValueCountFrequency (%)
1 15
37.5%
8 6
 
15.0%
2 5
 
12.5%
3 4
 
10.0%
5 4
 
10.0%
6 2
 
5.0%
4 2
 
5.0%
0 1
 
2.5%
9 1
 
2.5%
Space Separator
ValueCountFrequency (%)
23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10206
98.5%
Common 79
 
0.8%
Latin 78
 
0.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1573
15.4%
944
 
9.2%
938
 
9.2%
831
 
8.1%
829
 
8.1%
771
 
7.6%
739
 
7.2%
374
 
3.7%
354
 
3.5%
338
 
3.3%
Other values (327) 2515
24.6%
Latin
ValueCountFrequency (%)
e 16
20.5%
T 9
11.5%
K 7
 
9.0%
h 6
 
7.7%
O 5
 
6.4%
H 4
 
5.1%
S 4
 
5.1%
N 3
 
3.8%
E 3
 
3.8%
w 3
 
3.8%
Other values (13) 18
23.1%
Common
ValueCountFrequency (%)
23
29.1%
1 15
19.0%
8 6
 
7.6%
( 6
 
7.6%
) 6
 
7.6%
2 5
 
6.3%
3 4
 
5.1%
5 4
 
5.1%
- 2
 
2.5%
. 2
 
2.5%
Other values (4) 6
 
7.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10206
98.5%
ASCII 156
 
1.5%
CJK 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1573
15.4%
944
 
9.2%
938
 
9.2%
831
 
8.1%
829
 
8.1%
771
 
7.6%
739
 
7.2%
374
 
3.7%
354
 
3.5%
338
 
3.3%
Other values (327) 2515
24.6%
ASCII
ValueCountFrequency (%)
23
14.7%
e 16
 
10.3%
1 15
 
9.6%
T 9
 
5.8%
K 7
 
4.5%
8 6
 
3.8%
( 6
 
3.8%
h 6
 
3.8%
) 6
 
3.8%
2 5
 
3.2%
Other values (26) 57
36.5%
CJK
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

사무소전화번호
Text

MISSING 

Distinct668
Distinct (%)97.1%
Missing245
Missing (%)26.3%
Memory size7.4 KiB
2023-12-12T18:14:16.776277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.111919
Min length9

Characters and Unicode

Total characters8333
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique650 ?
Unique (%)94.5%

Sample

1st row051-897-3343
2nd row051-803-9100
3rd row051-806-0456
4th row051-918-8898
5th row051-896-5500
ValueCountFrequency (%)
051-867-4949 3
 
0.4%
051-862-8080 3
 
0.4%
051-863-3100 3
 
0.4%
051-809-0045 2
 
0.3%
051-923-5554 2
 
0.3%
051-803-5566 2
 
0.3%
051-806-4988 2
 
0.3%
051-918-8898 2
 
0.3%
051-997-1001 2
 
0.3%
051-808-6689 2
 
0.3%
Other values (652) 665
96.7%
2023-12-12T18:14:17.277398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1583
19.0%
- 1374
16.5%
1 1106
13.3%
5 1033
12.4%
8 1023
12.3%
9 499
 
6.0%
3 344
 
4.1%
6 340
 
4.1%
7 336
 
4.0%
4 327
 
3.9%
Other values (3) 368
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6880
82.6%
Dash Punctuation 1374
 
16.5%
Space Separator 78
 
0.9%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1583
23.0%
1 1106
16.1%
5 1033
15.0%
8 1023
14.9%
9 499
 
7.3%
3 344
 
5.0%
6 340
 
4.9%
7 336
 
4.9%
4 327
 
4.8%
2 289
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1374
100.0%
Space Separator
ValueCountFrequency (%)
78
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8333
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1583
19.0%
- 1374
16.5%
1 1106
13.3%
5 1033
12.4%
8 1023
12.3%
9 499
 
6.0%
3 344
 
4.1%
6 340
 
4.1%
7 336
 
4.0%
4 327
 
3.9%
Other values (3) 368
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8333
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1583
19.0%
- 1374
16.5%
1 1106
13.3%
5 1033
12.4%
8 1023
12.3%
9 499
 
6.0%
3 344
 
4.1%
6 340
 
4.1%
7 336
 
4.0%
4 327
 
3.9%
Other values (3) 368
 
4.4%
Distinct817
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-12T18:14:17.627057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length50
Mean length33.891747
Min length18

Characters and Unicode

Total characters31621
Distinct characters302
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique729 ?
Unique (%)78.1%

Sample

1st row부산광역시 부산진구 동평로50번길 10, 1층(당감동)
2nd row부산광역시 부산진구 서면로 1, 2층(부전동)
3rd row부산광역시 부산진구 범전로5번길 6, 2층(부전동)
4th row부산광역시 부산진구 성지로 111-1, 2층(초읍동)
5th row부산광역시 부산진구 서면로 64, 14층(부전동)
ValueCountFrequency (%)
부산광역시 933
 
18.0%
부산진구 929
 
17.9%
중앙대로 67
 
1.3%
1층(양정동 63
 
1.2%
3층(부전동 46
 
0.9%
상가동 42
 
0.8%
새싹로 42
 
0.8%
가야대로 41
 
0.8%
1층(연지동 38
 
0.7%
연수로 37
 
0.7%
Other values (1137) 2946
56.8%
2023-12-12T18:14:18.192877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4251
 
13.4%
2138
 
6.8%
1893
 
6.0%
1 1525
 
4.8%
, 1280
 
4.0%
1217
 
3.8%
966
 
3.1%
957
 
3.0%
954
 
3.0%
949
 
3.0%
Other values (292) 15491
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18973
60.0%
Decimal Number 5120
 
16.2%
Space Separator 4251
 
13.4%
Other Punctuation 1282
 
4.1%
Close Punctuation 887
 
2.8%
Open Punctuation 886
 
2.8%
Dash Punctuation 138
 
0.4%
Uppercase Letter 84
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2138
 
11.3%
1893
 
10.0%
1217
 
6.4%
966
 
5.1%
957
 
5.0%
954
 
5.0%
949
 
5.0%
944
 
5.0%
936
 
4.9%
523
 
2.8%
Other values (259) 7496
39.5%
Uppercase Letter
ValueCountFrequency (%)
B 27
32.1%
S 11
13.1%
K 6
 
7.1%
D 5
 
6.0%
A 5
 
6.0%
Y 4
 
4.8%
I 4
 
4.8%
U 4
 
4.8%
T 4
 
4.8%
H 3
 
3.6%
Other values (7) 11
13.1%
Decimal Number
ValueCountFrequency (%)
1 1525
29.8%
2 704
13.8%
0 606
 
11.8%
3 468
 
9.1%
5 393
 
7.7%
4 381
 
7.4%
6 298
 
5.8%
8 267
 
5.2%
9 248
 
4.8%
7 230
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 1280
99.8%
/ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
4251
100.0%
Close Punctuation
ValueCountFrequency (%)
) 887
100.0%
Open Punctuation
ValueCountFrequency (%)
( 886
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18973
60.0%
Common 12564
39.7%
Latin 84
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2138
 
11.3%
1893
 
10.0%
1217
 
6.4%
966
 
5.1%
957
 
5.0%
954
 
5.0%
949
 
5.0%
944
 
5.0%
936
 
4.9%
523
 
2.8%
Other values (259) 7496
39.5%
Latin
ValueCountFrequency (%)
B 27
32.1%
S 11
13.1%
K 6
 
7.1%
D 5
 
6.0%
A 5
 
6.0%
Y 4
 
4.8%
I 4
 
4.8%
U 4
 
4.8%
T 4
 
4.8%
H 3
 
3.6%
Other values (7) 11
13.1%
Common
ValueCountFrequency (%)
4251
33.8%
1 1525
 
12.1%
, 1280
 
10.2%
) 887
 
7.1%
( 886
 
7.1%
2 704
 
5.6%
0 606
 
4.8%
3 468
 
3.7%
5 393
 
3.1%
4 381
 
3.0%
Other values (6) 1183
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18973
60.0%
ASCII 12648
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4251
33.6%
1 1525
 
12.1%
, 1280
 
10.1%
) 887
 
7.0%
( 886
 
7.0%
2 704
 
5.6%
0 606
 
4.8%
3 468
 
3.7%
5 393
 
3.1%
4 381
 
3.0%
Other values (23) 1267
 
10.0%
Hangul
ValueCountFrequency (%)
2138
 
11.3%
1893
 
10.0%
1217
 
6.4%
966
 
5.1%
957
 
5.0%
954
 
5.0%
949
 
5.0%
944
 
5.0%
936
 
4.9%
523
 
2.8%
Other values (259) 7496
39.5%

Missing values

2023-12-12T18:14:14.467430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:14:14.606016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구등록번호사무소명사무소전화번호사무소주소(도로명)
0부산광역시 부산진구가-05-230동원부동산중개사무소051-897-3343부산광역시 부산진구 동평로50번길 10, 1층(당감동)
1부산광역시 부산진구가-05-2376영광부동산중개<NA>부산광역시 부산진구 서면로 1, 2층(부전동)
2부산광역시 부산진구나-05-86남산부동산중개사무소051-803-9100부산광역시 부산진구 범전로5번길 6, 2층(부전동)
3부산광역시 부산진구나-05-91유신부동산중개사무소051-806-0456부산광역시 부산진구 성지로 111-1, 2층(초읍동)
4부산광역시 부산진구26230-2019-00168B.S.서면부동산중개인사무소051-918-8898부산광역시 부산진구 서면로 64, 14층(부전동)
5부산광역시 부산진구나-05-242우리부동산중개사무소051-896-5500부산광역시 부산진구 백양관문로 3, 113호(개금동,주공복합상가)
6부산광역시 부산진구가-05-1833제일부동산중개사무소051-862-2240부산광역시 부산진구 연수로 41, 2층 (양정동)
7부산광역시 부산진구가-05-2576부산부동산중개사무소051-806-9875부산광역시 부산진구 새싹로28번길 23, 3층 301호(부전동)
8부산광역시 부산진구나-05-155밀양부동산중개사무소051-897-0674부산광역시 부산진구 당감서로 55, 2층(당감동)
9부산광역시 부산진구가-05-360공인중개사송재진사무소051-897-3060부산광역시 부산진구 당감로 10(당감동)
시군구등록번호사무소명사무소전화번호사무소주소(도로명)
923부산광역시 부산진구26230-2019-00188-001주식회사 오케이부동산중개법인 분사무소051-753-5545부산광역시 수영구 좌수영로83번길 14, B동(수영동)
924부산광역시 부산진구26230-2021-00160주식회사 제이앤유부동산중개법인051-861-2689부산광역시 부산진구 연수로 5, 5층(양정동)
925부산광역시 부산진구26230-2021-00172주식회사 공간부동산중개법인051-714-6033부산광역시 부산진구 동평로 416, 102호(양정동,대원플러스빌)
926부산광역시 부산진구26230-2021-00194주식회사 명가부동산중개법인<NA>부산광역시 부산진구 중앙대로 754, 8층(부전동)
927부산광역시 부산진구26230-2022-00067탑케이부동산중개 주식회사<NA>부산광역시 부산진구 거제천로 56, 401호(양정동,도운빌딩)
928부산광역시 부산진구26230-2022-00089부동산서베이부동산중개법인 주식회사051-852-2588부산광역시 부산진구 서면문화로 27, 507호(부전동,유원골든타워오피스텔)
929부산광역시 부산진구26230-2022-00127뉴신세기부동산중개법인주식회사051-714-5004부산광역시 부산진구 중앙대로990번길 8, 1층 102-1호, 102-2호(양정동, 남명씨티밸리)
930부산광역시 부산진구26230-2022-00131(주)해강부동산중개법인051-231-4032부산광역시 부산진구 동천로 4, 4069호
931부산광역시 부산진구26230-2022-00136(주)타워부동산중개법인051-809-6004부산광역시 부산진구 중앙대로666번길 50, 상가동 102-3호
932부산광역시 부산진구가-05-3626(주)금문부동산중개051-710-9500부산광역시 부산진구 서면문화로 25, 4층(부전동)