Overview

Dataset statistics

Number of variables6
Number of observations96
Missing cells140
Missing cells (%)24.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory50.4 B

Variable types

Text5
Unsupported1

Dataset

Description부산광역시_동래구_옥외광고업현황_20230816
Author부산광역시 동래구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3079382

Alerts

업소명 has 1 (1.0%) missing valuesMissing
대표자명 has 1 (1.0%) missing valuesMissing
영업장전화번호 has 40 (41.7%) missing valuesMissing
영업장도로명주소 has 1 (1.0%) missing valuesMissing
영업내용 has 1 (1.0%) missing valuesMissing
Unnamed: 5 has 96 (100.0%) missing valuesMissing
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 17:12:47.837635
Analysis finished2023-12-10 17:12:50.384223
Duration2.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

MISSING 

Distinct95
Distinct (%)100.0%
Missing1
Missing (%)1.0%
Memory size900.0 B
2023-12-11T02:12:50.721957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.3157895
Min length2

Characters and Unicode

Total characters505
Distinct characters155
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row대신애드
2nd row주식회사 써머트리
3rd row현디자인
4th row성진광고
5th row맥스리뷰
ValueCountFrequency (%)
주식회사 3
 
2.9%
디자인 2
 
1.9%
하나광고 1
 
1.0%
shop 1
 
1.0%
루미led 1
 
1.0%
아크릴메이커 1
 
1.0%
곰디자인 1
 
1.0%
한양광고 1
 
1.0%
불새기획 1
 
1.0%
우리광고 1
 
1.0%
Other values (92) 92
87.6%
2023-12-11T02:12:51.712171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
6.3%
31
 
6.1%
19
 
3.8%
19
 
3.8%
18
 
3.6%
17
 
3.4%
16
 
3.2%
16
 
3.2%
10
 
2.0%
10
 
2.0%
Other values (145) 317
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
88.5%
Uppercase Letter 25
 
5.0%
Space Separator 10
 
2.0%
Open Punctuation 9
 
1.8%
Close Punctuation 9
 
1.8%
Lowercase Letter 4
 
0.8%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
7.2%
31
 
6.9%
19
 
4.3%
19
 
4.3%
18
 
4.0%
17
 
3.8%
16
 
3.6%
16
 
3.6%
10
 
2.2%
10
 
2.2%
Other values (124) 259
57.9%
Uppercase Letter
ValueCountFrequency (%)
D 4
16.0%
E 3
12.0%
N 3
12.0%
I 2
8.0%
P 2
8.0%
S 2
8.0%
G 2
8.0%
L 2
8.0%
J 1
 
4.0%
A 1
 
4.0%
Other values (3) 3
12.0%
Lowercase Letter
ValueCountFrequency (%)
p 1
25.0%
o 1
25.0%
h 1
25.0%
s 1
25.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
88.5%
Common 29
 
5.7%
Latin 29
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
7.2%
31
 
6.9%
19
 
4.3%
19
 
4.3%
18
 
4.0%
17
 
3.8%
16
 
3.6%
16
 
3.6%
10
 
2.2%
10
 
2.2%
Other values (124) 259
57.9%
Latin
ValueCountFrequency (%)
D 4
13.8%
E 3
10.3%
N 3
10.3%
I 2
 
6.9%
P 2
 
6.9%
S 2
 
6.9%
G 2
 
6.9%
L 2
 
6.9%
J 1
 
3.4%
p 1
 
3.4%
Other values (7) 7
24.1%
Common
ValueCountFrequency (%)
10
34.5%
( 9
31.0%
) 9
31.0%
: 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
88.5%
ASCII 58
 
11.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
7.2%
31
 
6.9%
19
 
4.3%
19
 
4.3%
18
 
4.0%
17
 
3.8%
16
 
3.6%
16
 
3.6%
10
 
2.2%
10
 
2.2%
Other values (124) 259
57.9%
ASCII
ValueCountFrequency (%)
10
17.2%
( 9
15.5%
) 9
15.5%
D 4
 
6.9%
E 3
 
5.2%
N 3
 
5.2%
I 2
 
3.4%
P 2
 
3.4%
S 2
 
3.4%
G 2
 
3.4%
Other values (11) 12
20.7%

대표자명
Text

MISSING 

Distinct94
Distinct (%)98.9%
Missing1
Missing (%)1.0%
Memory size900.0 B
2023-12-11T02:12:52.386016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9789474
Min length2

Characters and Unicode

Total characters283
Distinct characters108
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)97.9%

Sample

1st row채대수
2nd row김선영
3rd row김태윤
4th row최윤성
5th row김성희
ValueCountFrequency (%)
이종진 2
 
2.1%
안용배 1
 
1.1%
서규희 1
 
1.1%
이수진 1
 
1.1%
이춘식 1
 
1.1%
김재규 1
 
1.1%
이상문 1
 
1.1%
홍석열 1
 
1.1%
차장길 1
 
1.1%
배용국 1
 
1.1%
Other values (84) 84
88.4%
2023-12-11T02:12:54.725763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
8.8%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (98) 191
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 283
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
8.8%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (98) 191
67.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 283
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
8.8%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (98) 191
67.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 283
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
8.8%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (98) 191
67.5%

영업장전화번호
Text

MISSING 

Distinct56
Distinct (%)100.0%
Missing40
Missing (%)41.7%
Memory size900.0 B
2023-12-11T02:12:55.671623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.875
Min length12

Characters and Unicode

Total characters777
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row051 -638 -0777
2nd row051 -5532-658
3rd row051 -333 -7670
4th row051 -990 -1010
5th row051 -503 -2394
ValueCountFrequency (%)
051 55
34.2%
505 4
 
2.5%
503 4
 
2.5%
553 3
 
1.9%
502 3
 
1.9%
554 3
 
1.9%
558 3
 
1.9%
501 3
 
1.9%
557 2
 
1.2%
528 2
 
1.2%
Other values (76) 79
49.1%
2023-12-11T02:12:57.267244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 145
18.7%
- 112
14.4%
105
13.5%
0 100
12.9%
1 85
10.9%
3 53
 
6.8%
2 44
 
5.7%
7 34
 
4.4%
4 30
 
3.9%
6 26
 
3.3%
Other values (2) 43
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 560
72.1%
Dash Punctuation 112
 
14.4%
Space Separator 105
 
13.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 145
25.9%
0 100
17.9%
1 85
15.2%
3 53
 
9.5%
2 44
 
7.9%
7 34
 
6.1%
4 30
 
5.4%
6 26
 
4.6%
8 26
 
4.6%
9 17
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%
Space Separator
ValueCountFrequency (%)
105
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 777
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 145
18.7%
- 112
14.4%
105
13.5%
0 100
12.9%
1 85
10.9%
3 53
 
6.8%
2 44
 
5.7%
7 34
 
4.4%
4 30
 
3.9%
6 26
 
3.3%
Other values (2) 43
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 777
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 145
18.7%
- 112
14.4%
105
13.5%
0 100
12.9%
1 85
10.9%
3 53
 
6.8%
2 44
 
5.7%
7 34
 
4.4%
4 30
 
3.9%
6 26
 
3.3%
Other values (2) 43
 
5.5%
Distinct92
Distinct (%)96.8%
Missing1
Missing (%)1.0%
Memory size900.0 B
2023-12-11T02:12:58.349445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length28.789474
Min length22

Characters and Unicode

Total characters2735
Distinct characters103
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)93.7%

Sample

1st row부산광역시 동래구 명안로17번길 14-1 (안락동)
2nd row부산광역시 동래구 석사로 21, 11층 1106호 (사직동, 오름파크)
3rd row부산광역시 동래구 명장로 65, 109동 2302호 (명장동, e편한세상 동래명장)
4th row부산광역시 동래구 명륜로112번길 68 (수안동)
5th row부산광역시 동래구 충렬대로 144 (온천동)
ValueCountFrequency (%)
부산광역시 95
 
17.8%
동래구 95
 
17.8%
사직동 25
 
4.7%
안락동 24
 
4.5%
온천동 14
 
2.6%
명장동 12
 
2.3%
1층 8
 
1.5%
수안동 8
 
1.5%
2층 6
 
1.1%
명륜동 6
 
1.1%
Other values (174) 240
45.0%
2023-12-11T02:13:00.126689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
441
 
16.1%
201
 
7.3%
1 101
 
3.7%
98
 
3.6%
98
 
3.6%
96
 
3.5%
95
 
3.5%
) 95
 
3.5%
( 95
 
3.5%
95
 
3.5%
Other values (93) 1320
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1603
58.6%
Space Separator 441
 
16.1%
Decimal Number 435
 
15.9%
Close Punctuation 95
 
3.5%
Open Punctuation 95
 
3.5%
Other Punctuation 42
 
1.5%
Dash Punctuation 20
 
0.7%
Uppercase Letter 3
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
12.5%
98
 
6.1%
98
 
6.1%
96
 
6.0%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
49
 
3.1%
Other values (76) 586
36.6%
Decimal Number
ValueCountFrequency (%)
1 101
23.2%
2 75
17.2%
3 46
10.6%
4 45
10.3%
5 39
 
9.0%
6 32
 
7.4%
8 29
 
6.7%
9 25
 
5.7%
0 24
 
5.5%
7 19
 
4.4%
Space Separator
ValueCountFrequency (%)
441
100.0%
Close Punctuation
ValueCountFrequency (%)
) 95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 95
100.0%
Other Punctuation
ValueCountFrequency (%)
, 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1603
58.6%
Common 1128
41.2%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
12.5%
98
 
6.1%
98
 
6.1%
96
 
6.0%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
49
 
3.1%
Other values (76) 586
36.6%
Common
ValueCountFrequency (%)
441
39.1%
1 101
 
9.0%
) 95
 
8.4%
( 95
 
8.4%
2 75
 
6.6%
3 46
 
4.1%
4 45
 
4.0%
, 42
 
3.7%
5 39
 
3.5%
6 32
 
2.8%
Other values (5) 117
 
10.4%
Latin
ValueCountFrequency (%)
A 3
75.0%
e 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1603
58.6%
ASCII 1132
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
441
39.0%
1 101
 
8.9%
) 95
 
8.4%
( 95
 
8.4%
2 75
 
6.6%
3 46
 
4.1%
4 45
 
4.0%
, 42
 
3.7%
5 39
 
3.4%
6 32
 
2.8%
Other values (7) 121
 
10.7%
Hangul
ValueCountFrequency (%)
201
 
12.5%
98
 
6.1%
98
 
6.1%
96
 
6.0%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
95
 
5.9%
49
 
3.1%
Other values (76) 586
36.6%

영업내용
Text

MISSING 

Distinct51
Distinct (%)53.7%
Missing1
Missing (%)1.0%
Memory size900.0 B
2023-12-11T02:13:01.023771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length8.3368421
Min length2

Characters and Unicode

Total characters792
Distinct characters60
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)38.9%

Sample

1st row광고물, 간판, 현수막 제작
2nd row광고디자인, 광고대행
3rd row광고물디자인,설치
4th row간판제작, 인테리어
5th row명함, 전단제작
ValueCountFrequency (%)
제작 24
 
13.0%
광고물제작 16
 
8.6%
간판 13
 
7.0%
13
 
7.0%
광고물 11
 
5.9%
옥외광고물등제작 11
 
5.9%
간판제작 9
 
4.9%
옥외광고물 8
 
4.3%
옥외광고물제작 7
 
3.8%
설치 5
 
2.7%
Other values (46) 68
36.8%
2023-12-11T02:13:01.821221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
11.4%
82
10.4%
81
10.2%
77
9.7%
75
9.5%
63
 
8.0%
42
 
5.3%
41
 
5.2%
29
 
3.7%
26
 
3.3%
Other values (50) 186
23.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 673
85.0%
Space Separator 90
 
11.4%
Other Punctuation 21
 
2.7%
Uppercase Letter 6
 
0.8%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
12.2%
81
12.0%
77
11.4%
75
11.1%
63
9.4%
42
 
6.2%
41
 
6.1%
29
 
4.3%
26
 
3.9%
16
 
2.4%
Other values (43) 141
21.0%
Uppercase Letter
ValueCountFrequency (%)
L 2
33.3%
D 2
33.3%
E 2
33.3%
Space Separator
ValueCountFrequency (%)
90
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 673
85.0%
Common 113
 
14.3%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
12.2%
81
12.0%
77
11.4%
75
11.1%
63
9.4%
42
 
6.2%
41
 
6.1%
29
 
4.3%
26
 
3.9%
16
 
2.4%
Other values (43) 141
21.0%
Common
ValueCountFrequency (%)
90
79.6%
, 21
 
18.6%
) 1
 
0.9%
( 1
 
0.9%
Latin
ValueCountFrequency (%)
L 2
33.3%
D 2
33.3%
E 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 673
85.0%
ASCII 119
 
15.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
90
75.6%
, 21
 
17.6%
L 2
 
1.7%
D 2
 
1.7%
E 2
 
1.7%
) 1
 
0.8%
( 1
 
0.8%
Hangul
ValueCountFrequency (%)
82
12.2%
81
12.0%
77
11.4%
75
11.1%
63
9.4%
42
 
6.2%
41
 
6.1%
29
 
4.3%
26
 
3.9%
16
 
2.4%
Other values (43) 141
21.0%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing96
Missing (%)100.0%
Memory size996.0 B

Correlations

2023-12-11T02:13:02.052597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명대표자명영업장전화번호영업장도로명주소영업내용
업소명1.0001.0001.0001.0001.000
대표자명1.0001.0001.0000.9970.999
영업장전화번호1.0001.0001.0001.0001.000
영업장도로명주소1.0000.9971.0001.0000.735
영업내용1.0000.9991.0000.7351.000

Missing values

2023-12-11T02:12:49.783291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:12:50.020635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:12:50.237313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업소명대표자명영업장전화번호영업장도로명주소영업내용Unnamed: 5
0대신애드채대수<NA>부산광역시 동래구 명안로17번길 14-1 (안락동)광고물, 간판, 현수막 제작<NA>
1주식회사 써머트리김선영051 -638 -0777부산광역시 동래구 석사로 21, 11층 1106호 (사직동, 오름파크)광고디자인, 광고대행<NA>
2현디자인김태윤<NA>부산광역시 동래구 명장로 65, 109동 2302호 (명장동, e편한세상 동래명장)광고물디자인,설치<NA>
3성진광고최윤성051 -5532-658부산광역시 동래구 명륜로112번길 68 (수안동)간판제작, 인테리어<NA>
4맥스리뷰김성희<NA>부산광역시 동래구 충렬대로 144 (온천동)명함, 전단제작<NA>
5천우최상노051 -333 -7670부산광역시 동래구 명장로63번길 124 (명장동)광고물 제작<NA>
6디펜더스광고채송묵<NA>부산광역시 동래구 명륜로 238 (명륜동)옥외광고물제작,설치<NA>
7(주)엠쓰리미디어곽병익051 -990 -1010부산광역시 동래구 여고로 1, 삼주빌딩 101호 (사직동)옥외광고물 표시, 대행<NA>
8단아디자인김연서<NA>부산광역시 동래구 사직북로34번길 13, 1층 (사직동)옥내외 사인물 디자인 및 제작<NA>
9옥외광고사김태한<NA>부산광역시 동래구 명장로22번길 56 (명장동)옥외광고<NA>
업소명대표자명영업장전화번호영업장도로명주소영업내용Unnamed: 5
86미성애드이승기051 -502 -5654부산광역시 동래구 여고로 66 (사직동)옥외광고물제작<NA>
87성예사박봉명051 -558 -4232부산광역시 동래구 충렬대로237번길 72 (복천동)광고물제작<NA>
88금영광고사우황구051 -528 -0484부산광역시 동래구 명장로 86 (명장동)옥외광고물등제작<NA>
89장원애드산업차판수051 -529 -1909부산광역시 동래구 안락로 90 (안락동)광고물제작<NA>
90영진광고황원규051 -523 -0363부산광역시 동래구 반송로 284 (명장동)옥외 광고물제작<NA>
91대진광고문대식051 -554 -8182부산광역시 동래구 충렬대로237번길 120-4 (명륜동)옥외광고물등제작<NA>
92신라광고최경완051 -552 -2323부산광역시 동래구 중앙대로 1495-1 (온천동)광고물제작<NA>
93(주)지앤비조기범051- 851-1515부산광역시 동래구 여고로 50 (사직동)광고물 제작 및 광고 대행<NA>
94세움광고김동진<NA>부산광역시 동래구 아시아드대로161번길 38 (사직동,1층)간판제작 및 시공<NA>
95<NA><NA><NA><NA><NA><NA>