Overview

Dataset statistics

Number of variables5
Number of observations4811
Missing cells1945
Missing cells (%)8.1%
Duplicate rows61
Duplicate rows (%)1.3%
Total size in memory192.8 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description부산광역시 동래구 옥외광고물 허가관리 정보에 대한 데이터로 업소명, 표시장소(도로명주소), 광고물종류, 구분 등에 대한 정보를 제공합니다.
Author부산광역시 동래구
URLhttps://www.data.go.kr/data/15086600/fileData.do

Alerts

Dataset has 61 (1.3%) duplicate rowsDuplicates
구분 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
광고물종류 is highly overall correlated with 구분High correlation
순번 is highly overall correlated with 구분High correlation
순번 has 1943 (40.4%) missing valuesMissing

Reproduction

Analysis started2024-03-14 13:22:59.292849
Analysis finished2024-03-14 13:23:01.515124
Duration2.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2868
Distinct (%)100.0%
Missing1943
Missing (%)40.4%
Infinite0
Infinite (%)0.0%
Mean1434.5
Minimum1
Maximum2868
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size42.4 KiB
2024-03-14T22:23:01.737056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile144.35
Q1717.75
median1434.5
Q32151.25
95-th percentile2724.65
Maximum2868
Range2867
Interquartile range (IQR)1433.5

Descriptive statistics

Standard deviation828.06461
Coefficient of variation (CV)0.57724964
Kurtosis-1.2
Mean1434.5
Median Absolute Deviation (MAD)717
Skewness0
Sum4114146
Variance685691
MonotonicityStrictly increasing
2024-03-14T22:23:02.176314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1929 1
 
< 0.1%
1909 1
 
< 0.1%
1910 1
 
< 0.1%
1911 1
 
< 0.1%
1912 1
 
< 0.1%
1913 1
 
< 0.1%
1914 1
 
< 0.1%
1915 1
 
< 0.1%
1916 1
 
< 0.1%
1917 1
 
< 0.1%
Other values (2858) 2858
59.4%
(Missing) 1943
40.4%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2868 1
< 0.1%
2867 1
< 0.1%
2866 1
< 0.1%
2865 1
< 0.1%
2864 1
< 0.1%
2863 1
< 0.1%
2862 1
< 0.1%
2861 1
< 0.1%
2860 1
< 0.1%
2859 1
< 0.1%
Distinct3561
Distinct (%)74.0%
Missing1
Missing (%)< 0.1%
Memory size37.7 KiB
2024-03-14T22:23:03.185920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length34
Mean length6.9939709
Min length1

Characters and Unicode

Total characters33641
Distinct characters836
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2946 ?
Unique (%)61.2%

Sample

1st row바른몸청담한의원
2nd row바른몸청담한의원
3rd rowT world
4th row르하임스터디카페
5th row동센부동산
ValueCountFrequency (%)
미주치과 269
 
4.8%
미주치과병원 48
 
0.9%
김병준흉부외과 33
 
0.6%
세흥병원 32
 
0.6%
주)애드스토리 31
 
0.6%
cu 24
 
0.4%
광혜병원 20
 
0.4%
주)비지에프리테일 17
 
0.3%
동래점 16
 
0.3%
세계로병원 14
 
0.3%
Other values (3878) 5081
91.0%
2024-03-14T22:23:04.309383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1022
 
3.0%
816
 
2.4%
771
 
2.3%
653
 
1.9%
627
 
1.9%
560
 
1.7%
555
 
1.6%
548
 
1.6%
500
 
1.5%
485
 
1.4%
Other values (826) 27104
80.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29822
88.6%
Uppercase Letter 1364
 
4.1%
Space Separator 816
 
2.4%
Lowercase Letter 471
 
1.4%
Decimal Number 435
 
1.3%
Close Punctuation 303
 
0.9%
Open Punctuation 301
 
0.9%
Other Punctuation 75
 
0.2%
Dash Punctuation 38
 
0.1%
Math Symbol 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1022
 
3.4%
771
 
2.6%
653
 
2.2%
627
 
2.1%
560
 
1.9%
555
 
1.9%
548
 
1.8%
500
 
1.7%
485
 
1.6%
480
 
1.6%
Other values (750) 23621
79.2%
Uppercase Letter
ValueCountFrequency (%)
S 156
 
11.4%
E 111
 
8.1%
G 91
 
6.7%
K 88
 
6.5%
C 83
 
6.1%
T 82
 
6.0%
O 76
 
5.6%
P 72
 
5.3%
B 70
 
5.1%
N 67
 
4.9%
Other values (16) 468
34.3%
Lowercase Letter
ValueCountFrequency (%)
e 60
12.7%
a 41
 
8.7%
o 38
 
8.1%
s 38
 
8.1%
c 32
 
6.8%
i 30
 
6.4%
t 30
 
6.4%
l 28
 
5.9%
n 25
 
5.3%
h 23
 
4.9%
Other values (14) 126
26.8%
Decimal Number
ValueCountFrequency (%)
2 122
28.0%
5 64
14.7%
1 54
12.4%
4 51
11.7%
0 46
 
10.6%
7 28
 
6.4%
3 26
 
6.0%
8 17
 
3.9%
6 14
 
3.2%
9 13
 
3.0%
Other Punctuation
ValueCountFrequency (%)
& 27
36.0%
. 25
33.3%
# 6
 
8.0%
' 5
 
6.7%
/ 5
 
6.7%
? 4
 
5.3%
! 3
 
4.0%
Math Symbol
ValueCountFrequency (%)
> 9
60.0%
~ 2
 
13.3%
2
 
13.3%
+ 2
 
13.3%
Space Separator
ValueCountFrequency (%)
816
100.0%
Close Punctuation
ValueCountFrequency (%)
) 303
100.0%
Open Punctuation
ValueCountFrequency (%)
( 301
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29818
88.6%
Common 1983
 
5.9%
Latin 1836
 
5.5%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1022
 
3.4%
771
 
2.6%
653
 
2.2%
627
 
2.1%
560
 
1.9%
555
 
1.9%
548
 
1.8%
500
 
1.7%
485
 
1.6%
480
 
1.6%
Other values (746) 23617
79.2%
Latin
ValueCountFrequency (%)
S 156
 
8.5%
E 111
 
6.0%
G 91
 
5.0%
K 88
 
4.8%
C 83
 
4.5%
T 82
 
4.5%
O 76
 
4.1%
P 72
 
3.9%
B 70
 
3.8%
N 67
 
3.6%
Other values (41) 940
51.2%
Common
ValueCountFrequency (%)
816
41.1%
) 303
 
15.3%
( 301
 
15.2%
2 122
 
6.2%
5 64
 
3.2%
1 54
 
2.7%
4 51
 
2.6%
0 46
 
2.3%
- 38
 
1.9%
7 28
 
1.4%
Other values (15) 160
 
8.1%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29818
88.6%
ASCII 3816
 
11.3%
CJK 4
 
< 0.1%
Arrows 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1022
 
3.4%
771
 
2.6%
653
 
2.2%
627
 
2.1%
560
 
1.9%
555
 
1.9%
548
 
1.8%
500
 
1.7%
485
 
1.6%
480
 
1.6%
Other values (746) 23617
79.2%
ASCII
ValueCountFrequency (%)
816
21.4%
) 303
 
7.9%
( 301
 
7.9%
S 156
 
4.1%
2 122
 
3.2%
E 111
 
2.9%
G 91
 
2.4%
K 88
 
2.3%
C 83
 
2.2%
T 82
 
2.1%
Other values (64) 1663
43.6%
Arrows
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct2945
Distinct (%)61.2%
Missing1
Missing (%)< 0.1%
Memory size37.7 KiB
2024-03-14T22:23:05.300390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length54
Mean length26.694179
Min length1

Characters and Unicode

Total characters128399
Distinct characters362
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2078 ?
Unique (%)43.2%

Sample

1st row부산광역시 동래구 사직북로 24 가람빌딩 3층 (사직동)
2nd row부산광역시 동래구 사직북로 24 가람빌딩 3층 (사직동)
3rd row부산광역시 동래구 명장로20번길 120-1 (안락동)
4th row부산광역시 동래구 충렬대로 207 (명륜동)
5th row부산광역시 동래구 명륜로 208-1 (명륜동)
ValueCountFrequency (%)
부산광역시 4638
18.8%
동래구 4607
18.7%
온천동 1123
 
4.6%
사직동 778
 
3.2%
안락동 708
 
2.9%
충렬대로 533
 
2.2%
명륜동 523
 
2.1%
수안동 328
 
1.3%
명장동 270
 
1.1%
아시아드대로 217
 
0.9%
Other values (1949) 10936
44.3%
2024-03-14T22:23:06.588025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21344
 
16.6%
9645
 
7.5%
5201
 
4.1%
4819
 
3.8%
( 4686
 
3.6%
) 4686
 
3.6%
4682
 
3.6%
4682
 
3.6%
4659
 
3.6%
4657
 
3.6%
Other values (352) 59338
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79463
61.9%
Space Separator 21344
 
16.6%
Decimal Number 17372
 
13.5%
Open Punctuation 4686
 
3.6%
Close Punctuation 4686
 
3.6%
Dash Punctuation 693
 
0.5%
Uppercase Letter 120
 
0.1%
Lowercase Letter 27
 
< 0.1%
Other Punctuation 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9645
 
12.1%
5201
 
6.5%
4819
 
6.1%
4682
 
5.9%
4682
 
5.9%
4659
 
5.9%
4657
 
5.9%
4642
 
5.8%
4631
 
5.8%
2071
 
2.6%
Other values (308) 29774
37.5%
Uppercase Letter
ValueCountFrequency (%)
K 23
19.2%
S 22
18.3%
B 21
17.5%
H 13
10.8%
U 8
 
6.7%
R 5
 
4.2%
Y 5
 
4.2%
G 5
 
4.2%
T 4
 
3.3%
L 3
 
2.5%
Other values (8) 11
9.2%
Decimal Number
ValueCountFrequency (%)
1 4001
23.0%
2 2602
15.0%
3 1977
11.4%
4 1594
 
9.2%
5 1434
 
8.3%
7 1212
 
7.0%
9 1210
 
7.0%
0 1184
 
6.8%
8 1146
 
6.6%
6 1012
 
5.8%
Lowercase Letter
ValueCountFrequency (%)
b 10
37.0%
o 3
 
11.1%
l 3
 
11.1%
v 3
 
11.1%
e 3
 
11.1%
i 3
 
11.1%
k 1
 
3.7%
t 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 3
50.0%
· 3
50.0%
Space Separator
ValueCountFrequency (%)
21344
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4686
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4686
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 693
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79463
61.9%
Common 48788
38.0%
Latin 148
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9645
 
12.1%
5201
 
6.5%
4819
 
6.1%
4682
 
5.9%
4682
 
5.9%
4659
 
5.9%
4657
 
5.9%
4642
 
5.8%
4631
 
5.8%
2071
 
2.6%
Other values (308) 29774
37.5%
Latin
ValueCountFrequency (%)
K 23
15.5%
S 22
14.9%
B 21
14.2%
H 13
8.8%
b 10
 
6.8%
U 8
 
5.4%
R 5
 
3.4%
Y 5
 
3.4%
G 5
 
3.4%
T 4
 
2.7%
Other values (17) 32
21.6%
Common
ValueCountFrequency (%)
21344
43.7%
( 4686
 
9.6%
) 4686
 
9.6%
1 4001
 
8.2%
2 2602
 
5.3%
3 1977
 
4.1%
4 1594
 
3.3%
5 1434
 
2.9%
7 1212
 
2.5%
9 1210
 
2.5%
Other values (7) 4042
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79463
61.9%
ASCII 48932
38.1%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21344
43.6%
( 4686
 
9.6%
) 4686
 
9.6%
1 4001
 
8.2%
2 2602
 
5.3%
3 1977
 
4.0%
4 1594
 
3.3%
5 1434
 
2.9%
7 1212
 
2.5%
9 1210
 
2.5%
Other values (32) 4186
 
8.6%
Hangul
ValueCountFrequency (%)
9645
 
12.1%
5201
 
6.5%
4819
 
6.1%
4682
 
5.9%
4682
 
5.9%
4659
 
5.9%
4657
 
5.9%
4642
 
5.8%
4631
 
5.8%
2071
 
2.6%
Other values (308) 29774
37.5%
None
ValueCountFrequency (%)
· 3
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

광고물종류
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size37.7 KiB
돌출간판
2907 
벽면이용간판(가로형)
696 
공공시설물이용 광고물
354 
가로형간판_입체형
329 
교통수단이용 광고물
 
275
Other values (7)
 
250

Length

Max length11
Median length4
Mean length6.3610476
Min length4

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row돌출간판
2nd row가로형간판_입체형
3rd row돌출간판
4th row돌출간판
5th row돌출간판

Common Values

ValueCountFrequency (%)
돌출간판 2907
60.4%
벽면이용간판(가로형) 696
 
14.5%
공공시설물이용 광고물 354
 
7.4%
가로형간판_입체형 329
 
6.8%
교통수단이용 광고물 275
 
5.7%
지주이용 간판 171
 
3.6%
옥상간판 44
 
0.9%
벽면이용간판(세로형) 25
 
0.5%
현수막게시틀 7
 
0.1%
교통시설이용 광고물 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Length

2024-03-14T22:23:07.088579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
돌출간판 2907
51.8%
벽면이용간판(가로형 696
 
12.4%
광고물 630
 
11.2%
공공시설물이용 354
 
6.3%
가로형간판_입체형 329
 
5.9%
교통수단이용 275
 
4.9%
지주이용 171
 
3.0%
간판 171
 
3.0%
옥상간판 44
 
0.8%
벽면이용간판(세로형 25
 
0.4%
Other values (4) 10
 
0.2%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size37.7 KiB
허가
2868 
<NA>
1943 

Length

Max length4
Median length2
Mean length2.8077323
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row허가
2nd row허가
3rd row허가
4th row허가
5th row허가

Common Values

ValueCountFrequency (%)
허가 2868
59.6%
<NA> 1943
40.4%

Length

2024-03-14T22:23:07.521542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:23:07.859684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
허가 2868
59.6%
na 1943
40.4%

Interactions

2024-03-14T22:23:00.364782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T22:23:08.036286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번광고물종류
순번1.0000.356
광고물종류0.3561.000
2024-03-14T22:23:08.250509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분광고물종류
구분1.0001.000
광고물종류1.0001.000
2024-03-14T22:23:08.397356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번광고물종류구분
순번1.0000.1161.000
광고물종류0.1161.0001.000
구분1.0001.0001.000

Missing values

2024-03-14T22:23:00.711595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:23:01.026121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T22:23:01.334619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번업소명표시장소_도로명광고물종류구분
01바른몸청담한의원부산광역시 동래구 사직북로 24 가람빌딩 3층 (사직동)돌출간판허가
12바른몸청담한의원부산광역시 동래구 사직북로 24 가람빌딩 3층 (사직동)가로형간판_입체형허가
23T world부산광역시 동래구 명장로20번길 120-1 (안락동)돌출간판허가
34르하임스터디카페부산광역시 동래구 충렬대로 207 (명륜동)돌출간판허가
45동센부동산부산광역시 동래구 명륜로 208-1 (명륜동)돌출간판허가
56링구아교육컨설팅부산광역시 동래구 충렬대로237번길 74 2층 (복천동)가로형간판_입체형허가
67링구아틴즈부산광역시 동래구 충렬대로237번길 74 (복천동)돌출간판허가
78링구아교육부산광역시 동래구 충렬대로237번길 74 4층 (복천동)가로형간판_입체형허가
89링구아어학원부산광역시 동래구 충렬대로237번길 74 3층 (복천동)가로형간판_입체형허가
910링구아어학원부산광역시 동래구 충렬대로237번길 74 (복천동)돌출간판허가
순번업소명표시장소_도로명광고물종류구분
4801<NA>세정한의원부산광역시 동래구 충렬대로108번길 11 (온천동)벽면이용간판(가로형)<NA>
4802<NA>금강 골프연습장부산광역시 동래구 금강로 67 (온천동)돌출간판<NA>
4803<NA>정휘트니스헬스부산광역시 동래구 충렬대로 259 (낙민동)돌출간판<NA>
4804<NA>하늘노래연습장부산광역시 동래구 사직북로28번길 201 (온천동)돌출간판<NA>
4805<NA>동영 태권도부산광역시 동래구 여고북로 199 (온천동)돌출간판<NA>
4806<NA>SK부산광역시 동래구 여고로 131 (사직동)지주이용 간판<NA>
4807<NA>세금장 모텔부산광역시 동래구 금강로 125 (온천동)돌출간판<NA>
4808<NA>미남 복 매운탕부산광역시 동래구 아시아드대로247번길 8 (온천동)돌출간판<NA>
4809<NA>롯데수퍼마부산광역시 동래구 여고북로 8 (사직동)돌출간판<NA>
4810<NA>금강놀이방부산광역시 동래구 사직북로28번길 152 (온천동)돌출간판<NA>

Duplicate rows

Most frequently occurring

순번업소명표시장소_도로명광고물종류구분# duplicates
17<NA>미주치과-공공시설물이용 광고물<NA>118
18<NA>미주치과부산광역시 동래구 사직북로5번길 49 온누리택시 (사직동)교통수단이용 광고물<NA>111
19<NA>미주치과부산광역시 동래구 연안로81번길 71 대륙교통 (안락동)교통수단이용 광고물<NA>40
34<NA>세흥병원부산광역시 동래구 복천로 133 (명장동)교통수단이용 광고물<NA>30
25<NA>삼원기업부산광역시 금정구 금샘로 431 (구서동)공공시설물이용 광고물<NA>4
40<NA>우리가족한의원-공공시설물이용 광고물<NA>4
29<NA>세계로병원부산광역시 동래구 명서로 114 (주)성신여객 (명장동)교통수단이용 광고물<NA>3
30<NA>세계로병원부산광역시 동래구 미남로 58 학성여객 (사직동)교통수단이용 광고물<NA>3
32<NA>세계로병원부산광역시 동래구 쇠미로17번길 16 (주)신아교통 (사직동)교통수단이용 광고물<NA>3
53<NA>코코모텔부산광역시 동래구 명륜로112번가길 6 (명륜동)가로형간판_입체형<NA>3