Overview

Dataset statistics

Number of variables5
Number of observations65
Missing cells21
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory43.0 B

Variable types

Numeric1
Text4

Dataset

Description부산광역시남구_옥외광고사업자현황_20220412
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15055984

Alerts

영업장전화번호 has 21 (32.3%) missing valuesMissing
순번 has unique valuesUnique
성명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:43:19.622151
Analysis finished2023-12-10 17:43:21.210238
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-11T02:43:21.356563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.2
Q117
median33
Q349
95-th percentile61.8
Maximum65
Range64
Interquartile range (IQR)32

Descriptive statistics

Standard deviation18.90767
Coefficient of variation (CV)0.57295971
Kurtosis-1.2
Mean33
Median Absolute Deviation (MAD)16
Skewness0
Sum2145
Variance357.5
MonotonicityStrictly increasing
2023-12-11T02:43:21.623482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
50 1
 
1.5%
36 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
Other values (55) 55
84.6%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%
56 1
1.5%
Distinct64
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T02:43:22.092996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.4153846
Min length2

Characters and Unicode

Total characters352
Distinct characters125
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)96.9%

Sample

1st row지음
2nd row거성광고
3rd row플로스
4th row우리디자인
5th row유진테크
ValueCountFrequency (%)
거성광고 2
 
2.8%
광고기획 2
 
2.8%
아이앤에스 1
 
1.4%
디자인 1
 
1.4%
돼랑 1
 
1.4%
서린 1
 
1.4%
종합광고 1
 
1.4%
해인디자인 1
 
1.4%
제일애드 1
 
1.4%
가람광고기획 1
 
1.4%
Other values (59) 59
83.1%
2023-12-11T02:43:22.770155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
7.4%
26
 
7.4%
18
 
5.1%
16
 
4.5%
14
 
4.0%
13
 
3.7%
13
 
3.7%
11
 
3.1%
11
 
3.1%
7
 
2.0%
Other values (115) 197
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 323
91.8%
Uppercase Letter 10
 
2.8%
Open Punctuation 6
 
1.7%
Close Punctuation 6
 
1.7%
Space Separator 6
 
1.7%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
8.0%
26
 
8.0%
18
 
5.6%
16
 
5.0%
14
 
4.3%
13
 
4.0%
13
 
4.0%
11
 
3.4%
11
 
3.4%
7
 
2.2%
Other values (103) 168
52.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
20.0%
A 2
20.0%
P 1
10.0%
U 1
10.0%
E 1
10.0%
L 1
10.0%
S 1
10.0%
M 1
10.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 323
91.8%
Common 19
 
5.4%
Latin 10
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
8.0%
26
 
8.0%
18
 
5.6%
16
 
5.0%
14
 
4.3%
13
 
4.0%
13
 
4.0%
11
 
3.4%
11
 
3.4%
7
 
2.2%
Other values (103) 168
52.0%
Latin
ValueCountFrequency (%)
C 2
20.0%
A 2
20.0%
P 1
10.0%
U 1
10.0%
E 1
10.0%
L 1
10.0%
S 1
10.0%
M 1
10.0%
Common
ValueCountFrequency (%)
( 6
31.6%
) 6
31.6%
6
31.6%
, 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 323
91.8%
ASCII 29
 
8.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
8.0%
26
 
8.0%
18
 
5.6%
16
 
5.0%
14
 
4.3%
13
 
4.0%
13
 
4.0%
11
 
3.4%
11
 
3.4%
7
 
2.2%
Other values (103) 168
52.0%
ASCII
ValueCountFrequency (%)
( 6
20.7%
) 6
20.7%
6
20.7%
C 2
 
6.9%
A 2
 
6.9%
P 1
 
3.4%
U 1
 
3.4%
E 1
 
3.4%
L 1
 
3.4%
S 1
 
3.4%
Other values (2) 2
 
6.9%

성명
Text

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T02:43:23.250161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9538462
Min length2

Characters and Unicode

Total characters192
Distinct characters81
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row박경태
2nd row안성민
3rd row문지영
4th row이지수
5th row장태진
ValueCountFrequency (%)
박경태 1
 
1.5%
이상수 1
 
1.5%
황선호 1
 
1.5%
이헌운 1
 
1.5%
구본열 1
 
1.5%
노기석 1
 
1.5%
김태오 1
 
1.5%
김수민 1
 
1.5%
김춘좌 1
 
1.5%
강남석 1
 
1.5%
Other values (55) 55
84.6%
2023-12-11T02:43:23.913294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
10.9%
10
 
5.2%
9
 
4.7%
8
 
4.2%
7
 
3.6%
6
 
3.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (71) 115
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 192
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
10.9%
10
 
5.2%
9
 
4.7%
8
 
4.2%
7
 
3.6%
6
 
3.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (71) 115
59.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 192
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
10.9%
10
 
5.2%
9
 
4.7%
8
 
4.2%
7
 
3.6%
6
 
3.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (71) 115
59.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 192
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
10.9%
10
 
5.2%
9
 
4.7%
8
 
4.2%
7
 
3.6%
6
 
3.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (71) 115
59.9%

영업장전화번호
Text

MISSING 

Distinct44
Distinct (%)100.0%
Missing21
Missing (%)32.3%
Memory size652.0 B
2023-12-11T02:43:24.290228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.840909
Min length12

Characters and Unicode

Total characters609
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row051 -919 -8567
2nd row051 -742 -0143
3rd row051-817-9114
4th row051 -621 -3800
5th row051 -642 -0090
ValueCountFrequency (%)
051 41
32.8%
626 5
 
4.0%
628 5
 
4.0%
633 3
 
2.4%
611 2
 
1.6%
634 2
 
1.6%
621 2
 
1.6%
644 2
 
1.6%
631 2
 
1.6%
4404 2
 
1.6%
Other values (56) 59
47.2%
2023-12-11T02:43:24.993057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 88
14.4%
81
13.3%
1 76
12.5%
0 70
11.5%
5 63
10.3%
6 57
9.4%
2 40
6.6%
4 35
 
5.7%
3 32
 
5.3%
8 28
 
4.6%
Other values (2) 39
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 440
72.2%
Dash Punctuation 88
 
14.4%
Space Separator 81
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 76
17.3%
0 70
15.9%
5 63
14.3%
6 57
13.0%
2 40
9.1%
4 35
8.0%
3 32
7.3%
8 28
 
6.4%
9 22
 
5.0%
7 17
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Space Separator
ValueCountFrequency (%)
81
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 609
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 88
14.4%
81
13.3%
1 76
12.5%
0 70
11.5%
5 63
10.3%
6 57
9.4%
2 40
6.6%
4 35
 
5.7%
3 32
 
5.3%
8 28
 
4.6%
Other values (2) 39
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 609
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 88
14.4%
81
13.3%
1 76
12.5%
0 70
11.5%
5 63
10.3%
6 57
9.4%
2 40
6.6%
4 35
 
5.7%
3 32
 
5.3%
8 28
 
4.6%
Other values (2) 39
6.4%
Distinct64
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T02:43:25.500786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length40
Mean length27.969231
Min length21

Characters and Unicode

Total characters1818
Distinct characters124
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)96.9%

Sample

1st row부산광역시 남구 유엔로201번길 48 (대연동)
2nd row부산광역시 남구 우암로 248-1 (우암동)
3rd row부산광역시 남구 신선로 428, 동명대학교 학생복지관 333호 (용당동)
4th row부산광역시 남구 유엔평화로 122 (대연동)
5th row부산광역시 남구 전포대로 122 (문현동)
ValueCountFrequency (%)
부산광역시 65
17.9%
남구 65
17.9%
대연동 28
 
7.7%
문현동 15
 
4.1%
용호동 8
 
2.2%
유엔평화로 6
 
1.7%
용당동 6
 
1.7%
수영로 6
 
1.7%
14 3
 
0.8%
신선로 3
 
0.8%
Other values (137) 158
43.5%
2023-12-11T02:43:26.298654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
298
 
16.4%
80
 
4.4%
67
 
3.7%
66
 
3.6%
66
 
3.6%
66
 
3.6%
( 66
 
3.6%
) 66
 
3.6%
66
 
3.6%
65
 
3.6%
Other values (114) 912
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1068
58.7%
Space Separator 298
 
16.4%
Decimal Number 283
 
15.6%
Open Punctuation 66
 
3.6%
Close Punctuation 66
 
3.6%
Other Punctuation 24
 
1.3%
Dash Punctuation 13
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
7.5%
67
 
6.3%
66
 
6.2%
66
 
6.2%
66
 
6.2%
66
 
6.2%
65
 
6.1%
65
 
6.1%
65
 
6.1%
36
 
3.4%
Other values (99) 426
39.9%
Decimal Number
ValueCountFrequency (%)
1 64
22.6%
2 46
16.3%
3 33
11.7%
0 31
11.0%
6 22
 
7.8%
4 22
 
7.8%
9 19
 
6.7%
8 16
 
5.7%
5 16
 
5.7%
7 14
 
4.9%
Space Separator
ValueCountFrequency (%)
298
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Other Punctuation
ValueCountFrequency (%)
, 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1068
58.7%
Common 750
41.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
7.5%
67
 
6.3%
66
 
6.2%
66
 
6.2%
66
 
6.2%
66
 
6.2%
65
 
6.1%
65
 
6.1%
65
 
6.1%
36
 
3.4%
Other values (99) 426
39.9%
Common
ValueCountFrequency (%)
298
39.7%
( 66
 
8.8%
) 66
 
8.8%
1 64
 
8.5%
2 46
 
6.1%
3 33
 
4.4%
0 31
 
4.1%
, 24
 
3.2%
6 22
 
2.9%
4 22
 
2.9%
Other values (5) 78
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1068
58.7%
ASCII 750
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
298
39.7%
( 66
 
8.8%
) 66
 
8.8%
1 64
 
8.5%
2 46
 
6.1%
3 33
 
4.4%
0 31
 
4.1%
, 24
 
3.2%
6 22
 
2.9%
4 22
 
2.9%
Other values (5) 78
 
10.4%
Hangul
ValueCountFrequency (%)
80
 
7.5%
67
 
6.3%
66
 
6.2%
66
 
6.2%
66
 
6.2%
66
 
6.2%
65
 
6.1%
65
 
6.1%
65
 
6.1%
36
 
3.4%
Other values (99) 426
39.9%

Interactions

2023-12-11T02:43:20.599927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:43:26.512475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업소명성명영업장전화번호영업장도로명주소
순번1.0000.9351.0001.0000.945
업소명0.9351.0001.0001.0000.998
성명1.0001.0001.0001.0001.000
영업장전화번호1.0001.0001.0001.0001.000
영업장도로명주소0.9450.9981.0001.0001.000

Missing values

2023-12-11T02:43:20.891693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:43:21.126255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업소명성명영업장전화번호영업장도로명주소
01지음박경태<NA>부산광역시 남구 유엔로201번길 48 (대연동)
12거성광고안성민<NA>부산광역시 남구 우암로 248-1 (우암동)
23플로스문지영<NA>부산광역시 남구 신선로 428, 동명대학교 학생복지관 333호 (용당동)
34우리디자인이지수<NA>부산광역시 남구 유엔평화로 122 (대연동)
45유진테크장태진<NA>부산광역시 남구 전포대로 122 (문현동)
56와우미디어조성영051 -919 -8567부산광역시 남구 유엔평화로76번길 26, 가람빌딩 701호 (대연동)
67(주)디자인엑스투김광051 -742 -0143부산광역시 남구 신선로 365, 6공학관 209호 부경대학교용당캠퍼스 (용당동)
78햇빛디자인김성태<NA>부산광역시 남구 유엔평화로 157 (용당동)
89부성애드최종열<NA>부산광역시 남구 유엔로 226 (대연동)
910에이엠씨(AMC)서룡051-817-9114부산광역시 남구 수영로 12, 상가동 106호 (문현동, 세종그랑시아아파트)
순번업소명성명영업장전화번호영업장도로명주소
5556이왕광고기획,현수막김상일051 -636 -8489부산광역시 남구 자성로 149 (문현동)
5657오색기획나현숙051 -626 -2722부산광역시 남구 유엔평화로 113-1 (대연동)
5758(주)시드애드컴강문봉051 -611 -9496부산광역시 남구 황령대로353번길 9-32 (대연동)
5859해바라기광고사황지근051 -628 -5300부산광역시 남구 수영로366번길 14 (대연동)
5960사인탤우수현051 -624 -6535부산광역시 남구 동명로112번길 14 (용호동)
6061착한디자인정은애051 -647 -9199부산광역시 남구 석포로 20 (감만동)
6162해오름광고오민수051 -644 -5411부산광역시 남구 고동골로 55-1 (문현동)
6263대구광고사이용희051 -627 -5858부산광역시 남구 유엔평화로 40 (대연동)
6364성산광고기획조성호051 -622 -2058부산광역시 남구 천제등로 35 (대연동)
6465누리광고기획김태용<NA>부산광역시 남구 유엔평화로38번길 63(대연동)