Overview

Dataset statistics

Number of variables5
Number of observations118
Missing cells62
Missing cells (%)10.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory41.1 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description부산광역시_사하구_출판사및인쇄사현황_20221124
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045772

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 62 (52.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:23:55.830514
Analysis finished2023-12-10 17:23:56.738910
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
출판사
95 
인쇄사
23 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 95
80.5%
인쇄사 23
 
19.5%

Length

2023-12-11T02:23:56.903091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:23:57.132201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 95
80.5%
인쇄사 23
 
19.5%
Distinct116
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T02:23:57.645386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length12
Mean length6.6610169
Min length2

Characters and Unicode

Total characters786
Distinct characters262
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)96.6%

Sample

1st row태극도 출판부
2nd row동문출판기획
3rd row도서출판 동아기획
4th row(주)도시인쇄문화사
5th row동아대학교출판사
ValueCountFrequency (%)
도서출판 11
 
6.3%
주식회사 5
 
2.9%
출판사 4
 
2.3%
동문출판기획 2
 
1.1%
출판 2
 
1.1%
디자인 2
 
1.1%
인쇄사 2
 
1.1%
동아기획 2
 
1.1%
글꽃 2
 
1.1%
예람 1
 
0.6%
Other values (141) 141
81.0%
2023-12-11T02:23:58.665442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
 
7.1%
26
 
3.3%
25
 
3.2%
25
 
3.2%
23
 
2.9%
15
 
1.9%
15
 
1.9%
15
 
1.9%
14
 
1.8%
14
 
1.8%
Other values (252) 558
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 603
76.7%
Uppercase Letter 60
 
7.6%
Space Separator 56
 
7.1%
Lowercase Letter 37
 
4.7%
Open Punctuation 11
 
1.4%
Close Punctuation 11
 
1.4%
Decimal Number 5
 
0.6%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
4.3%
25
 
4.1%
25
 
4.1%
23
 
3.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
13
 
2.2%
Other values (210) 418
69.3%
Uppercase Letter
ValueCountFrequency (%)
S 8
13.3%
D 5
 
8.3%
A 5
 
8.3%
O 4
 
6.7%
N 4
 
6.7%
C 4
 
6.7%
E 3
 
5.0%
T 3
 
5.0%
R 3
 
5.0%
J 3
 
5.0%
Other values (12) 18
30.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
16.2%
s 5
13.5%
l 5
13.5%
a 5
13.5%
i 3
8.1%
n 3
8.1%
o 3
8.1%
r 2
 
5.4%
g 2
 
5.4%
t 1
 
2.7%
Other values (2) 2
 
5.4%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
4 1
 
20.0%
5 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
' 1
33.3%
Space Separator
ValueCountFrequency (%)
56
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 601
76.5%
Latin 97
 
12.3%
Common 86
 
10.9%
Han 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
4.3%
25
 
4.2%
25
 
4.2%
23
 
3.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
13
 
2.2%
Other values (208) 416
69.2%
Latin
ValueCountFrequency (%)
S 8
 
8.2%
e 6
 
6.2%
s 5
 
5.2%
l 5
 
5.2%
a 5
 
5.2%
D 5
 
5.2%
A 5
 
5.2%
O 4
 
4.1%
N 4
 
4.1%
C 4
 
4.1%
Other values (24) 46
47.4%
Common
ValueCountFrequency (%)
56
65.1%
( 11
 
12.8%
) 11
 
12.8%
1 3
 
3.5%
. 2
 
2.3%
4 1
 
1.2%
5 1
 
1.2%
' 1
 
1.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 600
76.3%
ASCII 183
 
23.3%
CJK 2
 
0.3%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56
30.6%
( 11
 
6.0%
) 11
 
6.0%
S 8
 
4.4%
e 6
 
3.3%
s 5
 
2.7%
l 5
 
2.7%
a 5
 
2.7%
D 5
 
2.7%
A 5
 
2.7%
Other values (32) 66
36.1%
Hangul
ValueCountFrequency (%)
26
 
4.3%
25
 
4.2%
25
 
4.2%
23
 
3.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
13
 
2.2%
Other values (207) 415
69.2%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct110
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T02:23:59.298063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length36.076271
Min length21

Characters and Unicode

Total characters4257
Distinct characters156
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)87.3%

Sample

1st row부산광역시 사하구 감천로142번길 25-4 (감천동)
2nd row부산광역시 사하구 낙동대로520번길 1 (하단동)
3rd row부산광역시 사하구 낙동대로 536 (하단동)
4th row부산광역시 사하구 다대로170번길 13 (신평동)
5th row부산광역시 사하구 낙동대로550번길 37 (하단동)
ValueCountFrequency (%)
부산광역시 118
 
15.0%
사하구 118
 
15.0%
하단동 28
 
3.6%
다대동 23
 
2.9%
장림동 17
 
2.2%
괴정동 16
 
2.0%
당리동 15
 
1.9%
신평동 14
 
1.8%
낙동대로 13
 
1.7%
다대낙조2길 9
 
1.1%
Other values (271) 416
52.9%
2023-12-11T02:24:00.329918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
669
 
15.7%
197
 
4.6%
1 187
 
4.4%
169
 
4.0%
0 133
 
3.1%
, 131
 
3.1%
127
 
3.0%
124
 
2.9%
121
 
2.8%
121
 
2.8%
Other values (146) 2278
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2421
56.9%
Decimal Number 788
 
18.5%
Space Separator 669
 
15.7%
Other Punctuation 131
 
3.1%
Close Punctuation 118
 
2.8%
Open Punctuation 118
 
2.8%
Dash Punctuation 8
 
0.2%
Uppercase Letter 3
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
197
 
8.1%
169
 
7.0%
127
 
5.2%
124
 
5.1%
121
 
5.0%
121
 
5.0%
119
 
4.9%
118
 
4.9%
118
 
4.9%
106
 
4.4%
Other values (129) 1101
45.5%
Decimal Number
ValueCountFrequency (%)
1 187
23.7%
0 133
16.9%
2 109
13.8%
3 104
13.2%
4 62
 
7.9%
5 59
 
7.5%
7 43
 
5.5%
6 41
 
5.2%
8 29
 
3.7%
9 21
 
2.7%
Space Separator
ValueCountFrequency (%)
669
100.0%
Other Punctuation
ValueCountFrequency (%)
, 131
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2421
56.9%
Common 1832
43.0%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
197
 
8.1%
169
 
7.0%
127
 
5.2%
124
 
5.1%
121
 
5.0%
121
 
5.0%
119
 
4.9%
118
 
4.9%
118
 
4.9%
106
 
4.4%
Other values (129) 1101
45.5%
Common
ValueCountFrequency (%)
669
36.5%
1 187
 
10.2%
0 133
 
7.3%
, 131
 
7.2%
) 118
 
6.4%
( 118
 
6.4%
2 109
 
5.9%
3 104
 
5.7%
4 62
 
3.4%
5 59
 
3.2%
Other values (5) 142
 
7.8%
Latin
ValueCountFrequency (%)
A 3
75.0%
e 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2421
56.9%
ASCII 1836
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
669
36.4%
1 187
 
10.2%
0 133
 
7.2%
, 131
 
7.1%
) 118
 
6.4%
( 118
 
6.4%
2 109
 
5.9%
3 104
 
5.7%
4 62
 
3.4%
5 59
 
3.2%
Other values (7) 146
 
8.0%
Hangul
ValueCountFrequency (%)
197
 
8.1%
169
 
7.0%
127
 
5.2%
124
 
5.1%
121
 
5.0%
121
 
5.0%
119
 
4.9%
118
 
4.9%
118
 
4.9%
106
 
4.4%
Other values (129) 1101
45.5%

전화번호
Text

MISSING 

Distinct54
Distinct (%)96.4%
Missing62
Missing (%)52.5%
Memory size1.1 KiB
2023-12-11T02:24:00.856079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.089286
Min length12

Characters and Unicode

Total characters677
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)92.9%

Sample

1st row051-292-0571
2nd row051-206-9785
3rd row051-291-7655
4th row051-292-1177
5th row051-200-6391
ValueCountFrequency (%)
051-292-1177 2
 
3.6%
051-206-9785 2
 
3.6%
051-292-4583 1
 
1.8%
051-631-3032 1
 
1.8%
051-203-0570 1
 
1.8%
070-8881-3825 1
 
1.8%
070-4197-6693 1
 
1.8%
051-714-3935 1
 
1.8%
051-206-1891 1
 
1.8%
050-6435-6394 1
 
1.8%
Other values (44) 44
78.6%
2023-12-11T02:24:01.620144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 116
17.1%
- 112
16.5%
1 96
14.2%
5 85
12.6%
2 71
10.5%
9 39
 
5.8%
7 38
 
5.6%
3 36
 
5.3%
6 29
 
4.3%
4 28
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 565
83.5%
Dash Punctuation 112
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 116
20.5%
1 96
17.0%
5 85
15.0%
2 71
12.6%
9 39
 
6.9%
7 38
 
6.7%
3 36
 
6.4%
6 29
 
5.1%
4 28
 
5.0%
8 27
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 677
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 116
17.1%
- 112
16.5%
1 96
14.2%
5 85
12.6%
2 71
10.5%
9 39
 
5.8%
7 38
 
5.6%
3 36
 
5.3%
6 29
 
4.3%
4 28
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 677
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 116
17.1%
- 112
16.5%
1 96
14.2%
5 85
12.6%
2 71
10.5%
9 39
 
5.8%
7 38
 
5.6%
3 36
 
5.3%
6 29
 
4.3%
4 28
 
4.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2022-11-24 00:00:00
Maximum2022-11-24 00:00:00
2023-12-11T02:24:01.868000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:24:02.090736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-11T02:24:02.226156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종전화번호
업종1.0000.000
전화번호0.0001.000

Missing values

2023-12-11T02:23:56.412201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:23:56.650255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종사업체명칭도로명주소전화번호데이터기준일자
0출판사태극도 출판부부산광역시 사하구 감천로142번길 25-4 (감천동)051-292-05712022-11-24
1출판사동문출판기획부산광역시 사하구 낙동대로520번길 1 (하단동)051-206-97852022-11-24
2출판사도서출판 동아기획부산광역시 사하구 낙동대로 536 (하단동)051-291-76552022-11-24
3출판사(주)도시인쇄문화사부산광역시 사하구 다대로170번길 13 (신평동)051-292-11772022-11-24
4출판사동아대학교출판사부산광역시 사하구 낙동대로550번길 37 (하단동)051-200-63912022-11-24
5출판사힌트출판사부산광역시 사하구 사리로 47 (괴정동)051-207-25302022-11-24
6출판사(주)켑스부산광역시 사하구 낙동대로550번길 37 (하단동)051-203-54902022-11-24
7출판사한국인간재활공학연구소부산광역시 사하구 승학로3번길 87 (하단동)051-204-50852022-11-24
8출판사국민전화번호부 출판부산광역시 사하구 회화나무길 67 (괴정동)051-204-71142022-11-24
9출판사시르영어웹부산광역시 사하구 하신번영로 365, 145-212호 (하단동, 가락상가)051-292-33712022-11-24
업종사업체명칭도로명주소전화번호데이터기준일자
108인쇄사(주)페이퍼박스부산광역시 사하구 장평로 66 (장림동)051-262-52042022-11-24
109인쇄사주식회사 동아피앤피부산광역시 사하구 하신중앙로27번길 6 (장림동)051-807-06002022-11-24
110인쇄사주식회사 동아위드부산광역시 사하구 낙동대로 542, 지하 3층 301호 (하단동, 대우에덴프라자)051-291-09112022-11-24
111인쇄사디자인 글꽃부산광역시 사하구 낙동대로 535 (하단동)<NA>2022-11-24
112인쇄사주식회사 다정플러스부산광역시 사하구 하신번영로 294, 티파니빌라트 2층 202호 (하단동, 티파니빌라트)070-5208-88802022-11-24
113인쇄사주식회사 예람부산광역시 사하구 낙동대로 542, 138,139,140호 (하단동, 대우에덴프라자)051-631-30322022-11-24
114인쇄사도서출판 책이야기부산광역시 사하구 장림번영로104번길 110, 2층 (장림동)<NA>2022-11-24
115인쇄사예스 CTP부산광역시 사하구 장림번영로104번길 110 (장림동)<NA>2022-11-24
116인쇄사(주)유승인쇄제책부산광역시 사하구 장림번영로104번길 110 (장림동)051-463-67372022-11-24
117인쇄사알스코(ARSHCO)부산광역시 사하구 하신중앙로 324, 보해이브빌2차 2층 (하단동)051-201-01912022-11-24