Overview

Dataset statistics

Number of variables4
Number of observations182
Missing cells131
Missing cells (%)18.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory32.7 B

Variable types

Categorical2
Text2

Dataset

Description부산광역시 연제구에 소재한 출판사 및 인쇄사에 대한 데이터로 업종, 상호, 소재지, 전화번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3040708/fileData.do

Alerts

전화번호 has 131 (72.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:03:44.572355
Analysis finished2023-12-12 14:03:44.939268
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
출판사
153 
인쇄사
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 153
84.1%
인쇄사 29
 
15.9%

Length

2023-12-12T23:03:45.022056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:03:45.119378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 153
84.1%
인쇄사 29
 
15.9%
Distinct169
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T23:03:45.351028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length13
Mean length6.9065934
Min length2

Characters and Unicode

Total characters1257
Distinct characters296
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)85.7%

Sample

1st row교지출판사
2nd row도서출판 교문사
3rd row도서출판 동혁
4th row(주)국제신문
5th row대학어학연구원
ValueCountFrequency (%)
도서출판 24
 
9.3%
주식회사 11
 
4.3%
에프 2
 
0.8%
다솔 2
 
0.8%
디자인 2
 
0.8%
스튜디오 2
 
0.8%
출판사 2
 
0.8%
예감 2
 
0.8%
시선 2
 
0.8%
주)미리내에이앤씨 2
 
0.8%
Other values (198) 207
80.2%
2023-12-12T23:03:45.788897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
76
 
6.0%
44
 
3.5%
44
 
3.5%
39
 
3.1%
35
 
2.8%
30
 
2.4%
29
 
2.3%
27
 
2.1%
) 26
 
2.1%
( 25
 
2.0%
Other values (286) 882
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1028
81.8%
Space Separator 76
 
6.0%
Lowercase Letter 60
 
4.8%
Uppercase Letter 37
 
2.9%
Close Punctuation 26
 
2.1%
Open Punctuation 25
 
2.0%
Dash Punctuation 2
 
0.2%
Other Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
4.3%
44
 
4.3%
39
 
3.8%
35
 
3.4%
30
 
2.9%
29
 
2.8%
27
 
2.6%
24
 
2.3%
24
 
2.3%
22
 
2.1%
Other values (245) 710
69.1%
Lowercase Letter
ValueCountFrequency (%)
o 11
18.3%
s 8
13.3%
i 7
11.7%
a 5
8.3%
n 5
8.3%
l 4
 
6.7%
t 3
 
5.0%
d 3
 
5.0%
e 3
 
5.0%
r 2
 
3.3%
Other values (7) 9
15.0%
Uppercase Letter
ValueCountFrequency (%)
A 4
10.8%
E 4
10.8%
S 3
 
8.1%
G 3
 
8.1%
B 3
 
8.1%
P 3
 
8.1%
N 2
 
5.4%
C 2
 
5.4%
T 2
 
5.4%
D 2
 
5.4%
Other values (7) 9
24.3%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
' 1
50.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1028
81.8%
Common 132
 
10.5%
Latin 97
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
4.3%
44
 
4.3%
39
 
3.8%
35
 
3.4%
30
 
2.9%
29
 
2.8%
27
 
2.6%
24
 
2.3%
24
 
2.3%
22
 
2.1%
Other values (245) 710
69.1%
Latin
ValueCountFrequency (%)
o 11
 
11.3%
s 8
 
8.2%
i 7
 
7.2%
a 5
 
5.2%
n 5
 
5.2%
l 4
 
4.1%
A 4
 
4.1%
E 4
 
4.1%
S 3
 
3.1%
G 3
 
3.1%
Other values (24) 43
44.3%
Common
ValueCountFrequency (%)
76
57.6%
) 26
 
19.7%
( 25
 
18.9%
- 2
 
1.5%
1 1
 
0.8%
, 1
 
0.8%
' 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1028
81.8%
ASCII 229
 
18.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
76
33.2%
) 26
 
11.4%
( 25
 
10.9%
o 11
 
4.8%
s 8
 
3.5%
i 7
 
3.1%
a 5
 
2.2%
n 5
 
2.2%
l 4
 
1.7%
A 4
 
1.7%
Other values (31) 58
25.3%
Hangul
ValueCountFrequency (%)
44
 
4.3%
44
 
4.3%
39
 
3.8%
35
 
3.4%
30
 
2.9%
29
 
2.8%
27
 
2.6%
24
 
2.3%
24
 
2.3%
22
 
2.1%
Other values (245) 710
69.1%
Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
부산광역시 연제구 연산동
99 
부산광역시 연제구 거제동
83 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 연제구 연산동
2nd row부산광역시 연제구 거제동
3rd row부산광역시 연제구 거제동
4th row부산광역시 연제구 거제동
5th row부산광역시 연제구 연산동

Common Values

ValueCountFrequency (%)
부산광역시 연제구 연산동 99
54.4%
부산광역시 연제구 거제동 83
45.6%

Length

2023-12-12T23:03:45.929577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:03:46.024521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 182
33.3%
연제구 182
33.3%
연산동 99
18.1%
거제동 83
15.2%

전화번호
Text

MISSING 

Distinct44
Distinct (%)86.3%
Missing131
Missing (%)72.0%
Memory size1.6 KiB
2023-12-12T23:03:46.231774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.039216
Min length12

Characters and Unicode

Total characters614
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)72.5%

Sample

1st row051-850-1049
2nd row051-756-5337
3rd row051-865-3338
4th row051-863-4800
5th row051-852-2357
ValueCountFrequency (%)
051-807-9935 2
 
3.9%
051-866-6988 2
 
3.9%
051-865-9090 2
 
3.9%
051-853-8787 2
 
3.9%
051-253-0001 2
 
3.9%
051-756-5337 2
 
3.9%
051-624-4486 2
 
3.9%
051-623-6232 1
 
2.0%
051-502-4060 1
 
2.0%
051-623-4430 1
 
2.0%
Other values (34) 34
66.7%
2023-12-12T23:03:46.680901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 102
16.6%
0 93
15.1%
5 87
14.2%
1 86
14.0%
8 46
7.5%
6 46
7.5%
7 36
 
5.9%
3 35
 
5.7%
4 32
 
5.2%
2 28
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 512
83.4%
Dash Punctuation 102
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 93
18.2%
5 87
17.0%
1 86
16.8%
8 46
9.0%
6 46
9.0%
7 36
 
7.0%
3 35
 
6.8%
4 32
 
6.2%
2 28
 
5.5%
9 23
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 614
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 102
16.6%
0 93
15.1%
5 87
14.2%
1 86
14.0%
8 46
7.5%
6 46
7.5%
7 36
 
5.9%
3 35
 
5.7%
4 32
 
5.2%
2 28
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 614
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 102
16.6%
0 93
15.1%
5 87
14.2%
1 86
14.0%
8 46
7.5%
6 46
7.5%
7 36
 
5.9%
3 35
 
5.7%
4 32
 
5.2%
2 28
 
4.6%

Correlations

2023-12-12T23:03:46.783936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종사업체소재지전화번호
업종1.0000.0000.000
사업체소재지0.0001.0000.962
전화번호0.0000.9621.000
2023-12-12T23:03:46.874825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업체소재지업종
사업체소재지1.0000.000
업종0.0001.000
2023-12-12T23:03:46.974370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종사업체소재지
업종1.0000.000
사업체소재지0.0001.000

Missing values

2023-12-12T23:03:44.790617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:03:44.894485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종사업체명칭사업체소재지전화번호
0출판사교지출판사부산광역시 연제구 연산동<NA>
1출판사도서출판 교문사부산광역시 연제구 거제동<NA>
2출판사도서출판 동혁부산광역시 연제구 거제동<NA>
3출판사(주)국제신문부산광역시 연제구 거제동<NA>
4출판사대학어학연구원부산광역시 연제구 연산동<NA>
5출판사부산경상대학 출판부부산광역시 연제구 연산동051-850-1049
6출판사법연부산광역시 연제구 연산동<NA>
7출판사도서출판 대원부산광역시 연제구 거제동<NA>
8출판사고스음악출판사부산광역시 연제구 연산동<NA>
9출판사(주)뱅크코리아부산광역시 연제구 연산동<NA>
업종사업체명칭사업체소재지전화번호
172인쇄사지누애드부산광역시 연제구 연산동051-852-1694
173인쇄사유일기획인쇄부산광역시 연제구 연산동051-754-1576
174인쇄사주식회사 시선부산광역시 연제구 거제동<NA>
175인쇄사엠에스상사부산광역시 연제구 거제동<NA>
176인쇄사미주애드부산광역시 연제구 거제동051-807-9935
177인쇄사주식회사 디자인에이원부산광역시 연제구 연산동051-624-4486
178인쇄사(주)참한디자인부산광역시 연제구 거제동051-711-4512
179인쇄사디자인 예감부산광역시 연제구 거제동051-464-4244
180인쇄사반석인쇄출판사부산광역시 연제구 거제동<NA>
181인쇄사디올아트부산광역시 연제구 거제동<NA>