Overview

Dataset statistics

Number of variables4
Number of observations92
Missing cells64
Missing cells (%)17.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory33.4 B

Variable types

Categorical2
Text2

Dataset

Description부산광역시 기장군청에서 관리하는 자료로, 기장군 내에 등록된 출판사 및 인쇄사의 업체명, 주소, 업종 등의 정보를 제공하는 자료입니다.
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/15077536/fileData.do

Alerts

사업체명칭 has 1 (1.1%) missing valuesMissing
전화번호 has 63 (68.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:27:49.198109
Analysis finished2023-12-12 14:27:49.707694
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
출판사
67 
인쇄사
24 
<NA>
 
1

Length

Max length4
Median length3
Mean length3.0108696
Min length3

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 67
72.8%
인쇄사 24
 
26.1%
<NA> 1
 
1.1%

Length

2023-12-12T23:27:49.774262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:27:49.872474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 67
72.8%
인쇄사 24
 
26.1%
na 1
 
1.1%

사업체명칭
Text

MISSING 

Distinct86
Distinct (%)94.5%
Missing1
Missing (%)1.1%
Memory size868.0 B
2023-12-12T23:27:50.161845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length6.5714286
Min length3

Characters and Unicode

Total characters598
Distinct characters225
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)89.0%

Sample

1st row놀이속의 세상
2nd row영상교육
3rd row수다과학연구소
4th row경호무술출판사
5th row장원(차문화)
ValueCountFrequency (%)
도서출판 6
 
5.0%
주식회사 4
 
3.4%
현대이앤지 2
 
1.7%
사인몰 2
 
1.7%
나라테크 2
 
1.7%
한국능력개발진흥원 2
 
1.7%
힘찬문서 2
 
1.7%
1
 
0.8%
출판 1
 
0.8%
식스북 1
 
0.8%
Other values (96) 96
80.7%
2023-12-12T23:27:50.690460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
4.7%
24
 
4.0%
( 13
 
2.2%
) 13
 
2.2%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
11
 
1.8%
11
 
1.8%
Other values (215) 452
75.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 469
78.4%
Uppercase Letter 41
 
6.9%
Lowercase Letter 31
 
5.2%
Space Separator 28
 
4.7%
Open Punctuation 13
 
2.2%
Close Punctuation 13
 
2.2%
Other Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
5.1%
12
 
2.6%
12
 
2.6%
11
 
2.3%
11
 
2.3%
11
 
2.3%
11
 
2.3%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (180) 350
74.6%
Uppercase Letter
ValueCountFrequency (%)
S 6
14.6%
A 4
9.8%
D 3
 
7.3%
Y 3
 
7.3%
I 3
 
7.3%
W 3
 
7.3%
R 3
 
7.3%
P 3
 
7.3%
O 3
 
7.3%
E 2
 
4.9%
Other values (6) 8
19.5%
Lowercase Letter
ValueCountFrequency (%)
s 5
16.1%
o 5
16.1%
e 4
12.9%
m 3
9.7%
b 2
 
6.5%
u 2
 
6.5%
k 2
 
6.5%
i 2
 
6.5%
p 1
 
3.2%
a 1
 
3.2%
Other values (4) 4
12.9%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
· 1
33.3%
Space Separator
ValueCountFrequency (%)
28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 469
78.4%
Latin 72
 
12.0%
Common 57
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
5.1%
12
 
2.6%
12
 
2.6%
11
 
2.3%
11
 
2.3%
11
 
2.3%
11
 
2.3%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (180) 350
74.6%
Latin
ValueCountFrequency (%)
S 6
 
8.3%
s 5
 
6.9%
o 5
 
6.9%
A 4
 
5.6%
e 4
 
5.6%
D 3
 
4.2%
m 3
 
4.2%
Y 3
 
4.2%
I 3
 
4.2%
W 3
 
4.2%
Other values (20) 33
45.8%
Common
ValueCountFrequency (%)
28
49.1%
( 13
22.8%
) 13
22.8%
. 2
 
3.5%
· 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 469
78.4%
ASCII 128
 
21.4%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
21.9%
( 13
 
10.2%
) 13
 
10.2%
S 6
 
4.7%
s 5
 
3.9%
o 5
 
3.9%
A 4
 
3.1%
e 4
 
3.1%
D 3
 
2.3%
m 3
 
2.3%
Other values (24) 44
34.4%
Hangul
ValueCountFrequency (%)
24
 
5.1%
12
 
2.6%
12
 
2.6%
11
 
2.3%
11
 
2.3%
11
 
2.3%
11
 
2.3%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (180) 350
74.6%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct6
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size868.0 B
부산광역시 기장군 기장읍
36 
부산광역시 기장군 정관읍
32 
부산광역시 기장군 장안읍
12 
부산광역시 기장군 철마면
부산광역시 기장군 일광읍

Length

Max length13
Median length13
Mean length12.902174
Min length4

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row부산광역시 기장군 기장읍
2nd row부산광역시 기장군 정관읍
3rd row부산광역시 기장군 기장읍
4th row부산광역시 기장군 기장읍
5th row부산광역시 기장군 기장읍

Common Values

ValueCountFrequency (%)
부산광역시 기장군 기장읍 36
39.1%
부산광역시 기장군 정관읍 32
34.8%
부산광역시 기장군 장안읍 12
 
13.0%
부산광역시 기장군 철마면 6
 
6.5%
부산광역시 기장군 일광읍 5
 
5.4%
<NA> 1
 
1.1%

Length

2023-12-12T23:27:50.843469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:27:50.951409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 91
33.2%
기장군 91
33.2%
기장읍 36
 
13.1%
정관읍 32
 
11.7%
장안읍 12
 
4.4%
철마면 6
 
2.2%
일광읍 5
 
1.8%
na 1
 
0.4%

전화번호
Text

MISSING 

Distinct26
Distinct (%)89.7%
Missing63
Missing (%)68.5%
Memory size868.0 B
2023-12-12T23:27:51.175938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters348
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)79.3%

Sample

1st row051-724-6723
2nd row051-959-9979
3rd row051-722-0316
4th row051-333-4007
5th row051-722-9881
ValueCountFrequency (%)
051-722-9881 2
 
6.9%
051-333-4007 2
 
6.9%
051-727-4074 2
 
6.9%
051-515-0345 1
 
3.4%
051-724-6723 1
 
3.4%
051-721-3648 1
 
3.4%
051-516-9350 1
 
3.4%
051-724-8279 1
 
3.4%
051-898-2456 1
 
3.4%
051-722-0620 1
 
3.4%
Other values (16) 16
55.2%
2023-12-12T23:27:51.546762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 58
16.7%
5 50
14.4%
1 49
14.1%
0 48
13.8%
7 32
9.2%
2 32
9.2%
4 21
 
6.0%
8 17
 
4.9%
3 15
 
4.3%
6 14
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 290
83.3%
Dash Punctuation 58
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 50
17.2%
1 49
16.9%
0 48
16.6%
7 32
11.0%
2 32
11.0%
4 21
7.2%
8 17
 
5.9%
3 15
 
5.2%
6 14
 
4.8%
9 12
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 348
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 58
16.7%
5 50
14.4%
1 49
14.1%
0 48
13.8%
7 32
9.2%
2 32
9.2%
4 21
 
6.0%
8 17
 
4.9%
3 15
 
4.3%
6 14
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 348
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 58
16.7%
5 50
14.4%
1 49
14.1%
0 48
13.8%
7 32
9.2%
2 32
9.2%
4 21
 
6.0%
8 17
 
4.9%
3 15
 
4.3%
6 14
 
4.0%

Correlations

2023-12-12T23:27:51.661454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종사업체명칭사업체소재지전화번호
업종1.0000.0000.2180.000
사업체명칭0.0001.0000.9941.000
사업체소재지0.2180.9941.0001.000
전화번호0.0001.0001.0001.000
2023-12-12T23:27:51.800771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업체소재지업종
사업체소재지1.0000.261
업종0.2611.000
2023-12-12T23:27:51.890102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종사업체소재지
업종1.0000.261
사업체소재지0.2611.000

Missing values

2023-12-12T23:27:49.460164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:27:49.553675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:27:49.650637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종사업체명칭사업체소재지전화번호
0출판사놀이속의 세상부산광역시 기장군 기장읍051-724-6723
1출판사영상교육부산광역시 기장군 정관읍<NA>
2출판사수다과학연구소부산광역시 기장군 기장읍051-959-9979
3출판사경호무술출판사부산광역시 기장군 기장읍<NA>
4출판사장원(차문화)부산광역시 기장군 기장읍<NA>
5출판사해광식품 주식회사부산광역시 기장군 기장읍<NA>
6출판사도서출판 마루부산광역시 기장군 기장읍<NA>
7출판사가이오부산광역시 기장군 장안읍<NA>
8출판사부울경뉴스 협동조합부산광역시 기장군 기장읍051-722-0316
9출판사(주)한울림부산광역시 기장군 기장읍<NA>
업종사업체명칭사업체소재지전화번호
82인쇄사나라테크부산광역시 기장군 장안읍051-333-4007
83인쇄사현대이앤지부산광역시 기장군 기장읍051-722-9881
84인쇄사디지털뱅크부산광역시 기장군 기장읍051-722-0620
85인쇄사뱅크OA 시스템부산광역시 기장군 정관읍051-898-2456
86인쇄사힘찬문서부산광역시 기장군 기장읍051-724-8279
87인쇄사컴닥터복사나라부산광역시 기장군 정관읍<NA>
88인쇄사(주)시화부산광역시 기장군 정관읍051-516-9350
89인쇄사주식회사디앤디부산광역시 기장군 정관읍<NA>
90인쇄사(주)세대ITS부산광역시 기장군 정관읍051-515-1441
91<NA><NA><NA><NA>