Overview

Dataset statistics

Number of variables5
Number of observations22
Missing cells9
Missing cells (%)8.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1012.0 B
Average record size in memory46.0 B

Variable types

Text4
Categorical1

Dataset

Description부산광역시_강서구_출판인쇄업현황_20180930
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15023174

Alerts

전화번호 has 9 (40.9%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:40:55.061328
Analysis finished2023-12-10 16:40:55.695414
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct20
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-11T01:40:55.894881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length7.5
Mean length5.8181818
Min length2

Characters and Unicode

Total characters128
Distinct characters78
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)81.8%

Sample

1st row천지문화사
2nd row프리죤
3rd row다음세대
4th row리얼라이프북스
5th row정림
ValueCountFrequency (%)
영신사 2
 
7.1%
도서출판 2
 
7.1%
참기획 2
 
7.1%
books 1
 
3.6%
천지문화사 1
 
3.6%
프리죤 1
 
3.6%
서흥페케이지 1
 
3.6%
대진종합인쇄소 1
 
3.6%
천지종합인쇄소 1
 
3.6%
동진인쇄소 1
 
3.6%
Other values (15) 15
53.6%
2023-12-11T01:40:56.409138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
4.7%
4
 
3.1%
4
 
3.1%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (68) 91
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
84.4%
Lowercase Letter 10
 
7.8%
Space Separator 6
 
4.7%
Uppercase Letter 2
 
1.6%
Close Punctuation 1
 
0.8%
Open Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (55) 74
68.5%
Lowercase Letter
ValueCountFrequency (%)
s 2
20.0%
o 2
20.0%
k 1
10.0%
y 1
10.0%
n 1
10.0%
e 1
10.0%
t 1
10.0%
i 1
10.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
D 1
50.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
84.4%
Latin 12
 
9.4%
Common 8
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (55) 74
68.5%
Latin
ValueCountFrequency (%)
s 2
16.7%
o 2
16.7%
k 1
8.3%
B 1
8.3%
y 1
8.3%
n 1
8.3%
D 1
8.3%
e 1
8.3%
t 1
8.3%
i 1
8.3%
Common
ValueCountFrequency (%)
6
75.0%
) 1
 
12.5%
( 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
84.4%
ASCII 20
 
15.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6
30.0%
s 2
 
10.0%
o 2
 
10.0%
k 1
 
5.0%
) 1
 
5.0%
B 1
 
5.0%
y 1
 
5.0%
n 1
 
5.0%
( 1
 
5.0%
D 1
 
5.0%
Other values (3) 3
15.0%
Hangul
ValueCountFrequency (%)
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (55) 74
68.5%
Distinct19
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-11T01:40:56.759403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length49
Mean length35.454545
Min length23

Characters and Unicode

Total characters780
Distinct characters83
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)72.7%

Sample

1st row부산광역시 강서구 대저로 253-1 (대저1동)
2nd row부산광역시 강서구 녹산산단232로 38-26 (송정동)
3rd row부산광역시 강서구 제도로 230 (명지동)
4th row부산광역시 강서구 명지오션시티10로 16, 202동 1202호 (명지동, 영어도시 퀸덤1차)
5th row부산광역시 강서구 낙동남로 1032 (명지동)
ValueCountFrequency (%)
부산광역시 22
 
15.9%
강서구 22
 
15.9%
명지동 8
 
5.8%
대저2동 6
 
4.3%
대저1동 5
 
3.6%
대저로 4
 
2.9%
253-1 2
 
1.4%
맥도길377번길 2
 
1.4%
158-15 2
 
1.4%
울만로 2
 
1.4%
Other values (54) 63
45.7%
2023-12-11T01:40:57.306556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
116
 
14.9%
1 47
 
6.0%
2 32
 
4.1%
32
 
4.1%
28
 
3.6%
28
 
3.6%
24
 
3.1%
23
 
2.9%
0 22
 
2.8%
( 22
 
2.8%
Other values (73) 406
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 429
55.0%
Decimal Number 165
 
21.2%
Space Separator 116
 
14.9%
Open Punctuation 22
 
2.8%
Close Punctuation 22
 
2.8%
Other Punctuation 17
 
2.2%
Dash Punctuation 9
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
7.5%
28
 
6.5%
28
 
6.5%
24
 
5.6%
23
 
5.4%
22
 
5.1%
22
 
5.1%
22
 
5.1%
22
 
5.1%
21
 
4.9%
Other values (58) 185
43.1%
Decimal Number
ValueCountFrequency (%)
1 47
28.5%
2 32
19.4%
0 22
13.3%
3 17
 
10.3%
5 14
 
8.5%
6 11
 
6.7%
7 7
 
4.2%
8 6
 
3.6%
9 6
 
3.6%
4 3
 
1.8%
Space Separator
ValueCountFrequency (%)
116
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 429
55.0%
Common 351
45.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
7.5%
28
 
6.5%
28
 
6.5%
24
 
5.6%
23
 
5.4%
22
 
5.1%
22
 
5.1%
22
 
5.1%
22
 
5.1%
21
 
4.9%
Other values (58) 185
43.1%
Common
ValueCountFrequency (%)
116
33.0%
1 47
13.4%
2 32
 
9.1%
0 22
 
6.3%
( 22
 
6.3%
) 22
 
6.3%
3 17
 
4.8%
, 17
 
4.8%
5 14
 
4.0%
6 11
 
3.1%
Other values (5) 31
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 429
55.0%
ASCII 351
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
116
33.0%
1 47
13.4%
2 32
 
9.1%
0 22
 
6.3%
( 22
 
6.3%
) 22
 
6.3%
3 17
 
4.8%
, 17
 
4.8%
5 14
 
4.0%
6 11
 
3.1%
Other values (5) 31
 
8.8%
Hangul
ValueCountFrequency (%)
32
 
7.5%
28
 
6.5%
28
 
6.5%
24
 
5.6%
23
 
5.4%
22
 
5.1%
22
 
5.1%
22
 
5.1%
22
 
5.1%
21
 
4.9%
Other values (58) 185
43.1%
Distinct20
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-11T01:40:57.636918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length39
Mean length27.181818
Min length21

Characters and Unicode

Total characters598
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)81.8%

Sample

1st row부산광역시 강서구 대저1동 2377-5번지
2nd row부산광역시 강서구 송정동 1709-2번지 부산중소기원종합지원센터 3층
3rd row부산광역시 강서구 명지동 457-681번지 6통3반
4th row부산광역시 강서구 명지동 3231번지 영어도시 퀸덤1차 202동 1202호
5th row부산광역시 강서구 명지동 115-3번지
ValueCountFrequency (%)
부산광역시 22
21.2%
강서구 22
21.2%
명지동 8
 
7.7%
대저2동 6
 
5.8%
대저1동 5
 
4.8%
송정동 2
 
1.9%
202동 2
 
1.9%
3639-6번지 2
 
1.9%
5469번지 2
 
1.9%
2377-5번지 2
 
1.9%
Other values (31) 31
29.8%
2023-12-11T01:40:58.119366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
16.9%
34
 
5.7%
2 30
 
5.0%
1 29
 
4.8%
3 26
 
4.3%
26
 
4.3%
24
 
4.0%
23
 
3.8%
23
 
3.8%
23
 
3.8%
Other values (52) 259
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339
56.7%
Decimal Number 142
23.7%
Space Separator 101
 
16.9%
Dash Punctuation 16
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
10.0%
26
 
7.7%
24
 
7.1%
23
 
6.8%
23
 
6.8%
23
 
6.8%
22
 
6.5%
22
 
6.5%
22
 
6.5%
22
 
6.5%
Other values (40) 98
28.9%
Decimal Number
ValueCountFrequency (%)
2 30
21.1%
1 29
20.4%
3 26
18.3%
7 12
 
8.5%
6 11
 
7.7%
5 10
 
7.0%
0 8
 
5.6%
9 7
 
4.9%
4 6
 
4.2%
8 3
 
2.1%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 339
56.7%
Common 259
43.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
10.0%
26
 
7.7%
24
 
7.1%
23
 
6.8%
23
 
6.8%
23
 
6.8%
22
 
6.5%
22
 
6.5%
22
 
6.5%
22
 
6.5%
Other values (40) 98
28.9%
Common
ValueCountFrequency (%)
101
39.0%
2 30
 
11.6%
1 29
 
11.2%
3 26
 
10.0%
- 16
 
6.2%
7 12
 
4.6%
6 11
 
4.2%
5 10
 
3.9%
0 8
 
3.1%
9 7
 
2.7%
Other values (2) 9
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 339
56.7%
ASCII 259
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
101
39.0%
2 30
 
11.6%
1 29
 
11.2%
3 26
 
10.0%
- 16
 
6.2%
7 12
 
4.6%
6 11
 
4.2%
5 10
 
3.9%
0 8
 
3.1%
9 7
 
2.7%
Other values (2) 9
 
3.5%
Hangul
ValueCountFrequency (%)
34
 
10.0%
26
 
7.7%
24
 
7.1%
23
 
6.8%
23
 
6.8%
23
 
6.8%
22
 
6.5%
22
 
6.5%
22
 
6.5%
22
 
6.5%
Other values (40) 98
28.9%

전화번호
Text

MISSING 

Distinct11
Distinct (%)84.6%
Missing9
Missing (%)40.9%
Memory size308.0 B
2023-12-11T01:40:58.392047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters156
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)69.2%

Sample

1st row051-971-2233
2nd row051-831-8545
3rd row051-271-0030
4th row051-971-4704
5th row051-862-6000
ValueCountFrequency (%)
051-971-2233 2
15.4%
051-604-8921 2
15.4%
051-831-8545 1
7.7%
051-271-0030 1
7.7%
051-971-4704 1
7.7%
051-862-6000 1
7.7%
051-248-1588 1
7.7%
051-973-8656 1
7.7%
051-972-0202 1
7.7%
051-971-3705 1
7.7%
2023-12-11T01:40:59.182434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 27
17.3%
- 26
16.7%
1 23
14.7%
5 18
11.5%
2 14
9.0%
7 10
 
6.4%
8 9
 
5.8%
9 8
 
5.1%
3 8
 
5.1%
6 7
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 130
83.3%
Dash Punctuation 26
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 27
20.8%
1 23
17.7%
5 18
13.8%
2 14
10.8%
7 10
 
7.7%
8 9
 
6.9%
9 8
 
6.2%
3 8
 
6.2%
6 7
 
5.4%
4 6
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 156
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 27
17.3%
- 26
16.7%
1 23
14.7%
5 18
11.5%
2 14
9.0%
7 10
 
6.4%
8 9
 
5.8%
9 8
 
5.1%
3 8
 
5.1%
6 7
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 27
17.3%
- 26
16.7%
1 23
14.7%
5 18
11.5%
2 14
9.0%
7 10
 
6.4%
8 9
 
5.8%
9 8
 
5.1%
3 8
 
5.1%
6 7
 
4.5%

업종
Categorical

Distinct2
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size308.0 B
출판사
15 
인쇄사

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 15
68.2%
인쇄사 7
31.8%

Length

2023-12-11T01:40:59.336918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:40:59.493062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 15
68.2%
인쇄사 7
31.8%

Correlations

2023-12-11T01:40:59.593051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업체명칭도로명주소지번주소전화번호업종
사업체명칭1.0001.0000.9931.0000.000
도로명주소1.0001.0001.0001.0000.000
지번주소0.9931.0001.0001.0000.000
전화번호1.0001.0001.0001.0000.000
업종0.0000.0000.0000.0001.000

Missing values

2023-12-11T01:40:55.480797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:40:55.635412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업체명칭도로명주소지번주소전화번호업종
0천지문화사부산광역시 강서구 대저로 253-1 (대저1동)부산광역시 강서구 대저1동 2377-5번지051-971-2233출판사
1프리죤부산광역시 강서구 녹산산단232로 38-26 (송정동)부산광역시 강서구 송정동 1709-2번지 부산중소기원종합지원센터 3층051-831-8545출판사
2다음세대부산광역시 강서구 제도로 230 (명지동)부산광역시 강서구 명지동 457-681번지 6통3반051-271-0030출판사
3리얼라이프북스부산광역시 강서구 명지오션시티10로 16, 202동 1202호 (명지동, 영어도시 퀸덤1차)부산광역시 강서구 명지동 3231번지 영어도시 퀸덤1차 202동 1202호<NA>출판사
4정림부산광역시 강서구 낙동남로 1032 (명지동)부산광역시 강서구 명지동 115-3번지051-971-4704출판사
5도서출판 한국선급부산광역시 강서구 명지오션시티9로 36 (명지동)부산광역시 강서구 명지동 3229-22번지051-862-6000출판사
6작은통일부산광역시 강서구 명지오션시티10로 17 (명지동, 퀸덤1차)부산광역시 강서구 명지동 3230-11번지051-248-1588출판사
7신현출판사부산광역시 강서구 명지국제5로 30, 105동 1501호 (명지동, 명지대방노블랜드오션뷰1차)부산광역시 강서구 명지동 2527번지<NA>출판사
8귀를 닫은 토끼부산광역시 강서구 유통단지1로 41-46721, 109동 206호 (대저2동, 티플렉스)부산광역시 강서구 대저2동 3153-1번지 티플렉스 109동 206호<NA>출판사
9데스티니 북스(Destiny Books)부산광역시 강서구 명지국제5로 109, 206동 2401호 (명지동, 명지2차금강펜테리움센트럴파크)부산광역시 강서구 명지동 3343번지<NA>출판사
사업체명칭도로명주소지번주소전화번호업종
12영신사부산광역시 강서구 울만로 308-1, 1동 (대저2동)부산광역시 강서구 대저2동 3639-6번지051-604-8921출판사
13도서출판 파랑새부산광역시 강서구 과학산단2로20번길 69, 117동 1901호 (지사동, 지사금강펜테리움)부산광역시 강서구 지사동 1184-1번지 지사금강펜테리움<NA>출판사
14넉넉부산광역시 강서구 명지오션시티1로 155, 128동 3층 306호 (명지동, 명지오션시티 한신휴플러스)부산광역시 강서구 명지동 3236번지 명지오션시티 한신휴플러스<NA>출판사
15동진인쇄소부산광역시 강서구 대저로 266 (대저1동)부산광역시 강서구 대저1동 2372-2번지051-973-8656인쇄사
16천지종합인쇄소부산광역시 강서구 대저로 253-1 (대저1동)부산광역시 강서구 대저1동 2377-5번지051-971-2233인쇄사
17대진종합인쇄소부산광역시 강서구 대저로 259-1 (대저1동)부산광역시 강서구 대저1동 2375-1번지051-972-0202인쇄사
18서흥페케이지부산광역시 강서구 공항로 1223 (대저1동)부산광역시 강서구 대저1동 2706-5번지051-971-3705인쇄사
19참기획부산광역시 강서구 맥도길377번길 158-15 (대저2동)부산광역시 강서구 대저2동 5469번지<NA>인쇄사
20영신사부산광역시 강서구 울만로 308-1, 1동 (대저2동)부산광역시 강서구 대저2동 3639-6번지 1동051-604-8921인쇄사
21성광정판인쇄부산광역시 강서구 녹산산단362로 30 (송정동)부산광역시 강서구 송정동 1737-14번지051-261-0027인쇄사