Overview

Dataset statistics

Number of variables3
Number of observations216
Missing cells1
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory24.6 B

Variable types

Text2
Categorical1

Dataset

Description부산광역시 금정구 관내 출판업 등록 업체 현황입니다. 해당 내용의 구성요소로는 소재지, 업체명, 업종 등을 포함하고있습니다.
Author부산광역시 금정구
URLhttps://www.data.go.kr/data/3055406/fileData.do

Alerts

업종 has constant value ""Constant

Reproduction

Analysis started2024-03-14 21:15:25.402935
Analysis finished2024-03-14 21:15:26.309862
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct215
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-15T06:15:27.038340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length6.4537037
Min length2

Characters and Unicode

Total characters1394
Distinct characters360
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)99.1%

Sample

1st row사단법인 부산대학교 출판문화원
2nd row제일출판인쇄
3rd row만수출판사
4th row도서출판 늘함께
5th row월간불교세계출판부
ValueCountFrequency (%)
도서출판 19
 
6.6%
주식회사 10
 
3.5%
디자인 3
 
1.0%
세컨리폼 2
 
0.7%
출판부 2
 
0.7%
사단법인 2
 
0.7%
club 2
 
0.7%
가을 2
 
0.7%
광진출판사 1
 
0.3%
영테크 1
 
0.3%
Other values (243) 243
84.7%
2024-03-15T06:15:28.578009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
5.1%
50
 
3.6%
50
 
3.6%
49
 
3.5%
30
 
2.2%
30
 
2.2%
28
 
2.0%
22
 
1.6%
20
 
1.4%
20
 
1.4%
Other values (350) 1024
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1171
84.0%
Space Separator 71
 
5.1%
Lowercase Letter 60
 
4.3%
Uppercase Letter 45
 
3.2%
Close Punctuation 15
 
1.1%
Open Punctuation 15
 
1.1%
Decimal Number 9
 
0.6%
Other Punctuation 6
 
0.4%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
4.3%
50
 
4.3%
49
 
4.2%
30
 
2.6%
30
 
2.6%
28
 
2.4%
22
 
1.9%
20
 
1.7%
20
 
1.7%
20
 
1.7%
Other values (301) 852
72.8%
Lowercase Letter
ValueCountFrequency (%)
n 8
13.3%
r 7
11.7%
t 6
10.0%
a 6
10.0%
e 5
 
8.3%
i 5
 
8.3%
s 3
 
5.0%
o 2
 
3.3%
k 2
 
3.3%
p 2
 
3.3%
Other values (11) 14
23.3%
Uppercase Letter
ValueCountFrequency (%)
S 6
13.3%
O 5
11.1%
C 4
8.9%
E 4
8.9%
M 3
 
6.7%
T 3
 
6.7%
H 3
 
6.7%
U 3
 
6.7%
B 3
 
6.7%
R 2
 
4.4%
Other values (6) 9
20.0%
Decimal Number
ValueCountFrequency (%)
0 3
33.3%
3 2
22.2%
1 2
22.2%
4 1
 
11.1%
2 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
. 3
50.0%
& 2
33.3%
% 1
 
16.7%
Space Separator
ValueCountFrequency (%)
71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1161
83.3%
Common 118
 
8.5%
Latin 105
 
7.5%
Han 10
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
4.3%
50
 
4.3%
49
 
4.2%
30
 
2.6%
30
 
2.6%
28
 
2.4%
22
 
1.9%
20
 
1.7%
20
 
1.7%
20
 
1.7%
Other values (291) 842
72.5%
Latin
ValueCountFrequency (%)
n 8
 
7.6%
r 7
 
6.7%
t 6
 
5.7%
S 6
 
5.7%
a 6
 
5.7%
e 5
 
4.8%
O 5
 
4.8%
i 5
 
4.8%
C 4
 
3.8%
E 4
 
3.8%
Other values (27) 49
46.7%
Common
ValueCountFrequency (%)
71
60.2%
) 15
 
12.7%
( 15
 
12.7%
0 3
 
2.5%
. 3
 
2.5%
- 2
 
1.7%
& 2
 
1.7%
3 2
 
1.7%
1 2
 
1.7%
4 1
 
0.8%
Other values (2) 2
 
1.7%
Han
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1161
83.3%
ASCII 223
 
16.0%
CJK 10
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
31.8%
) 15
 
6.7%
( 15
 
6.7%
n 8
 
3.6%
r 7
 
3.1%
t 6
 
2.7%
S 6
 
2.7%
a 6
 
2.7%
e 5
 
2.2%
O 5
 
2.2%
Other values (39) 79
35.4%
Hangul
ValueCountFrequency (%)
50
 
4.3%
50
 
4.3%
49
 
4.2%
30
 
2.6%
30
 
2.6%
28
 
2.4%
22
 
1.9%
20
 
1.7%
20
 
1.7%
20
 
1.7%
Other values (291) 842
72.5%
CJK
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Distinct211
Distinct (%)98.1%
Missing1
Missing (%)0.5%
Memory size1.8 KiB
2024-03-15T06:15:29.715431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length44
Mean length34.069767
Min length21

Characters and Unicode

Total characters7325
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique207 ?
Unique (%)96.3%

Sample

1st row부산광역시 금정구 부산대학로63번길 2 (장전동)
2nd row부산광역시 금정구 부산대학로 10, 103동 23층 2301호 (부곡동, 대우아파트)
3rd row부산광역시 금정구 부산대학로64번길 14-7 (장전동)
4th row부산광역시 금정구 두실로 16 (남산동)
5th row부산광역시 금정구 수림로 132 (장전동)
ValueCountFrequency (%)
부산광역시 215
 
15.8%
금정구 215
 
15.8%
장전동 79
 
5.8%
구서동 38
 
2.8%
부곡동 31
 
2.3%
남산동 31
 
2.3%
금강로 21
 
1.5%
2층 12
 
0.9%
1층 11
 
0.8%
금정로 11
 
0.8%
Other values (450) 694
51.1%
2024-03-15T06:15:31.253543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1201
 
16.4%
299
 
4.1%
297
 
4.1%
294
 
4.0%
281
 
3.8%
267
 
3.6%
1 259
 
3.5%
238
 
3.2%
219
 
3.0%
217
 
3.0%
Other values (188) 3753
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4139
56.5%
Decimal Number 1277
 
17.4%
Space Separator 1201
 
16.4%
Close Punctuation 217
 
3.0%
Open Punctuation 217
 
3.0%
Other Punctuation 217
 
3.0%
Dash Punctuation 42
 
0.6%
Uppercase Letter 13
 
0.2%
Lowercase Letter 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
7.2%
297
 
7.2%
294
 
7.1%
281
 
6.8%
267
 
6.5%
238
 
5.8%
219
 
5.3%
217
 
5.2%
216
 
5.2%
216
 
5.2%
Other values (164) 1595
38.5%
Decimal Number
ValueCountFrequency (%)
1 259
20.3%
2 186
14.6%
0 167
13.1%
3 136
10.6%
5 112
8.8%
4 107
8.4%
7 94
 
7.4%
6 86
 
6.7%
9 71
 
5.6%
8 59
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
B 6
46.2%
A 3
23.1%
D 1
 
7.7%
F 1
 
7.7%
T 1
 
7.7%
P 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 216
99.5%
/ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1201
100.0%
Close Punctuation
ValueCountFrequency (%)
) 217
100.0%
Open Punctuation
ValueCountFrequency (%)
( 217
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4139
56.5%
Common 3171
43.3%
Latin 15
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
7.2%
297
 
7.2%
294
 
7.1%
281
 
6.8%
267
 
6.5%
238
 
5.8%
219
 
5.3%
217
 
5.2%
216
 
5.2%
216
 
5.2%
Other values (164) 1595
38.5%
Common
ValueCountFrequency (%)
1201
37.9%
1 259
 
8.2%
) 217
 
6.8%
( 217
 
6.8%
, 216
 
6.8%
2 186
 
5.9%
0 167
 
5.3%
3 136
 
4.3%
5 112
 
3.5%
4 107
 
3.4%
Other values (6) 353
 
11.1%
Latin
ValueCountFrequency (%)
B 6
40.0%
A 3
20.0%
b 1
 
6.7%
1
 
6.7%
D 1
 
6.7%
F 1
 
6.7%
T 1
 
6.7%
P 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4139
56.5%
ASCII 3185
43.5%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1201
37.7%
1 259
 
8.1%
) 217
 
6.8%
( 217
 
6.8%
, 216
 
6.8%
2 186
 
5.8%
0 167
 
5.2%
3 136
 
4.3%
5 112
 
3.5%
4 107
 
3.4%
Other values (13) 367
 
11.5%
Hangul
ValueCountFrequency (%)
299
 
7.2%
297
 
7.2%
294
 
7.1%
281
 
6.8%
267
 
6.5%
238
 
5.8%
219
 
5.3%
217
 
5.2%
216
 
5.2%
216
 
5.2%
Other values (164) 1595
38.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
출판사
216 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 216
100.0%

Length

2024-03-15T06:15:31.702526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:15:32.022072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 216
100.0%

Missing values

2024-03-15T06:15:25.944591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T06:15:26.202591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업체명칭사업체소재지(도로명)업종
0사단법인 부산대학교 출판문화원부산광역시 금정구 부산대학로63번길 2 (장전동)출판사
1제일출판인쇄부산광역시 금정구 부산대학로 10, 103동 23층 2301호 (부곡동, 대우아파트)출판사
2만수출판사부산광역시 금정구 부산대학로64번길 14-7 (장전동)출판사
3도서출판 늘함께부산광역시 금정구 두실로 16 (남산동)출판사
4월간불교세계출판부부산광역시 금정구 수림로 132 (장전동)출판사
5도서출판미래원부산광역시 금정구 중앙대로1841번길 65, 1층 103호 (구서동, 구서골드1상가)출판사
6동성출판사부산광역시 금정구 서부로 74-6 (서동)출판사
7광진출판사부산광역시 금정구 부산대학로 60-1 (장전동)출판사
8한둘학력개발연구소부산광역시 금정구 중앙대로1959번길 11 (구서동)출판사
9시공연출부산광역시 금정구 부곡로 1 (부곡동)출판사
사업체명칭사업체소재지(도로명)업종
206금정미술관숙제학원부산광역시 금정구 금강로194번길 7-3, 1층 (장전동)출판사
207만지작(作) Ent.부산광역시 금정구 장전로 49, b1층 (장전동)출판사
208부또황부산광역시 금정구 부곡로156번길 24, 401호 (부곡동, 만복빌)출판사
209OAES CLUB부산광역시 금정구 금샘로438번길 7, 302호 (구서동, 대성빌라)출판사
210(주)금정문화네트워크부산광역시 금정구 장전온천천로 87 (장전동)출판사
211동사서독부산광역시 금정구 금정로 87 (장전동)출판사
212잉글리쉬올라부산광역시 금정구 금샘로229번길 29, 102동 705호 (구서동, 금강부광아파트)출판사
213에덴부산광역시 금정구 수림로20번길 19, 효정빌딩 4층 (부곡동)출판사
214재와니의 취미생활부산광역시 금정구 동현로43번길 29, 201호 (부곡동, 동림아트빌라)출판사
215디자인 뜰엔숲부산광역시 금정구 중앙대로 2021-6, 3층 (남산동)출판사