Overview

Dataset statistics

Number of variables5
Number of observations170
Missing cells49
Missing cells (%)5.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시남구출판사및인쇄소현황_20200731
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3034659

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 49 (28.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:25:50.248137
Analysis finished2023-12-10 16:25:50.809586
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct170
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85.5
Minimum1
Maximum170
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-11T01:25:50.899926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.45
Q143.25
median85.5
Q3127.75
95-th percentile161.55
Maximum170
Range169
Interquartile range (IQR)84.5

Descriptive statistics

Standard deviation49.218899
Coefficient of variation (CV)0.57565964
Kurtosis-1.2
Mean85.5
Median Absolute Deviation (MAD)42.5
Skewness0
Sum14535
Variance2422.5
MonotonicityStrictly increasing
2023-12-11T01:25:51.057105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
118 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
Other values (160) 160
94.1%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
170 1
0.6%
169 1
0.6%
168 1
0.6%
167 1
0.6%
166 1
0.6%
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
출판사
144 
인쇄사
26 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 144
84.7%
인쇄사 26
 
15.3%

Length

2023-12-11T01:25:51.192236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:25:51.302460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 144
84.7%
인쇄사 26
 
15.3%
Distinct159
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T01:25:51.575222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length14
Mean length6.7764706
Min length2

Characters and Unicode

Total characters1152
Distinct characters307
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)87.6%

Sample

1st row경성대학교출판부
2nd row부경대학교 출판부
3rd row도서출판 에이맨
4th row아베마리아출판사
5th row폰테고 출판사
ValueCountFrequency (%)
도서출판 12
 
5.4%
주식회사 6
 
2.7%
출판사 6
 
2.7%
이너스 3
 
1.4%
디자인통두손컴부설연구소 2
 
0.9%
디자인 2
 
0.9%
golden 2
 
0.9%
음악 2
 
0.9%
플랑 2
 
0.9%
출판부 2
 
0.9%
Other values (174) 182
82.4%
2023-12-11T01:25:51.994021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
4.4%
33
 
2.9%
31
 
2.7%
30
 
2.6%
28
 
2.4%
) 26
 
2.3%
( 26
 
2.3%
24
 
2.1%
21
 
1.8%
20
 
1.7%
Other values (297) 862
74.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 932
80.9%
Uppercase Letter 56
 
4.9%
Lowercase Letter 53
 
4.6%
Space Separator 51
 
4.4%
Close Punctuation 26
 
2.3%
Open Punctuation 26
 
2.3%
Other Punctuation 4
 
0.3%
Decimal Number 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
3.5%
31
 
3.3%
30
 
3.2%
28
 
3.0%
24
 
2.6%
21
 
2.3%
20
 
2.1%
19
 
2.0%
18
 
1.9%
17
 
1.8%
Other values (253) 691
74.1%
Uppercase Letter
ValueCountFrequency (%)
D 7
12.5%
E 6
10.7%
N 5
 
8.9%
C 5
 
8.9%
P 5
 
8.9%
G 4
 
7.1%
J 3
 
5.4%
I 3
 
5.4%
A 3
 
5.4%
R 2
 
3.6%
Other values (9) 13
23.2%
Lowercase Letter
ValueCountFrequency (%)
n 7
13.2%
o 7
13.2%
e 6
11.3%
a 4
7.5%
r 4
7.5%
i 4
7.5%
g 4
7.5%
d 3
 
5.7%
b 3
 
5.7%
l 2
 
3.8%
Other values (8) 9
17.0%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
& 2
50.0%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 924
80.2%
Common 111
 
9.6%
Latin 109
 
9.5%
Han 8
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
3.6%
31
 
3.4%
30
 
3.2%
28
 
3.0%
24
 
2.6%
21
 
2.3%
20
 
2.2%
19
 
2.1%
18
 
1.9%
17
 
1.8%
Other values (245) 683
73.9%
Latin
ValueCountFrequency (%)
n 7
 
6.4%
D 7
 
6.4%
o 7
 
6.4%
E 6
 
5.5%
e 6
 
5.5%
N 5
 
4.6%
C 5
 
4.6%
P 5
 
4.6%
G 4
 
3.7%
a 4
 
3.7%
Other values (27) 53
48.6%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Common
ValueCountFrequency (%)
51
45.9%
) 26
23.4%
( 26
23.4%
. 2
 
1.8%
1 2
 
1.8%
2 2
 
1.8%
& 2
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 924
80.2%
ASCII 220
 
19.1%
CJK 8
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
23.2%
) 26
 
11.8%
( 26
 
11.8%
n 7
 
3.2%
D 7
 
3.2%
o 7
 
3.2%
E 6
 
2.7%
e 6
 
2.7%
N 5
 
2.3%
C 5
 
2.3%
Other values (34) 74
33.6%
Hangul
ValueCountFrequency (%)
33
 
3.6%
31
 
3.4%
30
 
3.2%
28
 
3.0%
24
 
2.6%
21
 
2.3%
20
 
2.2%
19
 
2.1%
18
 
1.9%
17
 
1.8%
Other values (245) 683
73.9%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct159
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T01:25:52.329240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length35.770588
Min length21

Characters and Unicode

Total characters6081
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)87.6%

Sample

1st row부산광역시 남구 수영로 309 (대연동)
2nd row부산광역시 남구 용소로 45 (대연동)
3rd row부산광역시 남구 수영로 25 (문현동)
4th row부산광역시 남구 장고개로16번길 13 (우암동)
5th row부산광역시 남구 조각공원로 42, 503호 (대연동, 우성)
ValueCountFrequency (%)
부산광역시 170
 
14.6%
남구 170
 
14.6%
대연동 86
 
7.4%
수영로 38
 
3.3%
문현동 25
 
2.1%
용호동 23
 
2.0%
용당동 19
 
1.6%
312 13
 
1.1%
신선로 13
 
1.1%
감만동 12
 
1.0%
Other values (365) 594
51.1%
2023-12-11T01:25:52.988004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
994
 
16.3%
1 278
 
4.6%
232
 
3.8%
, 204
 
3.4%
197
 
3.2%
182
 
3.0%
181
 
3.0%
180
 
3.0%
2 174
 
2.9%
173
 
2.8%
Other values (172) 3286
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3382
55.6%
Decimal Number 1110
 
18.3%
Space Separator 994
 
16.3%
Other Punctuation 204
 
3.4%
Close Punctuation 172
 
2.8%
Open Punctuation 172
 
2.8%
Dash Punctuation 28
 
0.5%
Uppercase Letter 18
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
232
 
6.9%
197
 
5.8%
182
 
5.4%
181
 
5.4%
180
 
5.3%
173
 
5.1%
171
 
5.1%
170
 
5.0%
170
 
5.0%
149
 
4.4%
Other values (150) 1577
46.6%
Decimal Number
ValueCountFrequency (%)
1 278
25.0%
2 174
15.7%
0 147
13.2%
3 133
12.0%
4 81
 
7.3%
6 80
 
7.2%
5 76
 
6.8%
9 68
 
6.1%
8 44
 
4.0%
7 29
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
B 5
27.8%
G 4
22.2%
L 4
22.2%
J 2
 
11.1%
O 2
 
11.1%
A 1
 
5.6%
Space Separator
ValueCountFrequency (%)
994
100.0%
Other Punctuation
ValueCountFrequency (%)
, 204
100.0%
Close Punctuation
ValueCountFrequency (%)
) 172
100.0%
Open Punctuation
ValueCountFrequency (%)
( 172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3382
55.6%
Common 2681
44.1%
Latin 18
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
232
 
6.9%
197
 
5.8%
182
 
5.4%
181
 
5.4%
180
 
5.3%
173
 
5.1%
171
 
5.1%
170
 
5.0%
170
 
5.0%
149
 
4.4%
Other values (150) 1577
46.6%
Common
ValueCountFrequency (%)
994
37.1%
1 278
 
10.4%
, 204
 
7.6%
2 174
 
6.5%
) 172
 
6.4%
( 172
 
6.4%
0 147
 
5.5%
3 133
 
5.0%
4 81
 
3.0%
6 80
 
3.0%
Other values (6) 246
 
9.2%
Latin
ValueCountFrequency (%)
B 5
27.8%
G 4
22.2%
L 4
22.2%
J 2
 
11.1%
O 2
 
11.1%
A 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3382
55.6%
ASCII 2699
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
994
36.8%
1 278
 
10.3%
, 204
 
7.6%
2 174
 
6.4%
) 172
 
6.4%
( 172
 
6.4%
0 147
 
5.4%
3 133
 
4.9%
4 81
 
3.0%
6 80
 
3.0%
Other values (12) 264
 
9.8%
Hangul
ValueCountFrequency (%)
232
 
6.9%
197
 
5.8%
182
 
5.4%
181
 
5.4%
180
 
5.3%
173
 
5.1%
171
 
5.1%
170
 
5.0%
170
 
5.0%
149
 
4.4%
Other values (150) 1577
46.6%

전화번호
Text

MISSING 

Distinct106
Distinct (%)87.6%
Missing49
Missing (%)28.8%
Memory size1.5 KiB
2023-12-11T01:25:53.329585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.033058
Min length9

Characters and Unicode

Total characters1456
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)76.0%

Sample

1st row051-620-4355
2nd row051-620-1325
3rd row051-645-9801
4th row051-625-7373
5th row051-631-8101
ValueCountFrequency (%)
051-632-9005 3
 
2.5%
051-248-3699 2
 
1.7%
051-626-0777 2
 
1.7%
051-936-1408 2
 
1.7%
051-631-9907 2
 
1.7%
051-623-8003 2
 
1.7%
051-624-4620 2
 
1.7%
051-646-2392 2
 
1.7%
051-623-7733 2
 
1.7%
051-464-1230 2
 
1.7%
Other values (96) 100
82.6%
2023-12-11T01:25:53.844301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 241
16.6%
1 217
14.9%
0 208
14.3%
5 161
11.1%
6 151
10.4%
2 109
7.5%
3 89
 
6.1%
4 82
 
5.6%
8 71
 
4.9%
7 68
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1215
83.4%
Dash Punctuation 241
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 217
17.9%
0 208
17.1%
5 161
13.3%
6 151
12.4%
2 109
9.0%
3 89
7.3%
4 82
 
6.7%
8 71
 
5.8%
7 68
 
5.6%
9 59
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 241
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1456
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 241
16.6%
1 217
14.9%
0 208
14.3%
5 161
11.1%
6 151
10.4%
2 109
7.5%
3 89
 
6.1%
4 82
 
5.6%
8 71
 
4.9%
7 68
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1456
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 241
16.6%
1 217
14.9%
0 208
14.3%
5 161
11.1%
6 151
10.4%
2 109
7.5%
3 89
 
6.1%
4 82
 
5.6%
8 71
 
4.9%
7 68
 
4.7%

Interactions

2023-12-11T01:25:50.539202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:25:53.972565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.985
업종0.9851.000
2023-12-11T01:25:54.109902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.871
업종0.8711.000

Missing values

2023-12-11T01:25:50.665712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:25:50.765863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종사업체명칭사업체소재지(도로명)전화번호
01출판사경성대학교출판부부산광역시 남구 수영로 309 (대연동)051-620-4355
12출판사부경대학교 출판부부산광역시 남구 용소로 45 (대연동)051-620-1325
23출판사도서출판 에이맨부산광역시 남구 수영로 25 (문현동)051-645-9801
34출판사아베마리아출판사부산광역시 남구 장고개로16번길 13 (우암동)<NA>
45출판사폰테고 출판사부산광역시 남구 조각공원로 42, 503호 (대연동, 우성)051-625-7373
56출판사기러기문화원부산광역시 남구 수영로39번길 35, 코리아학원 (문현동)<NA>
67출판사(재)한국경제정책연구원부산광역시 남구 자성로 152, 1611호 (문현동, 한일오피스텔)051-631-8101
78출판사도서출판 논문의집부산광역시 남구 용소로 40 (대연동)<NA>
89출판사도서출판글초롱부산광역시 남구 수영로334번길 4 (대연동)051-623-7733
910출판사만나출판사부산광역시 남구 동명로164번길 28-1 (용호동)051-622-8536
연번업종사업체명칭사업체소재지(도로명)전화번호
160161인쇄사담앤북스(부산)부산광역시 남구 진남로 82 (대연동)051-244-1251
161162인쇄사동지프린텍부산광역시 남구 양지골로 79 (감만동)051-624-6211
162163인쇄사한길기획부산광역시 남구 황령대로 355-13 (대연동)051-624-8898
163164인쇄사PDJ MEDIA(피디제이미디어)부산광역시 남구 신선로 365, 부경대학교용당캠퍼스 407호 (용당동)051-702-1113
164165인쇄사플랑부산광역시 남구 전포대로 133, 비아이시티 1914호 (문현동)051-631-9907
165166인쇄사디자인통두손컴부설연구소부산광역시 남구 용소로46번길 7, 청년창조발전소 고고씽JOB 2층 (대연동)051-623-8003
166167인쇄사주식회사 유니온키드부산광역시 남구 용소로46번길 7, 청년창조발전소 지하1층 (대연동)070-8676-7196
167168인쇄사주식회사 재원부산광역시 남구 용소로46번길 4, 아이노스오피스텔 102호 (대연동)051-248-3699
168169인쇄사수애드부산광역시 남구 동명로170번길 93, 6동 1층 102호 (용호동, 용호동일타운)<NA>
169170인쇄사이너스부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 1905,1906호 (대연동)051-632-9005