Overview

Dataset statistics

Number of variables5
Number of observations160
Missing cells47
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시남구출판사및인쇄소현황
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3034659

Alerts

순번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 순번High correlation
전화번호 has 47 (29.4%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:25:56.317805
Analysis finished2023-12-10 16:25:57.445587
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct160
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.5
Minimum1
Maximum160
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-11T01:25:57.559902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.95
Q140.75
median80.5
Q3120.25
95-th percentile152.05
Maximum160
Range159
Interquartile range (IQR)79.5

Descriptive statistics

Standard deviation46.332134
Coefficient of variation (CV)0.57555446
Kurtosis-1.2
Mean80.5
Median Absolute Deviation (MAD)40
Skewness0
Sum12880
Variance2146.6667
MonotonicityStrictly increasing
2023-12-11T01:25:57.798302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
82 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
Other values (150) 150
93.8%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
출판사
136 
인쇄사
24 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 136
85.0%
인쇄사 24
 
15.0%

Length

2023-12-11T01:25:57.956959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:25:58.089096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 136
85.0%
인쇄사 24
 
15.0%
Distinct151
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-11T01:25:58.430841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length14
Mean length6.85
Min length2

Characters and Unicode

Total characters1096
Distinct characters299
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)88.8%

Sample

1st row경성대학교출판부
2nd row부경대학교 출판부
3rd row도서출판 에이맨
4th row아베마리아출판사
5th row폰테고 출판사
ValueCountFrequency (%)
도서출판 14
 
6.6%
출판사 7
 
3.3%
주식회사 6
 
2.8%
플랑 2
 
0.9%
한길기획 2
 
0.9%
재원 2
 
0.9%
디자인통두손컴부설연구소 2
 
0.9%
디자인 2
 
0.9%
다인커뮤니케이션 2
 
0.9%
출판부 2
 
0.9%
Other values (165) 172
80.8%
2023-12-11T01:25:59.056304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
4.8%
33
 
3.0%
32
 
2.9%
30
 
2.7%
24
 
2.2%
) 21
 
1.9%
( 21
 
1.9%
20
 
1.8%
19
 
1.7%
19
 
1.7%
Other values (289) 824
75.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 890
81.2%
Uppercase Letter 55
 
5.0%
Space Separator 53
 
4.8%
Lowercase Letter 48
 
4.4%
Close Punctuation 21
 
1.9%
Open Punctuation 21
 
1.9%
Other Punctuation 4
 
0.4%
Decimal Number 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
3.7%
32
 
3.6%
30
 
3.4%
24
 
2.7%
20
 
2.2%
19
 
2.1%
19
 
2.1%
19
 
2.1%
18
 
2.0%
18
 
2.0%
Other values (245) 658
73.9%
Uppercase Letter
ValueCountFrequency (%)
D 7
12.7%
E 6
10.9%
N 5
 
9.1%
C 5
 
9.1%
G 4
 
7.3%
P 4
 
7.3%
I 3
 
5.5%
A 3
 
5.5%
J 3
 
5.5%
R 2
 
3.6%
Other values (9) 13
23.6%
Lowercase Letter
ValueCountFrequency (%)
n 7
14.6%
o 6
12.5%
g 4
8.3%
a 4
8.3%
i 4
8.3%
e 4
8.3%
r 4
8.3%
d 3
 
6.2%
l 2
 
4.2%
b 2
 
4.2%
Other values (8) 8
16.7%
Other Punctuation
ValueCountFrequency (%)
& 2
50.0%
. 2
50.0%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%
Space Separator
ValueCountFrequency (%)
53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 882
80.5%
Common 103
 
9.4%
Latin 103
 
9.4%
Han 8
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
3.7%
32
 
3.6%
30
 
3.4%
24
 
2.7%
20
 
2.3%
19
 
2.2%
19
 
2.2%
19
 
2.2%
18
 
2.0%
18
 
2.0%
Other values (237) 650
73.7%
Latin
ValueCountFrequency (%)
n 7
 
6.8%
D 7
 
6.8%
E 6
 
5.8%
o 6
 
5.8%
N 5
 
4.9%
C 5
 
4.9%
G 4
 
3.9%
P 4
 
3.9%
g 4
 
3.9%
a 4
 
3.9%
Other values (27) 51
49.5%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Common
ValueCountFrequency (%)
53
51.5%
) 21
 
20.4%
( 21
 
20.4%
& 2
 
1.9%
. 2
 
1.9%
1 2
 
1.9%
2 2
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 882
80.5%
ASCII 206
 
18.8%
CJK 8
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53
25.7%
) 21
 
10.2%
( 21
 
10.2%
n 7
 
3.4%
D 7
 
3.4%
E 6
 
2.9%
o 6
 
2.9%
N 5
 
2.4%
C 5
 
2.4%
G 4
 
1.9%
Other values (34) 71
34.5%
Hangul
ValueCountFrequency (%)
33
 
3.7%
32
 
3.6%
30
 
3.4%
24
 
2.7%
20
 
2.3%
19
 
2.2%
19
 
2.2%
19
 
2.2%
18
 
2.0%
18
 
2.0%
Other values (237) 650
73.7%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct152
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-11T01:25:59.474381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length35.59375
Min length21

Characters and Unicode

Total characters5695
Distinct characters190
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)90.0%

Sample

1st row부산광역시 남구 수영로 309 (대연동)
2nd row부산광역시 남구 용소로 45 (대연동)
3rd row부산광역시 남구 수영로 25 (문현동)
4th row부산광역시 남구 장고개로16번길 13 (우암동)
5th row부산광역시 남구 조각공원로 42, 503호 (대연동, 우성)
ValueCountFrequency (%)
부산광역시 160
 
14.8%
남구 160
 
14.8%
대연동 82
 
7.6%
수영로 32
 
3.0%
용호동 23
 
2.1%
문현동 20
 
1.9%
용당동 19
 
1.8%
신선로 13
 
1.2%
365 12
 
1.1%
감만동 12
 
1.1%
Other values (351) 548
50.7%
2023-12-11T01:26:00.116814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
922
 
16.2%
1 255
 
4.5%
215
 
3.8%
, 192
 
3.4%
180
 
3.2%
174
 
3.1%
171
 
3.0%
169
 
3.0%
164
 
2.9%
) 162
 
2.8%
Other values (180) 3091
54.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3196
56.1%
Decimal Number 1016
 
17.8%
Space Separator 922
 
16.2%
Other Punctuation 192
 
3.4%
Close Punctuation 162
 
2.8%
Open Punctuation 162
 
2.8%
Dash Punctuation 26
 
0.5%
Uppercase Letter 18
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
215
 
6.7%
180
 
5.6%
174
 
5.4%
171
 
5.4%
169
 
5.3%
164
 
5.1%
161
 
5.0%
160
 
5.0%
160
 
5.0%
140
 
4.4%
Other values (158) 1502
47.0%
Decimal Number
ValueCountFrequency (%)
1 255
25.1%
2 159
15.6%
0 132
13.0%
3 116
11.4%
4 79
 
7.8%
5 75
 
7.4%
6 72
 
7.1%
9 56
 
5.5%
8 46
 
4.5%
7 26
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
B 5
27.8%
L 4
22.2%
G 4
22.2%
J 2
 
11.1%
O 2
 
11.1%
A 1
 
5.6%
Space Separator
ValueCountFrequency (%)
922
100.0%
Other Punctuation
ValueCountFrequency (%)
, 192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 162
100.0%
Open Punctuation
ValueCountFrequency (%)
( 162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3196
56.1%
Common 2481
43.6%
Latin 18
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
215
 
6.7%
180
 
5.6%
174
 
5.4%
171
 
5.4%
169
 
5.3%
164
 
5.1%
161
 
5.0%
160
 
5.0%
160
 
5.0%
140
 
4.4%
Other values (158) 1502
47.0%
Common
ValueCountFrequency (%)
922
37.2%
1 255
 
10.3%
, 192
 
7.7%
) 162
 
6.5%
( 162
 
6.5%
2 159
 
6.4%
0 132
 
5.3%
3 116
 
4.7%
4 79
 
3.2%
5 75
 
3.0%
Other values (6) 227
 
9.1%
Latin
ValueCountFrequency (%)
B 5
27.8%
L 4
22.2%
G 4
22.2%
J 2
 
11.1%
O 2
 
11.1%
A 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3196
56.1%
ASCII 2499
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
922
36.9%
1 255
 
10.2%
, 192
 
7.7%
) 162
 
6.5%
( 162
 
6.5%
2 159
 
6.4%
0 132
 
5.3%
3 116
 
4.6%
4 79
 
3.2%
5 75
 
3.0%
Other values (12) 245
 
9.8%
Hangul
ValueCountFrequency (%)
215
 
6.7%
180
 
5.6%
174
 
5.4%
171
 
5.4%
169
 
5.3%
164
 
5.1%
161
 
5.0%
160
 
5.0%
160
 
5.0%
140
 
4.4%
Other values (158) 1502
47.0%

전화번호
Text

MISSING 

Distinct103
Distinct (%)91.2%
Missing47
Missing (%)29.4%
Memory size1.4 KiB
2023-12-11T01:26:00.407017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035398
Min length9

Characters and Unicode

Total characters1360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)82.3%

Sample

1st row051-620-4355
2nd row051-620-1325
3rd row051-645-9801
4th row051-625-7373
5th row051-631-8101
ValueCountFrequency (%)
051-623-7733 2
 
1.8%
051-611-3951 2
 
1.8%
051-626-0777 2
 
1.8%
051-631-9907 2
 
1.8%
051-646-2392 2
 
1.8%
051-624-4620 2
 
1.8%
051-702-1113 2
 
1.8%
051-248-3699 2
 
1.8%
051-624-8898 2
 
1.8%
051-244-1251 2
 
1.8%
Other values (93) 93
82.3%
2023-12-11T01:26:00.867154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 225
16.5%
1 204
15.0%
0 189
13.9%
5 151
11.1%
6 149
11.0%
2 104
7.6%
3 82
 
6.0%
4 73
 
5.4%
8 67
 
4.9%
7 63
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1135
83.5%
Dash Punctuation 225
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 204
18.0%
0 189
16.7%
5 151
13.3%
6 149
13.1%
2 104
9.2%
3 82
7.2%
4 73
 
6.4%
8 67
 
5.9%
7 63
 
5.6%
9 53
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 225
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 225
16.5%
1 204
15.0%
0 189
13.9%
5 151
11.1%
6 149
11.0%
2 104
7.6%
3 82
 
6.0%
4 73
 
5.4%
8 67
 
4.9%
7 63
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 225
16.5%
1 204
15.0%
0 189
13.9%
5 151
11.1%
6 149
11.0%
2 104
7.6%
3 82
 
6.0%
4 73
 
5.4%
8 67
 
4.9%
7 63
 
4.6%

Interactions

2023-12-11T01:25:57.114452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:26:01.010753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업종
순번1.0000.984
업종0.9841.000
2023-12-11T01:26:01.107439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업종
순번1.0000.867
업종0.8671.000

Missing values

2023-12-11T01:25:57.274180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:25:57.393774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업종상호명소재지전화번호
01출판사경성대학교출판부부산광역시 남구 수영로 309 (대연동)051-620-4355
12출판사부경대학교 출판부부산광역시 남구 용소로 45 (대연동)051-620-1325
23출판사도서출판 에이맨부산광역시 남구 수영로 25 (문현동)051-645-9801
34출판사아베마리아출판사부산광역시 남구 장고개로16번길 13 (우암동)<NA>
45출판사폰테고 출판사부산광역시 남구 조각공원로 42, 503호 (대연동, 우성)051-625-7373
56출판사기러기문화원부산광역시 남구 수영로39번길 35, 코리아학원 (문현동)<NA>
67출판사(재)한국경제정책연구원부산광역시 남구 자성로 152, 1611호 (문현동, 한일오피스텔)051-631-8101
78출판사도서출판 논문의집부산광역시 남구 용소로 42, 1층 (대연동, 필통노래연습장)051-626-8047
89출판사도서출판글초롱부산광역시 남구 수영로334번길 4 (대연동)051-623-7733
910출판사만나출판사부산광역시 남구 동명로164번길 28-1 (용호동)051-622-8536
순번업종상호명소재지전화번호
150151인쇄사(주)디알부산광역시 남구 황령대로319번가길 159 (대연동)051-624-4620
151152인쇄사(주)한명전산부산광역시 남구 신선로356번길 21 (용당동)051-524-6800
152153인쇄사현대북스부산광역시 남구 진남로 82 (대연동)051-244-1251
153154인쇄사동지프린텍부산광역시 남구 양지골로 79 (감만동)051-624-6211
154155인쇄사한길기획부산광역시 남구 황령대로 355-13 (대연동)051-624-8898
155156인쇄사PDJ MEDIA(피디제이미디어)부산광역시 남구 신선로 365, 부경대학교용당캠퍼스 407호 (용당동)051-702-1113
156157인쇄사플랑부산광역시 남구 전포대로110번길 5, 금융센터 디온플레이스 1610호 (문현동)051-631-9907
157158인쇄사디자인통두손컴부설연구소부산광역시 남구 용소로46번길 7, 청년창조발전소 고고씽JOB 2층 (대연동)<NA>
158159인쇄사주식회사 유니온키드부산광역시 남구 용소로46번길 7, 청년창조발전소 지하1층 (대연동)070-8676-7196
159160인쇄사주식회사 재원부산광역시 남구 용소로46번길 4, 아이노스오피스텔 102호 (대연동)051-248-3699