Overview

Dataset statistics

Number of variables4
Number of observations391
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.7 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text1
Categorical2

Dataset

Description부산광역시부산진구출판사및인쇄사현황_20230822
Author부산광역시 부산진구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025579

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:33:46.679298
Analysis finished2023-12-10 17:33:47.988923
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct391
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196
Minimum1
Maximum391
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-11T02:33:48.159860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.5
Q198.5
median196
Q3293.5
95-th percentile371.5
Maximum391
Range390
Interquartile range (IQR)195

Descriptive statistics

Standard deviation113.01622
Coefficient of variation (CV)0.57661338
Kurtosis-1.2
Mean196
Median Absolute Deviation (MAD)98
Skewness0
Sum76636
Variance12772.667
MonotonicityStrictly increasing
2023-12-11T02:33:48.516842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
270 1
 
0.3%
268 1
 
0.3%
267 1
 
0.3%
266 1
 
0.3%
265 1
 
0.3%
264 1
 
0.3%
263 1
 
0.3%
262 1
 
0.3%
261 1
 
0.3%
Other values (381) 381
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
391 1
0.3%
390 1
0.3%
389 1
0.3%
388 1
0.3%
387 1
0.3%
386 1
0.3%
385 1
0.3%
384 1
0.3%
383 1
0.3%
382 1
0.3%

명칭
Text

Distinct346
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:33:49.008815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length26
Mean length7.859335
Min length2

Characters and Unicode

Total characters3073
Distinct characters393
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique301 ?
Unique (%)77.0%

Sample

1st row도서출판 보훈사
2nd row문화사
3rd row동의과학대학교출판부
4th row도서출판계림
5th row도서출판 성문
ValueCountFrequency (%)
무점포 55
 
10.2%
도서출판 31
 
5.7%
주식회사 11
 
2.0%
디자인 4
 
0.7%
출판사(1인 3
 
0.6%
레브드디자인 2
 
0.4%
디자인글꼴 2
 
0.4%
주)참이즈 2
 
0.4%
디자인제로 2
 
0.4%
대훈기획 2
 
0.4%
Other values (381) 427
78.9%
2023-12-11T02:33:50.376084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168
 
5.5%
150
 
4.9%
) 128
 
4.2%
( 128
 
4.2%
87
 
2.8%
80
 
2.6%
78
 
2.5%
1 77
 
2.5%
75
 
2.4%
66
 
2.1%
Other values (383) 2036
66.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2357
76.7%
Space Separator 150
 
4.9%
Close Punctuation 130
 
4.2%
Open Punctuation 130
 
4.2%
Uppercase Letter 111
 
3.6%
Lowercase Letter 101
 
3.3%
Decimal Number 85
 
2.8%
Other Punctuation 7
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
168
 
7.1%
87
 
3.7%
80
 
3.4%
78
 
3.3%
75
 
3.2%
66
 
2.8%
58
 
2.5%
54
 
2.3%
47
 
2.0%
45
 
1.9%
Other values (323) 1599
67.8%
Lowercase Letter
ValueCountFrequency (%)
n 14
13.9%
e 14
13.9%
o 12
11.9%
a 11
10.9%
r 6
 
5.9%
t 6
 
5.9%
i 5
 
5.0%
u 5
 
5.0%
m 4
 
4.0%
d 3
 
3.0%
Other values (12) 21
20.8%
Uppercase Letter
ValueCountFrequency (%)
C 12
 
10.8%
A 10
 
9.0%
S 9
 
8.1%
E 9
 
8.1%
P 9
 
8.1%
D 7
 
6.3%
T 7
 
6.3%
O 6
 
5.4%
R 6
 
5.4%
B 5
 
4.5%
Other values (12) 31
27.9%
Decimal Number
ValueCountFrequency (%)
1 77
90.6%
4 2
 
2.4%
2 2
 
2.4%
3 1
 
1.2%
7 1
 
1.2%
8 1
 
1.2%
0 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 3
42.9%
& 3
42.9%
, 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 128
98.5%
] 2
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 128
98.5%
[ 2
 
1.5%
Space Separator
ValueCountFrequency (%)
150
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2349
76.4%
Common 504
 
16.4%
Latin 212
 
6.9%
Han 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
168
 
7.2%
87
 
3.7%
80
 
3.4%
78
 
3.3%
75
 
3.2%
66
 
2.8%
58
 
2.5%
54
 
2.3%
47
 
2.0%
45
 
1.9%
Other values (315) 1591
67.7%
Latin
ValueCountFrequency (%)
n 14
 
6.6%
e 14
 
6.6%
o 12
 
5.7%
C 12
 
5.7%
a 11
 
5.2%
A 10
 
4.7%
S 9
 
4.2%
E 9
 
4.2%
P 9
 
4.2%
D 7
 
3.3%
Other values (34) 105
49.5%
Common
ValueCountFrequency (%)
150
29.8%
) 128
25.4%
( 128
25.4%
1 77
15.3%
. 3
 
0.6%
& 3
 
0.6%
- 2
 
0.4%
] 2
 
0.4%
[ 2
 
0.4%
4 2
 
0.4%
Other values (6) 7
 
1.4%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2349
76.4%
ASCII 716
 
23.3%
CJK 8
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
168
 
7.2%
87
 
3.7%
80
 
3.4%
78
 
3.3%
75
 
3.2%
66
 
2.8%
58
 
2.5%
54
 
2.3%
47
 
2.0%
45
 
1.9%
Other values (315) 1591
67.7%
ASCII
ValueCountFrequency (%)
150
20.9%
) 128
17.9%
( 128
17.9%
1 77
10.8%
n 14
 
2.0%
e 14
 
2.0%
o 12
 
1.7%
C 12
 
1.7%
a 11
 
1.5%
A 10
 
1.4%
Other values (50) 160
22.3%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

소재지
Categorical

Distinct11
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
부산광역시 부산진구 부전동
136 
부산광역시 부산진구 범천동
92 
부산광역시 부산진구 양정동
50 
부산광역시 부산진구 전포동
39 
부산광역시 부산진구 개금동
19 
Other values (6)
55 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 부산진구 범천동
2nd row부산광역시 부산진구 범천동
3rd row부산광역시 부산진구 양정동
4th row부산광역시 부산진구 부전동
5th row부산광역시 부산진구 범천동

Common Values

ValueCountFrequency (%)
부산광역시 부산진구 부전동 136
34.8%
부산광역시 부산진구 범천동 92
23.5%
부산광역시 부산진구 양정동 50
 
12.8%
부산광역시 부산진구 전포동 39
 
10.0%
부산광역시 부산진구 개금동 19
 
4.9%
부산광역시 부산진구 초읍동 12
 
3.1%
부산광역시 부산진구 가야동 12
 
3.1%
부산광역시 부산진구 당감동 12
 
3.1%
부산광역시 부산진구 부암동 10
 
2.6%
부산광역시 부산진구 범전동 5
 
1.3%

Length

2023-12-11T02:33:50.662488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 391
33.3%
부산진구 391
33.3%
부전동 136
 
11.6%
범천동 92
 
7.8%
양정동 50
 
4.3%
전포동 39
 
3.3%
개금동 19
 
1.6%
초읍동 12
 
1.0%
가야동 12
 
1.0%
당감동 12
 
1.0%
Other values (3) 19
 
1.6%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
출판사
249 
인쇄사
142 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 249
63.7%
인쇄사 142
36.3%

Length

2023-12-11T02:33:50.911636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:33:51.130812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 249
63.7%
인쇄사 142
36.3%

Interactions

2023-12-11T02:33:47.310883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:33:51.313678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지업종
연번1.0000.3730.997
소재지0.3731.0000.323
업종0.9970.3231.000
2023-12-11T02:33:51.550061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종소재지
업종1.0000.306
소재지0.3061.000
2023-12-11T02:33:51.742124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지업종
연번1.0000.1630.938
소재지0.1631.0000.306
업종0.9380.3061.000

Missing values

2023-12-11T02:33:47.648139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:33:47.911186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번명칭소재지업종
01도서출판 보훈사부산광역시 부산진구 범천동출판사
12문화사부산광역시 부산진구 범천동출판사
23동의과학대학교출판부부산광역시 부산진구 양정동출판사
34도서출판계림부산광역시 부산진구 부전동출판사
45도서출판 성문부산광역시 부산진구 범천동출판사
56도서출판부다가야부산광역시 부산진구 양정동출판사
67광명인쇄출판사부산광역시 부산진구 전포동출판사
78영신애드부산광역시 부산진구 부전동출판사
89대성인쇄사부산광역시 부산진구 부전동출판사
910도서출판 한일부산광역시 부산진구 부전동출판사
연번명칭소재지업종
381382동아디앤피부산광역시 부산진구 범천동인쇄사
382383(주)디자인거북골부산광역시 부산진구 부전동인쇄사
383384주식회사 부산기획부산광역시 부산진구 범천동인쇄사
384385광명애드부산광역시 부산진구 부전동인쇄사
385386프레스바이부산광역시 부산진구 부전동인쇄사
386387이노디자인부산광역시 부산진구 당감동인쇄사
387388크리콤부산광역시 부산진구 범천동인쇄사
388389광명정판부산광역시 부산진구 부전동인쇄사
389390주식회사 디자인제로부산광역시 부산진구 부전동인쇄사
390391(주)비손부산광역시 부산진구 양정동인쇄사