Overview

Dataset statistics

Number of variables6
Number of observations367
Missing cells11
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.7 KiB
Average record size in memory49.4 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description부산광역시부산진구출판사및인쇄사현황_20201019
Author부산광역시 부산진구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025579

Alerts

관리부서 has constant value ""Constant
기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
사업체소재지(도로명) has 11 (3.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:33:53.849486
Analysis finished2023-12-10 17:33:55.488583
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct367
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean184
Minimum1
Maximum367
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-11T02:33:55.724382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.3
Q192.5
median184
Q3275.5
95-th percentile348.7
Maximum367
Range366
Interquartile range (IQR)183

Descriptive statistics

Standard deviation106.08801
Coefficient of variation (CV)0.57656529
Kurtosis-1.2
Mean184
Median Absolute Deviation (MAD)92
Skewness0
Sum67528
Variance11254.667
MonotonicityStrictly increasing
2023-12-11T02:33:56.159512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
243 1
 
0.3%
252 1
 
0.3%
251 1
 
0.3%
250 1
 
0.3%
249 1
 
0.3%
248 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
Other values (357) 357
97.3%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
367 1
0.3%
366 1
0.3%
365 1
0.3%
364 1
0.3%
363 1
0.3%
362 1
0.3%
361 1
0.3%
360 1
0.3%
359 1
0.3%
358 1
0.3%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
출판사
222 
인쇄사
145 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 222
60.5%
인쇄사 145
39.5%

Length

2023-12-11T02:33:56.585639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:33:56.838865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 222
60.5%
인쇄사 145
39.5%
Distinct325
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-11T02:33:57.437573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length19
Mean length6.9564033
Min length1

Characters and Unicode

Total characters2553
Distinct characters353
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique283 ?
Unique (%)77.1%

Sample

1st row도서출판 보훈사
2nd row문화사
3rd row동의과학대학교출판부
4th row도서출판계림
5th row도서출판 성문
ValueCountFrequency (%)
도서출판 30
 
6.5%
무점포 18
 
3.9%
주식회사 6
 
1.3%
디자인 6
 
1.3%
주)코이노린트 2
 
0.4%
대훈기획 2
 
0.4%
미라클 2
 
0.4%
2
 
0.4%
애드랙디엠굿디자인 2
 
0.4%
오투네트웍스 2
 
0.4%
Other values (350) 390
84.4%
2023-12-11T02:33:58.427799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
132
 
5.2%
95
 
3.7%
( 85
 
3.3%
) 85
 
3.3%
76
 
3.0%
62
 
2.4%
55
 
2.2%
48
 
1.9%
45
 
1.8%
45
 
1.8%
Other values (343) 1825
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2093
82.0%
Space Separator 95
 
3.7%
Open Punctuation 86
 
3.4%
Close Punctuation 86
 
3.4%
Lowercase Letter 75
 
2.9%
Uppercase Letter 61
 
2.4%
Decimal Number 51
 
2.0%
Other Punctuation 4
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
 
6.3%
76
 
3.6%
62
 
3.0%
55
 
2.6%
48
 
2.3%
45
 
2.2%
45
 
2.2%
43
 
2.1%
42
 
2.0%
40
 
1.9%
Other values (290) 1505
71.9%
Lowercase Letter
ValueCountFrequency (%)
o 13
17.3%
e 12
16.0%
n 9
12.0%
a 5
 
6.7%
u 4
 
5.3%
k 4
 
5.3%
s 4
 
5.3%
i 3
 
4.0%
b 3
 
4.0%
r 3
 
4.0%
Other values (11) 15
20.0%
Uppercase Letter
ValueCountFrequency (%)
C 9
14.8%
A 7
11.5%
T 6
9.8%
O 5
8.2%
P 5
8.2%
S 5
8.2%
R 5
8.2%
D 4
6.6%
B 4
6.6%
K 2
 
3.3%
Other values (8) 9
14.8%
Decimal Number
ValueCountFrequency (%)
1 42
82.4%
2 3
 
5.9%
0 2
 
3.9%
4 2
 
3.9%
9 1
 
2.0%
8 1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 85
98.8%
[ 1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 85
98.8%
] 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
, 1
 
25.0%
Space Separator
ValueCountFrequency (%)
95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2087
81.7%
Common 324
 
12.7%
Latin 136
 
5.3%
Han 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
 
6.3%
76
 
3.6%
62
 
3.0%
55
 
2.6%
48
 
2.3%
45
 
2.2%
45
 
2.2%
43
 
2.1%
42
 
2.0%
40
 
1.9%
Other values (284) 1499
71.8%
Latin
ValueCountFrequency (%)
o 13
 
9.6%
e 12
 
8.8%
C 9
 
6.6%
n 9
 
6.6%
A 7
 
5.1%
T 6
 
4.4%
O 5
 
3.7%
a 5
 
3.7%
P 5
 
3.7%
S 5
 
3.7%
Other values (29) 60
44.1%
Common
ValueCountFrequency (%)
95
29.3%
( 85
26.2%
) 85
26.2%
1 42
13.0%
2 3
 
0.9%
& 3
 
0.9%
0 2
 
0.6%
4 2
 
0.6%
- 2
 
0.6%
] 1
 
0.3%
Other values (4) 4
 
1.2%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2087
81.7%
ASCII 460
 
18.0%
CJK 6
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
132
 
6.3%
76
 
3.6%
62
 
3.0%
55
 
2.6%
48
 
2.3%
45
 
2.2%
45
 
2.2%
43
 
2.1%
42
 
2.0%
40
 
1.9%
Other values (284) 1499
71.8%
ASCII
ValueCountFrequency (%)
95
20.7%
( 85
18.5%
) 85
18.5%
1 42
9.1%
o 13
 
2.8%
e 12
 
2.6%
C 9
 
2.0%
n 9
 
2.0%
A 7
 
1.5%
T 6
 
1.3%
Other values (43) 97
21.1%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct287
Distinct (%)80.6%
Missing11
Missing (%)3.0%
Memory size3.0 KiB
2023-12-11T02:33:58.995765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length48
Mean length31.404494
Min length22

Characters and Unicode

Total characters11180
Distinct characters220
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)64.9%

Sample

1st row부산광역시 부산진구 양지로 54 (양정동)
2nd row부산광역시 부산진구 부전로 15 (부전동)
3rd row부산광역시 부산진구 신천대로 87, 1층 (범천동)
4th row부산광역시 부산진구 중앙대로 940 (양정동)
5th row부산광역시 부산진구 동성로 143 (전포동)
ValueCountFrequency (%)
부산광역시 356
 
17.1%
부산진구 356
 
17.1%
부전동 111
 
5.3%
범천동 83
 
4.0%
부전로 64
 
3.1%
양정동 48
 
2.3%
전포동 33
 
1.6%
중앙대로 23
 
1.1%
3층 22
 
1.1%
신천대로65번길 22
 
1.1%
Other values (446) 958
46.1%
2023-12-11T02:34:00.014370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1817
 
16.3%
915
 
8.2%
721
 
6.4%
438
 
3.9%
1 379
 
3.4%
368
 
3.3%
363
 
3.2%
361
 
3.2%
) 359
 
3.2%
( 359
 
3.2%
Other values (210) 5100
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6623
59.2%
Space Separator 1817
 
16.3%
Decimal Number 1671
 
14.9%
Close Punctuation 359
 
3.2%
Open Punctuation 359
 
3.2%
Other Punctuation 242
 
2.2%
Dash Punctuation 92
 
0.8%
Uppercase Letter 17
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
915
13.8%
721
 
10.9%
438
 
6.6%
368
 
5.6%
363
 
5.5%
361
 
5.5%
358
 
5.4%
356
 
5.4%
356
 
5.4%
247
 
3.7%
Other values (182) 2140
32.3%
Uppercase Letter
ValueCountFrequency (%)
L 3
17.6%
A 2
11.8%
D 2
11.8%
T 2
11.8%
O 2
11.8%
H 1
 
5.9%
M 1
 
5.9%
I 1
 
5.9%
V 1
 
5.9%
S 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 379
22.7%
5 204
12.2%
2 196
11.7%
0 185
11.1%
3 167
10.0%
6 161
9.6%
4 123
 
7.4%
7 96
 
5.7%
8 83
 
5.0%
9 77
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 240
99.2%
· 1
 
0.4%
/ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
1817
100.0%
Close Punctuation
ValueCountFrequency (%)
) 359
100.0%
Open Punctuation
ValueCountFrequency (%)
( 359
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6623
59.2%
Common 4540
40.6%
Latin 17
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
915
13.8%
721
 
10.9%
438
 
6.6%
368
 
5.6%
363
 
5.5%
361
 
5.5%
358
 
5.4%
356
 
5.4%
356
 
5.4%
247
 
3.7%
Other values (182) 2140
32.3%
Common
ValueCountFrequency (%)
1817
40.0%
1 379
 
8.3%
) 359
 
7.9%
( 359
 
7.9%
, 240
 
5.3%
5 204
 
4.5%
2 196
 
4.3%
0 185
 
4.1%
3 167
 
3.7%
6 161
 
3.5%
Other values (7) 473
 
10.4%
Latin
ValueCountFrequency (%)
L 3
17.6%
A 2
11.8%
D 2
11.8%
T 2
11.8%
O 2
11.8%
H 1
 
5.9%
M 1
 
5.9%
I 1
 
5.9%
V 1
 
5.9%
S 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6623
59.2%
ASCII 4556
40.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1817
39.9%
1 379
 
8.3%
) 359
 
7.9%
( 359
 
7.9%
, 240
 
5.3%
5 204
 
4.5%
2 196
 
4.3%
0 185
 
4.1%
3 167
 
3.7%
6 161
 
3.5%
Other values (17) 489
 
10.7%
Hangul
ValueCountFrequency (%)
915
13.8%
721
 
10.9%
438
 
6.6%
368
 
5.6%
363
 
5.5%
361
 
5.5%
358
 
5.4%
356
 
5.4%
356
 
5.4%
247
 
3.7%
Other values (182) 2140
32.3%
None
ValueCountFrequency (%)
· 1
100.0%

관리부서
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
부산광역시 부산진구
367 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 부산진구
2nd row부산광역시 부산진구
3rd row부산광역시 부산진구
4th row부산광역시 부산진구
5th row부산광역시 부산진구

Common Values

ValueCountFrequency (%)
부산광역시 부산진구 367
100.0%

Length

2023-12-11T02:34:00.339324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:34:00.582785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 367
50.0%
부산진구 367
50.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2020-10-19 00:00:00
Maximum2020-10-19 00:00:00
2023-12-11T02:34:00.831573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:34:01.128487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T02:33:54.633275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:34:01.324264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0001.000
업종1.0001.000
2023-12-11T02:34:01.542975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.978
업종0.9781.000

Missing values

2023-12-11T02:33:55.009106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:33:55.364484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종사업체명칭사업체소재지(도로명)관리부서기준일자
01출판사도서출판 보훈사<NA>부산광역시 부산진구2020-10-19
12출판사문화사<NA>부산광역시 부산진구2020-10-19
23출판사동의과학대학교출판부부산광역시 부산진구 양지로 54 (양정동)부산광역시 부산진구2020-10-19
34출판사도서출판계림부산광역시 부산진구 부전로 15 (부전동)부산광역시 부산진구2020-10-19
45출판사도서출판 성문부산광역시 부산진구 신천대로 87, 1층 (범천동)부산광역시 부산진구2020-10-19
56출판사도서출판부다가야부산광역시 부산진구 중앙대로 940 (양정동)부산광역시 부산진구2020-10-19
67출판사광명인쇄출판사부산광역시 부산진구 동성로 143 (전포동)부산광역시 부산진구2020-10-19
78출판사영신애드부산광역시 부산진구 부전로 59-1 (부전동)부산광역시 부산진구2020-10-19
89출판사대성인쇄사부산광역시 부산진구 서면문화로53번길 15-6 (부전동)부산광역시 부산진구2020-10-19
910출판사도서출판 한일부산광역시 부산진구 중앙대로755번길 10 (부전동)부산광역시 부산진구2020-10-19
연번업종사업체명칭사업체소재지(도로명)관리부서기준일자
357358인쇄사태경인쇄부산광역시 부산진구 신천대로65번길 26 (범천동)부산광역시 부산진구2020-10-19
358359인쇄사레브드디자인부산광역시 부산진구 양지로 28, 3층 (양정동)부산광역시 부산진구2020-10-19
359360인쇄사Ace문화사부산광역시 부산진구 신천대로71번길 33, 1층 (범천동)부산광역시 부산진구2020-10-19
360361인쇄사부산기획부산광역시 부산진구 중앙대로635번길 9 (범천동)부산광역시 부산진구2020-10-19
361362인쇄사에비션(evition)부산광역시 부산진구 진남로 525, 유정빌딩 5층 (양정동)부산광역시 부산진구2020-10-19
362363인쇄사동아디앤피부산광역시 부산진구 신천대로65번길 37, 3층 (범천동)부산광역시 부산진구2020-10-19
363364인쇄사(주)디자인거북골부산광역시 부산진구 부전로 5-1, 402호 (부전동)부산광역시 부산진구2020-10-19
364365인쇄사주식회사 부산기획부산광역시 부산진구 중앙대로635번길 9, 3층 (범천동)부산광역시 부산진구2020-10-19
365366인쇄사광명애드부산광역시 부산진구 신천대로78번길 8 (부전동)부산광역시 부산진구2020-10-19
366367인쇄사프레스바이부산광역시 부산진구 서면로 5, 1층 (부전동)부산광역시 부산진구2020-10-19