Overview

Dataset statistics

Number of variables5
Number of observations75
Missing cells47
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory42.8 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description부산광역시기장군_출판사및인쇄사현황_20210303
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15077536

Alerts

시설면적(㎡) has 47 (62.7%) missing valuesMissing
신고(등록)번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:47:52.499541
Analysis finished2023-12-10 16:47:53.113379
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-11T01:47:53.308521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length17
Mean length17
Min length17

Characters and Unicode

Total characters1275
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row25100-2009-000001
2nd row25100-2009-000002
3rd row25100-2010-000001
4th row25100-2010-000002
5th row25100-2010-000003
ValueCountFrequency (%)
25100-2009-000001 1
 
1.3%
25100-2018-000002 1
 
1.3%
25200-1997-000001 1
 
1.3%
25200-1995-000001 1
 
1.3%
25200-1993-000001 1
 
1.3%
25200-1987-000001 1
 
1.3%
25100-2020-000002 1
 
1.3%
25100-2020-000001 1
 
1.3%
25100-2019-000007 1
 
1.3%
25100-2019-000006 1
 
1.3%
Other values (65) 65
86.7%
2023-12-11T01:47:53.687440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 608
47.7%
2 188
 
14.7%
1 151
 
11.8%
- 150
 
11.8%
5 84
 
6.6%
4 21
 
1.6%
9 19
 
1.5%
3 19
 
1.5%
7 19
 
1.5%
8 8
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1125
88.2%
Dash Punctuation 150
 
11.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 608
54.0%
2 188
 
16.7%
1 151
 
13.4%
5 84
 
7.5%
4 21
 
1.9%
9 19
 
1.7%
3 19
 
1.7%
7 19
 
1.7%
8 8
 
0.7%
6 8
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 150
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1275
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 608
47.7%
2 188
 
14.7%
1 151
 
11.8%
- 150
 
11.8%
5 84
 
6.6%
4 21
 
1.6%
9 19
 
1.5%
3 19
 
1.5%
7 19
 
1.5%
8 8
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1275
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 608
47.7%
2 188
 
14.7%
1 151
 
11.8%
- 150
 
11.8%
5 84
 
6.6%
4 21
 
1.6%
9 19
 
1.5%
3 19
 
1.5%
7 19
 
1.5%
8 8
 
0.6%
Distinct71
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-11T01:47:53.968188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length6.44
Min length3

Characters and Unicode

Total characters483
Distinct characters198
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)89.3%

Sample

1st row놀이속의 세상
2nd row영상교육
3rd row주식회사천북
4th row수다과학연구소
5th row경호무술출판사
ValueCountFrequency (%)
도서출판 9
 
9.2%
사인몰 2
 
2.0%
부산기장협동조합 2
 
2.0%
나라테크 2
 
2.0%
현대이앤지 2
 
2.0%
한국심리과학연구소 1
 
1.0%
빨간집(red 1
 
1.0%
정관타임스live 1
 
1.0%
글고은 1
 
1.0%
오소호필름 1
 
1.0%
Other values (76) 76
77.6%
2023-12-11T01:47:54.467804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
4.8%
19
 
3.9%
13
 
2.7%
13
 
2.7%
11
 
2.3%
11
 
2.3%
11
 
2.3%
10
 
2.1%
10
 
2.1%
10
 
2.1%
Other values (188) 352
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 394
81.6%
Uppercase Letter 26
 
5.4%
Lowercase Letter 25
 
5.2%
Space Separator 23
 
4.8%
Close Punctuation 6
 
1.2%
Open Punctuation 6
 
1.2%
Other Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
4.8%
13
 
3.3%
13
 
3.3%
11
 
2.8%
11
 
2.8%
11
 
2.8%
10
 
2.5%
10
 
2.5%
10
 
2.5%
7
 
1.8%
Other values (154) 279
70.8%
Uppercase Letter
ValueCountFrequency (%)
O 4
15.4%
P 2
 
7.7%
D 2
 
7.7%
R 2
 
7.7%
S 2
 
7.7%
J 2
 
7.7%
M 2
 
7.7%
Y 1
 
3.8%
I 1
 
3.8%
G 1
 
3.8%
Other values (7) 7
26.9%
Lowercase Letter
ValueCountFrequency (%)
o 5
20.0%
e 4
16.0%
b 3
12.0%
s 3
12.0%
i 2
 
8.0%
k 2
 
8.0%
u 1
 
4.0%
d 1
 
4.0%
v 1
 
4.0%
l 1
 
4.0%
Other values (2) 2
 
8.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
· 1
33.3%
Space Separator
ValueCountFrequency (%)
23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 394
81.6%
Latin 51
 
10.6%
Common 38
 
7.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
4.8%
13
 
3.3%
13
 
3.3%
11
 
2.8%
11
 
2.8%
11
 
2.8%
10
 
2.5%
10
 
2.5%
10
 
2.5%
7
 
1.8%
Other values (154) 279
70.8%
Latin
ValueCountFrequency (%)
o 5
 
9.8%
O 4
 
7.8%
e 4
 
7.8%
b 3
 
5.9%
s 3
 
5.9%
P 2
 
3.9%
D 2
 
3.9%
R 2
 
3.9%
i 2
 
3.9%
S 2
 
3.9%
Other values (19) 22
43.1%
Common
ValueCountFrequency (%)
23
60.5%
) 6
 
15.8%
( 6
 
15.8%
. 2
 
5.3%
· 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 394
81.6%
ASCII 88
 
18.2%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23
26.1%
) 6
 
6.8%
( 6
 
6.8%
o 5
 
5.7%
O 4
 
4.5%
e 4
 
4.5%
b 3
 
3.4%
s 3
 
3.4%
P 2
 
2.3%
D 2
 
2.3%
Other values (23) 30
34.1%
Hangul
ValueCountFrequency (%)
19
 
4.8%
13
 
3.3%
13
 
3.3%
11
 
2.8%
11
 
2.8%
11
 
2.8%
10
 
2.5%
10
 
2.5%
10
 
2.5%
7
 
1.8%
Other values (154) 279
70.8%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct70
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-11T01:47:54.819687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length44
Mean length30.76
Min length19

Characters and Unicode

Total characters2307
Distinct characters140
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)86.7%

Sample

1st row부산광역시 기장군 기장읍 차성서로101번길 6-3
2nd row부산광역시 기장군 정관읍 달산1길 49, 202호
3rd row부산광역시 기장군 정관면 달음산길 66
4th row부산광역시 기장군 기장읍 차성남로89번길 8
5th row부산광역시 기장군 기장읍 차성동로 140
ValueCountFrequency (%)
부산광역시 75
 
15.5%
기장군 75
 
15.5%
기장읍 31
 
6.4%
정관읍 19
 
3.9%
장안읍 10
 
2.1%
정관면 6
 
1.2%
일광면 6
 
1.2%
정관신도시 4
 
0.8%
기장대로 4
 
0.8%
9 4
 
0.8%
Other values (191) 249
51.6%
2023-12-11T01:47:55.267908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
408
 
17.7%
120
 
5.2%
110
 
4.8%
82
 
3.6%
1 82
 
3.6%
81
 
3.5%
81
 
3.5%
76
 
3.3%
75
 
3.3%
75
 
3.3%
Other values (130) 1117
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1377
59.7%
Decimal Number 414
 
17.9%
Space Separator 408
 
17.7%
Other Punctuation 36
 
1.6%
Close Punctuation 30
 
1.3%
Open Punctuation 30
 
1.3%
Dash Punctuation 10
 
0.4%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
8.7%
110
 
8.0%
82
 
6.0%
81
 
5.9%
81
 
5.9%
76
 
5.5%
75
 
5.4%
75
 
5.4%
62
 
4.5%
60
 
4.4%
Other values (113) 555
40.3%
Decimal Number
ValueCountFrequency (%)
1 82
19.8%
0 69
16.7%
2 59
14.3%
3 52
12.6%
4 33
8.0%
5 27
 
6.5%
8 27
 
6.5%
6 23
 
5.6%
7 22
 
5.3%
9 20
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
H 1
50.0%
Space Separator
ValueCountFrequency (%)
408
100.0%
Other Punctuation
ValueCountFrequency (%)
, 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1377
59.7%
Common 928
40.2%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
8.7%
110
 
8.0%
82
 
6.0%
81
 
5.9%
81
 
5.9%
76
 
5.5%
75
 
5.4%
75
 
5.4%
62
 
4.5%
60
 
4.4%
Other values (113) 555
40.3%
Common
ValueCountFrequency (%)
408
44.0%
1 82
 
8.8%
0 69
 
7.4%
2 59
 
6.4%
3 52
 
5.6%
, 36
 
3.9%
4 33
 
3.6%
) 30
 
3.2%
( 30
 
3.2%
5 27
 
2.9%
Other values (5) 102
 
11.0%
Latin
ValueCountFrequency (%)
L 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1377
59.7%
ASCII 930
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
408
43.9%
1 82
 
8.8%
0 69
 
7.4%
2 59
 
6.3%
3 52
 
5.6%
, 36
 
3.9%
4 33
 
3.5%
) 30
 
3.2%
( 30
 
3.2%
5 27
 
2.9%
Other values (7) 104
 
11.2%
Hangul
ValueCountFrequency (%)
120
 
8.7%
110
 
8.0%
82
 
6.0%
81
 
5.9%
81
 
5.9%
76
 
5.5%
75
 
5.4%
75
 
5.4%
62
 
4.5%
60
 
4.4%
Other values (113) 555
40.3%

업종
Categorical

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
출판사
54 
인쇄사
21 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 54
72.0%
인쇄사 21
 
28.0%

Length

2023-12-11T01:47:55.428982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:47:55.561224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 54
72.0%
인쇄사 21
 
28.0%

시설면적(㎡)
Real number (ℝ)

MISSING 

Distinct26
Distinct (%)92.9%
Missing47
Missing (%)62.7%
Infinite0
Infinite (%)0.0%
Mean71.961071
Minimum20
Maximum209
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-11T01:47:55.674816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile21.7
Q138.8375
median59.965
Q385.985
95-th percentile175.8165
Maximum209
Range189
Interquartile range (IQR)47.1475

Descriptive statistics

Standard deviation47.151854
Coefficient of variation (CV)0.65524113
Kurtosis2.6856495
Mean71.961071
Median Absolute Deviation (MAD)24.845
Skewness1.5740146
Sum2014.91
Variance2223.2973
MonotonicityNot monotonic
2023-12-11T01:47:55.821146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
89.0 2
 
2.7%
57.0 2
 
2.7%
25.5 1
 
1.3%
66.76 1
 
1.3%
117.83 1
 
1.3%
60.0 1
 
1.3%
42.12 1
 
1.3%
39.6 1
 
1.3%
209.0 1
 
1.3%
35.0 1
 
1.3%
Other values (16) 16
 
21.3%
(Missing) 47
62.7%
ValueCountFrequency (%)
20.0 1
1.3%
21.0 1
1.3%
23.0 1
1.3%
25.5 1
1.3%
25.8 1
1.3%
35.0 1
1.3%
36.55 1
1.3%
39.6 1
1.3%
42.12 1
1.3%
57.0 2
2.7%
ValueCountFrequency (%)
209.0 1
1.3%
195.63 1
1.3%
139.02 1
1.3%
117.83 1
1.3%
97.2 1
1.3%
89.0 2
2.7%
84.98 1
1.3%
84.69 1
1.3%
84.53 1
1.3%
69.39 1
1.3%

Interactions

2023-12-11T01:47:52.815809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:47:55.921215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고(등록)번호사업체명칭사업체소재지(도로명)업종시설면적(㎡)
신고(등록)번호1.0001.0001.0001.0001.000
사업체명칭1.0001.0001.0000.0001.000
사업체소재지(도로명)1.0001.0001.0000.0001.000
업종1.0000.0000.0001.0000.000
시설면적(㎡)1.0001.0001.0000.0001.000
2023-12-11T01:47:56.039244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설면적(㎡)업종
시설면적(㎡)1.0000.000
업종0.0001.000

Missing values

2023-12-11T01:47:52.941730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:47:53.062176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신고(등록)번호사업체명칭사업체소재지(도로명)업종시설면적(㎡)
025100-2009-000001놀이속의 세상부산광역시 기장군 기장읍 차성서로101번길 6-3출판사<NA>
125100-2009-000002영상교육부산광역시 기장군 정관읍 달산1길 49, 202호출판사<NA>
225100-2010-000001주식회사천북부산광역시 기장군 정관면 달음산길 66출판사66.0
325100-2010-000002수다과학연구소부산광역시 기장군 기장읍 차성남로89번길 8출판사20.0
425100-2010-000003경호무술출판사부산광역시 기장군 기장읍 차성동로 140출판사195.63
525100-2011-000001사람들부산광역시 기장군 정관면 정관2로 40출판사84.98
625100-2011-000003장원차문화교류회부산광역시 기장군 기장읍 기장대로 563출판사84.69
725100-2011-000004해광식품 주식회사부산광역시 기장군 기장읍 청강로43번길 36출판사<NA>
825100-2012-000002도서출판 마루부산광역시 기장군 기장읍 동암1길 17-4출판사<NA>
925100-2012-000004가이오부산광역시 기장군 기장읍 차성로441번길 7출판사<NA>
신고(등록)번호사업체명칭사업체소재지(도로명)업종시설면적(㎡)
6525200-2014-000002계림인쇄부산광역시 기장군 장안읍 좌천2길 45인쇄사39.6
6625200-2014-000003문일문화사부산광역시 기장군 기장읍 차성남로 104 (은성상가)인쇄사42.12
6725200-2014-000004나라테크부산광역시 기장군 장안읍 길천길 73, 303호인쇄사<NA>
6825200-2015-000001현대이앤지부산광역시 기장군 기장읍 기장대로 506인쇄사57.0
6925200-2015-000002디지털뱅크부산광역시 기장군 기장읍 차성로253번길 8인쇄사60.0
7025200-2015-000003뱅크OA 시스템부산광역시 기장군 정관읍 양수길 22인쇄사117.83
7125200-2020-000001힘찬문서부산광역시 기장군 기장읍 차성동로45번길 7, 2층인쇄사66.76
7225100-2020-000003도서출판 주드부산광역시 기장군 일광면 이화로23, 104호(금강한스빌)출판사<NA>
7325100-2020-000004조이스프링(JOY SPRING)부산광역시 기장군 정관읍 방곡3로 11, 304동 203로(서희스타힐스)출판사<NA>
7425100-2021-000002도서출판 참놀부산광역시 기장군 기장읍 대청로 13, 나동 304호(삼홍하이츠)출판사<NA>