Overview

Dataset statistics

Number of variables6
Number of observations69
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory50.9 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description부산광역시 영도구 소재의 출판업소 및 인쇄업소의 현황(연번, 업체명, 도로명주소 등)등의 데이터를 제공하고 있습니다.
Author부산광역시 영도구
URLhttps://www.data.go.kr/data/3069124/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
업종 is highly imbalanced (57.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 16:12:53.074460
Analysis finished2024-03-14 16:12:54.388384
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct69
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35
Minimum1
Maximum69
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size749.0 B
2024-03-15T01:12:54.518725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.4
Q118
median35
Q352
95-th percentile65.6
Maximum69
Range68
Interquartile range (IQR)34

Descriptive statistics

Standard deviation20.062403
Coefficient of variation (CV)0.5732115
Kurtosis-1.2
Mean35
Median Absolute Deviation (MAD)17
Skewness0
Sum2415
Variance402.5
MonotonicityStrictly increasing
2024-03-15T01:12:54.963753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
45 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
46 1
 
1.4%
44 1
 
1.4%
53 1
 
1.4%
Other values (59) 59
85.5%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
61 1
1.4%
60 1
1.4%

업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size680.0 B
출판사
63 
인쇄사
 
6

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 63
91.3%
인쇄사 6
 
8.7%

Length

2024-03-15T01:12:55.394822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:12:55.712285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 63
91.3%
인쇄사 6
 
8.7%
Distinct66
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size680.0 B
2024-03-15T01:12:56.744192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length7.2608696
Min length2

Characters and Unicode

Total characters501
Distinct characters198
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)91.3%

Sample

1st row국제해양문제연구소
2nd row고신대학교출판부
3rd row한국해양대학교출판부
4th row창경사
5th rowCL&D
ValueCountFrequency (%)
도서출판 4
 
4.6%
주)애드원플러스 2
 
2.3%
주)이도시스템 2
 
2.3%
동원문화사i 2
 
2.3%
초아주역연구원 1
 
1.1%
한국해양문학연구소 1
 
1.1%
영도 1
 
1.1%
책의 1
 
1.1%
범강 1
 
1.1%
고래섬 1
 
1.1%
Other values (71) 71
81.6%
2024-03-15T01:12:58.064691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
3.6%
14
 
2.8%
14
 
2.8%
14
 
2.8%
( 12
 
2.4%
) 12
 
2.4%
12
 
2.4%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (188) 373
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 402
80.2%
Uppercase Letter 30
 
6.0%
Lowercase Letter 23
 
4.6%
Space Separator 18
 
3.6%
Open Punctuation 12
 
2.4%
Close Punctuation 12
 
2.4%
Other Punctuation 3
 
0.6%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
3.5%
14
 
3.5%
14
 
3.5%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
9
 
2.2%
8
 
2.0%
Other values (151) 289
71.9%
Lowercase Letter
ValueCountFrequency (%)
i 3
13.0%
e 3
13.0%
r 2
 
8.7%
o 2
 
8.7%
d 2
 
8.7%
q 1
 
4.3%
u 1
 
4.3%
s 1
 
4.3%
n 1
 
4.3%
h 1
 
4.3%
Other values (6) 6
26.1%
Uppercase Letter
ValueCountFrequency (%)
M 4
13.3%
C 4
13.3%
A 3
10.0%
B 3
10.0%
E 3
10.0%
O 2
 
6.7%
T 2
 
6.7%
L 2
 
6.7%
P 1
 
3.3%
I 1
 
3.3%
Other values (5) 5
16.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 402
80.2%
Latin 53
 
10.6%
Common 46
 
9.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
3.5%
14
 
3.5%
14
 
3.5%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
9
 
2.2%
8
 
2.0%
Other values (151) 289
71.9%
Latin
ValueCountFrequency (%)
M 4
 
7.5%
C 4
 
7.5%
i 3
 
5.7%
e 3
 
5.7%
A 3
 
5.7%
B 3
 
5.7%
E 3
 
5.7%
r 2
 
3.8%
O 2
 
3.8%
T 2
 
3.8%
Other values (21) 24
45.3%
Common
ValueCountFrequency (%)
18
39.1%
( 12
26.1%
) 12
26.1%
. 2
 
4.3%
& 1
 
2.2%
- 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 402
80.2%
ASCII 99
 
19.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18
18.2%
( 12
 
12.1%
) 12
 
12.1%
M 4
 
4.0%
C 4
 
4.0%
i 3
 
3.0%
e 3
 
3.0%
A 3
 
3.0%
B 3
 
3.0%
E 3
 
3.0%
Other values (27) 34
34.3%
Hangul
ValueCountFrequency (%)
14
 
3.5%
14
 
3.5%
14
 
3.5%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
9
 
2.2%
8
 
2.0%
Other values (151) 289
71.9%
Distinct61
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Memory size680.0 B
2024-03-15T01:12:59.428122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length46
Mean length33.869565
Min length21

Characters and Unicode

Total characters2337
Distinct characters134
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)78.3%

Sample

1st row부산광역시 영도구 태종로 727 (동삼동)
2nd row부산광역시 영도구 와치로 194 (동삼동)
3rd row부산광역시 영도구 태종로 727 (동삼동)
4th row부산광역시 영도구 태종로 594 (동삼동)
5th row부산광역시 영도구 와치로 194 (동삼동)
ValueCountFrequency (%)
부산광역시 69
 
15.0%
영도구 69
 
15.0%
동삼동 40
 
8.7%
태종로 18
 
3.9%
청학동 11
 
2.4%
조내기로 6
 
1.3%
봉래동2가 5
 
1.1%
103동 5
 
1.1%
동삼서로 5
 
1.1%
101동 5
 
1.1%
Other values (155) 227
49.3%
2024-03-15T01:13:01.589820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
395
 
16.9%
146
 
6.2%
1 91
 
3.9%
84
 
3.6%
77
 
3.3%
) 71
 
3.0%
71
 
3.0%
( 71
 
3.0%
70
 
3.0%
69
 
3.0%
Other values (124) 1192
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1328
56.8%
Space Separator 395
 
16.9%
Decimal Number 378
 
16.2%
Close Punctuation 71
 
3.0%
Open Punctuation 71
 
3.0%
Other Punctuation 69
 
3.0%
Lowercase Letter 14
 
0.6%
Dash Punctuation 6
 
0.3%
Uppercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
11.0%
84
 
6.3%
77
 
5.8%
71
 
5.3%
70
 
5.3%
69
 
5.2%
69
 
5.2%
69
 
5.2%
69
 
5.2%
65
 
4.9%
Other values (94) 539
40.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
21.4%
o 2
14.3%
a 1
 
7.1%
i 1
 
7.1%
d 1
 
7.1%
m 1
 
7.1%
p 1
 
7.1%
l 1
 
7.1%
x 1
 
7.1%
w 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 91
24.1%
0 60
15.9%
2 46
12.2%
3 42
11.1%
4 33
 
8.7%
5 31
 
8.2%
6 30
 
7.9%
7 22
 
5.8%
9 14
 
3.7%
8 9
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
S 2
40.0%
C 1
20.0%
B 1
20.0%
T 1
20.0%
Space Separator
ValueCountFrequency (%)
395
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Other Punctuation
ValueCountFrequency (%)
, 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1328
56.8%
Common 990
42.4%
Latin 19
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
11.0%
84
 
6.3%
77
 
5.8%
71
 
5.3%
70
 
5.3%
69
 
5.2%
69
 
5.2%
69
 
5.2%
69
 
5.2%
65
 
4.9%
Other values (94) 539
40.6%
Common
ValueCountFrequency (%)
395
39.9%
1 91
 
9.2%
) 71
 
7.2%
( 71
 
7.2%
, 69
 
7.0%
0 60
 
6.1%
2 46
 
4.6%
3 42
 
4.2%
4 33
 
3.3%
5 31
 
3.1%
Other values (5) 81
 
8.2%
Latin
ValueCountFrequency (%)
e 3
15.8%
o 2
 
10.5%
S 2
 
10.5%
C 1
 
5.3%
a 1
 
5.3%
i 1
 
5.3%
d 1
 
5.3%
B 1
 
5.3%
m 1
 
5.3%
p 1
 
5.3%
Other values (5) 5
26.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1328
56.8%
ASCII 1009
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
395
39.1%
1 91
 
9.0%
) 71
 
7.0%
( 71
 
7.0%
, 69
 
6.8%
0 60
 
5.9%
2 46
 
4.6%
3 42
 
4.2%
4 33
 
3.3%
5 31
 
3.1%
Other values (20) 100
 
9.9%
Hangul
ValueCountFrequency (%)
146
 
11.0%
84
 
6.3%
77
 
5.8%
71
 
5.3%
70
 
5.3%
69
 
5.2%
69
 
5.2%
69
 
5.2%
69
 
5.2%
65
 
4.9%
Other values (94) 539
40.6%
Distinct62
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size680.0 B
2024-03-15T01:13:02.538177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.0869565
Min length2

Characters and Unicode

Total characters213
Distinct characters95
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)81.2%

Sample

1st row정문수
2nd row이병수
3rd row박한일
4th row김형주
5th row조상래
ValueCountFrequency (%)
신경규 3
 
4.2%
황병률 2
 
2.8%
옥영주 2
 
2.8%
한창호 2
 
2.8%
최석 2
 
2.8%
최정수 2
 
2.8%
황서영 1
 
1.4%
조상제 1
 
1.4%
신상규 1
 
1.4%
김치업 1
 
1.4%
Other values (54) 54
76.1%
2024-03-15T01:13:04.123671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
5.2%
11
 
5.2%
9
 
4.2%
7
 
3.3%
7
 
3.3%
6
 
2.8%
6
 
2.8%
6
 
2.8%
5
 
2.3%
4
 
1.9%
Other values (85) 141
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 208
97.7%
Space Separator 2
 
0.9%
Open Punctuation 1
 
0.5%
Decimal Number 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
5.3%
11
 
5.3%
9
 
4.3%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
6
 
2.9%
5
 
2.4%
4
 
1.9%
Other values (81) 136
65.4%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 208
97.7%
Common 5
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
5.3%
11
 
5.3%
9
 
4.3%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
6
 
2.9%
5
 
2.4%
4
 
1.9%
Other values (81) 136
65.4%
Common
ValueCountFrequency (%)
2
40.0%
( 1
20.0%
1 1
20.0%
) 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 208
97.7%
ASCII 5
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
5.3%
11
 
5.3%
9
 
4.3%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
6
 
2.9%
5
 
2.4%
4
 
1.9%
Other values (81) 136
65.4%
ASCII
ValueCountFrequency (%)
2
40.0%
( 1
20.0%
1 1
20.0%
) 1
20.0%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size680.0 B
Minimum2024-03-04 00:00:00
Maximum2024-03-04 00:00:00
2024-03-15T01:13:04.469982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:13:04.804585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T01:12:53.791444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:13:04.939691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종사업체명칭사업체소재지(도로명)대표자
연번1.0000.9880.8060.9080.871
업종0.9881.0000.0000.0000.000
사업체명칭0.8060.0001.0001.0001.000
사업체소재지(도로명)0.9080.0001.0001.0000.999
대표자0.8710.0001.0000.9991.000
2024-03-15T01:13:05.129908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.850
업종0.8501.000

Missing values

2024-03-15T01:12:54.101462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:12:54.313531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종사업체명칭사업체소재지(도로명)대표자데이터 기준일자
01출판사국제해양문제연구소부산광역시 영도구 태종로 727 (동삼동)정문수2024-03-04
12출판사고신대학교출판부부산광역시 영도구 와치로 194 (동삼동)이병수2024-03-04
23출판사한국해양대학교출판부부산광역시 영도구 태종로 727 (동삼동)박한일2024-03-04
34출판사창경사부산광역시 영도구 태종로 594 (동삼동)김형주2024-03-04
45출판사CL&D부산광역시 영도구 와치로 194 (동삼동)조상래2024-03-04
56출판사엠.이.시(MEC)부산광역시 영도구 절영로85번길 1 (남항동2가)임현미2024-03-04
67출판사(재)한국조선해양기자재연구원부산광역시 영도구 해양로 435 (동삼동)김정렬2024-03-04
78출판사도서출판 산하부산광역시 영도구 태종로 170-1 (봉래동4가)정진우2024-03-04
89출판사행복을만드는사람들부산광역시 영도구 상리로 41, 202동 324호 (동삼동,주공아파트)권오용2024-03-04
910출판사해양 인문 사회 공동체부산광역시 영도구 동삼서로 61, 103동 305호 (동삼동, 도시아파트)안영숙2024-03-04
연번업종사업체명칭사업체소재지(도로명)대표자데이터 기준일자
5960출판사(주) 북앤아트부산광역시 영도구 대교로46번길 46, 6층 (봉래동2가)박지선2024-03-04
6061출판사꽃기리네부산광역시 영도구 하나길 596 (신선동2가)이유미2024-03-04
6162출판사북인도 요가 명상부산광역시 영도구 함지로 8, 107동 604호 (동삼동, 함지그린아파트)은성원2024-03-04
6263출판사에스에스엘부산광역시 영도구 태종로 405, 청학 우성스마트시티 뷰 101동 1603호 (청학동)박지향2024-03-04
6364인쇄사대교인쇄사부산광역시 영도구 대평로 7-4 (대평동1가)이현영2024-03-04
6465인쇄사해양마크사부산광역시 영도구 태종로 709 (동삼동)김창균2024-03-04
6566인쇄사성지문화사부산광역시 영도구 절영로13번길 25 (봉래동2가)정향자2024-03-04
6667인쇄사(주)애드원플러스부산광역시 영도구 동삼서로 52, 지하1층 101호 (동삼동)옥영주2024-03-04
6768인쇄사동원문화사i부산광역시 영도구 태종로 662 (동삼동)황병률2024-03-04
6869인쇄사(주)이도시스템부산광역시 영도구 태종로 736, 5층 (동삼동)한창호2024-03-04