Overview

Dataset statistics

Number of variables5
Number of observations264
Missing cells16
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.4 KiB
Average record size in memory40.5 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description대구광역시 북구 출판사및인쇄소_20190725
Author대구광역시 북구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15006402&dataSetDetailId=150064022cb93e6187445_201908051744&provdMethod=FILE

Alerts

데이터기준일자 has constant value ""Constant
소재지도로명주소 has 15 (5.7%) missing valuesMissing

Reproduction

Analysis started2024-04-20 16:59:39.701680
Analysis finished2024-04-20 16:59:40.738042
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
출판사
185 
인쇄사
79 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 185
70.1%
인쇄사 79
29.9%

Length

2024-04-21T01:59:40.848207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T01:59:41.009069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 185
70.1%
인쇄사 79
29.9%
Distinct245
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-21T01:59:41.769505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length14
Mean length6.5719697
Min length1

Characters and Unicode

Total characters1735
Distinct characters330
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)86.0%

Sample

1st row경북대학교 출판부
2nd row배영출판사
3rd row(주)금구
4th row한진출판사
5th row태평양기획
ValueCountFrequency (%)
주식회사 14
 
4.1%
도서출판 11
 
3.2%
디자인 6
 
1.8%
출판사 4
 
1.2%
4
 
1.2%
하나문화사 3
 
0.9%
종이와연필 2
 
0.6%
금강디지탈프린텍 2
 
0.6%
경북대학교 2
 
0.6%
집현전 2
 
0.6%
Other values (273) 290
85.3%
2024-04-21T01:59:42.809956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
76
 
4.4%
69
 
4.0%
68
 
3.9%
53
 
3.1%
43
 
2.5%
) 43
 
2.5%
42
 
2.4%
( 41
 
2.4%
40
 
2.3%
34
 
2.0%
Other values (320) 1226
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1491
85.9%
Space Separator 76
 
4.4%
Lowercase Letter 55
 
3.2%
Close Punctuation 43
 
2.5%
Open Punctuation 41
 
2.4%
Uppercase Letter 21
 
1.2%
Decimal Number 6
 
0.3%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
4.6%
68
 
4.6%
53
 
3.6%
43
 
2.9%
42
 
2.8%
40
 
2.7%
34
 
2.3%
31
 
2.1%
27
 
1.8%
25
 
1.7%
Other values (280) 1059
71.0%
Lowercase Letter
ValueCountFrequency (%)
i 5
 
9.1%
a 4
 
7.3%
e 4
 
7.3%
l 4
 
7.3%
o 4
 
7.3%
g 4
 
7.3%
n 4
 
7.3%
u 3
 
5.5%
d 3
 
5.5%
m 3
 
5.5%
Other values (10) 17
30.9%
Uppercase Letter
ValueCountFrequency (%)
M 4
19.0%
C 3
14.3%
P 3
14.3%
K 2
9.5%
H 2
9.5%
S 1
 
4.8%
U 1
 
4.8%
N 1
 
4.8%
O 1
 
4.8%
I 1
 
4.8%
Other values (2) 2
9.5%
Decimal Number
ValueCountFrequency (%)
6 2
33.3%
4 2
33.3%
1 2
33.3%
Space Separator
ValueCountFrequency (%)
76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1486
85.6%
Common 168
 
9.7%
Latin 76
 
4.4%
Han 5
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
4.6%
68
 
4.6%
53
 
3.6%
43
 
2.9%
42
 
2.8%
40
 
2.7%
34
 
2.3%
31
 
2.1%
27
 
1.8%
25
 
1.7%
Other values (275) 1054
70.9%
Latin
ValueCountFrequency (%)
i 5
 
6.6%
M 4
 
5.3%
a 4
 
5.3%
e 4
 
5.3%
l 4
 
5.3%
o 4
 
5.3%
g 4
 
5.3%
n 4
 
5.3%
u 3
 
3.9%
d 3
 
3.9%
Other values (22) 37
48.7%
Common
ValueCountFrequency (%)
76
45.2%
) 43
25.6%
( 41
24.4%
6 2
 
1.2%
4 2
 
1.2%
1 2
 
1.2%
. 1
 
0.6%
- 1
 
0.6%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1486
85.6%
ASCII 244
 
14.1%
CJK 5
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
76
31.1%
) 43
17.6%
( 41
16.8%
i 5
 
2.0%
M 4
 
1.6%
a 4
 
1.6%
e 4
 
1.6%
l 4
 
1.6%
o 4
 
1.6%
g 4
 
1.6%
Other values (30) 55
22.5%
Hangul
ValueCountFrequency (%)
69
 
4.6%
68
 
4.6%
53
 
3.6%
43
 
2.9%
42
 
2.8%
40
 
2.7%
34
 
2.3%
31
 
2.1%
27
 
1.8%
25
 
1.7%
Other values (275) 1054
70.9%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct210
Distinct (%)84.3%
Missing15
Missing (%)5.7%
Memory size2.2 KiB
2024-04-21T01:59:43.804411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length48
Mean length28.931727
Min length20

Characters and Unicode

Total characters7204
Distinct characters185
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)71.5%

Sample

1st row대구광역시 북구 대학로 80 (산격동)
2nd row대구광역시 북구 중앙대로 487 (칠성동2가)
3rd row대구광역시 북구 칠성남로37길 29 (칠성동2가)
4th row대구광역시 북구 원대로23길 9 (노원동1가)
5th row대구광역시 북구 침산남로 80 (침산동)
ValueCountFrequency (%)
대구광역시 249
 
17.1%
북구 249
 
17.1%
산격동 40
 
2.8%
대현동 31
 
2.1%
복현동 26
 
1.8%
침산동 25
 
1.7%
노원동3가 20
 
1.4%
칠성동2가 18
 
1.2%
대학로 15
 
1.0%
대현로9길 14
 
1.0%
Other values (424) 767
52.8%
2024-04-21T01:59:45.086921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1205
 
16.7%
521
 
7.2%
376
 
5.2%
341
 
4.7%
1 279
 
3.9%
277
 
3.8%
255
 
3.5%
251
 
3.5%
250
 
3.5%
250
 
3.5%
Other values (175) 3199
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4090
56.8%
Decimal Number 1214
 
16.9%
Space Separator 1205
 
16.7%
Close Punctuation 249
 
3.5%
Open Punctuation 249
 
3.5%
Other Punctuation 132
 
1.8%
Dash Punctuation 57
 
0.8%
Uppercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
521
 
12.7%
376
 
9.2%
341
 
8.3%
277
 
6.8%
255
 
6.2%
251
 
6.1%
250
 
6.1%
250
 
6.1%
120
 
2.9%
95
 
2.3%
Other values (156) 1354
33.1%
Decimal Number
ValueCountFrequency (%)
1 279
23.0%
2 177
14.6%
0 161
13.3%
3 146
12.0%
4 98
 
8.1%
5 85
 
7.0%
6 77
 
6.3%
7 74
 
6.1%
9 68
 
5.6%
8 49
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
I 3
37.5%
T 3
37.5%
D 1
 
12.5%
C 1
 
12.5%
Space Separator
ValueCountFrequency (%)
1205
100.0%
Close Punctuation
ValueCountFrequency (%)
) 249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 249
100.0%
Other Punctuation
ValueCountFrequency (%)
, 132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4090
56.8%
Common 3106
43.1%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
521
 
12.7%
376
 
9.2%
341
 
8.3%
277
 
6.8%
255
 
6.2%
251
 
6.1%
250
 
6.1%
250
 
6.1%
120
 
2.9%
95
 
2.3%
Other values (156) 1354
33.1%
Common
ValueCountFrequency (%)
1205
38.8%
1 279
 
9.0%
) 249
 
8.0%
( 249
 
8.0%
2 177
 
5.7%
0 161
 
5.2%
3 146
 
4.7%
, 132
 
4.2%
4 98
 
3.2%
5 85
 
2.7%
Other values (5) 325
 
10.5%
Latin
ValueCountFrequency (%)
I 3
37.5%
T 3
37.5%
D 1
 
12.5%
C 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4090
56.8%
ASCII 3114
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1205
38.7%
1 279
 
9.0%
) 249
 
8.0%
( 249
 
8.0%
2 177
 
5.7%
0 161
 
5.2%
3 146
 
4.7%
, 132
 
4.2%
4 98
 
3.1%
5 85
 
2.7%
Other values (9) 333
 
10.7%
Hangul
ValueCountFrequency (%)
521
 
12.7%
376
 
9.2%
341
 
8.3%
277
 
6.8%
255
 
6.2%
251
 
6.1%
250
 
6.1%
250
 
6.1%
120
 
2.9%
95
 
2.3%
Other values (156) 1354
33.1%
Distinct224
Distinct (%)85.2%
Missing1
Missing (%)0.4%
Memory size2.2 KiB
2024-04-21T01:59:45.970596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length41
Mean length24.410646
Min length19

Characters and Unicode

Total characters6420
Distinct characters160
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)73.0%

Sample

1st row대구광역시 북구 산격동 1370번지
2nd row대구광역시 북구 칠성동2가 302-300번지
3rd row대구광역시 북구 칠성동2가 548-3번지
4th row대구광역시 북구 노원동1가 415번지
5th row대구광역시 북구 읍내동 955-14번지
ValueCountFrequency (%)
대구광역시 263
22.0%
북구 263
22.0%
산격동 41
 
3.4%
대현동 32
 
2.7%
침산동 28
 
2.3%
복현동 26
 
2.2%
노원동3가 23
 
1.9%
칠성동2가 21
 
1.8%
서변동 13
 
1.1%
태전동 12
 
1.0%
Other values (350) 475
39.7%
2024-04-21T01:59:47.136443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1165
18.1%
540
 
8.4%
313
 
4.9%
304
 
4.7%
1 284
 
4.4%
269
 
4.2%
269
 
4.2%
266
 
4.1%
264
 
4.1%
264
 
4.1%
Other values (150) 2482
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3652
56.9%
Decimal Number 1396
 
21.7%
Space Separator 1165
 
18.1%
Dash Punctuation 202
 
3.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
540
14.8%
313
 
8.6%
304
 
8.3%
269
 
7.4%
269
 
7.4%
266
 
7.3%
264
 
7.2%
264
 
7.2%
263
 
7.2%
74
 
2.0%
Other values (135) 826
22.6%
Decimal Number
ValueCountFrequency (%)
1 284
20.3%
3 199
14.3%
2 186
13.3%
0 141
10.1%
4 125
9.0%
7 119
8.5%
5 107
 
7.7%
9 87
 
6.2%
6 74
 
5.3%
8 74
 
5.3%
Space Separator
ValueCountFrequency (%)
1165
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 202
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3652
56.9%
Common 2767
43.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
540
14.8%
313
 
8.6%
304
 
8.3%
269
 
7.4%
269
 
7.4%
266
 
7.3%
264
 
7.2%
264
 
7.2%
263
 
7.2%
74
 
2.0%
Other values (135) 826
22.6%
Common
ValueCountFrequency (%)
1165
42.1%
1 284
 
10.3%
- 202
 
7.3%
3 199
 
7.2%
2 186
 
6.7%
0 141
 
5.1%
4 125
 
4.5%
7 119
 
4.3%
5 107
 
3.9%
9 87
 
3.1%
Other values (4) 152
 
5.5%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3652
56.9%
ASCII 2768
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1165
42.1%
1 284
 
10.3%
- 202
 
7.3%
3 199
 
7.2%
2 186
 
6.7%
0 141
 
5.1%
4 125
 
4.5%
7 119
 
4.3%
5 107
 
3.9%
9 87
 
3.1%
Other values (5) 153
 
5.5%
Hangul
ValueCountFrequency (%)
540
14.8%
313
 
8.6%
304
 
8.3%
269
 
7.4%
269
 
7.4%
266
 
7.3%
264
 
7.2%
264
 
7.2%
263
 
7.2%
74
 
2.0%
Other values (135) 826
22.6%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
Minimum2019-07-25 00:00:00
Maximum2019-07-25 00:00:00
2024-04-21T01:59:47.323150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T01:59:47.493489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-04-21T01:59:40.347183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T01:59:40.514017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T01:59:40.660421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종사업체명칭소재지도로명주소소재지지번주소데이터기준일자
0출판사경북대학교 출판부대구광역시 북구 대학로 80 (산격동)대구광역시 북구 산격동 1370번지2019-07-25
1출판사배영출판사대구광역시 북구 중앙대로 487 (칠성동2가)대구광역시 북구 칠성동2가 302-300번지2019-07-25
2출판사(주)금구대구광역시 북구 칠성남로37길 29 (칠성동2가)대구광역시 북구 칠성동2가 548-3번지2019-07-25
3출판사한진출판사대구광역시 북구 원대로23길 9 (노원동1가)대구광역시 북구 노원동1가 415번지2019-07-25
4출판사태평양기획<NA>대구광역시 북구 읍내동 955-14번지2019-07-25
5출판사매일관광문화사<NA>대구광역시 북구 노원동3가 1102-2번지2019-07-25
6출판사(주)한국종합기술대구광역시 북구 침산남로 80 (침산동)대구광역시 북구 침산동 443-2번지2019-07-25
7출판사영진전문대학교 출판부대구광역시 북구 복현로 35 (복현동)대구광역시 북구 복현동 218번지2019-07-25
8출판사대학생성경읽기 출판사<NA>대구광역시 북구 대현동 247번지2019-07-25
9출판사도서출판청림대구광역시 북구 복현로2길 16 (복현동)대구광역시 북구 복현동 375-4번지2019-07-25
업종사업체명칭소재지도로명주소소재지지번주소데이터기준일자
254인쇄사제이에스 파트너스대구광역시 북구 대학로 10 (산격동)대구광역시 북구 산격동 1382-93번지2019-07-25
255인쇄사상상146대구광역시 북구 호암로 51, 203호 (침산동, 대구삼성창조경제단지)대구광역시 북구 침산동 105-2번지2019-07-25
256인쇄사(주)시스픽대구광역시 북구 노원로47길 9 (침산동)대구광역시 북구 침산동 838-2번지2019-07-25
257인쇄사한울기획인쇄대구광역시 북구 고성로34길 39, 2층 201호 (고성동2가)대구광역시 북구 고성동2가 58-1번지2019-07-25
258인쇄사대구제판인쇄대구광역시 북구 원대로23길 9 (노원동1가)대구광역시 북구 노원동1가 416번지2019-07-25
259인쇄사(주)세명 에스 앤 더블유대구광역시 북구 유통단지로8길 121-16, 2층 (산격동)대구광역시 북구 산격동 1778번지 2층2019-07-25
260인쇄사디자인 풍경대구광역시 북구 칠곡중앙대로 653-1 (읍내동)대구광역시 북구 읍내동 359-3번지2019-07-25
261인쇄사디자인 해오름대구광역시 북구 동암로 25-20 (읍내동)<NA>2019-07-25
262인쇄사주식회사 렛츠스카이대구광역시 북구 동북로 117, 507호 (산격동, 소프트웨어 벤처타워)대구광역시 북구 산격동 505-7번지2019-07-25
263인쇄사티플디자인대구광역시 북구 유통단지로8길 66, 3층 (산격동)대구광역시 북구 산격동 179-4번지2019-07-25