Overview

Dataset statistics

Number of variables5
Number of observations343
Missing cells16
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.5 KiB
Average record size in memory40.4 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description대구광역시 북구 관내에서 운영 중인 출판사 및 인쇄소 현황(업종, 사업체명칭, 도로명주소, 지번주소 등) 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15006402/fileData.do

Alerts

데이터 기준 일자 has constant value ""Constant
사업체소재지(도로명) has 15 (4.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:17:17.571563
Analysis finished2023-12-12 15:17:18.336939
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
출판사
254 
인쇄사
89 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 254
74.1%
인쇄사 89
 
25.9%

Length

2023-12-13T00:17:18.429685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:17:18.562474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 254
74.1%
인쇄사 89
 
25.9%
Distinct319
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T00:17:18.826611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length18
Mean length6.5510204
Min length2

Characters and Unicode

Total characters2247
Distinct characters374
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)86.0%

Sample

1st row경북대학교 출판부
2nd row배영출판사
3rd row(주)금구
4th row한진출판사
5th row태평양기획
ValueCountFrequency (%)
주식회사 24
 
5.3%
도서출판 13
 
2.9%
디자인 7
 
1.6%
출판사 5
 
1.1%
4
 
0.9%
종이와연필 2
 
0.4%
렛츠스카이 2
 
0.4%
대구제판인쇄 2
 
0.4%
삼지출판사 2
 
0.4%
한영인쇄 2
 
0.4%
Other values (360) 387
86.0%
2023-12-13T00:17:19.315513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
107
 
4.8%
87
 
3.9%
80
 
3.6%
69
 
3.1%
) 55
 
2.4%
( 53
 
2.4%
50
 
2.2%
48
 
2.1%
45
 
2.0%
44
 
2.0%
Other values (364) 1609
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1889
84.1%
Space Separator 107
 
4.8%
Uppercase Letter 64
 
2.8%
Lowercase Letter 64
 
2.8%
Close Punctuation 55
 
2.4%
Open Punctuation 53
 
2.4%
Other Punctuation 7
 
0.3%
Decimal Number 7
 
0.3%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
4.6%
80
 
4.2%
69
 
3.7%
50
 
2.6%
48
 
2.5%
45
 
2.4%
44
 
2.3%
42
 
2.2%
35
 
1.9%
28
 
1.5%
Other values (317) 1361
72.0%
Uppercase Letter
ValueCountFrequency (%)
M 7
 
10.9%
I 6
 
9.4%
A 5
 
7.8%
T 4
 
6.2%
H 4
 
6.2%
S 4
 
6.2%
C 4
 
6.2%
D 3
 
4.7%
E 3
 
4.7%
K 3
 
4.7%
Other values (10) 21
32.8%
Lowercase Letter
ValueCountFrequency (%)
e 7
10.9%
o 6
 
9.4%
i 5
 
7.8%
a 5
 
7.8%
t 4
 
6.2%
l 4
 
6.2%
n 4
 
6.2%
u 4
 
6.2%
s 4
 
6.2%
d 4
 
6.2%
Other values (9) 17
26.6%
Decimal Number
ValueCountFrequency (%)
4 3
42.9%
6 2
28.6%
1 2
28.6%
Space Separator
ValueCountFrequency (%)
107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1884
83.8%
Common 230
 
10.2%
Latin 128
 
5.7%
Han 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
4.6%
80
 
4.2%
69
 
3.7%
50
 
2.7%
48
 
2.5%
45
 
2.4%
44
 
2.3%
42
 
2.2%
35
 
1.9%
28
 
1.5%
Other values (312) 1356
72.0%
Latin
ValueCountFrequency (%)
M 7
 
5.5%
e 7
 
5.5%
I 6
 
4.7%
o 6
 
4.7%
i 5
 
3.9%
a 5
 
3.9%
A 5
 
3.9%
t 4
 
3.1%
l 4
 
3.1%
n 4
 
3.1%
Other values (29) 75
58.6%
Common
ValueCountFrequency (%)
107
46.5%
) 55
23.9%
( 53
23.0%
. 7
 
3.0%
4 3
 
1.3%
6 2
 
0.9%
1 2
 
0.9%
- 1
 
0.4%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1884
83.8%
ASCII 358
 
15.9%
CJK 5
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
107
29.9%
) 55
15.4%
( 53
14.8%
. 7
 
2.0%
M 7
 
2.0%
e 7
 
2.0%
I 6
 
1.7%
o 6
 
1.7%
i 5
 
1.4%
a 5
 
1.4%
Other values (37) 100
27.9%
Hangul
ValueCountFrequency (%)
87
 
4.6%
80
 
4.2%
69
 
3.7%
50
 
2.7%
48
 
2.5%
45
 
2.4%
44
 
2.3%
42
 
2.2%
35
 
1.9%
28
 
1.5%
Other values (312) 1356
72.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct281
Distinct (%)85.7%
Missing15
Missing (%)4.4%
Memory size2.8 KiB
2023-12-13T00:17:19.655119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length38
Mean length26.77439
Min length20

Characters and Unicode

Total characters8782
Distinct characters215
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique243 ?
Unique (%)74.1%

Sample

1st row대구광역시 북구 대학로 80 (산격동)
2nd row대구광역시 북구 중앙대로 487 (칠성동2가)
3rd row대구광역시 북구 연경중앙로 55 (연경동, 대구연경 연경숲)
4th row대구광역시 북구 원대로23길 9 (노원동1가)
5th row대구광역시 북구 침산남로 80 (침산동)
ValueCountFrequency (%)
대구광역시 328
 
18.9%
북구 328
 
18.9%
산격동 56
 
3.2%
침산동 29
 
1.7%
대현동 29
 
1.7%
복현동 24
 
1.4%
대학로 21
 
1.2%
노원동3가 20
 
1.2%
칠성동2가 18
 
1.0%
읍내동 17
 
1.0%
Other values (469) 861
49.7%
2023-12-13T00:17:20.204154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1403
 
16.0%
695
 
7.9%
493
 
5.6%
406
 
4.6%
361
 
4.1%
338
 
3.8%
331
 
3.8%
330
 
3.8%
328
 
3.7%
) 328
 
3.7%
Other values (205) 3769
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5440
61.9%
Space Separator 1403
 
16.0%
Decimal Number 1104
 
12.6%
Close Punctuation 328
 
3.7%
Open Punctuation 328
 
3.7%
Other Punctuation 108
 
1.2%
Dash Punctuation 65
 
0.7%
Uppercase Letter 4
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
695
 
12.8%
493
 
9.1%
406
 
7.5%
361
 
6.6%
338
 
6.2%
331
 
6.1%
330
 
6.1%
328
 
6.0%
148
 
2.7%
145
 
2.7%
Other values (187) 1865
34.3%
Decimal Number
ValueCountFrequency (%)
1 239
21.6%
2 155
14.0%
3 123
11.1%
5 106
9.6%
0 97
8.8%
4 96
8.7%
7 86
 
7.8%
6 75
 
6.8%
9 68
 
6.2%
8 59
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
I 2
50.0%
T 2
50.0%
Space Separator
ValueCountFrequency (%)
1403
100.0%
Close Punctuation
ValueCountFrequency (%)
) 328
100.0%
Open Punctuation
ValueCountFrequency (%)
( 328
100.0%
Other Punctuation
ValueCountFrequency (%)
, 108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5440
61.9%
Common 3336
38.0%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
695
 
12.8%
493
 
9.1%
406
 
7.5%
361
 
6.6%
338
 
6.2%
331
 
6.1%
330
 
6.1%
328
 
6.0%
148
 
2.7%
145
 
2.7%
Other values (187) 1865
34.3%
Common
ValueCountFrequency (%)
1403
42.1%
) 328
 
9.8%
( 328
 
9.8%
1 239
 
7.2%
2 155
 
4.6%
3 123
 
3.7%
, 108
 
3.2%
5 106
 
3.2%
0 97
 
2.9%
4 96
 
2.9%
Other values (5) 353
 
10.6%
Latin
ValueCountFrequency (%)
I 2
33.3%
T 2
33.3%
e 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5440
61.9%
ASCII 3342
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1403
42.0%
) 328
 
9.8%
( 328
 
9.8%
1 239
 
7.2%
2 155
 
4.6%
3 123
 
3.7%
, 108
 
3.2%
5 106
 
3.2%
0 97
 
2.9%
4 96
 
2.9%
Other values (8) 359
 
10.7%
Hangul
ValueCountFrequency (%)
695
 
12.8%
493
 
9.1%
406
 
7.5%
361
 
6.6%
338
 
6.2%
331
 
6.1%
330
 
6.1%
328
 
6.0%
148
 
2.7%
145
 
2.7%
Other values (187) 1865
34.3%
Distinct291
Distinct (%)85.1%
Missing1
Missing (%)0.3%
Memory size2.8 KiB
2023-12-13T00:17:20.533288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length21.660819
Min length16

Characters and Unicode

Total characters7408
Distinct characters195
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique247 ?
Unique (%)72.2%

Sample

1st row대구광역시 북구 산격동 1370
2nd row대구광역시 북구 칠성동2가 302-300
3rd row대구광역시 북구 연경동 923 대구연경 연경숲
4th row대구광역시 북구 노원동1가 415
5th row대구광역시 북구 읍내동 955-14
ValueCountFrequency (%)
대구광역시 342
22.8%
북구 342
22.8%
산격동 62
 
4.1%
침산동 39
 
2.6%
대현동 32
 
2.1%
복현동 28
 
1.9%
칠성동2가 25
 
1.7%
노원동3가 23
 
1.5%
태전동 17
 
1.1%
읍내동 17
 
1.1%
Other values (395) 573
38.2%
2023-12-13T00:17:21.030313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1420
19.2%
710
 
9.6%
398
 
5.4%
376
 
5.1%
350
 
4.7%
350
 
4.7%
343
 
4.6%
342
 
4.6%
1 319
 
4.3%
- 249
 
3.4%
Other values (185) 2551
34.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4219
57.0%
Decimal Number 1516
 
20.5%
Space Separator 1420
 
19.2%
Dash Punctuation 249
 
3.4%
Other Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
710
16.8%
398
 
9.4%
376
 
8.9%
350
 
8.3%
350
 
8.3%
343
 
8.1%
342
 
8.1%
115
 
2.7%
70
 
1.7%
68
 
1.6%
Other values (171) 1097
26.0%
Decimal Number
ValueCountFrequency (%)
1 319
21.0%
3 217
14.3%
2 199
13.1%
4 133
8.8%
5 125
 
8.2%
7 124
 
8.2%
9 117
 
7.7%
0 106
 
7.0%
8 88
 
5.8%
6 88
 
5.8%
Space Separator
ValueCountFrequency (%)
1420
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 249
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4219
57.0%
Common 3187
43.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
710
16.8%
398
 
9.4%
376
 
8.9%
350
 
8.3%
350
 
8.3%
343
 
8.1%
342
 
8.1%
115
 
2.7%
70
 
1.7%
68
 
1.6%
Other values (171) 1097
26.0%
Common
ValueCountFrequency (%)
1420
44.6%
1 319
 
10.0%
- 249
 
7.8%
3 217
 
6.8%
2 199
 
6.2%
4 133
 
4.2%
5 125
 
3.9%
7 124
 
3.9%
9 117
 
3.7%
0 106
 
3.3%
Other values (3) 178
 
5.6%
Latin
ValueCountFrequency (%)
e 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4219
57.0%
ASCII 3189
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1420
44.5%
1 319
 
10.0%
- 249
 
7.8%
3 217
 
6.8%
2 199
 
6.2%
4 133
 
4.2%
5 125
 
3.9%
7 124
 
3.9%
9 117
 
3.7%
0 106
 
3.3%
Other values (4) 180
 
5.6%
Hangul
ValueCountFrequency (%)
710
16.8%
398
 
9.4%
376
 
8.9%
350
 
8.3%
350
 
8.3%
343
 
8.1%
342
 
8.1%
115
 
2.7%
70
 
1.7%
68
 
1.6%
Other values (171) 1097
26.0%
Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2023-07-25 00:00:00
Maximum2023-07-25 00:00:00
2023-12-13T00:17:21.182813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:17:21.307215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-13T00:17:18.037017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:17:18.174260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:17:18.280818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종사업체명칭사업체소재지(도로명)사업체소재지(지번)데이터 기준 일자
0출판사경북대학교 출판부대구광역시 북구 대학로 80 (산격동)대구광역시 북구 산격동 13702023-07-25
1출판사배영출판사대구광역시 북구 중앙대로 487 (칠성동2가)대구광역시 북구 칠성동2가 302-3002023-07-25
2출판사(주)금구대구광역시 북구 연경중앙로 55 (연경동, 대구연경 연경숲)대구광역시 북구 연경동 923 대구연경 연경숲2023-07-25
3출판사한진출판사대구광역시 북구 원대로23길 9 (노원동1가)대구광역시 북구 노원동1가 4152023-07-25
4출판사태평양기획<NA>대구광역시 북구 읍내동 955-142023-07-25
5출판사매일관광문화사<NA>대구광역시 북구 노원동3가 1102-22023-07-25
6출판사(주)한국종합기술대구광역시 북구 침산남로 80 (침산동)대구광역시 북구 침산동 443-22023-07-25
7출판사영진전문대학교 출판부대구광역시 북구 복현로 35 (복현동)대구광역시 북구 복현동 2182023-07-25
8출판사대학생성경읽기 출판사<NA>대구광역시 북구 대현동 2472023-07-25
9출판사도서출판청림대구광역시 북구 복현로2길 16 (복현동)대구광역시 북구 복현동 375-42023-07-25
업종사업체명칭사업체소재지(도로명)사업체소재지(지번)데이터 기준 일자
333인쇄사반석보호작업장대구광역시 북구 검단로27길 58-10 (검단동)대구광역시 북구 검단동 838-202023-07-25
334인쇄사성심인쇄사대구광역시 북구 태암로 60 (구암동)대구광역시 북구 구암동 687-12023-07-25
335인쇄사삼화프린텍대구광역시 북구 노원로42길 30 (침산동)대구광역시 북구 침산동 10962023-07-25
336인쇄사광고회사 유(U)대구광역시 북구 한강로8길 19, 페스트빌딩 (금호동)대구광역시 북구 금호동 803 페스트빌딩2023-07-25
337인쇄사채움대구광역시 북구 태암로 84 (구암동)대구광역시 북구 구암동 686-42023-07-25
338인쇄사주식회사 디엔디대구광역시 북구 연암로 74 (산격동)대구광역시 북구 산격동 9762023-07-25
339인쇄사주식회사 이투대구광역시 북구 연암로42길 60 (산격동)대구광역시 북구 산격동 620-92023-07-25
340인쇄사새희망대구광역시 북구 대학로 15-1 (산격동)대구광역시 북구 산격동 1398-22023-07-25
341인쇄사장온대구광역시 북구 검단로 50 (복현동, 복현서한타운)대구광역시 북구 복현동 539-110 복현서한타운2023-07-25
342인쇄사호림기획대구광역시 북구 옥산로 87, 태왕아너스로뎀플러스 (칠성동2가)대구광역시 북구 칠성동2가 89-1 태왕아너스로뎀플러스2023-07-25