Overview

Dataset statistics

Number of variables6
Number of observations643
Missing cells459
Missing cells (%)11.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.3 KiB
Average record size in memory48.2 B

Variable types

Text3
Categorical1
DateTime2

Dataset

Description출판 및 인쇄산업 관련 현황(업소명, 전화번호, 주소, 소재지)출판 및 인쇄산업 관련 현황 중 예시로는 북세종인쇄사, 광일인쇄사,중앙인쇄사, 한국법제연구원, 산업연구원, 금강인쇄사 등이 있습니다.
Author세종특별자치시
URLhttps://www.data.go.kr/data/3037718/fileData.do

Alerts

등록기준일 has constant value ""Constant
전화번호 has 455 (70.8%) missing valuesMissing

Reproduction

Analysis started2024-05-04 08:01:02.700709
Analysis finished2024-05-04 08:01:05.716447
Duration3.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct548
Distinct (%)85.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2024-05-04T08:01:06.149580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length24
Mean length6.9813375
Min length1

Characters and Unicode

Total characters4489
Distinct characters470
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique456 ?
Unique (%)70.9%

Sample

1st row금강출판사
2nd row(주)미래엔
3rd row하나로출판사
4th row대학가출판사
5th row도서출판 은진
ValueCountFrequency (%)
주식회사 53
 
6.3%
도서출판 21
 
2.5%
출판사 8
 
1.0%
디자인 7
 
0.8%
디자인시티 4
 
0.5%
사단법인 4
 
0.5%
세종지점 4
 
0.5%
나모기획 4
 
0.5%
협동조합 3
 
0.4%
기획 3
 
0.4%
Other values (620) 729
86.8%
2024-05-04T08:01:07.448412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
198
 
4.4%
167
 
3.7%
130
 
2.9%
128
 
2.9%
118
 
2.6%
117
 
2.6%
) 112
 
2.5%
( 110
 
2.5%
94
 
2.1%
83
 
1.8%
Other values (460) 3232
72.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3731
83.1%
Space Separator 198
 
4.4%
Lowercase Letter 167
 
3.7%
Uppercase Letter 151
 
3.4%
Close Punctuation 112
 
2.5%
Open Punctuation 110
 
2.5%
Other Punctuation 10
 
0.2%
Decimal Number 8
 
0.2%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
167
 
4.5%
130
 
3.5%
128
 
3.4%
118
 
3.2%
117
 
3.1%
94
 
2.5%
83
 
2.2%
75
 
2.0%
71
 
1.9%
70
 
1.9%
Other values (405) 2678
71.8%
Uppercase Letter
ValueCountFrequency (%)
M 14
 
9.3%
A 12
 
7.9%
S 11
 
7.3%
O 10
 
6.6%
P 9
 
6.0%
L 9
 
6.0%
I 8
 
5.3%
G 8
 
5.3%
C 8
 
5.3%
E 8
 
5.3%
Other values (13) 54
35.8%
Lowercase Letter
ValueCountFrequency (%)
i 19
11.4%
n 19
11.4%
a 19
11.4%
o 14
 
8.4%
e 14
 
8.4%
t 12
 
7.2%
s 10
 
6.0%
r 9
 
5.4%
d 8
 
4.8%
l 6
 
3.6%
Other values (10) 37
22.2%
Other Punctuation
ValueCountFrequency (%)
, 5
50.0%
. 2
 
20.0%
& 2
 
20.0%
/ 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
2 3
37.5%
1 2
25.0%
0 2
25.0%
4 1
 
12.5%
Space Separator
ValueCountFrequency (%)
198
100.0%
Close Punctuation
ValueCountFrequency (%)
) 112
100.0%
Open Punctuation
ValueCountFrequency (%)
( 110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3727
83.0%
Common 440
 
9.8%
Latin 318
 
7.1%
Han 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
167
 
4.5%
130
 
3.5%
128
 
3.4%
118
 
3.2%
117
 
3.1%
94
 
2.5%
83
 
2.2%
75
 
2.0%
71
 
1.9%
70
 
1.9%
Other values (403) 2674
71.7%
Latin
ValueCountFrequency (%)
i 19
 
6.0%
n 19
 
6.0%
a 19
 
6.0%
o 14
 
4.4%
e 14
 
4.4%
M 14
 
4.4%
A 12
 
3.8%
t 12
 
3.8%
S 11
 
3.5%
s 10
 
3.1%
Other values (33) 174
54.7%
Common
ValueCountFrequency (%)
198
45.0%
) 112
25.5%
( 110
25.0%
, 5
 
1.1%
2 3
 
0.7%
1 2
 
0.5%
0 2
 
0.5%
. 2
 
0.5%
& 2
 
0.5%
- 2
 
0.5%
Other values (2) 2
 
0.5%
Han
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3727
83.0%
ASCII 758
 
16.9%
CJK Compat Ideographs 2
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
198
26.1%
) 112
14.8%
( 110
14.5%
i 19
 
2.5%
n 19
 
2.5%
a 19
 
2.5%
o 14
 
1.8%
e 14
 
1.8%
M 14
 
1.8%
A 12
 
1.6%
Other values (45) 227
29.9%
Hangul
ValueCountFrequency (%)
167
 
4.5%
130
 
3.5%
128
 
3.4%
118
 
3.2%
117
 
3.1%
94
 
2.5%
83
 
2.2%
75
 
2.0%
71
 
1.9%
70
 
1.9%
Other values (403) 2674
71.7%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
2
100.0%
Distinct337
Distinct (%)52.7%
Missing4
Missing (%)0.6%
Memory size5.2 KiB
2024-05-04T08:01:08.463159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length17.99374
Min length14

Characters and Unicode

Total characters11498
Distinct characters162
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)31.0%

Sample

1st row세종특별자치시 금남면 금병로 31-9
2nd row세종특별자치시 연동면 청연로 492-14
3rd row세종특별자치시 소정면 세종로 4283
4th row세종특별자치시 전동면 하노장5길 2
5th row세종특별자치시 조치원읍 충현로 159
ValueCountFrequency (%)
세종특별자치시 639
28.9%
조치원읍 103
 
4.7%
장군면 80
 
3.6%
한누리대로 53
 
2.4%
연기면 27
 
1.2%
연서면 25
 
1.1%
갈매로 23
 
1.0%
금남면 22
 
1.0%
6 20
 
0.9%
시청대로 19
 
0.9%
Other values (401) 1201
54.3%
2024-05-04T08:01:09.961921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1585
 
13.8%
748
 
6.5%
670
 
5.8%
669
 
5.8%
659
 
5.7%
639
 
5.6%
639
 
5.6%
639
 
5.6%
456
 
4.0%
1 430
 
3.7%
Other values (152) 4364
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7780
67.7%
Decimal Number 1979
 
17.2%
Space Separator 1585
 
13.8%
Dash Punctuation 154
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
748
 
9.6%
670
 
8.6%
669
 
8.6%
659
 
8.5%
639
 
8.2%
639
 
8.2%
639
 
8.2%
456
 
5.9%
192
 
2.5%
184
 
2.4%
Other values (140) 2285
29.4%
Decimal Number
ValueCountFrequency (%)
1 430
21.7%
2 302
15.3%
3 248
12.5%
5 177
8.9%
4 159
 
8.0%
0 150
 
7.6%
9 144
 
7.3%
8 133
 
6.7%
7 125
 
6.3%
6 111
 
5.6%
Space Separator
ValueCountFrequency (%)
1585
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 154
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7780
67.7%
Common 3718
32.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
748
 
9.6%
670
 
8.6%
669
 
8.6%
659
 
8.5%
639
 
8.2%
639
 
8.2%
639
 
8.2%
456
 
5.9%
192
 
2.5%
184
 
2.4%
Other values (140) 2285
29.4%
Common
ValueCountFrequency (%)
1585
42.6%
1 430
 
11.6%
2 302
 
8.1%
3 248
 
6.7%
5 177
 
4.8%
4 159
 
4.3%
- 154
 
4.1%
0 150
 
4.0%
9 144
 
3.9%
8 133
 
3.6%
Other values (2) 236
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7780
67.7%
ASCII 3718
32.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1585
42.6%
1 430
 
11.6%
2 302
 
8.1%
3 248
 
6.7%
5 177
 
4.8%
4 159
 
4.3%
- 154
 
4.1%
0 150
 
4.0%
9 144
 
3.9%
8 133
 
3.6%
Other values (2) 236
 
6.3%
Hangul
ValueCountFrequency (%)
748
 
9.6%
670
 
8.6%
669
 
8.6%
659
 
8.5%
639
 
8.2%
639
 
8.2%
639
 
8.2%
456
 
5.9%
192
 
2.5%
184
 
2.4%
Other values (140) 2285
29.4%

업종
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
출판사
462 
인쇄사
181 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 462
71.9%
인쇄사 181
 
28.1%

Length

2024-05-04T08:01:10.370330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T08:01:10.748516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 462
71.9%
인쇄사 181
 
28.1%

전화번호
Text

MISSING 

Distinct150
Distinct (%)79.8%
Missing455
Missing (%)70.8%
Memory size5.2 KiB
2024-05-04T08:01:11.476493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length12
Mean length12.207447
Min length9

Characters and Unicode

Total characters2295
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)61.7%

Sample

1st row044-866-8787
2nd row044-861-3142
3rd row044-553-7097
4th row044-866-6044+044-866-6045
5th row044-865-5818
ValueCountFrequency (%)
044-868-7542 4
 
2.1%
044-865-2243 3
 
1.6%
044-867-5130 3
 
1.6%
044-858-3100 2
 
1.1%
044-864-5577 2
 
1.1%
044-866-8787 2
 
1.1%
044-864-9548 2
 
1.1%
070-8289-8337 2
 
1.1%
044-862-7949 2
 
1.1%
044-866-3011 2
 
1.1%
Other values (140) 164
87.2%
2024-05-04T08:01:13.033837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 441
19.2%
- 379
16.5%
0 305
13.3%
8 274
11.9%
6 272
11.9%
7 130
 
5.7%
5 115
 
5.0%
3 97
 
4.2%
1 95
 
4.1%
2 94
 
4.1%
Other values (2) 93
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1914
83.4%
Dash Punctuation 379
 
16.5%
Math Symbol 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 441
23.0%
0 305
15.9%
8 274
14.3%
6 272
14.2%
7 130
 
6.8%
5 115
 
6.0%
3 97
 
5.1%
1 95
 
5.0%
2 94
 
4.9%
9 91
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 379
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2295
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 441
19.2%
- 379
16.5%
0 305
13.3%
8 274
11.9%
6 272
11.9%
7 130
 
5.7%
5 115
 
5.0%
3 97
 
4.2%
1 95
 
4.1%
2 94
 
4.1%
Other values (2) 93
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2295
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 441
19.2%
- 379
16.5%
0 305
13.3%
8 274
11.9%
6 272
11.9%
7 130
 
5.7%
5 115
 
5.0%
3 97
 
4.2%
1 95
 
4.1%
2 94
 
4.1%
Other values (2) 93
 
4.1%
Distinct506
Distinct (%)78.7%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum1969-02-12 00:00:00
Maximum2024-04-19 00:00:00
2024-05-04T08:01:13.678516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T08:01:14.307801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2024-04-30 00:00:00
Maximum2024-04-30 00:00:00
2024-05-04T08:01:14.874040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T08:01:15.365838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-05-04T08:01:04.647767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T08:01:05.204271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-04T08:01:05.526485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업체명칭사업체소재지(도로명주소)업종전화번호신고일자등록기준일
0금강출판사세종특별자치시 금남면 금병로 31-9출판사044-866-87871985-09-162024-04-30
1(주)미래엔세종특별자치시 연동면 청연로 492-14출판사044-861-31421999-05-172024-04-30
2하나로출판사세종특별자치시 소정면 세종로 4283출판사044-553-70971999-09-232024-04-30
3대학가출판사세종특별자치시 전동면 하노장5길 2출판사<NA>1999-02-102024-04-30
4도서출판 은진세종특별자치시 조치원읍 충현로 159출판사<NA>2004-10-152024-04-30
5주식회사 아홉거리 미디어 윌세종특별자치시 조치원읍 신안새동네길 6출판사044-866-6044+044-866-60452005-09-082024-04-30
6명현디자인기획세종특별자치시 조치원읍 충현로 209출판사044-865-58182005-11-022024-04-30
7나와우리세종특별자치시 조치원읍 새내8길 22-13출판사<NA>2006-01-122024-04-30
8알파오피스세종특별자치시 조치원읍 새내15길 2출판사<NA>2006-01-122024-04-30
9세종아이콘(주)세종특별자치시 조치원읍 세종로 2639출판사044-865-13572006-07-182024-04-30
사업체명칭사업체소재지(도로명주소)업종전화번호신고일자등록기준일
633대승문화세종특별자치시 연기면 공단로 184-28인쇄사<NA>2022-11-012024-04-30
634(주)어진기획세종특별자치시 마음로 272-9인쇄사<NA>2022-11-172024-04-30
635세종프린팅세종특별자치시 금남면 금병로 903인쇄사044-999-41032022-11-282024-04-30
636주식회사 다인스세종특별자치시 장군면 월현윗길 63-3인쇄사044-868-44472013-03-202024-04-30
637세종인쇄제본세종특별자치시 절재로 194인쇄사070-7576-06502023-08-102024-04-30
638다나기획세종특별자치시 연서면 봉암길 18인쇄사044-866-44152023-10-112024-04-30
639(사)한국나눔복지연합회 아이씨티사업소세종특별자치시 전의면 부거실길 52인쇄사<NA>2023-10-262024-04-30
640크리커뮤니케이션세종특별자치시 연서면 함박로 312인쇄사<NA>2024-02-082024-04-30
641창조기획세종특별자치시 장군면 장척로 397-15인쇄사<NA>2024-03-052024-04-30
642길기획 주식회사세종특별자치시 장군면 장척로 397-15인쇄사<NA>2024-03-052024-04-30