Overview

Dataset statistics

Number of variables4
Number of observations365
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory11.5 KiB
Average record size in memory32.4 B

Variable types

Categorical2
Text2

Dataset

Description대구광역시 수성구 관내 출판사현황에 대한 데이터로 사업체 명칭, 사업체 소재지, 영업상태에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15054718/fileData.do

Alerts

업종 has constant value ""Constant
영업상태 has constant value ""Constant
Dataset has 1 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 21:57:18.927487
Analysis finished2023-12-12 21:57:19.406226
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
출판사
365 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 365
100.0%

Length

2023-12-13T06:57:19.487828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:19.601045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 365
100.0%
Distinct364
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-13T06:57:19.850234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length21
Mean length7.369863
Min length2

Characters and Unicode

Total characters2690
Distinct characters416
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique363 ?
Unique (%)99.5%

Sample

1st row무법이행원출판사
2nd row도서출판 아름다운 사람들
3rd row재단법인동일문화장학재단
4th row대성출판사
5th row도서출판 신세계
ValueCountFrequency (%)
도서출판 59
 
10.5%
주식회사 24
 
4.3%
출판사 10
 
1.8%
디자인 4
 
0.7%
출판 4
 
0.7%
사단법인 3
 
0.5%
북스 3
 
0.5%
에듀 2
 
0.4%
문깡 2
 
0.4%
2
 
0.4%
Other values (442) 449
79.9%
2023-12-13T06:57:20.260739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
 
7.3%
106
 
3.9%
106
 
3.9%
74
 
2.8%
71
 
2.6%
70
 
2.6%
55
 
2.0%
52
 
1.9%
47
 
1.7%
( 46
 
1.7%
Other values (406) 1866
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2129
79.1%
Space Separator 197
 
7.3%
Lowercase Letter 140
 
5.2%
Uppercase Letter 112
 
4.2%
Open Punctuation 46
 
1.7%
Close Punctuation 46
 
1.7%
Decimal Number 12
 
0.4%
Other Punctuation 6
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
5.0%
106
 
5.0%
74
 
3.5%
71
 
3.3%
70
 
3.3%
55
 
2.6%
52
 
2.4%
47
 
2.2%
39
 
1.8%
36
 
1.7%
Other values (350) 1473
69.2%
Uppercase Letter
ValueCountFrequency (%)
E 11
 
9.8%
M 10
 
8.9%
S 9
 
8.0%
A 8
 
7.1%
L 7
 
6.2%
O 7
 
6.2%
B 6
 
5.4%
C 6
 
5.4%
D 6
 
5.4%
F 5
 
4.5%
Other values (13) 37
33.0%
Lowercase Letter
ValueCountFrequency (%)
e 16
11.4%
n 16
11.4%
o 15
10.7%
i 14
10.0%
a 11
 
7.9%
t 11
 
7.9%
s 8
 
5.7%
r 6
 
4.3%
c 6
 
4.3%
m 6
 
4.3%
Other values (9) 31
22.1%
Decimal Number
ValueCountFrequency (%)
2 3
25.0%
1 2
16.7%
5 2
16.7%
9 2
16.7%
3 2
16.7%
7 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 2
33.3%
' 2
33.3%
& 1
16.7%
% 1
16.7%
Space Separator
ValueCountFrequency (%)
197
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2123
78.9%
Common 309
 
11.5%
Latin 252
 
9.4%
Han 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
5.0%
106
 
5.0%
74
 
3.5%
71
 
3.3%
70
 
3.3%
55
 
2.6%
52
 
2.4%
47
 
2.2%
39
 
1.8%
36
 
1.7%
Other values (344) 1467
69.1%
Latin
ValueCountFrequency (%)
e 16
 
6.3%
n 16
 
6.3%
o 15
 
6.0%
i 14
 
5.6%
a 11
 
4.4%
t 11
 
4.4%
E 11
 
4.4%
M 10
 
4.0%
S 9
 
3.6%
s 8
 
3.2%
Other values (32) 131
52.0%
Common
ValueCountFrequency (%)
197
63.8%
( 46
 
14.9%
) 46
 
14.9%
2 3
 
1.0%
1 2
 
0.6%
5 2
 
0.6%
9 2
 
0.6%
. 2
 
0.6%
3 2
 
0.6%
' 2
 
0.6%
Other values (4) 5
 
1.6%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2123
78.9%
ASCII 561
 
20.9%
CJK 6
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
197
35.1%
( 46
 
8.2%
) 46
 
8.2%
e 16
 
2.9%
n 16
 
2.9%
o 15
 
2.7%
i 14
 
2.5%
a 11
 
2.0%
t 11
 
2.0%
E 11
 
2.0%
Other values (46) 178
31.7%
Hangul
ValueCountFrequency (%)
106
 
5.0%
106
 
5.0%
74
 
3.5%
71
 
3.3%
70
 
3.3%
55
 
2.6%
52
 
2.4%
47
 
2.2%
39
 
1.8%
36
 
1.7%
Other values (344) 1467
69.1%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct151
Distinct (%)41.4%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-13T06:57:20.554278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length19
Mean length14.90137
Min length13

Characters and Unicode

Total characters5439
Distinct characters76
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)25.5%

Sample

1st row대구광역시 수성구 파동로25길 수성구 파동로25길
2nd row대구광역시 수성구 파동로44길
3rd row대구광역시 수성구 달구벌대로
4th row대구광역시 수성구 교학로
5th row대구광역시 수성구 수성로
ValueCountFrequency (%)
수성구 366
33.4%
대구광역시 365
33.3%
달구벌대로 26
 
2.4%
동대구로 25
 
2.3%
청수로 13
 
1.2%
용학로 11
 
1.0%
수성로 11
 
1.0%
들안로 10
 
0.9%
청호로 9
 
0.8%
화랑로 9
 
0.8%
Other values (142) 252
23.0%
2023-12-13T06:57:21.063147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
827
15.2%
732
13.5%
461
8.5%
413
7.6%
391
 
7.2%
370
 
6.8%
365
 
6.7%
365
 
6.7%
365
 
6.7%
152
 
2.8%
Other values (66) 998
18.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4377
80.5%
Space Separator 732
 
13.5%
Decimal Number 330
 
6.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
827
18.9%
461
10.5%
413
9.4%
391
8.9%
370
8.5%
365
8.3%
365
8.3%
365
8.3%
152
 
3.5%
63
 
1.4%
Other values (55) 605
13.8%
Decimal Number
ValueCountFrequency (%)
4 54
16.4%
2 49
14.8%
1 49
14.8%
5 38
11.5%
6 36
10.9%
0 26
7.9%
8 23
7.0%
3 20
 
6.1%
7 18
 
5.5%
9 17
 
5.2%
Space Separator
ValueCountFrequency (%)
732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4377
80.5%
Common 1062
 
19.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
827
18.9%
461
10.5%
413
9.4%
391
8.9%
370
8.5%
365
8.3%
365
8.3%
365
8.3%
152
 
3.5%
63
 
1.4%
Other values (55) 605
13.8%
Common
ValueCountFrequency (%)
732
68.9%
4 54
 
5.1%
2 49
 
4.6%
1 49
 
4.6%
5 38
 
3.6%
6 36
 
3.4%
0 26
 
2.4%
8 23
 
2.2%
3 20
 
1.9%
7 18
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4377
80.5%
ASCII 1062
 
19.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
827
18.9%
461
10.5%
413
9.4%
391
8.9%
370
8.5%
365
8.3%
365
8.3%
365
8.3%
152
 
3.5%
63
 
1.4%
Other values (55) 605
13.8%
ASCII
ValueCountFrequency (%)
732
68.9%
4 54
 
5.1%
2 49
 
4.6%
1 49
 
4.6%
5 38
 
3.6%
6 36
 
3.4%
0 26
 
2.4%
8 23
 
2.2%
3 20
 
1.9%
7 18
 
1.7%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
영업중
365 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 365
100.0%

Length

2023-12-13T06:57:21.220083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:21.342669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 365
100.0%

Missing values

2023-12-13T06:57:19.263117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:57:19.368613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종사업체명칭사업체소재지영업상태
0출판사무법이행원출판사대구광역시 수성구 파동로25길 수성구 파동로25길영업중
1출판사도서출판 아름다운 사람들대구광역시 수성구 파동로44길영업중
2출판사재단법인동일문화장학재단대구광역시 수성구 달구벌대로영업중
3출판사대성출판사대구광역시 수성구 교학로영업중
4출판사도서출판 신세계대구광역시 수성구 수성로영업중
5출판사도서출판 신기원대구광역시 수성구 동대구로영업중
6출판사청산문화사대구광역시 수성구 상록로영업중
7출판사도서출판 작가콜로퀴엄대구광역시 수성구 용학로48길영업중
8출판사한국기획대구광역시 수성구 들안로영업중
9출판사도서출판 사이대구광역시 수성구 들안로19길영업중
업종사업체명칭사업체소재지영업상태
355출판사주식회사 엠지뉴턴대구광역시 수성구 명덕로영업중
356출판사클레이키위대구광역시 수성구 청수로24길영업중
357출판사다원엠디에스출판대구광역시 수성구 달구벌대로영업중
358출판사주식회사 대경아이앤씨대구광역시 수성구 달구벌대로영업중
359출판사장졸당대구광역시 수성구 달구벌대로영업중
360출판사주식회사 지웍스대구광역시 수성구 청수로영업중
361출판사주식회사 스마트업대구광역시 수성구 알파시티1로31길영업중
362출판사디자인 마냑대구광역시 수성구 달구벌대로511길영업중
363출판사에하드대구광역시 수성구 청호로영업중
364출판사행운대구광역시 수성구 범어로26길영업중

Duplicate rows

Most frequently occurring

업종사업체명칭사업체소재지영업상태# duplicates
0출판사제이컴대구광역시 수성구 화랑로영업중2