Overview

Dataset statistics

Number of variables5
Number of observations1354
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.3 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical3
Text1

Dataset

Description2022년 6월 21일 기준 광주광역시에 등록 신고 및 영업 중인 출판사 현황 데이터입니다. 출판사의 상호(명), 주소(구단위) 등의 항목을 제공합니다. 실시간 정보 검색 기능은 [문화체육관광부 출판사/인쇄사 검색 시스템]에서 이용 가능합니다. ※ 주소 링크 : http://book.mcst.go.kr/html/searchList.php?search_area=6290000&search_state=1&search_kind=1&search_type=&search_word=&x=36&y=46
Author광주광역시
URLhttps://www.data.go.kr/data/15056464/fileData.do

Alerts

등록구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 주소High correlation
주소 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:26:50.200858
Analysis finished2023-12-12 14:26:50.823706
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1354
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean677.5
Minimum1
Maximum1354
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.0 KiB
2023-12-12T23:26:50.900763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile68.65
Q1339.25
median677.5
Q31015.75
95-th percentile1286.35
Maximum1354
Range1353
Interquartile range (IQR)676.5

Descriptive statistics

Standard deviation391.01044
Coefficient of variation (CV)0.57713719
Kurtosis-1.2
Mean677.5
Median Absolute Deviation (MAD)338.5
Skewness0
Sum917335
Variance152889.17
MonotonicityStrictly increasing
2023-12-12T23:26:51.052935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
901 1
 
0.1%
909 1
 
0.1%
908 1
 
0.1%
907 1
 
0.1%
906 1
 
0.1%
905 1
 
0.1%
904 1
 
0.1%
903 1
 
0.1%
902 1
 
0.1%
Other values (1344) 1344
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1354 1
0.1%
1353 1
0.1%
1352 1
0.1%
1351 1
0.1%
1350 1
0.1%
1349 1
0.1%
1348 1
0.1%
1347 1
0.1%
1346 1
0.1%
1345 1
0.1%

등록구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
출판사
1354 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 1354
100.0%

Length

2023-12-12T23:26:51.215538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:26:51.322733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 1354
100.0%

상호
Text

Distinct1324
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
2023-12-12T23:26:51.592299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length23
Mean length6.711226
Min length1

Characters and Unicode

Total characters9087
Distinct characters602
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1295 ?
Unique (%)95.6%

Sample

1st row호남 문화사
2nd row한국복음문서협회출판사
3rd row태광출판사
4th row복음 문화사
5th row광일문화사
ValueCountFrequency (%)
주식회사 79
 
4.5%
도서출판 77
 
4.4%
디자인 24
 
1.4%
출판사 15
 
0.9%
사단법인 6
 
0.3%
유한회사 5
 
0.3%
무점포 5
 
0.3%
기획 5
 
0.3%
스튜디오 5
 
0.3%
4
 
0.2%
Other values (1464) 1534
87.2%
2023-12-12T23:26:52.059867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
405
 
4.5%
332
 
3.7%
296
 
3.3%
280
 
3.1%
261
 
2.9%
259
 
2.9%
220
 
2.4%
) 220
 
2.4%
( 216
 
2.4%
177
 
1.9%
Other values (592) 6421
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7641
84.1%
Space Separator 405
 
4.5%
Lowercase Letter 285
 
3.1%
Uppercase Letter 253
 
2.8%
Close Punctuation 220
 
2.4%
Open Punctuation 216
 
2.4%
Decimal Number 38
 
0.4%
Other Punctuation 22
 
0.2%
Dash Punctuation 6
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
332
 
4.3%
296
 
3.9%
280
 
3.7%
261
 
3.4%
259
 
3.4%
220
 
2.9%
177
 
2.3%
155
 
2.0%
155
 
2.0%
151
 
2.0%
Other values (527) 5355
70.1%
Uppercase Letter
ValueCountFrequency (%)
S 27
 
10.7%
A 20
 
7.9%
M 17
 
6.7%
C 15
 
5.9%
O 15
 
5.9%
P 14
 
5.5%
E 14
 
5.5%
T 13
 
5.1%
B 12
 
4.7%
L 12
 
4.7%
Other values (14) 94
37.2%
Lowercase Letter
ValueCountFrequency (%)
i 30
10.5%
e 30
10.5%
o 28
 
9.8%
a 25
 
8.8%
s 21
 
7.4%
u 17
 
6.0%
n 17
 
6.0%
t 17
 
6.0%
l 15
 
5.3%
r 15
 
5.3%
Other values (12) 70
24.6%
Decimal Number
ValueCountFrequency (%)
1 20
52.6%
5 5
 
13.2%
2 4
 
10.5%
8 4
 
10.5%
0 1
 
2.6%
4 1
 
2.6%
3 1
 
2.6%
7 1
 
2.6%
9 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
. 10
45.5%
& 7
31.8%
/ 3
 
13.6%
' 1
 
4.5%
, 1
 
4.5%
Space Separator
ValueCountFrequency (%)
405
100.0%
Close Punctuation
ValueCountFrequency (%)
) 220
100.0%
Open Punctuation
ValueCountFrequency (%)
( 216
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7627
83.9%
Common 908
 
10.0%
Latin 538
 
5.9%
Han 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
332
 
4.4%
296
 
3.9%
280
 
3.7%
261
 
3.4%
259
 
3.4%
220
 
2.9%
177
 
2.3%
155
 
2.0%
155
 
2.0%
151
 
2.0%
Other values (513) 5341
70.0%
Latin
ValueCountFrequency (%)
i 30
 
5.6%
e 30
 
5.6%
o 28
 
5.2%
S 27
 
5.0%
a 25
 
4.6%
s 21
 
3.9%
A 20
 
3.7%
u 17
 
3.2%
n 17
 
3.2%
t 17
 
3.2%
Other values (36) 306
56.9%
Common
ValueCountFrequency (%)
405
44.6%
) 220
24.2%
( 216
23.8%
1 20
 
2.2%
. 10
 
1.1%
& 7
 
0.8%
- 6
 
0.7%
5 5
 
0.6%
2 4
 
0.4%
8 4
 
0.4%
Other values (9) 11
 
1.2%
Han
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7627
83.9%
ASCII 1446
 
15.9%
CJK 13
 
0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
405
28.0%
) 220
15.2%
( 216
14.9%
i 30
 
2.1%
e 30
 
2.1%
o 28
 
1.9%
S 27
 
1.9%
a 25
 
1.7%
s 21
 
1.5%
1 20
 
1.4%
Other values (55) 424
29.3%
Hangul
ValueCountFrequency (%)
332
 
4.4%
296
 
3.9%
280
 
3.7%
261
 
3.4%
259
 
3.4%
220
 
2.9%
177
 
2.3%
155
 
2.0%
155
 
2.0%
151
 
2.0%
Other values (513) 5341
70.0%
CJK
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

주소
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
광주광역시 동구
577 
광주광역시 북구
243 
광주광역시 남구
220 
광주광역시 서구
180 
광주광역시 광산구
134 

Length

Max length9
Median length8
Mean length8.098966
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시 동구
2nd row광주광역시 동구
3rd row광주광역시 동구
4th row광주광역시 동구
5th row광주광역시 동구

Common Values

ValueCountFrequency (%)
광주광역시 동구 577
42.6%
광주광역시 북구 243
17.9%
광주광역시 남구 220
 
16.2%
광주광역시 서구 180
 
13.3%
광주광역시 광산구 134
 
9.9%

Length

2023-12-12T23:26:52.241728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:26:52.349525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주광역시 1354
50.0%
동구 577
21.3%
북구 243
 
9.0%
남구 220
 
8.1%
서구 180
 
6.6%
광산구 134
 
4.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
2022-06-21
1354 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-06-21
2nd row2022-06-21
3rd row2022-06-21
4th row2022-06-21
5th row2022-06-21

Common Values

ValueCountFrequency (%)
2022-06-21 1354
100.0%

Length

2023-12-12T23:26:52.476372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:26:52.596163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-06-21 1354
100.0%

Interactions

2023-12-12T23:26:50.553164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:26:52.668586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주소
연번1.0000.995
주소0.9951.000
2023-12-12T23:26:52.746276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주소
연번1.0000.901
주소0.9011.000

Missing values

2023-12-12T23:26:50.686448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:26:50.782315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록구분상호주소데이터기준일자
01출판사호남 문화사광주광역시 동구2022-06-21
12출판사한국복음문서협회출판사광주광역시 동구2022-06-21
23출판사태광출판사광주광역시 동구2022-06-21
34출판사복음 문화사광주광역시 동구2022-06-21
45출판사광일문화사광주광역시 동구2022-06-21
56출판사삼화문화사광주광역시 동구2022-06-21
67출판사광신출판사광주광역시 동구2022-06-21
78출판사도서출판 예원광주광역시 동구2022-06-21
89출판사국제칼라광주광역시 동구2022-06-21
910출판사새날 출판사광주광역시 동구2022-06-21
연번등록구분상호주소데이터기준일자
13441345출판사(주)티엑스포광주광역시 광산구2022-06-21
13451346출판사나빌레라광주광역시 광산구2022-06-21
13461347출판사다미광주광역시 광산구2022-06-21
13471348출판사다미광주광역시 광산구2022-06-21
13481349출판사사유서(사유書)광주광역시 광산구2022-06-21
13491350출판사디자인나울광주광역시 광산구2022-06-21
13501351출판사씨줄과날줄광주광역시 광산구2022-06-21
13511352출판사반조광주광역시 광산구2022-06-21
13521353출판사북팟(Book pot)광주광역시 광산구2022-06-21
13531354출판사(주)디자인아이엠광주광역시 광산구2022-06-21