Overview

Dataset statistics

Number of variables5
Number of observations241
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory9.5 KiB
Average record size in memory40.5 B

Variable types

Categorical2
Text3

Dataset

Description울산광역시 남구 구립도서관 추천도서에 대한 데이터로 도서관구분, 추천도서명, 저자, 출판사, 추천년월 등의 항목을 제공합니다.
Author울산광역시 남구
URLhttps://www.data.go.kr/data/15049366/fileData.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates
도서관 구분 is highly overall correlated with 추천년월일High correlation
추천년월일 is highly overall correlated with 도서관 구분High correlation

Reproduction

Analysis started2024-03-14 16:38:11.683783
Analysis finished2024-03-14 16:38:12.864967
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관 구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
신복
71 
도산
68 
월봉
57 
옥현
45 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신복
2nd row신복
3rd row신복
4th row신복
5th row신복

Common Values

ValueCountFrequency (%)
신복 71
29.5%
도산 68
28.2%
월봉 57
23.7%
옥현 45
18.7%

Length

2024-03-15T01:38:13.058946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:38:13.254456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신복 71
29.5%
도산 68
28.2%
월봉 57
23.7%
옥현 45
18.7%
Distinct235
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-03-15T01:38:14.814833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length29
Mean length11.759336
Min length2

Characters and Unicode

Total characters2834
Distinct characters457
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)95.9%

Sample

1st row문과 남자의 과학 공부
2nd row꿀벌의 예언 1
3rd row꿀벌의 예언 2
4th row10대를 위한 첫 아바타 경제 수업
5th row과학자가 되는 시간
ValueCountFrequency (%)
13
 
1.6%
삭제 9
 
1.1%
나는 9
 
1.1%
위한 8
 
1.0%
6
 
0.7%
공부 6
 
0.7%
있는 5
 
0.6%
5
 
0.6%
미술관 5
 
0.6%
밖은 5
 
0.6%
Other values (630) 755
91.4%
2024-03-15T01:38:16.843345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
609
 
21.5%
75
 
2.6%
55
 
1.9%
51
 
1.8%
45
 
1.6%
44
 
1.6%
35
 
1.2%
30
 
1.1%
29
 
1.0%
29
 
1.0%
Other values (447) 1832
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2118
74.7%
Space Separator 609
 
21.5%
Decimal Number 36
 
1.3%
Other Punctuation 34
 
1.2%
Open Punctuation 12
 
0.4%
Close Punctuation 12
 
0.4%
Uppercase Letter 11
 
0.4%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
3.5%
55
 
2.6%
51
 
2.4%
45
 
2.1%
44
 
2.1%
35
 
1.7%
30
 
1.4%
29
 
1.4%
29
 
1.4%
27
 
1.3%
Other values (422) 1698
80.2%
Uppercase Letter
ValueCountFrequency (%)
S 3
27.3%
B 2
18.2%
E 1
 
9.1%
N 1
 
9.1%
H 1
 
9.1%
G 1
 
9.1%
P 1
 
9.1%
T 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
0 16
44.4%
1 10
27.8%
2 4
 
11.1%
3 3
 
8.3%
5 2
 
5.6%
4 1
 
2.8%
Other Punctuation
ValueCountFrequency (%)
, 15
44.1%
? 8
23.5%
: 5
 
14.7%
! 5
 
14.7%
· 1
 
2.9%
Open Punctuation
ValueCountFrequency (%)
[ 9
75.0%
( 3
 
25.0%
Close Punctuation
ValueCountFrequency (%)
] 9
75.0%
) 3
 
25.0%
Space Separator
ValueCountFrequency (%)
609
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2118
74.7%
Common 705
 
24.9%
Latin 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
3.5%
55
 
2.6%
51
 
2.4%
45
 
2.1%
44
 
2.1%
35
 
1.7%
30
 
1.4%
29
 
1.4%
29
 
1.4%
27
 
1.3%
Other values (422) 1698
80.2%
Common
ValueCountFrequency (%)
609
86.4%
0 16
 
2.3%
, 15
 
2.1%
1 10
 
1.4%
[ 9
 
1.3%
] 9
 
1.3%
? 8
 
1.1%
: 5
 
0.7%
! 5
 
0.7%
2 4
 
0.6%
Other values (7) 15
 
2.1%
Latin
ValueCountFrequency (%)
S 3
27.3%
B 2
18.2%
E 1
 
9.1%
N 1
 
9.1%
H 1
 
9.1%
G 1
 
9.1%
P 1
 
9.1%
T 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2118
74.7%
ASCII 715
 
25.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
609
85.2%
0 16
 
2.2%
, 15
 
2.1%
1 10
 
1.4%
[ 9
 
1.3%
] 9
 
1.3%
? 8
 
1.1%
: 5
 
0.7%
! 5
 
0.7%
2 4
 
0.6%
Other values (14) 25
 
3.5%
Hangul
ValueCountFrequency (%)
75
 
3.5%
55
 
2.6%
51
 
2.4%
45
 
2.1%
44
 
2.1%
35
 
1.7%
30
 
1.4%
29
 
1.4%
29
 
1.4%
27
 
1.3%
Other values (422) 1698
80.2%
None
ValueCountFrequency (%)
· 1
100.0%

저자
Text

Distinct228
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-03-15T01:38:17.839771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length33
Mean length7.659751
Min length2

Characters and Unicode

Total characters1846
Distinct characters279
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)90.5%

Sample

1st row유시민
2nd row베르나르 베르베르
3rd row베르나르 베르베르
4th row신진상
5th row템플 그랜딘
ValueCountFrequency (%)
그림 25
 
5.1%
17
 
3.5%
14
 
2.8%
지은이 14
 
2.8%
9
 
1.8%
9
 
1.8%
옮긴이 5
 
1.0%
옮김 5
 
1.0%
시빌 5
 
1.0%
들라크루아 5
 
1.0%
Other values (346) 384
78.0%
2024-03-15T01:38:19.096528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
14.2%
90
 
4.9%
57
 
3.1%
55
 
3.0%
) 44
 
2.4%
( 44
 
2.4%
42
 
2.3%
39
 
2.1%
+ 39
 
2.1%
38
 
2.1%
Other values (269) 1135
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1400
75.8%
Space Separator 263
 
14.2%
Other Punctuation 45
 
2.4%
Close Punctuation 44
 
2.4%
Open Punctuation 44
 
2.4%
Math Symbol 39
 
2.1%
Uppercase Letter 8
 
0.4%
Decimal Number 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
6.4%
57
 
4.1%
55
 
3.9%
42
 
3.0%
39
 
2.8%
38
 
2.7%
30
 
2.1%
26
 
1.9%
25
 
1.8%
23
 
1.6%
Other values (251) 975
69.6%
Uppercase Letter
ValueCountFrequency (%)
L 2
25.0%
S 1
12.5%
E 1
12.5%
B 1
12.5%
T 1
12.5%
O 1
12.5%
D 1
12.5%
Other Punctuation
ValueCountFrequency (%)
; 23
51.1%
/ 16
35.6%
. 5
 
11.1%
· 1
 
2.2%
Decimal Number
ValueCountFrequency (%)
6 1
33.3%
8 1
33.3%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
263
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Math Symbol
ValueCountFrequency (%)
+ 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1400
75.8%
Common 438
 
23.7%
Latin 8
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
6.4%
57
 
4.1%
55
 
3.9%
42
 
3.0%
39
 
2.8%
38
 
2.7%
30
 
2.1%
26
 
1.9%
25
 
1.8%
23
 
1.6%
Other values (251) 975
69.6%
Common
ValueCountFrequency (%)
263
60.0%
) 44
 
10.0%
( 44
 
10.0%
+ 39
 
8.9%
; 23
 
5.3%
/ 16
 
3.7%
. 5
 
1.1%
6 1
 
0.2%
8 1
 
0.2%
1 1
 
0.2%
Latin
ValueCountFrequency (%)
L 2
25.0%
S 1
12.5%
E 1
12.5%
B 1
12.5%
T 1
12.5%
O 1
12.5%
D 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1400
75.8%
ASCII 445
 
24.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
263
59.1%
) 44
 
9.9%
( 44
 
9.9%
+ 39
 
8.8%
; 23
 
5.2%
/ 16
 
3.6%
. 5
 
1.1%
L 2
 
0.4%
S 1
 
0.2%
E 1
 
0.2%
Other values (7) 7
 
1.6%
Hangul
ValueCountFrequency (%)
90
 
6.4%
57
 
4.1%
55
 
3.9%
42
 
3.0%
39
 
2.8%
38
 
2.7%
30
 
2.1%
26
 
1.9%
25
 
1.8%
23
 
1.6%
Other values (251) 975
69.6%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct145
Distinct (%)60.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-03-15T01:38:20.490848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length4.2904564
Min length2

Characters and Unicode

Total characters1034
Distinct characters225
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)45.2%

Sample

1st row돌베개
2nd row열린책들
3rd row열린책들
4th row체인지업
5th row창비
ValueCountFrequency (%)
창비 16
 
6.3%
문학동네 12
 
4.7%
책읽는곰 9
 
3.5%
위즈덤하우스 8
 
3.1%
웅진주니어 6
 
2.4%
풀빛 6
 
2.4%
북극곰 5
 
2.0%
비룡소 4
 
1.6%
자음과모음 4
 
1.6%
우리학교 4
 
1.6%
Other values (139) 181
71.0%
2024-03-15T01:38:22.049120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
4.1%
29
 
2.8%
27
 
2.6%
27
 
2.6%
25
 
2.4%
24
 
2.3%
22
 
2.1%
22
 
2.1%
20
 
1.9%
19
 
1.8%
Other values (215) 777
75.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 975
94.3%
Space Separator 25
 
2.4%
Uppercase Letter 14
 
1.4%
Decimal Number 8
 
0.8%
Lowercase Letter 5
 
0.5%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.3%
29
 
3.0%
27
 
2.8%
27
 
2.8%
24
 
2.5%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
1.9%
17
 
1.7%
Other values (199) 726
74.5%
Uppercase Letter
ValueCountFrequency (%)
K 4
28.6%
H 3
21.4%
R 3
21.4%
O 2
14.3%
B 1
 
7.1%
S 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
40.0%
s 1
20.0%
k 1
20.0%
b 1
20.0%
Decimal Number
ValueCountFrequency (%)
1 5
62.5%
2 3
37.5%
Space Separator
ValueCountFrequency (%)
25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
% 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 975
94.3%
Common 40
 
3.9%
Latin 19
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.3%
29
 
3.0%
27
 
2.8%
27
 
2.8%
24
 
2.5%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
1.9%
17
 
1.7%
Other values (199) 726
74.5%
Latin
ValueCountFrequency (%)
K 4
21.1%
H 3
15.8%
R 3
15.8%
o 2
10.5%
O 2
10.5%
B 1
 
5.3%
S 1
 
5.3%
s 1
 
5.3%
k 1
 
5.3%
b 1
 
5.3%
Common
ValueCountFrequency (%)
25
62.5%
1 5
 
12.5%
2 3
 
7.5%
( 3
 
7.5%
) 3
 
7.5%
% 1
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 975
94.3%
ASCII 59
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
4.3%
29
 
3.0%
27
 
2.8%
27
 
2.8%
24
 
2.5%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
1.9%
17
 
1.7%
Other values (199) 726
74.5%
ASCII
ValueCountFrequency (%)
25
42.4%
1 5
 
8.5%
K 4
 
6.8%
H 3
 
5.1%
R 3
 
5.1%
2 3
 
5.1%
( 3
 
5.1%
) 3
 
5.1%
o 2
 
3.4%
O 2
 
3.4%
Other values (6) 6
 
10.2%

추천년월일
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-10-29
22 
2023-08-29
 
15
2023-08-01
 
14
2023-09-26
 
13
2023-12-28
 
13
Other values (24)
164 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-01
2nd row2023-08-01
3rd row2023-08-01
4th row2023-08-01
5th row2023-08-01

Common Values

ValueCountFrequency (%)
2023-10-29 22
 
9.1%
2023-08-29 15
 
6.2%
2023-08-01 14
 
5.8%
2023-09-26 13
 
5.4%
2023-12-28 13
 
5.4%
2023-12-01 13
 
5.4%
2023-10-31 12
 
5.0%
2023-11-28 11
 
4.6%
2023-12-29 11
 
4.6%
2023-09-23 11
 
4.6%
Other values (19) 106
44.0%

Length

2024-03-15T01:38:22.460309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023-10-29 22
 
9.1%
2023-08-29 15
 
6.2%
2023-08-01 14
 
5.8%
2023-09-26 13
 
5.4%
2023-12-28 13
 
5.4%
2023-12-01 13
 
5.4%
2023-10-31 12
 
5.0%
2023-11-28 11
 
4.6%
2023-12-29 11
 
4.6%
2023-09-23 11
 
4.6%
Other values (19) 106
44.0%

Correlations

2024-03-15T01:38:22.680352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관 구분추천년월일
도서관 구분1.0000.937
추천년월일0.9371.000
2024-03-15T01:38:22.836398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관 구분추천년월일
도서관 구분1.0000.750
추천년월일0.7501.000
2024-03-15T01:38:22.975233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관 구분추천년월일
도서관 구분1.0000.750
추천년월일0.7501.000

Missing values

2024-03-15T01:38:12.565130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:38:12.797489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관 구분추천도서명저자출판사추천년월일
0신복문과 남자의 과학 공부유시민돌베개2023-08-01
1신복꿀벌의 예언 1베르나르 베르베르열린책들2023-08-01
2신복꿀벌의 예언 2베르나르 베르베르열린책들2023-08-01
3신복10대를 위한 첫 아바타 경제 수업신진상체인지업2023-08-01
4신복과학자가 되는 시간템플 그랜딘창비2023-08-01
5신복호랑이가 눈뜰 때이윤하창비2023-08-01
6월봉한 명김숨현대문학2023-08-01
7월봉동주 : 하늘과 바람과 별과 시윤동주문예춘추사2023-08-01
8월봉게으른 십대를 위한 작은 습관의 힘장근영메이트북스2023-08-01
9월봉이런 공부법은 처음이야신종호21세기북스2023-08-01
도서관 구분추천도서명저자출판사추천년월일
231도산이제는 대학이 아니라 직업이다손영배생각비행2024-01-30
232도산세상 좀 바꾸고 갈게요제이미 마골린 저/정아영 역서해문집2024-01-30
233도산작은 일에 상처받지 않고 용기 있는 아이로 키우는 법스즈키 하야토 저/이선주 역다산에듀2024-01-30
234도산박상미의 가족 상담소- 모르면 오해하기 쉽고, 알면 사랑하기 쉽다박상미특별한서재2024-01-30
235도산노력의 배신김영훈21세기북스2024-01-30
236도산기분을 관리하면 인생이 관리된다김다슬클라우디아2024-01-30
237도산탄소가 기후 위기랑 무슨 상관이야정지윤파란의자2024-01-31
238도산나는 정말 어디에 있는 걸까요시타케 신스케주니어 김영사2024-01-31
239도산여름비신경아논장2024-01-31
240도산마녀식당김신희북극곰2024-01-31

Duplicate rows

Most frequently occurring

도서관 구분추천도서명저자출판사추천년월일# duplicates
0신복[삭제] 창 밖은 미술관시빌 들라크루아 글 그림책읽는곰2023-12-293