Overview

Dataset statistics

Number of variables4
Number of observations83
Missing cells8
Missing cells (%)2.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory34.6 B

Variable types

Text3
Categorical1

Dataset

Description경기도_고양시_도서관센터 리뷰점수정보(컨텐츠한글명, 점수)- 고양시 도서관센터에서 제공하는 전자책을 이용한 독자들의 리뷰와 점수 정보
Author경기도 고양시
URLhttps://www.data.go.kr/data/15086037/fileData.do

Alerts

리뷰내용 has 8 (9.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:18:05.496744
Analysis finished2023-12-12 20:18:06.195696
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct78
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2023-12-13T05:18:06.427076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length12.60241
Min length7

Characters and Unicode

Total characters1046
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)89.2%

Sample

1st row3060101
2nd row104396016
3rd row115216313
4th row4801130612240
5th row4801130619300
ValueCountFrequency (%)
4801165226610 3
 
3.6%
4808954654753 2
 
2.4%
web0081 2
 
2.4%
web0084 2
 
2.4%
4808954622035 1
 
1.2%
4808965962205 1
 
1.2%
4808990982407 1
 
1.2%
4808982818226 1
 
1.2%
4808980104802 1
 
1.2%
4808979141276 1
 
1.2%
Other values (68) 68
81.9%
2023-12-13T05:18:06.927785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 169
16.2%
8 142
13.6%
1 129
12.3%
4 112
10.7%
9 84
8.0%
6 72
6.9%
69
6.6%
2 67
 
6.4%
5 64
 
6.1%
7 56
 
5.4%
Other values (7) 82
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 949
90.7%
Space Separator 69
 
6.6%
Uppercase Letter 28
 
2.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 169
17.8%
8 142
15.0%
1 129
13.6%
4 112
11.8%
9 84
8.9%
6 72
7.6%
2 67
 
7.1%
5 64
 
6.7%
7 56
 
5.9%
3 54
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
W 7
25.0%
B 7
25.0%
E 7
25.0%
X 3
10.7%
D 2
 
7.1%
N 2
 
7.1%
Space Separator
ValueCountFrequency (%)
69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1018
97.3%
Latin 28
 
2.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 169
16.6%
8 142
13.9%
1 129
12.7%
4 112
11.0%
9 84
8.3%
6 72
7.1%
69
6.8%
2 67
 
6.6%
5 64
 
6.3%
7 56
 
5.5%
Latin
ValueCountFrequency (%)
W 7
25.0%
B 7
25.0%
E 7
25.0%
X 3
10.7%
D 2
 
7.1%
N 2
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1046
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 169
16.2%
8 142
13.6%
1 129
12.3%
4 112
10.7%
9 84
8.0%
6 72
6.9%
69
6.6%
2 67
 
6.4%
5 64
 
6.1%
7 56
 
5.4%
Other values (7) 82
7.8%
Distinct78
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2023-12-13T05:18:07.380702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length26
Mean length13.144578
Min length2

Characters and Unicode

Total characters1091
Distinct characters305
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)89.2%

Sample

1st row베트남 10,000일의 전쟁
2nd row지구 끝의 온실
3rd row사라진 여자들
4th row레버리지
5th row마흔에게
ValueCountFrequency (%)
1 6
 
2.1%
4
 
1.4%
50대 3
 
1.0%
전자책 3
 
1.0%
만들기 3
 
1.0%
왕초보 3
 
1.0%
아이의 3
 
1.0%
말투 2
 
0.7%
2
 
0.7%
기적의 2
 
0.7%
Other values (235) 258
89.3%
2023-12-13T05:18:07.909781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
 
18.9%
21
 
1.9%
20
 
1.8%
20
 
1.8%
17
 
1.6%
15
 
1.4%
14
 
1.3%
14
 
1.3%
1 12
 
1.1%
0 11
 
1.0%
Other values (295) 741
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 746
68.4%
Space Separator 206
 
18.9%
Lowercase Letter 49
 
4.5%
Decimal Number 32
 
2.9%
Other Punctuation 22
 
2.0%
Uppercase Letter 18
 
1.6%
Open Punctuation 8
 
0.7%
Close Punctuation 8
 
0.7%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
2.8%
20
 
2.7%
20
 
2.7%
17
 
2.3%
15
 
2.0%
14
 
1.9%
14
 
1.9%
11
 
1.5%
11
 
1.5%
10
 
1.3%
Other values (250) 593
79.5%
Lowercase Letter
ValueCountFrequency (%)
n 8
16.3%
a 5
10.2%
o 5
10.2%
e 4
8.2%
i 3
 
6.1%
v 3
 
6.1%
t 3
 
6.1%
g 3
 
6.1%
h 3
 
6.1%
r 3
 
6.1%
Other values (6) 9
18.4%
Uppercase Letter
ValueCountFrequency (%)
W 3
16.7%
Y 3
16.7%
V 2
11.1%
G 2
11.1%
E 2
11.1%
H 2
11.1%
N 1
 
5.6%
I 1
 
5.6%
D 1
 
5.6%
M 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 12
37.5%
0 11
34.4%
5 4
 
12.5%
3 1
 
3.1%
2 1
 
3.1%
7 1
 
3.1%
4 1
 
3.1%
8 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 7
31.8%
: 5
22.7%
. 5
22.7%
? 4
18.2%
! 1
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 7
87.5%
[ 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 7
87.5%
] 1
 
12.5%
Space Separator
ValueCountFrequency (%)
206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 746
68.4%
Common 278
 
25.5%
Latin 67
 
6.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
2.8%
20
 
2.7%
20
 
2.7%
17
 
2.3%
15
 
2.0%
14
 
1.9%
14
 
1.9%
11
 
1.5%
11
 
1.5%
10
 
1.3%
Other values (250) 593
79.5%
Latin
ValueCountFrequency (%)
n 8
 
11.9%
a 5
 
7.5%
o 5
 
7.5%
e 4
 
6.0%
i 3
 
4.5%
v 3
 
4.5%
t 3
 
4.5%
g 3
 
4.5%
h 3
 
4.5%
W 3
 
4.5%
Other values (16) 27
40.3%
Common
ValueCountFrequency (%)
206
74.1%
1 12
 
4.3%
0 11
 
4.0%
( 7
 
2.5%
) 7
 
2.5%
, 7
 
2.5%
: 5
 
1.8%
. 5
 
1.8%
5 4
 
1.4%
? 4
 
1.4%
Other values (9) 10
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 746
68.4%
ASCII 345
31.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
206
59.7%
1 12
 
3.5%
0 11
 
3.2%
n 8
 
2.3%
( 7
 
2.0%
) 7
 
2.0%
, 7
 
2.0%
: 5
 
1.4%
. 5
 
1.4%
a 5
 
1.4%
Other values (35) 72
 
20.9%
Hangul
ValueCountFrequency (%)
21
 
2.8%
20
 
2.7%
20
 
2.7%
17
 
2.3%
15
 
2.0%
14
 
1.9%
14
 
1.9%
11
 
1.5%
11
 
1.5%
10
 
1.3%
Other values (250) 593
79.5%

리뷰내용
Text

MISSING 

Distinct75
Distinct (%)100.0%
Missing8
Missing (%)9.6%
Memory size796.0 B
2023-12-13T05:18:08.242851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length160
Median length61
Mean length43.746667
Min length1

Characters and Unicode

Total characters3281
Distinct characters446
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row목차의 중간부분까지만 있어서 읽다가보면 사기당한 기분 참 더러워!!!
2nd row절망의 끝에서도 내일을 살아가는 사람들. 결말까지 책을 손에서 놓지 못했다.
3rd row정말 재밌습니다.강추
4th row그래도 뭔가 있겠지 하며 끝까지 읽어낸 시간이 아까웠던 책
5th rowGood read
ValueCountFrequency (%)
7
 
0.9%
너무 7
 
0.9%
책을 6
 
0.8%
5
 
0.7%
많은 5
 
0.7%
4
 
0.5%
4
 
0.5%
4
 
0.5%
읽은 4
 
0.5%
도움이 4
 
0.5%
Other values (625) 691
93.3%
2023-12-13T05:18:08.755646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
696
 
21.2%
. 101
 
3.1%
91
 
2.8%
75
 
2.3%
51
 
1.6%
50
 
1.5%
50
 
1.5%
44
 
1.3%
41
 
1.2%
41
 
1.2%
Other values (436) 2041
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2366
72.1%
Space Separator 696
 
21.2%
Other Punctuation 130
 
4.0%
Lowercase Letter 37
 
1.1%
Decimal Number 19
 
0.6%
Math Symbol 14
 
0.4%
Close Punctuation 4
 
0.1%
Open Punctuation 3
 
0.1%
Modifier Symbol 3
 
0.1%
Dash Punctuation 2
 
0.1%
Other values (4) 7
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
3.8%
75
 
3.2%
51
 
2.2%
50
 
2.1%
50
 
2.1%
44
 
1.9%
41
 
1.7%
41
 
1.7%
39
 
1.6%
39
 
1.6%
Other values (397) 1845
78.0%
Lowercase Letter
ValueCountFrequency (%)
d 5
13.5%
a 5
13.5%
o 4
10.8%
n 4
10.8%
e 3
8.1%
r 3
8.1%
t 3
8.1%
f 3
8.1%
g 2
 
5.4%
i 2
 
5.4%
Other values (2) 3
8.1%
Decimal Number
ValueCountFrequency (%)
0 6
31.6%
1 4
21.1%
3 3
15.8%
4 3
15.8%
8 1
 
5.3%
5 1
 
5.3%
9 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 101
77.7%
! 19
 
14.6%
? 6
 
4.6%
: 2
 
1.5%
, 2
 
1.5%
Modifier Symbol
ValueCountFrequency (%)
´ 1
33.3%
` 1
33.3%
^ 1
33.3%
Math Symbol
ValueCountFrequency (%)
~ 13
92.9%
1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 3
75.0%
1
 
25.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
696
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2366
72.1%
Common 876
 
26.7%
Latin 39
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
3.8%
75
 
3.2%
51
 
2.2%
50
 
2.1%
50
 
2.1%
44
 
1.9%
41
 
1.7%
41
 
1.7%
39
 
1.6%
39
 
1.6%
Other values (397) 1845
78.0%
Common
ValueCountFrequency (%)
696
79.5%
. 101
 
11.5%
! 19
 
2.2%
~ 13
 
1.5%
0 6
 
0.7%
? 6
 
0.7%
1 4
 
0.5%
) 3
 
0.3%
( 3
 
0.3%
3 3
 
0.3%
Other values (15) 22
 
2.5%
Latin
ValueCountFrequency (%)
d 5
12.8%
a 5
12.8%
o 4
10.3%
n 4
10.3%
e 3
7.7%
r 3
7.7%
t 3
7.7%
f 3
7.7%
g 2
 
5.1%
i 2
 
5.1%
Other values (4) 5
12.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2357
71.8%
ASCII 908
 
27.7%
Compat Jamo 9
 
0.3%
Punctuation 4
 
0.1%
None 2
 
0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
696
76.7%
. 101
 
11.1%
! 19
 
2.1%
~ 13
 
1.4%
0 6
 
0.7%
? 6
 
0.7%
d 5
 
0.6%
a 5
 
0.6%
o 4
 
0.4%
n 4
 
0.4%
Other values (24) 49
 
5.4%
Hangul
ValueCountFrequency (%)
91
 
3.9%
75
 
3.2%
51
 
2.2%
50
 
2.1%
50
 
2.1%
44
 
1.9%
41
 
1.7%
41
 
1.7%
39
 
1.7%
39
 
1.7%
Other values (393) 1836
77.9%
Compat Jamo
ValueCountFrequency (%)
4
44.4%
2
22.2%
2
22.2%
1
 
11.1%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
None
ValueCountFrequency (%)
´ 1
50.0%
1
50.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

점수
Categorical

Distinct5
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
5
46 
4
16 
1
13 
3
2
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row5
3rd row5
4th row1
5th row2

Common Values

ValueCountFrequency (%)
5 46
55.4%
4 16
 
19.3%
1 13
 
15.7%
3 5
 
6.0%
2 3
 
3.6%

Length

2023-12-13T05:18:08.896217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:09.009095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 46
55.4%
4 16
 
19.3%
1 13
 
15.7%
3 5
 
6.0%
2 3
 
3.6%

Correlations

2023-12-13T05:18:09.094454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
바코드컨텐츠한글명리뷰내용점수
바코드1.0001.0001.0000.991
컨텐츠한글명1.0001.0001.0000.991
리뷰내용1.0001.0001.0001.000
점수0.9910.9911.0001.000

Missing values

2023-12-13T05:18:06.025750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:18:06.143095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

바코드컨텐츠한글명리뷰내용점수
03060101베트남 10,000일의 전쟁목차의 중간부분까지만 있어서 읽다가보면 사기당한 기분 참 더러워!!!1
1104396016지구 끝의 온실절망의 끝에서도 내일을 살아가는 사람들. 결말까지 책을 손에서 놓지 못했다.5
2115216313사라진 여자들정말 재밌습니다.강추5
34801130612240레버리지그래도 뭔가 있겠지 하며 끝까지 읽어낸 시간이 아까웠던 책1
44801130619300마흔에게<NA>2
54801130623109철학이 필요한 순간Good read4
64801130629033침입자들정혁용 작가님. 처음 보는 작가님 글인데 아주 묘한 매력이 있네요.잘 읽었습니다.감사합니다.4
74801155122977심장을 쏘다. 1심장을 쏘다 읽을만합니다..5
84801155811888나는 내가 죽었다고 생각했습니다지금까지 읽은 책 중 이렇게 몰입해서 순식간에 다 읽은 책은 이게 유일해요!! ‘뇌과학자에게 하루아침에 찾아온 뇌졸중’ 주제부터가 재미없을 수가 없죠 ‘^’5
94801156333075곽재식의 미래를 파는 상점<NA>5
바코드컨텐츠한글명리뷰내용점수
73WEB0027네트워크관리사 1급404에러구먼... not found...빨리 조치하시던지...아예 삭제를 하시던지...1
74WEB0066[NEW] 파워 포인트 2007정말 차근차근 상세하고 꼼꼼하게 설명해 주셔서 학습에 많은 도움이 되었습니다. 미처 몰랐던 기본적이고 중요한 점들도 제대로 잘 배웠습니다. 지정하는 마우스 위치가 좀 더 잘 보인다면 금상첨화이겠습니다.5
75WEB0081빈센트 반고흐(Gogh, Vincent van)큐레이터의 설명을 들으며 작쿰을 감상하니 작품이해가 훨씬 잘됩니다. 고희의 다른 많은 작품도 소개해주시면 좋겠습니다..5
76WEB0081빈센트 반고흐(Gogh, Vincent van)반고흐는 자상화를 그리기어려어서 자기 귀를 잘라버린것 모두아시조 빈센트반고희는 해바라기는 정말이쁜것있조 고희를 존경하고 미술관을만이가시기 바랍니다5
77WEB0082피에르 오그스트 르느와르큐레이터의 설명을 들으며 작품감상하니 훨씬 더 이해가 잘되고 행복했습니다.좋아하는 르느와르의 더 많은 작품 소개 바랍니다....5
78WEB0084신나는 과학애니메이션 WHY?이거 애 안돼나요?1
79WEB0084신나는 과학애니메이션 WHY?안되는데요?1
80X001374348분 기적의 독서법 - 인생역전 책 읽기 프로젝트책읽기에 빠지고 싶은 사람이 반드시 읽게 될 책이라고 생각한다. 이 책을 읽으면 또다른 많은 필독서들을 만날 수 있다.5
81X0013899마시멜로 이야기 (원본 완역)이 책은 목표와 성공을 이루기 위해 지금 감정이나 욕심에 좌우지되어 당장 하고 싶은-마시멜로로 비유될 수 있는-것들을 참아낼때 어떤 놀라운 인생의 변화가 일어나는가에 관한 얘기. 자극이 되는 책이다.5
82X0019368모리스의 월요일 - 절망이 희망으로 바뀌는 기적의 날크게 기대하지 않고 대출한 책인데 진한 감동을 주네요 순간의배려로 시작한 8년이 한사람의 인생을 바꾸었다는것에 감동받았습니다5