Overview

Dataset statistics

Number of variables8
Number of observations1000
Missing cells519
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory65.6 KiB
Average record size in memory67.1 B

Variable types

Categorical1
Numeric3
Text4

Dataset

Description경기도 공공-지역 도서관 인기대출도서 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=4K9YEYMEGUD0N4ZIATSR24291536&infSeq=1

Alerts

기준년월 has constant value ""Constant
출판년도 has 31 (3.1%) missing valuesMissing
권수(권) has 487 (48.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 21:53:41.425852
Analysis finished2023-12-10 21:53:43.739668
Duration2.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-05
1000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05
2nd row2023-05
3rd row2023-05
4th row2023-05
5th row2023-05

Common Values

ValueCountFrequency (%)
2023-05 1000
100.0%

Length

2023-12-11T06:53:43.796149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:53:43.878827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05 1000
100.0%

순위번호
Real number (ℝ)

Distinct572
Distinct (%)57.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean499.747
Minimum1
Maximum997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-11T06:53:43.981563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile50.95
Q1250.75
median500
Q3749
95-th percentile947.2
Maximum997
Range996
Interquartile range (IQR)498.25

Descriptive statistics

Standard deviation288.3381
Coefficient of variation (CV)0.57696815
Kurtosis-1.1998889
Mean499.747
Median Absolute Deviation (MAD)249
Skewness-0.00031204352
Sum499747
Variance83138.862
MonotonicityIncreasing
2023-12-11T06:53:44.132779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
790 8
 
0.8%
740 7
 
0.7%
886 7
 
0.7%
986 7
 
0.7%
814 6
 
0.6%
870 6
 
0.6%
556 6
 
0.6%
847 6
 
0.6%
896 6
 
0.6%
623 5
 
0.5%
Other values (562) 936
93.6%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
997 4
0.4%
995 2
 
0.2%
993 2
 
0.2%
986 7
0.7%
982 4
0.4%
978 4
0.4%
976 2
 
0.2%
973 3
0.3%
968 5
0.5%
963 5
0.5%
Distinct641
Distinct (%)64.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-11T06:53:44.510156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length51
Mean length20.267
Min length1

Characters and Unicode

Total characters20267
Distinct characters777
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique567 ?
Unique (%)56.7%

Sample

1st row불편한 편의점 :김호연 장편소설
2nd row아버지의 해방일지 :정지아 장편소설
3rd row불편한 편의점 :김호연 장편소설
4th row달러구트 꿈 백화점.이미예 장편소설
5th row어서오세요, 휴남동 서점입니다 :황보름 장편소설
ValueCountFrequency (%)
84
 
1.6%
장편소설 75
 
1.4%
go 60
 
1.1%
설민석의 57
 
1.1%
대모험 54
 
1.0%
그리스 43
 
0.8%
신화 43
 
0.8%
로마 43
 
0.8%
흔한남매 42
 
0.8%
세계 41
 
0.8%
Other values (2227) 4735
89.7%
2023-12-11T06:53:45.048720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4277
 
21.1%
: 443
 
2.2%
402
 
2.0%
382
 
1.9%
276
 
1.4%
255
 
1.3%
240
 
1.2%
232
 
1.1%
222
 
1.1%
210
 
1.0%
Other values (767) 13328
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13893
68.5%
Space Separator 4277
 
21.1%
Other Punctuation 676
 
3.3%
Lowercase Letter 624
 
3.1%
Decimal Number 240
 
1.2%
Open Punctuation 160
 
0.8%
Close Punctuation 160
 
0.8%
Uppercase Letter 141
 
0.7%
Dash Punctuation 58
 
0.3%
Math Symbol 36
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
402
 
2.9%
382
 
2.7%
276
 
2.0%
255
 
1.8%
240
 
1.7%
232
 
1.7%
222
 
1.6%
210
 
1.5%
186
 
1.3%
175
 
1.3%
Other values (694) 11313
81.4%
Lowercase Letter
ValueCountFrequency (%)
o 126
20.2%
g 77
12.3%
e 46
 
7.4%
t 41
 
6.6%
r 40
 
6.4%
i 34
 
5.4%
s 33
 
5.3%
y 29
 
4.6%
n 28
 
4.5%
h 27
 
4.3%
Other values (14) 143
22.9%
Uppercase Letter
ValueCountFrequency (%)
G 42
29.8%
N 20
14.2%
T 13
 
9.2%
E 11
 
7.8%
F 10
 
7.1%
S 9
 
6.4%
I 7
 
5.0%
K 6
 
4.3%
B 4
 
2.8%
M 4
 
2.8%
Other values (9) 15
 
10.6%
Other Punctuation
ValueCountFrequency (%)
: 443
65.5%
, 71
 
10.5%
! 44
 
6.5%
· 43
 
6.4%
/ 24
 
3.6%
. 23
 
3.4%
? 11
 
1.6%
& 10
 
1.5%
' 5
 
0.7%
% 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 79
32.9%
0 44
18.3%
2 30
 
12.5%
3 24
 
10.0%
5 13
 
5.4%
7 12
 
5.0%
4 12
 
5.0%
9 9
 
3.8%
8 9
 
3.8%
6 8
 
3.3%
Math Symbol
ValueCountFrequency (%)
= 33
91.7%
+ 2
 
5.6%
~ 1
 
2.8%
Open Punctuation
ValueCountFrequency (%)
( 159
99.4%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 159
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
4277
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13893
68.5%
Common 5609
27.7%
Latin 765
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
402
 
2.9%
382
 
2.7%
276
 
2.0%
255
 
1.8%
240
 
1.7%
232
 
1.7%
222
 
1.6%
210
 
1.5%
186
 
1.3%
175
 
1.3%
Other values (694) 11313
81.4%
Latin
ValueCountFrequency (%)
o 126
16.5%
g 77
 
10.1%
e 46
 
6.0%
G 42
 
5.5%
t 41
 
5.4%
r 40
 
5.2%
i 34
 
4.4%
s 33
 
4.3%
y 29
 
3.8%
n 28
 
3.7%
Other values (33) 269
35.2%
Common
ValueCountFrequency (%)
4277
76.3%
: 443
 
7.9%
( 159
 
2.8%
) 159
 
2.8%
1 79
 
1.4%
, 71
 
1.3%
- 58
 
1.0%
! 44
 
0.8%
0 44
 
0.8%
· 43
 
0.8%
Other values (20) 232
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13890
68.5%
ASCII 6327
31.2%
None 45
 
0.2%
Compat Jamo 3
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4277
67.6%
: 443
 
7.0%
( 159
 
2.5%
) 159
 
2.5%
o 126
 
2.0%
1 79
 
1.2%
g 77
 
1.2%
, 71
 
1.1%
- 58
 
0.9%
e 46
 
0.7%
Other values (59) 832
 
13.1%
Hangul
ValueCountFrequency (%)
402
 
2.9%
382
 
2.8%
276
 
2.0%
255
 
1.8%
240
 
1.7%
232
 
1.7%
222
 
1.6%
210
 
1.5%
186
 
1.3%
175
 
1.3%
Other values (692) 11310
81.4%
None
ValueCountFrequency (%)
· 43
95.6%
1
 
2.2%
1
 
2.2%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Distinct520
Distinct (%)52.1%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
2023-12-11T06:53:45.409294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length50
Mean length14.808809
Min length2

Characters and Unicode

Total characters14794
Distinct characters364
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique396 ?
Unique (%)39.6%

Sample

1st row지은이: 김호연
2nd row지은이: 정지아
3rd row지은이: 김호연
4th row지은이: 이미예
5th row지은이: 황보름
ValueCountFrequency (%)
387
 
10.6%
그림 384
 
10.5%
지은이 232
 
6.3%
지음 144
 
3.9%
옮김 139
 
3.8%
글·그림 101
 
2.8%
원작 81
 
2.2%
옮긴이 63
 
1.7%
흔한남매 47
 
1.3%
김정화 43
 
1.2%
Other values (770) 2042
55.7%
2023-12-11T06:53:45.960461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2664
 
18.0%
: 910
 
6.2%
588
 
4.0%
564
 
3.8%
553
 
3.7%
; 517
 
3.5%
496
 
3.4%
482
 
3.3%
425
 
2.9%
, 310
 
2.1%
Other values (354) 7285
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9761
66.0%
Space Separator 2664
 
18.0%
Other Punctuation 1867
 
12.6%
Close Punctuation 225
 
1.5%
Open Punctuation 225
 
1.5%
Lowercase Letter 42
 
0.3%
Uppercase Letter 8
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
588
 
6.0%
564
 
5.8%
553
 
5.7%
496
 
5.1%
482
 
4.9%
425
 
4.4%
303
 
3.1%
283
 
2.9%
204
 
2.1%
151
 
1.5%
Other values (331) 5712
58.5%
Uppercase Letter
ValueCountFrequency (%)
J 2
25.0%
R 1
12.5%
A 1
12.5%
B 1
12.5%
K 1
12.5%
V 1
12.5%
T 1
12.5%
Other Punctuation
ValueCountFrequency (%)
: 910
48.7%
; 517
27.7%
, 310
 
16.6%
· 108
 
5.8%
. 22
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
o 20
47.6%
c 19
45.2%
a 1
 
2.4%
e 1
 
2.4%
r 1
 
2.4%
Close Punctuation
ValueCountFrequency (%)
) 224
99.6%
] 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 224
99.6%
[ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
2664
100.0%
Math Symbol
ValueCountFrequency (%)
| 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9761
66.0%
Common 4983
33.7%
Latin 50
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
588
 
6.0%
564
 
5.8%
553
 
5.7%
496
 
5.1%
482
 
4.9%
425
 
4.4%
303
 
3.1%
283
 
2.9%
204
 
2.1%
151
 
1.5%
Other values (331) 5712
58.5%
Latin
ValueCountFrequency (%)
o 20
40.0%
c 19
38.0%
J 2
 
4.0%
R 1
 
2.0%
A 1
 
2.0%
B 1
 
2.0%
a 1
 
2.0%
e 1
 
2.0%
r 1
 
2.0%
K 1
 
2.0%
Other values (2) 2
 
4.0%
Common
ValueCountFrequency (%)
2664
53.5%
: 910
 
18.3%
; 517
 
10.4%
, 310
 
6.2%
) 224
 
4.5%
( 224
 
4.5%
· 108
 
2.2%
. 22
 
0.4%
| 2
 
< 0.1%
[ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9761
66.0%
ASCII 4925
33.3%
None 108
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2664
54.1%
: 910
 
18.5%
; 517
 
10.5%
, 310
 
6.3%
) 224
 
4.5%
( 224
 
4.5%
. 22
 
0.4%
o 20
 
0.4%
c 19
 
0.4%
J 2
 
< 0.1%
Other values (12) 13
 
0.3%
Hangul
ValueCountFrequency (%)
588
 
6.0%
564
 
5.8%
553
 
5.7%
496
 
5.1%
482
 
4.9%
425
 
4.4%
303
 
3.1%
283
 
2.9%
204
 
2.1%
151
 
1.5%
Other values (331) 5712
58.5%
None
ValueCountFrequency (%)
· 108
100.0%
Distinct227
Distinct (%)22.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-11T06:53:46.198377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length4.946
Min length1

Characters and Unicode

Total characters4946
Distinct characters309
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)13.4%

Sample

1st row나무옆의자
2nd row창비
3rd row나무옆의자
4th row팩토리나인
5th row클레이하우스
ValueCountFrequency (%)
북이십일 96
 
8.7%
미래엔 91
 
8.3%
위즈덤하우스 48
 
4.4%
서울문화사 44
 
4.0%
창비 41
 
3.7%
김영사 37
 
3.4%
다산북스 34
 
3.1%
비룡소 33
 
3.0%
문학동네 32
 
2.9%
미디어그룹 27
 
2.4%
Other values (230) 620
56.2%
2023-12-11T06:53:46.648518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
222
 
4.5%
219
 
4.4%
181
 
3.7%
155
 
3.1%
143
 
2.9%
106
 
2.1%
103
 
2.1%
100
 
2.0%
99
 
2.0%
99
 
2.0%
Other values (299) 3519
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4312
87.2%
Lowercase Letter 297
 
6.0%
Space Separator 103
 
2.1%
Uppercase Letter 98
 
2.0%
Open Punctuation 62
 
1.3%
Close Punctuation 62
 
1.3%
Decimal Number 11
 
0.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
222
 
5.1%
219
 
5.1%
181
 
4.2%
155
 
3.6%
143
 
3.3%
106
 
2.5%
100
 
2.3%
99
 
2.3%
99
 
2.3%
99
 
2.3%
Other values (256) 2889
67.0%
Lowercase Letter
ValueCountFrequency (%)
k 45
15.2%
a 39
13.1%
i 39
13.1%
o 34
11.4%
n 29
9.8%
u 18
 
6.1%
m 17
 
5.7%
e 17
 
5.7%
r 17
 
5.7%
s 9
 
3.0%
Other values (11) 33
11.1%
Uppercase Letter
ValueCountFrequency (%)
D 21
21.4%
M 15
15.3%
B 14
14.3%
N 13
13.3%
R 7
 
7.1%
H 7
 
7.1%
K 7
 
7.1%
P 4
 
4.1%
O 2
 
2.0%
F 2
 
2.0%
Other values (5) 6
 
6.1%
Decimal Number
ValueCountFrequency (%)
2 6
54.5%
1 4
36.4%
6 1
 
9.1%
Space Separator
ValueCountFrequency (%)
103
100.0%
Open Punctuation
ValueCountFrequency (%)
( 62
100.0%
Close Punctuation
ValueCountFrequency (%)
) 62
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4312
87.2%
Latin 395
 
8.0%
Common 239
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
222
 
5.1%
219
 
5.1%
181
 
4.2%
155
 
3.6%
143
 
3.3%
106
 
2.5%
100
 
2.3%
99
 
2.3%
99
 
2.3%
99
 
2.3%
Other values (256) 2889
67.0%
Latin
ValueCountFrequency (%)
k 45
 
11.4%
a 39
 
9.9%
i 39
 
9.9%
o 34
 
8.6%
n 29
 
7.3%
D 21
 
5.3%
u 18
 
4.6%
m 17
 
4.3%
e 17
 
4.3%
r 17
 
4.3%
Other values (26) 119
30.1%
Common
ValueCountFrequency (%)
103
43.1%
( 62
25.9%
) 62
25.9%
2 6
 
2.5%
1 4
 
1.7%
· 1
 
0.4%
6 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4312
87.2%
ASCII 633
 
12.8%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
222
 
5.1%
219
 
5.1%
181
 
4.2%
155
 
3.6%
143
 
3.3%
106
 
2.5%
100
 
2.3%
99
 
2.3%
99
 
2.3%
99
 
2.3%
Other values (256) 2889
67.0%
ASCII
ValueCountFrequency (%)
103
16.3%
( 62
 
9.8%
) 62
 
9.8%
k 45
 
7.1%
a 39
 
6.2%
i 39
 
6.2%
o 34
 
5.4%
n 29
 
4.6%
D 21
 
3.3%
u 18
 
2.8%
Other values (32) 181
28.6%
None
ValueCountFrequency (%)
· 1
100.0%

출판년도
Real number (ℝ)

MISSING 

Distinct23
Distinct (%)2.4%
Missing31
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean2019.3756
Minimum2000
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-11T06:53:46.815578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2011
Q12018
median2020
Q32022
95-th percentile2023
Maximum2023
Range23
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.7583605
Coefficient of variation (CV)0.0018611498
Kurtosis5.8485622
Mean2019.3756
Median Absolute Deviation (MAD)2
Skewness-2.1408314
Sum1956775
Variance14.125274
MonotonicityNot monotonic
2023-12-11T06:53:46.954985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
2022 200
20.0%
2021 159
15.9%
2023 121
12.1%
2020 109
10.9%
2019 109
10.9%
2017 75
 
7.5%
2018 60
 
6.0%
2016 42
 
4.2%
2011 20
 
2.0%
2014 14
 
1.4%
Other values (13) 60
 
6.0%
(Missing) 31
 
3.1%
ValueCountFrequency (%)
2000 2
 
0.2%
2001 3
 
0.3%
2003 1
 
0.1%
2004 1
 
0.1%
2005 13
1.3%
2006 2
 
0.2%
2007 2
 
0.2%
2008 3
 
0.3%
2009 2
 
0.2%
2010 3
 
0.3%
ValueCountFrequency (%)
2023 121
12.1%
2022 200
20.0%
2021 159
15.9%
2020 109
10.9%
2019 109
10.9%
2018 60
 
6.0%
2017 75
 
7.5%
2016 42
 
4.2%
2015 14
 
1.4%
2014 14
 
1.4%

권수(권)
Real number (ℝ)

MISSING 

Distinct92
Distinct (%)17.9%
Missing487
Missing (%)48.7%
Infinite0
Infinite (%)0.0%
Mean20.167641
Minimum1
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-11T06:53:47.086476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median5
Q314
95-th percentile75.4
Maximum2023
Range2022
Interquartile range (IQR)12

Descriptive statistics

Standard deviation94.588785
Coefficient of variation (CV)4.6901263
Kurtosis394.89406
Mean20.167641
Median Absolute Deviation (MAD)4
Skewness18.844312
Sum10346
Variance8947.0382
MonotonicityNot monotonic
2023-12-11T06:53:47.214716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 95
 
9.5%
2 67
 
6.7%
3 44
 
4.4%
4 31
 
3.1%
5 28
 
2.8%
6 22
 
2.2%
7 20
 
2.0%
9 16
 
1.6%
8 15
 
1.5%
10 13
 
1.3%
Other values (82) 162
 
16.2%
(Missing) 487
48.7%
ValueCountFrequency (%)
1 95
9.5%
2 67
6.7%
3 44
4.4%
4 31
 
3.1%
5 28
 
2.8%
6 22
 
2.2%
7 20
 
2.0%
8 15
 
1.5%
9 16
 
1.6%
10 13
 
1.3%
ValueCountFrequency (%)
2023 1
0.1%
354 1
0.1%
305 1
0.1%
280 1
0.1%
265 1
0.1%
115 1
0.1%
92 1
0.1%
91 2
0.2%
90 1
0.1%
89 1
0.1%
Distinct990
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-11T06:53:47.481103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length66
Mean length67.391
Min length42

Characters and Unicode

Total characters67391
Distinct characters41
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique980 ?
Unique (%)98.0%

Sample

1st rowhttps://image.aladin.co.kr/product/26942/84/cover/k582730818_1.jpg
2nd rowhttps://image.aladin.co.kr/product/30048/51/cover/8936438832_1.jpg
3rd rowhttps://image.aladin.co.kr/product/29858/98/cover/k432838027_1.jpg
4th rowhttps://image.aladin.co.kr/product/24512/70/cover/k392630952_1.jpg
5th rowhttps://image.aladin.co.kr/product/28685/95/cover/k362836265_1.jpg
ValueCountFrequency (%)
https://image.aladin.co.kr/product/29252/35/cover/k462837397_1.jpg 2
 
0.2%
http://image.aladin.co.kr/product/3568/34/cover/8993242984_1.jpg 2
 
0.2%
https://image.aladin.co.kr/product/29787/59/cover/k712838601_1.jpg 2
 
0.2%
https://bookthumb-phinf.pstatic.net/cover/062/895/06289560.jpg?type=m1&udate=20180324 2
 
0.2%
http://image.aladin.co.kr/product/10586/14/cover/k422530533_1.jpg 2
 
0.2%
https://image.aladin.co.kr/product/26302/71/cover/8954677150_1.jpg 2
 
0.2%
https://bookthumb-phinf.pstatic.net/cover/069/922/06992205.jpg?type=m1&udate=20180120 2
 
0.2%
http://image.aladin.co.kr/product/7918/85/cover/k352434031_1.jpg 2
 
0.2%
https://image.aladin.co.kr/product/30169/22/cover/8959897094_2.jpg 2
 
0.2%
http://image.aladin.co.kr/product/6232/31/cover/8936442805_1.jpg 2
 
0.2%
Other values (980) 980
98.0%
2023-12-11T06:53:47.827092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 6890
 
10.2%
. 3920
 
5.8%
t 3417
 
5.1%
p 3184
 
4.7%
o 2981
 
4.4%
a 2911
 
4.3%
c 2900
 
4.3%
r 2811
 
4.2%
1 2579
 
3.8%
2 2419
 
3.6%
Other values (31) 33379
49.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 36474
54.1%
Decimal Number 17778
26.4%
Other Punctuation 11970
 
17.8%
Connector Punctuation 920
 
1.4%
Math Symbol 160
 
0.2%
Dash Punctuation 89
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 3417
 
9.4%
p 3184
 
8.7%
o 2981
 
8.2%
a 2911
 
8.0%
c 2900
 
8.0%
r 2811
 
7.7%
e 2150
 
5.9%
i 2022
 
5.5%
g 1922
 
5.3%
d 1901
 
5.2%
Other values (13) 10275
28.2%
Decimal Number
ValueCountFrequency (%)
1 2579
14.5%
2 2419
13.6%
3 2171
12.2%
8 1972
11.1%
9 1710
9.6%
0 1578
8.9%
6 1379
7.8%
5 1373
7.7%
4 1311
7.4%
7 1286
7.2%
Other Punctuation
ValueCountFrequency (%)
/ 6890
57.6%
. 3920
32.7%
: 1000
 
8.4%
? 80
 
0.7%
& 80
 
0.7%
Connector Punctuation
ValueCountFrequency (%)
_ 920
100.0%
Math Symbol
ValueCountFrequency (%)
= 160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 36474
54.1%
Common 30917
45.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 3417
 
9.4%
p 3184
 
8.7%
o 2981
 
8.2%
a 2911
 
8.0%
c 2900
 
8.0%
r 2811
 
7.7%
e 2150
 
5.9%
i 2022
 
5.5%
g 1922
 
5.3%
d 1901
 
5.2%
Other values (13) 10275
28.2%
Common
ValueCountFrequency (%)
/ 6890
22.3%
. 3920
12.7%
1 2579
 
8.3%
2 2419
 
7.8%
3 2171
 
7.0%
8 1972
 
6.4%
9 1710
 
5.5%
0 1578
 
5.1%
6 1379
 
4.5%
5 1373
 
4.4%
Other values (8) 4926
15.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 67391
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 6890
 
10.2%
. 3920
 
5.8%
t 3417
 
5.1%
p 3184
 
4.7%
o 2981
 
4.4%
a 2911
 
4.3%
c 2900
 
4.3%
r 2811
 
4.2%
1 2579
 
3.8%
2 2419
 
3.6%
Other values (31) 33379
49.5%

Interactions

2023-12-11T06:53:42.743403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.171301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.440129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.851210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.255013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.558009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.961457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.340090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:53:42.638656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:53:47.907625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위번호출판년도권수(권)
순위번호1.0000.3580.126
출판년도0.3581.0000.143
권수(권)0.1260.1431.000
2023-12-11T06:53:47.985729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위번호출판년도권수(권)
순위번호1.0000.0830.017
출판년도0.0831.000-0.448
권수(권)0.017-0.4481.000

Missing values

2023-12-11T06:53:43.431936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:53:43.577727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:53:43.686912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년월순위번호도서명정보저자명정보출판사명출판년도권수(권)도서이미지URL
02023-051불편한 편의점 :김호연 장편소설지은이: 김호연나무옆의자2021<NA>https://image.aladin.co.kr/product/26942/84/cover/k582730818_1.jpg
12023-052아버지의 해방일지 :정지아 장편소설지은이: 정지아창비2022<NA>https://image.aladin.co.kr/product/30048/51/cover/8936438832_1.jpg
22023-053불편한 편의점 :김호연 장편소설지은이: 김호연나무옆의자20222https://image.aladin.co.kr/product/29858/98/cover/k432838027_1.jpg
32023-054달러구트 꿈 백화점.이미예 장편소설지은이: 이미예팩토리나인2020<NA>https://image.aladin.co.kr/product/24512/70/cover/k392630952_1.jpg
42023-055어서오세요, 휴남동 서점입니다 :황보름 장편소설지은이: 황보름클레이하우스2022<NA>https://image.aladin.co.kr/product/28685/95/cover/k362836265_1.jpg
52023-056역행자 :돈·시간·운명으로부터 완전한 자유를 얻는 7단계 인생 공략집자청 지음웅진씽크빅2022<NA>https://image.aladin.co.kr/product/29521/63/cover/8901260719_1.jpg
62023-057하얼빈 :김훈 장편소설지은이: 김훈문학동네2022<NA>https://image.aladin.co.kr/product/29857/0/cover/895469991x_1.jpg
72023-058(이미 늦었다고 생각하는 당신을 위한) 김미경의 마흔 수업김미경 지음엠케이유니버스2023<NA>https://image.aladin.co.kr/product/30995/11/cover/k672831500_1.jpg
82023-059흔한남매원작: 흔한남매 ;그림: 유난희미래엔20193https://image.aladin.co.kr/product/22327/46/cover/k782636924_1.jpg
92023-0510흔한남매원작: 흔한남매 ;그림: 유난희미래엔20197https://image.aladin.co.kr/product/26556/36/cover/k602738939_1.jpg
기준년월순위번호도서명정보저자명정보출판사명출판년도권수(권)도서이미지URL
9902023-05986내일은 발명왕 :본격 대결 과학발명 만화글: 곰돌이 co. ;그림: 홍종현Mirae N 아이세움201135https://image.aladin.co.kr/product/28408/12/cover/k312835916_1.jpg
9912023-05986(읽으면서 바로 써먹는) 어린이 속담글·그림: 한날파란정원2020<NA>https://image.aladin.co.kr/product/25785/13/cover/k122736716_1.jpg
9922023-05993여행의 시간 :도시건축가 김진애의 인생 여행법지은이: 김진애창비2023<NA>https://image.aladin.co.kr/product/31249/42/cover/8936479253_1.jpg
9932023-05993알로하, 나의 엄마들 :이금이 장편소설지은이: 이금이창비2020<NA>https://image.aladin.co.kr/product/23646/67/cover/8936456954_1.jpg
9942023-05995지적 대화를 위한 넓고 얕은 지식 1 - 현실 편 : 역사 / 경제 / 정치 / 사회 / 윤리채사장 (지은이)웨일북20201https://image.aladin.co.kr/product/22872/79/cover/k992636841_2.jpg
9952023-05995열두 살 경제학교 :부자가 되고 싶은 어린이를 위한 경제 교육 동화권오상 지음 ;손수정 그림카시오페아2022<NA>https://image.aladin.co.kr/product/29797/98/cover/k262838010_1.jpg
9962023-05997데미안지은이: 헤르만 헤세 ;옮긴이: 전영애민음사200044https://bookthumb-phinf.pstatic.net/cover/000/051/00005186.jpg?type=m1&udate=20160509
9972023-05997이상한 엄마백희나책읽는곰201633http://image.aladin.co.kr/product/7918/85/cover/k352434031_1.jpg
9982023-05997수학을 잘하고 싶어졌습니다 :서울대 3번 입학, 14년을 다니며 깨달은 공부의 본질서준석 지음다산북스2022<NA>https://image.aladin.co.kr/product/30705/78/cover/k282830149_1.jpg
9992023-05997엄마 까투리권정생 글 ;김세현 그림낮은산2008<NA>http://image.aladin.co.kr/product/213/52/cover/8989646480_1.jpg