Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells120
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory781.2 KiB
Average record size in memory80.0 B

Variable types

Categorical3
Text6

Dataset

Description서울특별시 서초구 전자도서관에서 보유하고 있는 전자책, 오디오북 관련 데이터입니다.(도서관명, 홍페이지url, 도서명, 저자명, isbn, 형식, 카테고리, 출판사, 표지url)
Author서울특별시 서초구
URLhttps://www.data.go.kr/data/15112631/fileData.do

Alerts

도서관명 has constant value ""Constant
도서관홈페이지url has constant value ""Constant
형식 is highly imbalanced (90.1%)Imbalance
isbn has 120 (1.2%) missing valuesMissing
표지url has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:37:47.736332
Analysis finished2023-12-12 05:37:51.196748
Duration3.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서초구전자도서관
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서초구전자도서관
2nd row서초구전자도서관
3rd row서초구전자도서관
4th row서초구전자도서관
5th row서초구전자도서관

Common Values

ValueCountFrequency (%)
서초구전자도서관 10000
100.0%

Length

2023-12-12T14:37:51.317151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:37:51.470351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서초구전자도서관 10000
100.0%

도서관홈페이지url
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
https://e-book.seocholib.or.kr/main
10000 

Length

Max length35
Median length35
Mean length35
Min length35

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://e-book.seocholib.or.kr/main
2nd rowhttps://e-book.seocholib.or.kr/main
3rd rowhttps://e-book.seocholib.or.kr/main
4th rowhttps://e-book.seocholib.or.kr/main
5th rowhttps://e-book.seocholib.or.kr/main

Common Values

ValueCountFrequency (%)
https://e-book.seocholib.or.kr/main 10000
100.0%

Length

2023-12-12T14:37:51.637890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:37:51.822686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://e-book.seocholib.or.kr/main 10000
100.0%
Distinct9958
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:37:52.318027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length55
Mean length12.9164
Min length1

Characters and Unicode

Total characters129164
Distinct characters1413
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9921 ?
Unique (%)99.2%

Sample

1st row포스퀘어 스토리
2nd row어른이 되면 무슨 일을 할까요
3rd row경제학 100 문장
4th row내 친구의 집
5th row더 테이블
ValueCountFrequency (%)
이야기 319
 
0.9%
1 255
 
0.7%
2 248
 
0.7%
나는 157
 
0.5%
153
 
0.4%
141
 
0.4%
들려주는 121
 
0.4%
위한 120
 
0.3%
3 103
 
0.3%
우리 92
 
0.3%
Other values (16048) 32825
95.1%
2023-12-12T14:37:52.977931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24909
 
19.3%
2769
 
2.1%
2498
 
1.9%
2195
 
1.7%
1619
 
1.3%
1576
 
1.2%
1329
 
1.0%
1308
 
1.0%
1258
 
1.0%
1222
 
0.9%
Other values (1403) 88481
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92621
71.7%
Space Separator 24909
 
19.3%
Decimal Number 4023
 
3.1%
Uppercase Letter 2066
 
1.6%
Other Punctuation 1593
 
1.2%
Lowercase Letter 1484
 
1.1%
Open Punctuation 1084
 
0.8%
Close Punctuation 1081
 
0.8%
Dash Punctuation 190
 
0.1%
Letter Number 39
 
< 0.1%
Other values (6) 74
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2769
 
3.0%
2498
 
2.7%
2195
 
2.4%
1619
 
1.7%
1576
 
1.7%
1329
 
1.4%
1308
 
1.4%
1258
 
1.4%
1222
 
1.3%
1206
 
1.3%
Other values (1302) 75641
81.7%
Uppercase Letter
ValueCountFrequency (%)
E 212
 
10.3%
S 190
 
9.2%
T 143
 
6.9%
A 140
 
6.8%
O 132
 
6.4%
N 121
 
5.9%
I 115
 
5.6%
B 96
 
4.6%
C 96
 
4.6%
R 89
 
4.3%
Other values (16) 732
35.4%
Lowercase Letter
ValueCountFrequency (%)
e 201
13.5%
t 123
 
8.3%
o 121
 
8.2%
s 121
 
8.2%
a 120
 
8.1%
i 100
 
6.7%
r 95
 
6.4%
l 81
 
5.5%
n 81
 
5.5%
h 66
 
4.4%
Other values (14) 375
25.3%
Other Punctuation
ValueCountFrequency (%)
. 620
38.9%
, 395
24.8%
: 234
 
14.7%
! 150
 
9.4%
? 112
 
7.0%
% 27
 
1.7%
& 20
 
1.3%
/ 16
 
1.0%
· 10
 
0.6%
* 3
 
0.2%
Other values (4) 6
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 990
24.6%
0 775
19.3%
2 687
17.1%
3 408
10.1%
5 299
 
7.4%
4 243
 
6.0%
6 178
 
4.4%
7 161
 
4.0%
9 143
 
3.6%
8 139
 
3.5%
Letter Number
ValueCountFrequency (%)
16
41.0%
8
20.5%
5
 
12.8%
4
 
10.3%
3
 
7.7%
2
 
5.1%
1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 981
90.5%
[ 98
 
9.0%
4
 
0.4%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 978
90.5%
] 98
 
9.1%
4
 
0.4%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 12
44.4%
+ 11
40.7%
< 2
 
7.4%
> 2
 
7.4%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
24909
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 190
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 34
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92551
71.7%
Common 32954
 
25.5%
Latin 3589
 
2.8%
Han 70
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2769
 
3.0%
2498
 
2.7%
2195
 
2.4%
1619
 
1.7%
1576
 
1.7%
1329
 
1.4%
1308
 
1.4%
1258
 
1.4%
1222
 
1.3%
1206
 
1.3%
Other values (1242) 75571
81.7%
Han
ValueCountFrequency (%)
4
 
5.7%
3
 
4.3%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (50) 50
71.4%
Latin
ValueCountFrequency (%)
E 212
 
5.9%
e 201
 
5.6%
S 190
 
5.3%
T 143
 
4.0%
A 140
 
3.9%
O 132
 
3.7%
t 123
 
3.4%
o 121
 
3.4%
s 121
 
3.4%
N 121
 
3.4%
Other values (47) 2085
58.1%
Common
ValueCountFrequency (%)
24909
75.6%
1 990
 
3.0%
( 981
 
3.0%
) 978
 
3.0%
0 775
 
2.4%
2 687
 
2.1%
. 620
 
1.9%
3 408
 
1.2%
, 395
 
1.2%
5 299
 
0.9%
Other values (34) 1912
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92550
71.7%
ASCII 36476
 
28.2%
CJK 70
 
0.1%
Number Forms 39
 
< 0.1%
None 21
 
< 0.1%
Punctuation 6
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24909
68.3%
1 990
 
2.7%
( 981
 
2.7%
) 978
 
2.7%
0 775
 
2.1%
2 687
 
1.9%
. 620
 
1.7%
3 408
 
1.1%
, 395
 
1.1%
5 299
 
0.8%
Other values (73) 5434
 
14.9%
Hangul
ValueCountFrequency (%)
2769
 
3.0%
2498
 
2.7%
2195
 
2.4%
1619
 
1.7%
1576
 
1.7%
1329
 
1.4%
1308
 
1.4%
1258
 
1.4%
1222
 
1.3%
1206
 
1.3%
Other values (1241) 75570
81.7%
Number Forms
ValueCountFrequency (%)
16
41.0%
8
20.5%
5
 
12.8%
4
 
10.3%
3
 
7.7%
2
 
5.1%
1
 
2.6%
None
ValueCountFrequency (%)
· 10
47.6%
4
 
19.0%
4
 
19.0%
1
 
4.8%
1
 
4.8%
1
 
4.8%
CJK
ValueCountFrequency (%)
4
 
5.7%
3
 
4.3%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (50) 50
71.4%
Punctuation
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct7140
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:37:53.337752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length3
Mean length4.6547
Min length1

Characters and Unicode

Total characters46547
Distinct characters901
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5784 ?
Unique (%)57.8%

Sample

1st row임홍택
2nd row어린이경제교육연구회
3rd row댄 스미스
4th row우미옥
5th row김종관
ValueCountFrequency (%)
편집부 167
 
1.2%
지음 106
 
0.8%
84
 
0.6%
베리타스알파 47
 
0.3%
그림 44
 
0.3%
37
 
0.3%
36
 
0.3%
데이비드 35
 
0.3%
로버트 32
 
0.2%
옮김 31
 
0.2%
Other values (8334) 13202
95.5%
2023-12-12T14:37:53.916493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3836
 
8.2%
1793
 
3.9%
1381
 
3.0%
887
 
1.9%
783
 
1.7%
712
 
1.5%
585
 
1.3%
512
 
1.1%
504
 
1.1%
410
 
0.9%
Other values (891) 35144
75.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39817
85.5%
Space Separator 3836
 
8.2%
Lowercase Letter 1128
 
2.4%
Uppercase Letter 1048
 
2.3%
Other Punctuation 367
 
0.8%
Close Punctuation 127
 
0.3%
Open Punctuation 127
 
0.3%
Decimal Number 71
 
0.2%
Math Symbol 20
 
< 0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1793
 
4.5%
1381
 
3.5%
887
 
2.2%
783
 
2.0%
712
 
1.8%
585
 
1.5%
512
 
1.3%
504
 
1.3%
410
 
1.0%
407
 
1.0%
Other values (810) 31843
80.0%
Uppercase Letter
ValueCountFrequency (%)
S 103
 
9.8%
A 87
 
8.3%
E 77
 
7.3%
K 73
 
7.0%
B 67
 
6.4%
R 65
 
6.2%
M 65
 
6.2%
C 62
 
5.9%
J 51
 
4.9%
D 47
 
4.5%
Other values (16) 351
33.5%
Lowercase Letter
ValueCountFrequency (%)
e 133
11.8%
a 120
10.6%
n 110
 
9.8%
r 104
 
9.2%
i 98
 
8.7%
o 80
 
7.1%
s 57
 
5.1%
h 51
 
4.5%
l 50
 
4.4%
c 43
 
3.8%
Other values (15) 282
25.0%
Decimal Number
ValueCountFrequency (%)
4 13
18.3%
1 11
15.5%
0 10
14.1%
2 10
14.1%
9 9
12.7%
8 7
9.9%
5 4
 
5.6%
6 3
 
4.2%
3 3
 
4.2%
7 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 198
54.0%
; 77
 
21.0%
, 44
 
12.0%
: 31
 
8.4%
& 7
 
1.9%
/ 5
 
1.4%
# 3
 
0.8%
· 2
 
0.5%
Math Symbol
ValueCountFrequency (%)
< 9
45.0%
> 9
45.0%
| 1
 
5.0%
~ 1
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 121
95.3%
5
 
3.9%
] 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 121
95.3%
5
 
3.9%
[ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
3836
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39805
85.5%
Common 4554
 
9.8%
Latin 2176
 
4.7%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1793
 
4.5%
1381
 
3.5%
887
 
2.2%
783
 
2.0%
712
 
1.8%
585
 
1.5%
512
 
1.3%
504
 
1.3%
410
 
1.0%
407
 
1.0%
Other values (798) 31831
80.0%
Latin
ValueCountFrequency (%)
e 133
 
6.1%
a 120
 
5.5%
n 110
 
5.1%
r 104
 
4.8%
S 103
 
4.7%
i 98
 
4.5%
A 87
 
4.0%
o 80
 
3.7%
E 77
 
3.5%
K 73
 
3.4%
Other values (41) 1191
54.7%
Common
ValueCountFrequency (%)
3836
84.2%
. 198
 
4.3%
) 121
 
2.7%
( 121
 
2.7%
; 77
 
1.7%
, 44
 
1.0%
: 31
 
0.7%
4 13
 
0.3%
1 11
 
0.2%
0 10
 
0.2%
Other values (20) 92
 
2.0%
Han
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39803
85.5%
ASCII 6718
 
14.4%
None 12
 
< 0.1%
CJK 12
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3836
57.1%
. 198
 
2.9%
e 133
 
2.0%
) 121
 
1.8%
( 121
 
1.8%
a 120
 
1.8%
n 110
 
1.6%
r 104
 
1.5%
S 103
 
1.5%
i 98
 
1.5%
Other values (68) 1774
26.4%
Hangul
ValueCountFrequency (%)
1793
 
4.5%
1381
 
3.5%
887
 
2.2%
783
 
2.0%
712
 
1.8%
585
 
1.5%
512
 
1.3%
504
 
1.3%
410
 
1.0%
407
 
1.0%
Other values (797) 31829
80.0%
None
ValueCountFrequency (%)
5
41.7%
5
41.7%
· 2
 
16.7%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%

isbn
Text

MISSING 

Distinct9646
Distinct (%)97.6%
Missing120
Missing (%)1.2%
Memory size156.2 KiB
2023-12-12T14:37:54.227618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.8419028
Min length1

Characters and Unicode

Total characters97238
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9644 ?
Unique (%)97.6%

Sample

1st row8992168691
2nd row8959802387
3rd row1191464334
4th row1160946876
5th row8950971984
ValueCountFrequency (%)
9.79e+12 202
 
2.0%
1.00e+12 34
 
0.3%
8991731570 1
 
< 0.1%
8956895007 1
 
< 0.1%
477 1
 
< 0.1%
8931911254 1
 
< 0.1%
8992168691 1
 
< 0.1%
8940012186 1
 
< 0.1%
116386028x 1
 
< 0.1%
8969020152 1
 
< 0.1%
Other values (9636) 9636
97.5%
2023-12-12T14:37:54.650283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 15721
16.2%
8 13322
13.7%
1 12571
12.9%
0 10346
10.6%
5 7926
8.2%
6 7909
8.1%
2 7239
7.4%
3 6931
7.1%
7 6874
7.1%
4 6613
6.8%
Other values (8) 1786
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 95452
98.2%
Uppercase Letter 1314
 
1.4%
Math Symbol 236
 
0.2%
Other Punctuation 236
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 15721
16.5%
8 13322
14.0%
1 12571
13.2%
0 10346
10.8%
5 7926
8.3%
6 7909
8.3%
2 7239
7.6%
3 6931
7.3%
7 6874
7.2%
4 6613
6.9%
Uppercase Letter
ValueCountFrequency (%)
X 805
61.3%
D 265
 
20.2%
E 237
 
18.0%
P 5
 
0.4%
A 1
 
0.1%
B 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 236
100.0%
Other Punctuation
ValueCountFrequency (%)
. 236
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 95924
98.6%
Latin 1314
 
1.4%

Most frequent character per script

Common
ValueCountFrequency (%)
9 15721
16.4%
8 13322
13.9%
1 12571
13.1%
0 10346
10.8%
5 7926
8.3%
6 7909
8.2%
2 7239
7.5%
3 6931
7.2%
7 6874
7.2%
4 6613
6.9%
Other values (2) 472
 
0.5%
Latin
ValueCountFrequency (%)
X 805
61.3%
D 265
 
20.2%
E 237
 
18.0%
P 5
 
0.4%
A 1
 
0.1%
B 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97238
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 15721
16.2%
8 13322
13.7%
1 12571
12.9%
0 10346
10.6%
5 7926
8.2%
6 7909
8.1%
2 7239
7.4%
3 6931
7.1%
7 6874
7.1%
4 6613
6.8%
Other values (8) 1786
 
1.8%

형식
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전자책
9872 
오디오북
 
128

Length

Max length4
Median length3
Mean length3.0128
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자책
2nd row전자책
3rd row전자책
4th row전자책
5th row전자책

Common Values

ValueCountFrequency (%)
전자책 9872
98.7%
오디오북 128
 
1.3%

Length

2023-12-12T14:37:54.808146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:37:54.913263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전자책 9872
98.7%
오디오북 128
 
1.3%
Distinct210
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:37:55.192818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length12
Mean length4.1126
Min length1

Characters and Unicode

Total characters41126
Distinct characters235
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)0.4%

Sample

1st row경영
2nd row어린이(초등)
3rd row경영
4th row어린이(초등)
5th row영화/연극
ValueCountFrequency (%)
소설 1217
 
11.9%
자기계발 888
 
8.7%
경영 782
 
7.6%
에세이 753
 
7.4%
어린이학습 742
 
7.3%
한국동화 737
 
7.2%
그림책 309
 
3.0%
장르소설 286
 
2.8%
신화/전설 255
 
2.5%
인문교양 249
 
2.4%
Other values (204) 4013
39.2%
2023-12-12T14:37:55.633328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 2498
 
6.1%
2235
 
5.4%
1778
 
4.3%
1760
 
4.3%
1598
 
3.9%
1569
 
3.8%
1407
 
3.4%
1069
 
2.6%
1055
 
2.6%
1055
 
2.6%
Other values (225) 25102
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38002
92.4%
Other Punctuation 2499
 
6.1%
Space Separator 231
 
0.6%
Close Punctuation 131
 
0.3%
Open Punctuation 131
 
0.3%
Lowercase Letter 90
 
0.2%
Uppercase Letter 42
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2235
 
5.9%
1778
 
4.7%
1760
 
4.6%
1598
 
4.2%
1569
 
4.1%
1407
 
3.7%
1069
 
2.8%
1055
 
2.8%
1055
 
2.8%
1044
 
2.7%
Other values (204) 23432
61.7%
Lowercase Letter
ValueCountFrequency (%)
g 10
11.1%
n 10
11.1%
w 10
11.1%
e 10
11.1%
t 10
11.1%
i 10
11.1%
k 10
11.1%
o 10
11.1%
r 10
11.1%
Uppercase Letter
ValueCountFrequency (%)
S 12
28.6%
O 12
28.6%
N 10
23.8%
T 2
 
4.8%
I 2
 
4.8%
F 2
 
4.8%
A 2
 
4.8%
Other Punctuation
ValueCountFrequency (%)
/ 2498
> 99.9%
, 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 131
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38002
92.4%
Common 2992
 
7.3%
Latin 132
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2235
 
5.9%
1778
 
4.7%
1760
 
4.6%
1598
 
4.2%
1569
 
4.1%
1407
 
3.7%
1069
 
2.8%
1055
 
2.8%
1055
 
2.8%
1044
 
2.7%
Other values (204) 23432
61.7%
Latin
ValueCountFrequency (%)
S 12
9.1%
O 12
9.1%
g 10
 
7.6%
n 10
 
7.6%
w 10
 
7.6%
N 10
 
7.6%
e 10
 
7.6%
t 10
 
7.6%
i 10
 
7.6%
k 10
 
7.6%
Other values (6) 28
21.2%
Common
ValueCountFrequency (%)
/ 2498
83.5%
231
 
7.7%
) 131
 
4.4%
( 131
 
4.4%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38002
92.4%
ASCII 3124
 
7.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 2498
80.0%
231
 
7.4%
) 131
 
4.2%
( 131
 
4.2%
S 12
 
0.4%
O 12
 
0.4%
g 10
 
0.3%
n 10
 
0.3%
w 10
 
0.3%
N 10
 
0.3%
Other values (11) 69
 
2.2%
Hangul
ValueCountFrequency (%)
2235
 
5.9%
1778
 
4.7%
1760
 
4.6%
1598
 
4.2%
1569
 
4.1%
1407
 
3.7%
1069
 
2.8%
1055
 
2.8%
1055
 
2.8%
1044
 
2.7%
Other values (204) 23432
61.7%
Distinct1835
Distinct (%)18.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:37:55.947522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length4.5972
Min length1

Characters and Unicode

Total characters45972
Distinct characters676
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique743 ?
Unique (%)7.4%

Sample

1st rowe비즈북스
2nd row예손미디어
3rd row미래의창
4th row사계절
5th row아르테(arte)
ValueCountFrequency (%)
한국교육문화연구원 187
 
1.8%
문학동네 131
 
1.3%
위즈덤하우스 128
 
1.3%
자음과모음 92
 
0.9%
21세기북스 86
 
0.8%
지경사 83
 
0.8%
이북코리아 82
 
0.8%
삼성출판사 74
 
0.7%
대교출판 73
 
0.7%
알에이치코리아 67
 
0.7%
Other values (1867) 9153
90.1%
2023-12-12T14:37:56.562462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2374
 
5.2%
1653
 
3.6%
1286
 
2.8%
1265
 
2.8%
960
 
2.1%
925
 
2.0%
853
 
1.9%
710
 
1.5%
602
 
1.3%
578
 
1.3%
Other values (666) 34766
75.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43022
93.6%
Uppercase Letter 1188
 
2.6%
Lowercase Letter 901
 
2.0%
Decimal Number 245
 
0.5%
Open Punctuation 188
 
0.4%
Close Punctuation 188
 
0.4%
Space Separator 157
 
0.3%
Other Punctuation 82
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2374
 
5.5%
1653
 
3.8%
1286
 
3.0%
1265
 
2.9%
960
 
2.2%
925
 
2.2%
853
 
2.0%
710
 
1.7%
602
 
1.4%
578
 
1.3%
Other values (601) 31816
74.0%
Uppercase Letter
ValueCountFrequency (%)
O 181
15.2%
B 164
13.8%
S 130
10.9%
K 98
 
8.2%
M 85
 
7.2%
P 67
 
5.6%
H 67
 
5.6%
I 49
 
4.1%
E 49
 
4.1%
A 36
 
3.0%
Other values (15) 262
22.1%
Lowercase Letter
ValueCountFrequency (%)
e 135
15.0%
o 108
12.0%
r 98
10.9%
s 86
9.5%
a 76
8.4%
l 48
 
5.3%
i 48
 
5.3%
t 45
 
5.0%
n 38
 
4.2%
h 32
 
3.6%
Other values (13) 187
20.8%
Decimal Number
ValueCountFrequency (%)
2 108
44.1%
1 100
40.8%
0 11
 
4.5%
3 8
 
3.3%
4 7
 
2.9%
6 6
 
2.4%
5 2
 
0.8%
9 1
 
0.4%
7 1
 
0.4%
8 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
& 65
79.3%
. 14
 
17.1%
: 3
 
3.7%
Open Punctuation
ValueCountFrequency (%)
( 188
100.0%
Close Punctuation
ValueCountFrequency (%)
) 188
100.0%
Space Separator
ValueCountFrequency (%)
157
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43021
93.6%
Latin 2089
 
4.5%
Common 861
 
1.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2374
 
5.5%
1653
 
3.8%
1286
 
3.0%
1265
 
2.9%
960
 
2.2%
925
 
2.2%
853
 
2.0%
710
 
1.7%
602
 
1.4%
578
 
1.3%
Other values (600) 31815
74.0%
Latin
ValueCountFrequency (%)
O 181
 
8.7%
B 164
 
7.9%
e 135
 
6.5%
S 130
 
6.2%
o 108
 
5.2%
r 98
 
4.7%
K 98
 
4.7%
s 86
 
4.1%
M 85
 
4.1%
a 76
 
3.6%
Other values (38) 928
44.4%
Common
ValueCountFrequency (%)
( 188
21.8%
) 188
21.8%
157
18.2%
2 108
12.5%
1 100
11.6%
& 65
 
7.5%
. 14
 
1.6%
0 11
 
1.3%
3 8
 
0.9%
4 7
 
0.8%
Other values (7) 15
 
1.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43021
93.6%
ASCII 2950
 
6.4%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2374
 
5.5%
1653
 
3.8%
1286
 
3.0%
1265
 
2.9%
960
 
2.2%
925
 
2.2%
853
 
2.0%
710
 
1.7%
602
 
1.4%
578
 
1.3%
Other values (600) 31815
74.0%
ASCII
ValueCountFrequency (%)
( 188
 
6.4%
) 188
 
6.4%
O 181
 
6.1%
B 164
 
5.6%
157
 
5.3%
e 135
 
4.6%
S 130
 
4.4%
2 108
 
3.7%
o 108
 
3.7%
1 100
 
3.4%
Other values (55) 1491
50.5%
CJK
ValueCountFrequency (%)
1
100.0%

표지url
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:37:56.971776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length89
Mean length86.7655
Min length26

Characters and Unicode

Total characters867655
Distinct characters49
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808992168694/L4808992168694.jpg
2nd row http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808959802388/L4808959802388.jpg
3rd rowhttps://ebook.seocholib.or.kr/upload/20553/content/ebook/4801191464338/L4801191464338.jpg
4th rowhttps://ebook.seocholib.or.kr/upload/20553/content//ebook/4801160946872/L4801160946872.jpg
5th row http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808950971984/L4808950971984.jpg
ValueCountFrequency (%)
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808992168694/l4808992168694.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808940012185/l4808940012185.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808931911251/l4808931911251.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808955861549/l4808955861549.jpg 1
 
< 0.1%
https://ebook.seocholib.or.kr/upload/20553/content//ebook/4801163860281/l4801163860281.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808969020154/l4808969020154.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808957571088/l4808957571088.jpg 1
 
< 0.1%
http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808963701448/l4808963701448.jpg 1
 
< 0.1%
https://ebook.seocholib.or.kr/upload/20553/content/ebook/4801189550685/l4801189550685.jpg 1
 
< 0.1%
res/img/eco/lsize/prd000141149.jpg 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T14:37:57.479526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 86370
 
10.0%
/ 79120
 
9.1%
0 50575
 
5.8%
8 46430
 
5.4%
e 39097
 
4.5%
. 38833
 
4.5%
t 38446
 
4.4%
5 35929
 
4.1%
4 32742
 
3.8%
9 30983
 
3.6%
Other values (39) 389130
44.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 418495
48.2%
Decimal Number 299726
34.5%
Other Punctuation 127564
 
14.7%
Uppercase Letter 13665
 
1.6%
Space Separator 8205
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 86370
20.6%
e 39097
9.3%
t 38446
9.2%
p 29188
 
7.0%
b 28705
 
6.9%
k 28704
 
6.9%
r 19614
 
4.7%
n 19225
 
4.6%
h 19223
 
4.6%
l 19223
 
4.6%
Other values (12) 90700
21.7%
Uppercase Letter
ValueCountFrequency (%)
L 9999
73.2%
D 810
 
5.9%
G 423
 
3.1%
E 391
 
2.9%
I 389
 
2.8%
M 389
 
2.8%
C 389
 
2.8%
O 389
 
2.8%
P 246
 
1.8%
R 202
 
1.5%
Other values (3) 38
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 50575
16.9%
8 46430
15.5%
5 35929
12.0%
4 32742
10.9%
9 30983
10.3%
1 25073
8.4%
2 24029
8.0%
3 23795
7.9%
6 16400
 
5.5%
7 13770
 
4.6%
Other Punctuation
ValueCountFrequency (%)
/ 79120
62.0%
. 38833
30.4%
: 9611
 
7.5%
Space Separator
ValueCountFrequency (%)
8205
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 435495
50.2%
Latin 432160
49.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 86370
20.0%
e 39097
 
9.0%
t 38446
 
8.9%
p 29188
 
6.8%
b 28705
 
6.6%
k 28704
 
6.6%
r 19614
 
4.5%
n 19225
 
4.4%
h 19223
 
4.4%
l 19223
 
4.4%
Other values (25) 104365
24.1%
Common
ValueCountFrequency (%)
/ 79120
18.2%
0 50575
11.6%
8 46430
10.7%
. 38833
8.9%
5 35929
8.3%
4 32742
7.5%
9 30983
 
7.1%
1 25073
 
5.8%
2 24029
 
5.5%
3 23795
 
5.5%
Other values (4) 47986
11.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 867655
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 86370
 
10.0%
/ 79120
 
9.1%
0 50575
 
5.8%
8 46430
 
5.4%
e 39097
 
4.5%
. 38833
 
4.5%
t 38446
 
4.4%
5 35929
 
4.1%
4 32742
 
3.8%
9 30983
 
3.6%
Other values (39) 389130
44.8%

Missing values

2023-12-12T14:37:50.813259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:37:51.062850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관명도서관홈페이지url도서명저자명isbn형식카테고리출판사표지url
4418서초구전자도서관https://e-book.seocholib.or.kr/main포스퀘어 스토리임홍택8992168691전자책경영e비즈북스http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808992168694/L4808992168694.jpg
14344서초구전자도서관https://e-book.seocholib.or.kr/main어른이 되면 무슨 일을 할까요어린이경제교육연구회8959802387전자책어린이(초등)예손미디어http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808959802388/L4808959802388.jpg
2902서초구전자도서관https://e-book.seocholib.or.kr/main경제학 100 문장댄 스미스1191464334전자책경영미래의창https://ebook.seocholib.or.kr/upload/20553/content/ebook/4801191464338/L4801191464338.jpg
14078서초구전자도서관https://e-book.seocholib.or.kr/main내 친구의 집우미옥1160946876전자책어린이(초등)사계절https://ebook.seocholib.or.kr/upload/20553/content//ebook/4801160946872/L4801160946872.jpg
12001서초구전자도서관https://e-book.seocholib.or.kr/main더 테이블김종관8950971984전자책영화/연극아르테(arte)http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808950971984/L4808950971984.jpg
32670서초구전자도서관https://e-book.seocholib.or.kr/main이직의 정석정구철1188331671전자책자기계발스노우폭스북스http://ebook.seocholib.or.kr/upload/20553/content/ebook/4801188331674/L4801188331674.jpg
2444서초구전자도서관https://e-book.seocholib.or.kr/main나의 첫 모빌리티 수업조정희1167850742전자책경영슬로디미디어https://ebook.seocholib.or.kr/upload/20553/content/ebook/4801167850745/L4801167850745.jpg
10857서초구전자도서관https://e-book.seocholib.or.kr/mainENJOY 제주 Part1 지역여행강석균8959943177전자책취미생활넥서스BOOKShttp://ebook.seocholib.or.kr/upload/20553/content/ebook/4808959943173/L4808959943173.jpg
18934서초구전자도서관https://e-book.seocholib.or.kr/main나만의 박물관에마 루이스1186670908전자책한국동화책속물고기http://ebook.seocholib.or.kr/upload/20553/content/ebook/4801186670904/L4801186670904.jpg
34770서초구전자도서관https://e-book.seocholib.or.kr/main달란트이야기이종선8992060068전자책자기계발토네이도http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808992060066/L4808992060066.jpg
도서관명도서관홈페이지url도서명저자명isbn형식카테고리출판사표지url
23391서초구전자도서관https://e-book.seocholib.or.kr/main메피스토클라우스 만8901106876전자책소설펭귄클래식코리아http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808901106878/L4808901106878.jpg
27471서초구전자도서관https://e-book.seocholib.or.kr/main대지의 힘류재상9070036270전자책한국문학도서관http://ebook.seocholib.or.kr/upload/20553/content/ebook/4809070036270/L4809070036270.jpg
23379서초구전자도서관https://e-book.seocholib.or.kr/main신문물검역소강지영8901099772전자책소설시작http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808901099774/L4808901099774.jpg
18025서초구전자도서관https://e-book.seocholib.or.kr/main고구려 소년 담덕 유목민 소년 테무친을 만나다김용만8992010192전자책어린이학습스콜라http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808992010191/L4808992010191.jpg
23411서초구전자도서관https://e-book.seocholib.or.kr/main나를 사랑한 스파이 007 시리즈이언 플레밍8901122979전자책소설http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808901122977/L4808901122977.jpg
7488서초구전자도서관https://e-book.seocholib.or.kr/main인간과 자아 그 딜레마와 그림자서광조8995979364전자책심리학과학과철학http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808995979365/L4808995979365.jpg
5174서초구전자도서관https://e-book.seocholib.or.kr/main신기술 실패의 원인 [도서요약]Pip Coburn9070026050전자책경제Portfolio Hardcoverhttp://ebook.seocholib.or.kr/upload/20553/content/ebook/4809070026059/L4809070026059.jpg
7809서초구전자도서관https://e-book.seocholib.or.kr/main자살자가본 사후세계(심령과학17)안동민 역3366001992전자책기타종교서음출판사http://ebook.seocholib.or.kr/upload/20553/content/ebook/4803366001997/L4803366001997.jpg
39474서초구전자도서관https://e-book.seocholib.or.kr/main로빈슨 크루소다니엘 디포<NA>오디오북드라마 세계명작이북코리아http://ebook.seocholib.or.kr/upload/20553/content/audio/5800046861390/L5800046861390.jpg
17614서초구전자도서관https://e-book.seocholib.or.kr/main우리 아이 첫 경주여행 1박광일898019885X전자책어린이학습삼성당아이http://ebook.seocholib.or.kr/upload/20553/content/ebook/4808980198856/L4808980198856.jpg