Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells2106
Missing cells (%)2.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory781.2 KiB
Average record size in memory80.0 B

Variable types

Categorical4
Text5

Dataset

Description서울특별시 강남구의 전자책 보유 목록입니다. 기타 자세한 사항은 서울특별시 강남구 문화도시과(02-3423-5954)로 주시면 자세히 안내해 드리겠습니다.
URLhttps://www.data.go.kr/data/15112734/fileData.do

Alerts

도서관명 has constant value ""Constant
도서관홈페이지 주소 has constant value ""Constant
국제표준도서번호 has 1982 (19.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:08:08.298618
Analysis finished2023-12-12 14:08:10.751874
Duration2.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구전자도서관
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강남구전자도서관
2nd row강남구전자도서관
3rd row강남구전자도서관
4th row강남구전자도서관
5th row강남구전자도서관

Common Values

ValueCountFrequency (%)
강남구전자도서관 10000
100.0%

Length

2023-12-12T23:08:10.828082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:08:10.931495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강남구전자도서관 10000
100.0%

도서관홈페이지 주소
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
https://ebook.gangnam.go.kr
10000 

Length

Max length27
Median length27
Mean length27
Min length27

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://ebook.gangnam.go.kr
2nd rowhttps://ebook.gangnam.go.kr
3rd rowhttps://ebook.gangnam.go.kr
4th rowhttps://ebook.gangnam.go.kr
5th rowhttps://ebook.gangnam.go.kr

Common Values

ValueCountFrequency (%)
https://ebook.gangnam.go.kr 10000
100.0%

Length

2023-12-12T23:08:11.052069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:08:11.151556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://ebook.gangnam.go.kr 10000
100.0%
Distinct9955
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:08:11.502347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length58
Mean length14.7026
Min length1

Characters and Unicode

Total characters147026
Distinct characters1392
Distinct categories17 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9912 ?
Unique (%)99.1%

Sample

1st row송건호 전집 20 - 역사에서 배운다
2nd row벌꿀 공장
3rd row화룡의 군주 2 - 갈림길
4th row나의 살인자에게
5th row니체와 장자는 이렇게 말했다
ValueCountFrequency (%)
2091
 
5.3%
2 352
 
0.9%
1 340
 
0.9%
이야기 287
 
0.7%
3 190
 
0.5%
나는 180
 
0.5%
위한 147
 
0.4%
읽는 139
 
0.3%
the 139
 
0.3%
129
 
0.3%
Other values (16785) 35819
90.0%
2023-12-12T23:08:12.139071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29813
 
20.3%
2987
 
2.0%
2792
 
1.9%
2205
 
1.5%
- 2032
 
1.4%
1649
 
1.1%
1579
 
1.1%
1469
 
1.0%
1432
 
1.0%
1300
 
0.9%
Other values (1382) 99768
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 98172
66.8%
Space Separator 29813
 
20.3%
Lowercase Letter 6831
 
4.6%
Decimal Number 4614
 
3.1%
Uppercase Letter 2563
 
1.7%
Dash Punctuation 2032
 
1.4%
Other Punctuation 1694
 
1.2%
Open Punctuation 593
 
0.4%
Close Punctuation 592
 
0.4%
Math Symbol 48
 
< 0.1%
Other values (7) 74
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2987
 
3.0%
2792
 
2.8%
2205
 
2.2%
1649
 
1.7%
1579
 
1.6%
1469
 
1.5%
1432
 
1.5%
1300
 
1.3%
1287
 
1.3%
1182
 
1.2%
Other values (1278) 80290
81.8%
Lowercase Letter
ValueCountFrequency (%)
e 964
14.1%
a 586
 
8.6%
o 586
 
8.6%
n 576
 
8.4%
i 562
 
8.2%
t 468
 
6.9%
s 463
 
6.8%
r 414
 
6.1%
h 346
 
5.1%
l 316
 
4.6%
Other values (16) 1550
22.7%
Uppercase Letter
ValueCountFrequency (%)
T 295
 
11.5%
E 199
 
7.8%
A 192
 
7.5%
S 188
 
7.3%
C 159
 
6.2%
O 154
 
6.0%
M 141
 
5.5%
I 139
 
5.4%
L 113
 
4.4%
B 110
 
4.3%
Other values (16) 873
34.1%
Other Punctuation
ValueCountFrequency (%)
, 676
39.9%
: 364
21.5%
. 224
 
13.2%
? 147
 
8.7%
! 102
 
6.0%
' 42
 
2.5%
/ 39
 
2.3%
· 39
 
2.3%
& 23
 
1.4%
% 16
 
0.9%
Other values (6) 22
 
1.3%
Decimal Number
ValueCountFrequency (%)
1 1161
25.2%
2 901
19.5%
0 701
15.2%
3 553
12.0%
4 319
 
6.9%
5 311
 
6.7%
6 198
 
4.3%
9 169
 
3.7%
8 151
 
3.3%
7 150
 
3.3%
Math Symbol
ValueCountFrequency (%)
~ 31
64.6%
+ 10
 
20.8%
| 4
 
8.3%
= 2
 
4.2%
÷ 1
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 473
79.8%
[ 113
 
19.1%
4
 
0.7%
3
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 472
79.7%
] 113
 
19.1%
4
 
0.7%
3
 
0.5%
Letter Number
ValueCountFrequency (%)
14
51.9%
9
33.3%
4
 
14.8%
Other Symbol
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
29813
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2032
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 21
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 8
100.0%
Final Punctuation
ValueCountFrequency (%)
8
100.0%
Initial Punctuation
ValueCountFrequency (%)
6
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 98110
66.7%
Common 39433
26.8%
Latin 9421
 
6.4%
Han 62
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2987
 
3.0%
2792
 
2.8%
2205
 
2.2%
1649
 
1.7%
1579
 
1.6%
1469
 
1.5%
1432
 
1.5%
1300
 
1.3%
1287
 
1.3%
1182
 
1.2%
Other values (1228) 80228
81.8%
Latin
ValueCountFrequency (%)
e 964
 
10.2%
a 586
 
6.2%
o 586
 
6.2%
n 576
 
6.1%
i 562
 
6.0%
t 468
 
5.0%
s 463
 
4.9%
r 414
 
4.4%
h 346
 
3.7%
l 316
 
3.4%
Other values (45) 4140
43.9%
Han
ValueCountFrequency (%)
4
 
6.5%
3
 
4.8%
3
 
4.8%
2
 
3.2%
2
 
3.2%
2
 
3.2%
2
 
3.2%
2
 
3.2%
1
 
1.6%
1
 
1.6%
Other values (40) 40
64.5%
Common
ValueCountFrequency (%)
29813
75.6%
- 2032
 
5.2%
1 1161
 
2.9%
2 901
 
2.3%
0 701
 
1.8%
, 676
 
1.7%
3 553
 
1.4%
( 473
 
1.2%
) 472
 
1.2%
: 364
 
0.9%
Other values (39) 2287
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 98070
66.7%
ASCII 48745
33.2%
CJK 60
 
< 0.1%
None 58
 
< 0.1%
Compat Jamo 40
 
< 0.1%
Number Forms 27
 
< 0.1%
Punctuation 20
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29813
61.2%
- 2032
 
4.2%
1 1161
 
2.4%
e 964
 
2.0%
2 901
 
1.8%
0 701
 
1.4%
, 676
 
1.4%
a 586
 
1.2%
o 586
 
1.2%
n 576
 
1.2%
Other values (75) 10749
 
22.1%
Hangul
ValueCountFrequency (%)
2987
 
3.0%
2792
 
2.8%
2205
 
2.2%
1649
 
1.7%
1579
 
1.6%
1469
 
1.5%
1432
 
1.5%
1300
 
1.3%
1287
 
1.3%
1182
 
1.2%
Other values (1222) 80188
81.8%
None
ValueCountFrequency (%)
· 39
67.2%
4
 
6.9%
4
 
6.9%
3
 
5.2%
3
 
5.2%
2
 
3.4%
1
 
1.7%
1
 
1.7%
÷ 1
 
1.7%
Compat Jamo
ValueCountFrequency (%)
33
82.5%
2
 
5.0%
2
 
5.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%
Number Forms
ValueCountFrequency (%)
14
51.9%
9
33.3%
4
 
14.8%
Punctuation
ValueCountFrequency (%)
8
40.0%
6
30.0%
6
30.0%
CJK
ValueCountFrequency (%)
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
1
 
1.7%
Other values (38) 38
63.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct6487
Distinct (%)65.2%
Missing46
Missing (%)0.5%
Memory size156.2 KiB
2023-12-12T23:08:12.489633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length53
Mean length6.793048
Min length1

Characters and Unicode

Total characters67618
Distinct characters955
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5395 ?
Unique (%)54.2%

Sample

1st row송건호
2nd row위르겐 타우츠 외/유영미 역
3rd row김형준
4th row아스트리드 홀레이더르/김지원 역
5th row양승권
ValueCountFrequency (%)
1988
 
11.4%
편집부 925
 
5.3%
376
 
2.2%
classic 122
 
0.7%
house 122
 
0.7%
두산동아편집부 81
 
0.5%
이북코리아 76
 
0.4%
엑스트라클래스 75
 
0.4%
이키드북 72
 
0.4%
도토리편집부 56
 
0.3%
Other values (8071) 13529
77.7%
2023-12-12T23:08:13.322270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7502
 
11.1%
2286
 
3.4%
/ 2103
 
3.1%
2007
 
3.0%
1462
 
2.2%
1277
 
1.9%
1256
 
1.9%
1209
 
1.8%
1146
 
1.7%
1010
 
1.5%
Other values (945) 46360
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52512
77.7%
Space Separator 7502
 
11.1%
Lowercase Letter 3276
 
4.8%
Other Punctuation 2631
 
3.9%
Uppercase Letter 1171
 
1.7%
Close Punctuation 185
 
0.3%
Open Punctuation 185
 
0.3%
Math Symbol 87
 
0.1%
Decimal Number 51
 
0.1%
Dash Punctuation 12
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2286
 
4.4%
2007
 
3.8%
1462
 
2.8%
1277
 
2.4%
1256
 
2.4%
1209
 
2.3%
1146
 
2.2%
1010
 
1.9%
869
 
1.7%
755
 
1.4%
Other values (860) 39235
74.7%
Lowercase Letter
ValueCountFrequency (%)
s 474
14.5%
e 431
13.2%
a 338
10.3%
i 298
9.1%
o 255
7.8%
l 212
 
6.5%
n 169
 
5.2%
u 164
 
5.0%
r 162
 
4.9%
c 159
 
4.9%
Other values (16) 614
18.7%
Uppercase Letter
ValueCountFrequency (%)
C 174
14.9%
H 162
13.8%
B 83
 
7.1%
S 75
 
6.4%
M 74
 
6.3%
T 70
 
6.0%
P 65
 
5.6%
A 61
 
5.2%
J 54
 
4.6%
E 50
 
4.3%
Other values (15) 303
25.9%
Decimal Number
ValueCountFrequency (%)
2 10
19.6%
1 9
17.6%
4 8
15.7%
0 6
11.8%
3 6
11.8%
6 5
9.8%
5 4
 
7.8%
7 2
 
3.9%
9 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
/ 2103
79.9%
, 299
 
11.4%
. 215
 
8.2%
: 7
 
0.3%
& 3
 
0.1%
? 2
 
0.1%
; 1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 179
96.8%
] 2
 
1.1%
2
 
1.1%
2
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 179
96.8%
2
 
1.1%
[ 2
 
1.1%
2
 
1.1%
Math Symbol
ValueCountFrequency (%)
| 81
93.1%
= 4
 
4.6%
< 1
 
1.1%
> 1
 
1.1%
Space Separator
ValueCountFrequency (%)
7502
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52478
77.6%
Common 10659
 
15.8%
Latin 4447
 
6.6%
Han 34
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2286
 
4.4%
2007
 
3.8%
1462
 
2.8%
1277
 
2.4%
1256
 
2.4%
1209
 
2.3%
1146
 
2.2%
1010
 
1.9%
869
 
1.7%
755
 
1.4%
Other values (840) 39201
74.7%
Latin
ValueCountFrequency (%)
s 474
 
10.7%
e 431
 
9.7%
a 338
 
7.6%
i 298
 
6.7%
o 255
 
5.7%
l 212
 
4.8%
C 174
 
3.9%
n 169
 
3.8%
u 164
 
3.7%
r 162
 
3.6%
Other values (41) 1770
39.8%
Common
ValueCountFrequency (%)
7502
70.4%
/ 2103
 
19.7%
, 299
 
2.8%
. 215
 
2.0%
) 179
 
1.7%
( 179
 
1.7%
| 81
 
0.8%
- 12
 
0.1%
2 10
 
0.1%
1 9
 
0.1%
Other values (24) 70
 
0.7%
Han
ValueCountFrequency (%)
5
14.7%
5
14.7%
3
 
8.8%
3
 
8.8%
3
 
8.8%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (10) 10
29.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52477
77.6%
ASCII 15095
 
22.3%
CJK 33
 
< 0.1%
None 9
 
< 0.1%
Punctuation 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7502
49.7%
/ 2103
 
13.9%
s 474
 
3.1%
e 431
 
2.9%
a 338
 
2.2%
, 299
 
2.0%
i 298
 
2.0%
o 255
 
1.7%
. 215
 
1.4%
l 212
 
1.4%
Other values (68) 2968
 
19.7%
Hangul
ValueCountFrequency (%)
2286
 
4.4%
2007
 
3.8%
1462
 
2.8%
1277
 
2.4%
1256
 
2.4%
1209
 
2.3%
1146
 
2.2%
1010
 
1.9%
869
 
1.7%
755
 
1.4%
Other values (839) 39200
74.7%
CJK
ValueCountFrequency (%)
5
15.2%
5
15.2%
3
 
9.1%
3
 
9.1%
3
 
9.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (9) 9
27.3%
None
ValueCountFrequency (%)
2
22.2%
2
22.2%
2
22.2%
2
22.2%
1
11.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct2086
Distinct (%)26.0%
Missing1982
Missing (%)19.8%
Memory size156.2 KiB
2023-12-12T23:08:13.603832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.213644
Min length10

Characters and Unicode

Total characters105947
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2047 ?
Unique (%)25.5%

Sample

1st row9788940000000
2nd row9788930000000
3rd row9788960000000
4th row9791130000000
5th row9791190000000
ValueCountFrequency (%)
9788950000000 559
 
7.0%
9791190000000 557
 
6.9%
9788960000000 543
 
6.8%
5550300000000 437
 
5.5%
9788970000000 435
 
5.4%
9788990000000 401
 
5.0%
9791160000000 374
 
4.7%
9788980000000 288
 
3.6%
9789000000000 240
 
3.0%
5550110000000 224
 
2.8%
Other values (2076) 3960
49.4%
2023-12-12T23:08:13.992569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 47033
44.4%
9 12136
 
11.5%
8 8653
 
8.2%
7827
 
7.4%
1 7509
 
7.1%
5 7257
 
6.8%
7 6382
 
6.0%
6 2769
 
2.6%
3 2316
 
2.2%
2 2037
 
1.9%
Other values (4) 2028
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 97929
92.4%
Space Separator 7827
 
7.4%
Uppercase Letter 191
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 47033
48.0%
9 12136
 
12.4%
8 8653
 
8.8%
1 7509
 
7.7%
5 7257
 
7.4%
7 6382
 
6.5%
6 2769
 
2.8%
3 2316
 
2.4%
2 2037
 
2.1%
4 1837
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
X 164
85.9%
D 26
 
13.6%
K 1
 
0.5%
Space Separator
ValueCountFrequency (%)
7827
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 105756
99.8%
Latin 191
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 47033
44.5%
9 12136
 
11.5%
8 8653
 
8.2%
7827
 
7.4%
1 7509
 
7.1%
5 7257
 
6.9%
7 6382
 
6.0%
6 2769
 
2.6%
3 2316
 
2.2%
2 2037
 
1.9%
Latin
ValueCountFrequency (%)
X 164
85.9%
D 26
 
13.6%
K 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 105947
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 47033
44.4%
9 12136
 
11.5%
8 8653
 
8.2%
7827
 
7.4%
1 7509
 
7.1%
5 7257
 
6.8%
7 6382
 
6.0%
6 2769
 
2.6%
3 2316
 
2.2%
2 2037
 
1.9%
Other values (4) 2028
 
1.9%
Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
EPUB
3495 
ezPDF
2301 
EPUB(Y)
1459 
XML
999 
Multimedia
653 
Other values (6)
1093 

Length

Max length10
Median length7
Mean length4.8617
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEPUB
2nd rowEPUB(Y)
3rd rowEPUB
4th rowEPUB(Y)
5th rowEPUB(Y)

Common Values

ValueCountFrequency (%)
EPUB 3495
34.9%
ezPDF 2301
23.0%
EPUB(Y) 1459
14.6%
XML 999
 
10.0%
Multimedia 653
 
6.5%
PDF 571
 
5.7%
XDF 261
 
2.6%
SWF 165
 
1.7%
<NA> 73
 
0.7%
FLASH 20
 
0.2%

Length

2023-12-12T23:08:14.151836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
epub 3495
34.9%
ezpdf 2301
23.0%
epub(y 1459
14.6%
xml 999
 
10.0%
multimedia 653
 
6.5%
pdf 571
 
5.7%
xdf 261
 
2.6%
swf 165
 
1.7%
na 73
 
0.7%
flash 20
 
0.2%

카테고리
Categorical

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
문학
3169 
어린이
1785 
사회과학
1775 
철학
809 
기술과학
571 
Other values (7)
1891 

Length

Max length4
Median length2
Mean length2.6939
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row총류
2nd row순수과학
3rd row문학
4th row문학
5th row철학

Common Values

ValueCountFrequency (%)
문학 3169
31.7%
어린이 1785
17.8%
사회과학 1775
17.8%
철학 809
 
8.1%
기술과학 571
 
5.7%
역사 470
 
4.7%
언어 417
 
4.2%
예술 260
 
2.6%
총류 231
 
2.3%
종교 228
 
2.3%
Other values (2) 285
 
2.9%

Length

2023-12-12T23:08:14.304849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문학 3169
31.7%
어린이 1785
17.8%
사회과학 1775
17.8%
철학 809
 
8.1%
기술과학 571
 
5.7%
역사 470
 
4.7%
언어 417
 
4.2%
예술 260
 
2.6%
총류 231
 
2.3%
종교 228
 
2.3%
Other values (2) 285
 
2.9%
Distinct1465
Distinct (%)14.8%
Missing78
Missing (%)0.8%
Memory size156.2 KiB
2023-12-12T23:08:14.645017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length4.7152792
Min length1

Characters and Unicode

Total characters46785
Distinct characters627
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique583 ?
Unique (%)5.9%

Sample

1st row한길사
2nd row열린책들
3rd row자음과모음
4th row다산책방
5th row페이퍼로드
ValueCountFrequency (%)
자음과모음 238
 
2.3%
작가문화 221
 
2.2%
문학동네 217
 
2.1%
위즈덤하우스 172
 
1.7%
21세기북스 166
 
1.6%
두산동아 132
 
1.3%
북토피아 124
 
1.2%
주)도서출판푸른숲 120
 
1.2%
한길사 118
 
1.2%
성바오로 110
 
1.1%
Other values (1480) 8546
84.1%
2023-12-12T23:08:15.264197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2110
 
4.5%
1557
 
3.3%
1408
 
3.0%
1190
 
2.5%
1035
 
2.2%
1015
 
2.2%
723
 
1.5%
648
 
1.4%
647
 
1.4%
550
 
1.2%
Other values (617) 35902
76.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41348
88.4%
Lowercase Letter 2543
 
5.4%
Uppercase Letter 1121
 
2.4%
Decimal Number 506
 
1.1%
Close Punctuation 476
 
1.0%
Open Punctuation 472
 
1.0%
Space Separator 242
 
0.5%
Other Punctuation 45
 
0.1%
Other Symbol 16
 
< 0.1%
Dash Punctuation 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2110
 
5.1%
1557
 
3.8%
1408
 
3.4%
1190
 
2.9%
1035
 
2.5%
1015
 
2.5%
723
 
1.7%
648
 
1.6%
647
 
1.6%
550
 
1.3%
Other values (550) 30465
73.7%
Uppercase Letter
ValueCountFrequency (%)
B 157
14.0%
M 98
 
8.7%
Y 78
 
7.0%
T 78
 
7.0%
C 77
 
6.9%
P 67
 
6.0%
L 66
 
5.9%
E 65
 
5.8%
A 62
 
5.5%
K 60
 
5.4%
Other values (15) 313
27.9%
Lowercase Letter
ValueCountFrequency (%)
e 321
12.6%
o 289
11.4%
a 264
10.4%
i 233
 
9.2%
s 202
 
7.9%
r 169
 
6.6%
h 107
 
4.2%
n 107
 
4.2%
l 104
 
4.1%
c 100
 
3.9%
Other values (13) 647
25.4%
Decimal Number
ValueCountFrequency (%)
2 224
44.3%
1 215
42.5%
6 16
 
3.2%
4 13
 
2.6%
0 11
 
2.2%
3 11
 
2.2%
8 6
 
1.2%
5 6
 
1.2%
9 3
 
0.6%
7 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 24
53.3%
& 12
26.7%
6
 
13.3%
# 3
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 476
100.0%
Open Punctuation
ValueCountFrequency (%)
( 472
100.0%
Space Separator
ValueCountFrequency (%)
242
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41354
88.4%
Latin 3664
 
7.8%
Common 1757
 
3.8%
Han 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2110
 
5.1%
1557
 
3.8%
1408
 
3.4%
1190
 
2.9%
1035
 
2.5%
1015
 
2.5%
723
 
1.7%
648
 
1.6%
647
 
1.6%
550
 
1.3%
Other values (549) 30471
73.7%
Latin
ValueCountFrequency (%)
e 321
 
8.8%
o 289
 
7.9%
a 264
 
7.2%
i 233
 
6.4%
s 202
 
5.5%
r 169
 
4.6%
B 157
 
4.3%
h 107
 
2.9%
n 107
 
2.9%
l 104
 
2.8%
Other values (38) 1711
46.7%
Common
ValueCountFrequency (%)
) 476
27.1%
( 472
26.9%
242
13.8%
2 224
12.7%
1 215
12.2%
. 24
 
1.4%
- 16
 
0.9%
6 16
 
0.9%
4 13
 
0.7%
& 12
 
0.7%
Other values (8) 47
 
2.7%
Han
ValueCountFrequency (%)
5
50.0%
5
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41338
88.4%
ASCII 5415
 
11.6%
None 22
 
< 0.1%
CJK 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2110
 
5.1%
1557
 
3.8%
1408
 
3.4%
1190
 
2.9%
1035
 
2.5%
1015
 
2.5%
723
 
1.7%
648
 
1.6%
647
 
1.6%
550
 
1.3%
Other values (548) 30455
73.7%
ASCII
ValueCountFrequency (%)
) 476
 
8.8%
( 472
 
8.7%
e 321
 
5.9%
o 289
 
5.3%
a 264
 
4.9%
242
 
4.5%
i 233
 
4.3%
2 224
 
4.1%
1 215
 
4.0%
s 202
 
3.7%
Other values (55) 2477
45.7%
None
ValueCountFrequency (%)
16
72.7%
6
 
27.3%
CJK
ValueCountFrequency (%)
5
50.0%
5
50.0%
Distinct9990
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:08:15.487531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length72
Mean length67.3478
Min length48

Characters and Unicode

Total characters673478
Distinct characters67
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9988 ?
Unique (%)99.9%

Sample

1st rowhttp://ebook.gangnam.go.kr/images/bookimg/submain/B555/B5550302209300.gif
2nd rowhttp://ebook.gangnam.go.kr/images/bookimg/YES24/344758866.jpg
3rd rowhttp://ebook.gangnam.go.kr/images/bookimg/submain/B555/B5550306049600.gif
4th rowhttp://ebook.gangnam.go.kr/images/bookimg/YES24/M_1002830.jpg
5th rowhttp://ebook.gangnam.go.kr/images/bookimg/YES24/M_1153182.jpg
ValueCountFrequency (%)
http://ebook.gangnam.go.kr/imagesookimg/submain/0204/02040190.gif 9
 
0.1%
https://ebook.gangnam.go.kr/imagesookimg/interpark 3
 
< 0.1%
http://ebook.gangnam.go.kr/images/bookimg/yes24/m_245035.jpg 1
 
< 0.1%
http://ebook.gangnam.go.kr/images/bookimg/yes24/m_673197.jpg 1
 
< 0.1%
http://ebook.gangnam.go.kr/images/bookimg/submain/kb00/kb004332.gif 1
 
< 0.1%
http://ebook.gangnam.go.kr/imagesookimg/submain/0302/03020859.gif 1
 
< 0.1%
http://ebook.gangnam.go.kr/images/bookimg/submain/b978/b9788962860207.gif 1
 
< 0.1%
http://ebook.gangnam.go.kr/images/bookimg/submain/b555/b5550105030500.gif 1
 
< 0.1%
http://ebook.gangnam.go.kr/nuri/cover/ebookkorea/f0803420.jpg 1
 
< 0.1%
http://ebook.gangnam.go.kr/imagesookimg/submain/a000/a0002692.gif 1
 
< 0.1%
Other values (9980) 9980
99.8%
2023-12-12T23:08:15.842773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 65844
 
9.8%
g 59124
 
8.8%
o 49997
 
7.4%
. 39997
 
5.9%
a 37809
 
5.6%
0 37362
 
5.5%
m 37143
 
5.5%
i 35664
 
5.3%
k 29924
 
4.4%
n 28509
 
4.2%
Other values (57) 252105
37.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 414313
61.5%
Decimal Number 118680
 
17.6%
Other Punctuation 115841
 
17.2%
Uppercase Letter 23545
 
3.5%
Connector Punctuation 1059
 
0.2%
Open Punctuation 20
 
< 0.1%
Close Punctuation 20
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 11715
49.8%
K 4603
 
19.5%
S 1541
 
6.5%
Y 1463
 
6.2%
E 1460
 
6.2%
M 659
 
2.8%
N 549
 
2.3%
A 527
 
2.2%
L 307
 
1.3%
X 220
 
0.9%
Other values (16) 501
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
g 59124
14.3%
o 49997
12.1%
a 37809
9.1%
m 37143
9.0%
i 35664
8.6%
k 29924
7.2%
n 28509
6.9%
b 25606
 
6.2%
e 20300
 
4.9%
t 20086
 
4.8%
Other values (15) 70151
16.9%
Decimal Number
ValueCountFrequency (%)
0 37362
31.5%
5 19971
16.8%
1 11086
 
9.3%
2 9165
 
7.7%
3 8114
 
6.8%
9 7826
 
6.6%
4 7544
 
6.4%
8 6822
 
5.7%
7 6347
 
5.3%
6 4443
 
3.7%
Other Punctuation
ValueCountFrequency (%)
/ 65844
56.8%
. 39997
34.5%
: 10000
 
8.6%
Connector Punctuation
ValueCountFrequency (%)
_ 1059
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 437858
65.0%
Common 235620
35.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
g 59124
13.5%
o 49997
11.4%
a 37809
8.6%
m 37143
 
8.5%
i 35664
 
8.1%
k 29924
 
6.8%
n 28509
 
6.5%
b 25606
 
5.8%
e 20300
 
4.6%
t 20086
 
4.6%
Other values (41) 93696
21.4%
Common
ValueCountFrequency (%)
/ 65844
27.9%
. 39997
17.0%
0 37362
15.9%
5 19971
 
8.5%
1 11086
 
4.7%
: 10000
 
4.2%
2 9165
 
3.9%
3 8114
 
3.4%
9 7826
 
3.3%
4 7544
 
3.2%
Other values (6) 18711
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 673478
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 65844
 
9.8%
g 59124
 
8.8%
o 49997
 
7.4%
. 39997
 
5.9%
a 37809
 
5.6%
0 37362
 
5.5%
m 37143
 
5.5%
i 35664
 
5.3%
k 29924
 
4.4%
n 28509
 
4.2%
Other values (57) 252105
37.4%

Correlations

2023-12-12T23:08:15.927203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
형식(전자책)카테고리
형식(전자책)1.0000.661
카테고리0.6611.000
2023-12-12T23:08:16.029329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리형식(전자책)
카테고리1.0000.351
형식(전자책)0.3511.000
2023-12-12T23:08:16.125662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
형식(전자책)카테고리
형식(전자책)1.0000.351
카테고리0.3511.000

Missing values

2023-12-12T23:08:10.373747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:08:10.548038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:08:10.680008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서관명도서관홈페이지 주소도서명저자명국제표준도서번호형식(전자책)카테고리출판사표지주소
6442강남구전자도서관https://ebook.gangnam.go.kr송건호 전집 20 - 역사에서 배운다송건호9788940000000EPUB총류한길사http://ebook.gangnam.go.kr/images/bookimg/submain/B555/B5550302209300.gif
24798강남구전자도서관https://ebook.gangnam.go.kr벌꿀 공장위르겐 타우츠 외/유영미 역9788930000000EPUB(Y)순수과학열린책들http://ebook.gangnam.go.kr/images/bookimg/YES24/344758866.jpg
6794강남구전자도서관https://ebook.gangnam.go.kr화룡의 군주 2 - 갈림길김형준9788960000000EPUB문학자음과모음http://ebook.gangnam.go.kr/images/bookimg/submain/B555/B5550306049600.gif
20769강남구전자도서관https://ebook.gangnam.go.kr나의 살인자에게아스트리드 홀레이더르/김지원 역9791130000000EPUB(Y)문학다산책방http://ebook.gangnam.go.kr/images/bookimg/YES24/M_1002830.jpg
22157강남구전자도서관https://ebook.gangnam.go.kr니체와 장자는 이렇게 말했다양승권9791190000000EPUB(Y)철학페이퍼로드http://ebook.gangnam.go.kr/images/bookimg/YES24/M_1153182.jpg
14576강남구전자도서관https://ebook.gangnam.go.kr[New 다이나믹 일본어] 일본어 중급 다지기 New 다이나믹 일본어 Step5오현정 외9788930000000PDF언어다락원http://ebook.gangnam.go.kr/nuri/cover/N1112/N1112351.jpg
27207강남구전자도서관https://ebook.gangnam.go.kr일 잘하는 사람은 단순하게 말합니다박소연1165211386ezPDF문학더퀘스트http://ebook.gangnam.go.kr/images/bookimg/submain/KB00/KB005849.gif
25068강남구전자도서관https://ebook.gangnam.go.kr불교성전대한불교조계종 불교성전편찬추진위원회1155801571ezPDF종교조계종출판사http://ebook.gangnam.go.kr/images/bookimg/submain/KB00/KB004700.gif
17099강남구전자도서관https://ebook.gangnam.go.kr제3의 살 - 젊고 건강한 몸매로 만드는 안티셀룰라이트 다이어트김세현9788930000000XML기술과학RHKhttp://ebook.gangnam.go.kr/imagesookimg/submain/X008/X0083456.gif
14992강남구전자도서관https://ebook.gangnam.go.kr아주 오래된 농담박완서9788940000000PDF문학실천문학사http://ebook.gangnam.go.kr/nuri/cover/N0608/N0608040.jpg
도서관명도서관홈페이지 주소도서명저자명국제표준도서번호형식(전자책)카테고리출판사표지주소
18016강남구전자도서관https://ebook.gangnam.go.kr화이트 나이트저자 : 오사 라르손역자 : 이수영9788950000000EPUB문학artehttp://ebook.gangnam.go.kr/images/bookimg/submain/B978/B9788950962500.gif
18852강남구전자도서관https://ebook.gangnam.go.kr하루 5분으로 만나는 일분문학 대표작가 단편선: 귤아쿠타가와 류노스케 외9791160000000EPUB문학아이웰콘텐츠http://ebook.gangnam.go.kr/images/bookimg/submain/B979/B9791155573822.gif
893강남구전자도서관https://ebook.gangnam.go.kr퍼즐 게임북 - 개미와 배짱이 외아기별 편집부<NA>Multimedia어린이아기별http://ebook.gangnam.go.kr/imagesookimg/submain/0106/01060170.gif
23239강남구전자도서관https://ebook.gangnam.go.kr초등 혼자 매일 공부김은영1196848173ezPDF사회과학블루무스http://ebook.gangnam.go.kr/images/bookimg/submain/KB00/KB003665.gif
12609강남구전자도서관https://ebook.gangnam.go.kr문학아카데미시선 223 - 별까지 걸어가다김나무9788940000000EPUB문학문학아카데미http://ebook.gangnam.go.kr/images/bookimg/submain/B978/B9788940072233.gif
20146강남구전자도서관https://ebook.gangnam.go.kr자동 부자 습관데이비드 바크9791200000000EPUB사회과학마인드빌딩http://ebook.gangnam.go.kr/images/bookimg/submain/B979/B9791196339029.gif
1037강남구전자도서관https://ebook.gangnam.go.kr이솝 이야기 - 여우와 구렁이흰돌 편집부<NA>Multimedia어린이흰돌http://ebook.gangnam.go.kr/imagesookimg/submain/0108/01080152.gif
2960강남구전자도서관https://ebook.gangnam.go.kr요술정원 1 - 콕콕 찍어 들려주는 명작 리스닝서승진5550210000000EPUB어린이(주)다락원http://ebook.gangnam.go.kr/images/bookimg/submain/B555/B5550208035900.gif
17538강남구전자도서관https://ebook.gangnam.go.kr채널예스가 만난 사람들 vol.6 최효종의 추파채널예스 편집부9788970000000EPUB(Y)문학그래출판http://ebook.gangnam.go.kr/images/bookimg/YES24/M_219948.jpg
21480강남구전자도서관https://ebook.gangnam.go.krAFTER 애프터 5안나 토드9791190000000EPUB(Y)문학콤마http://ebook.gangnam.go.kr/images/bookimg/YES24/M_1030781.jpg