Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description부산광역시 연제구 연제도서관 자료관도서목록(등록번호, 서명, 저자, 발행자, 청구기호)의 정보를 제공합니다.
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15048049/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:13:30.937367
Analysis finished2024-04-21 01:13:33.235634
Duration2.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45755.261
Minimum2
Maximum91733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T10:13:33.300776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4498.85
Q122813.5
median45987.5
Q368379.75
95-th percentile87093.75
Maximum91733
Range91731
Interquartile range (IQR)45566.25

Descriptive statistics

Standard deviation26425.105
Coefficient of variation (CV)0.5775315
Kurtosis-1.1907126
Mean45755.261
Median Absolute Deviation (MAD)22760
Skewness-0.010209941
Sum4.5755261 × 108
Variance6.9828617 × 108
MonotonicityNot monotonic
2024-04-21T10:13:33.413007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
55873 1
 
< 0.1%
83275 1
 
< 0.1%
1980 1
 
< 0.1%
14024 1
 
< 0.1%
63587 1
 
< 0.1%
73889 1
 
< 0.1%
40045 1
 
< 0.1%
81378 1
 
< 0.1%
59768 1
 
< 0.1%
31159 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
13 1
< 0.1%
16 1
< 0.1%
27 1
< 0.1%
31 1
< 0.1%
60 1
< 0.1%
61 1
< 0.1%
70 1
< 0.1%
ValueCountFrequency (%)
91733 1
< 0.1%
91729 1
< 0.1%
91709 1
< 0.1%
91705 1
< 0.1%
91697 1
< 0.1%
91693 1
< 0.1%
91690 1
< 0.1%
91683 1
< 0.1%
91668 1
< 0.1%
91649 1
< 0.1%
Distinct9960
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:13:33.600787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9920 ?
Unique (%)99.2%

Sample

1st rowABN000083498
2nd rowABN000139176
3rd rowABN000101702
4th rowABN000061532
5th rowABN000121806
ValueCountFrequency (%)
abn000039044 2
 
< 0.1%
abn000039185 2
 
< 0.1%
abn000040789 2
 
< 0.1%
abn000041310 2
 
< 0.1%
abn000041181 2
 
< 0.1%
abn000038743 2
 
< 0.1%
abn000041176 2
 
< 0.1%
abn000042493 2
 
< 0.1%
abn000038526 2
 
< 0.1%
abn000041858 2
 
< 0.1%
Other values (9950) 9980
99.8%
2024-04-21T10:13:33.917950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 39686
33.1%
A 10000
 
8.3%
B 10000
 
8.3%
N 10000
 
8.3%
1 9815
 
8.2%
4 5651
 
4.7%
5 5638
 
4.7%
6 5600
 
4.7%
3 5079
 
4.2%
2 4735
 
3.9%
Other values (3) 13796
 
11.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90000
75.0%
Uppercase Letter 30000
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 39686
44.1%
1 9815
 
10.9%
4 5651
 
6.3%
5 5638
 
6.3%
6 5600
 
6.2%
3 5079
 
5.6%
2 4735
 
5.3%
8 4680
 
5.2%
7 4635
 
5.1%
9 4481
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 10000
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 90000
75.0%
Latin 30000
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 39686
44.1%
1 9815
 
10.9%
4 5651
 
6.3%
5 5638
 
6.3%
6 5600
 
6.2%
3 5079
 
5.6%
2 4735
 
5.3%
8 4680
 
5.2%
7 4635
 
5.1%
9 4481
 
5.0%
Latin
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 10000
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 39686
33.1%
A 10000
 
8.3%
B 10000
 
8.3%
N 10000
 
8.3%
1 9815
 
8.2%
4 5651
 
4.7%
5 5638
 
4.7%
6 5600
 
4.7%
3 5079
 
4.2%
2 4735
 
3.9%
Other values (3) 13796
 
11.5%

서명
Text

Distinct9910
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:13:34.308170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length84
Mean length23.1229
Min length1

Characters and Unicode

Total characters231229
Distinct characters1604
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9821 ?
Unique (%)98.2%

Sample

1st row(외대부고 공신들의)진짜 1등 공부법 : 진학 야전사령과 박인호 선생님의 SKY 입시공략의 비밀
2nd row(내 손으로 만드는)내 삶을 위한 정치 : 청소년을 위한 대한민국 정치 사용 설명서
3rd row인사이드 아웃 : 사람이 만드는 기업의 미래
4th row(더책)문제가 생겼어요
5th row미국 주식 스타터팩 : 미국 주식 초심자를 위한 토탈 솔루션
ValueCountFrequency (%)
3988
 
6.9%
이야기 373
 
0.6%
1 308
 
0.5%
위한 283
 
0.5%
2 279
 
0.5%
the 214
 
0.4%
186
 
0.3%
3 153
 
0.3%
152
 
0.3%
나는 150
 
0.3%
Other values (23950) 52127
89.5%
2024-04-21T10:13:34.795598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48242
 
20.9%
4116
 
1.8%
3859
 
1.7%
: 3842
 
1.7%
3359
 
1.5%
e 2462
 
1.1%
2200
 
1.0%
, 2082
 
0.9%
2070
 
0.9%
1996
 
0.9%
Other values (1594) 157001
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140249
60.7%
Space Separator 48242
 
20.9%
Lowercase Letter 20173
 
8.7%
Other Punctuation 9437
 
4.1%
Decimal Number 4382
 
1.9%
Uppercase Letter 3233
 
1.4%
Close Punctuation 2378
 
1.0%
Open Punctuation 2377
 
1.0%
Math Symbol 596
 
0.3%
Dash Punctuation 120
 
0.1%
Other values (7) 42
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4116
 
2.9%
3859
 
2.8%
3359
 
2.4%
2200
 
1.6%
2070
 
1.5%
1996
 
1.4%
1931
 
1.4%
1892
 
1.3%
1876
 
1.3%
1765
 
1.3%
Other values (1455) 115185
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 2462
12.2%
o 1799
 
8.9%
a 1782
 
8.8%
n 1483
 
7.4%
i 1447
 
7.2%
t 1440
 
7.1%
r 1436
 
7.1%
s 1312
 
6.5%
l 913
 
4.5%
h 897
 
4.4%
Other values (31) 5202
25.8%
Uppercase Letter
ValueCountFrequency (%)
T 362
 
11.2%
S 297
 
9.2%
B 243
 
7.5%
A 209
 
6.5%
M 192
 
5.9%
C 176
 
5.4%
D 163
 
5.0%
W 161
 
5.0%
P 147
 
4.5%
I 135
 
4.2%
Other values (18) 1148
35.5%
Other Punctuation
ValueCountFrequency (%)
: 3842
40.7%
, 2082
22.1%
. 1497
 
15.9%
! 743
 
7.9%
? 615
 
6.5%
' 263
 
2.8%
· 223
 
2.4%
& 39
 
0.4%
% 22
 
0.2%
21
 
0.2%
Other values (14) 90
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 1118
25.5%
0 807
18.4%
2 734
16.8%
3 450
10.3%
5 302
 
6.9%
4 264
 
6.0%
9 198
 
4.5%
6 195
 
4.5%
7 158
 
3.6%
8 156
 
3.6%
Math Symbol
ValueCountFrequency (%)
= 485
81.4%
~ 53
 
8.9%
+ 19
 
3.2%
> 11
 
1.8%
< 11
 
1.8%
| 8
 
1.3%
× 8
 
1.3%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1883
79.2%
] 478
 
20.1%
11
 
0.5%
2
 
0.1%
2
 
0.1%
2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1883
79.2%
[ 478
 
20.1%
11
 
0.5%
2
 
0.1%
2
 
0.1%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
9
52.9%
2
 
11.8%
2
 
11.8%
2
 
11.8%
® 1
 
5.9%
1
 
5.9%
Letter Number
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
2
 
20.0%
Space Separator
ValueCountFrequency (%)
48242
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 120
100.0%
Final Punctuation
ValueCountFrequency (%)
7
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140015
60.6%
Common 67564
29.2%
Latin 23384
 
10.1%
Han 234
 
0.1%
Cyrillic 31
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4116
 
2.9%
3859
 
2.8%
3359
 
2.4%
2200
 
1.6%
2070
 
1.5%
1996
 
1.4%
1931
 
1.4%
1892
 
1.4%
1876
 
1.3%
1765
 
1.3%
Other values (1289) 114951
82.1%
Han
ValueCountFrequency (%)
7
 
3.0%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (156) 192
82.1%
Common
ValueCountFrequency (%)
48242
71.4%
: 3842
 
5.7%
, 2082
 
3.1%
) 1883
 
2.8%
( 1883
 
2.8%
. 1497
 
2.2%
1 1118
 
1.7%
0 807
 
1.2%
! 743
 
1.1%
2 734
 
1.1%
Other values (57) 4733
 
7.0%
Latin
ValueCountFrequency (%)
e 2462
 
10.5%
o 1799
 
7.7%
a 1782
 
7.6%
n 1483
 
6.3%
i 1447
 
6.2%
t 1440
 
6.2%
r 1436
 
6.1%
s 1312
 
5.6%
l 913
 
3.9%
h 897
 
3.8%
Other values (45) 8413
36.0%
Cyrillic
ValueCountFrequency (%)
е 3
9.7%
к 3
9.7%
у 3
9.7%
о 3
9.7%
р 3
9.7%
а 2
 
6.5%
т 2
 
6.5%
с 2
 
6.5%
и 2
 
6.5%
л 2
 
6.5%
Other values (6) 6
19.4%
Greek
ValueCountFrequency (%)
π 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140002
60.5%
ASCII 90590
39.2%
None 313
 
0.1%
CJK 226
 
0.1%
Cyrillic 31
 
< 0.1%
Punctuation 19
 
< 0.1%
Compat Jamo 13
 
< 0.1%
Number Forms 10
 
< 0.1%
Misc Symbols 9
 
< 0.1%
CJK Compat Ideographs 8
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48242
53.3%
: 3842
 
4.2%
e 2462
 
2.7%
, 2082
 
2.3%
) 1883
 
2.1%
( 1883
 
2.1%
o 1799
 
2.0%
a 1782
 
2.0%
. 1497
 
1.7%
n 1483
 
1.6%
Other values (80) 23635
26.1%
Hangul
ValueCountFrequency (%)
4116
 
2.9%
3859
 
2.8%
3359
 
2.4%
2200
 
1.6%
2070
 
1.5%
1996
 
1.4%
1931
 
1.4%
1892
 
1.4%
1876
 
1.3%
1765
 
1.3%
Other values (1283) 114938
82.1%
None
ValueCountFrequency (%)
· 223
71.2%
21
 
6.7%
18
 
5.8%
11
 
3.5%
11
 
3.5%
× 8
 
2.6%
3
 
1.0%
2
 
0.6%
2
 
0.6%
2
 
0.6%
Other values (10) 12
 
3.8%
Misc Symbols
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
7
 
3.1%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (151) 184
81.4%
Punctuation
ValueCountFrequency (%)
7
36.8%
7
36.8%
4
21.1%
1
 
5.3%
Number Forms
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
2
 
20.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
37.5%
樂 2
25.0%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
3
23.1%
3
23.1%
3
23.1%
2
15.4%
1
 
7.7%
1
 
7.7%
Cyrillic
ValueCountFrequency (%)
е 3
9.7%
к 3
9.7%
у 3
9.7%
о 3
9.7%
р 3
9.7%
а 2
 
6.5%
т 2
 
6.5%
с 2
 
6.5%
и 2
 
6.5%
л 2
 
6.5%
Other values (6) 6
19.4%
Geometric Shapes
ValueCountFrequency (%)
2
66.7%
1
33.3%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
66.7%
1
33.3%

저자
Text

Distinct8973
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:13:35.123701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length109
Mean length17.0593
Min length3

Characters and Unicode

Total characters170593
Distinct characters1058
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8290 ?
Unique (%)82.9%

Sample

1st row박인호 지음
2nd row박선민 지음
3rd row강성춘 지음
4th row이보나 흐미엘레프스카 글·그림 ; 이지원 옮김
5th row정두현 지음
ValueCountFrequency (%)
7362
 
15.0%
지음 5361
 
10.9%
옮김 3056
 
6.2%
그림 2746
 
5.6%
2272
 
4.6%
by 1004
 
2.0%
글·그림 616
 
1.3%
illustrated 336
 
0.7%
공]지음 312
 
0.6%
188
 
0.4%
Other values (13667) 25783
52.6%
2024-04-21T10:13:35.576550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39112
22.9%
; 7343
 
4.3%
6623
 
3.9%
5887
 
3.5%
5490
 
3.2%
3652
 
2.1%
3563
 
2.1%
3156
 
1.9%
3141
 
1.8%
3022
 
1.8%
Other values (1048) 89604
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 96230
56.4%
Space Separator 39112
22.9%
Lowercase Letter 20214
 
11.8%
Other Punctuation 9833
 
5.8%
Uppercase Letter 3116
 
1.8%
Open Punctuation 995
 
0.6%
Close Punctuation 994
 
0.6%
Decimal Number 41
 
< 0.1%
Dash Punctuation 39
 
< 0.1%
Math Symbol 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6623
 
6.9%
5887
 
6.1%
5490
 
5.7%
3652
 
3.8%
3563
 
3.7%
3156
 
3.3%
3141
 
3.3%
3022
 
3.1%
1610
 
1.7%
1572
 
1.6%
Other values (957) 58514
60.8%
Lowercase Letter
ValueCountFrequency (%)
a 2012
10.0%
e 1999
9.9%
t 1699
 
8.4%
l 1588
 
7.9%
r 1576
 
7.8%
i 1562
 
7.7%
y 1393
 
6.9%
n 1347
 
6.7%
b 1151
 
5.7%
s 1079
 
5.3%
Other values (16) 4808
23.8%
Uppercase Letter
ValueCountFrequency (%)
S 290
 
9.3%
M 274
 
8.8%
J 269
 
8.6%
B 239
 
7.7%
C 236
 
7.6%
D 177
 
5.7%
A 173
 
5.6%
R 160
 
5.1%
T 137
 
4.4%
P 133
 
4.3%
Other values (16) 1028
33.0%
Other Punctuation
ValueCountFrequency (%)
; 7343
74.7%
, 1038
 
10.6%
· 716
 
7.3%
. 533
 
5.4%
? 104
 
1.1%
: 72
 
0.7%
' 13
 
0.1%
& 8
 
0.1%
/ 2
 
< 0.1%
% 2
 
< 0.1%
Other values (2) 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 12
29.3%
3 7
17.1%
5 6
14.6%
1 4
 
9.8%
4 4
 
9.8%
6 3
 
7.3%
9 2
 
4.9%
8 2
 
4.9%
7 1
 
2.4%
Open Punctuation
ValueCountFrequency (%)
[ 981
98.6%
( 8
 
0.8%
4
 
0.4%
1
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 980
98.6%
) 8
 
0.8%
4
 
0.4%
1
 
0.1%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
< 7
41.2%
> 7
41.2%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Space Separator
ValueCountFrequency (%)
39112
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96177
56.4%
Common 51033
29.9%
Latin 23330
 
13.7%
Han 53
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6623
 
6.9%
5887
 
6.1%
5490
 
5.7%
3652
 
3.8%
3563
 
3.7%
3156
 
3.3%
3141
 
3.3%
3022
 
3.1%
1610
 
1.7%
1572
 
1.6%
Other values (919) 58461
60.8%
Latin
ValueCountFrequency (%)
a 2012
 
8.6%
e 1999
 
8.6%
t 1699
 
7.3%
l 1588
 
6.8%
r 1576
 
6.8%
i 1562
 
6.7%
y 1393
 
6.0%
n 1347
 
5.8%
b 1151
 
4.9%
s 1079
 
4.6%
Other values (42) 7924
34.0%
Common
ValueCountFrequency (%)
39112
76.6%
; 7343
 
14.4%
, 1038
 
2.0%
[ 981
 
1.9%
] 980
 
1.9%
· 716
 
1.4%
. 533
 
1.0%
? 104
 
0.2%
: 72
 
0.1%
- 39
 
0.1%
Other values (29) 115
 
0.2%
Han
ValueCountFrequency (%)
6
 
11.3%
4
 
7.5%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (28) 28
52.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96173
56.4%
ASCII 73628
43.2%
None 733
 
0.4%
CJK 52
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39112
53.1%
; 7343
 
10.0%
a 2012
 
2.7%
e 1999
 
2.7%
t 1699
 
2.3%
l 1588
 
2.2%
r 1576
 
2.1%
i 1562
 
2.1%
y 1393
 
1.9%
n 1347
 
1.8%
Other values (68) 13997
 
19.0%
Hangul
ValueCountFrequency (%)
6623
 
6.9%
5887
 
6.1%
5490
 
5.7%
3652
 
3.8%
3563
 
3.7%
3156
 
3.3%
3141
 
3.3%
3022
 
3.1%
1610
 
1.7%
1572
 
1.6%
Other values (918) 58457
60.8%
None
ValueCountFrequency (%)
· 716
97.7%
4
 
0.5%
4
 
0.5%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other values (2) 2
 
0.3%
CJK
ValueCountFrequency (%)
6
 
11.5%
4
 
7.7%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (27) 27
51.9%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct2695
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:13:35.806845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length32
Mean length4.8321
Min length1

Characters and Unicode

Total characters48321
Distinct characters777
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1442 ?
Unique (%)14.4%

Sample

1st row글로세움
2nd row곰곰
3rd row21세기북스
4th row논장
5th rowBJpublic(비제이퍼블릭)
ValueCountFrequency (%)
문학동네 157
 
1.4%
창비 154
 
1.4%
books 133
 
1.2%
위즈덤하우스 107
 
1.0%
민음사 99
 
0.9%
서울문화사 99
 
0.9%
비룡소 97
 
0.9%
house 79
 
0.7%
웅진주니어 75
 
0.7%
시공주니어 69
 
0.6%
Other values (2727) 9788
90.2%
2024-04-21T10:13:36.161000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1515
 
3.1%
1281
 
2.7%
1236
 
2.6%
1183
 
2.4%
o 1074
 
2.2%
929
 
1.9%
857
 
1.8%
e 749
 
1.6%
s 698
 
1.4%
687
 
1.4%
Other values (767) 38112
78.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37230
77.0%
Lowercase Letter 7841
 
16.2%
Uppercase Letter 1835
 
3.8%
Space Separator 857
 
1.8%
Other Punctuation 239
 
0.5%
Decimal Number 153
 
0.3%
Open Punctuation 70
 
0.1%
Close Punctuation 70
 
0.1%
Dash Punctuation 25
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1515
 
4.1%
1281
 
3.4%
1236
 
3.3%
1183
 
3.2%
929
 
2.5%
687
 
1.8%
652
 
1.8%
639
 
1.7%
555
 
1.5%
498
 
1.3%
Other values (691) 28055
75.4%
Lowercase Letter
ValueCountFrequency (%)
o 1074
13.7%
e 749
 
9.6%
s 698
 
8.9%
r 633
 
8.1%
i 561
 
7.2%
n 557
 
7.1%
a 548
 
7.0%
l 404
 
5.2%
d 338
 
4.3%
t 330
 
4.2%
Other values (16) 1949
24.9%
Uppercase Letter
ValueCountFrequency (%)
B 272
14.8%
H 218
11.9%
R 183
10.0%
P 156
 
8.5%
S 142
 
7.7%
C 130
 
7.1%
M 105
 
5.7%
K 93
 
5.1%
L 67
 
3.7%
A 66
 
3.6%
Other values (14) 403
22.0%
Other Punctuation
ValueCountFrequency (%)
? 89
37.2%
& 49
20.5%
' 30
 
12.6%
. 20
 
8.4%
· 18
 
7.5%
, 14
 
5.9%
# 8
 
3.3%
7
 
2.9%
; 3
 
1.3%
! 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 66
43.1%
1 59
38.6%
4 7
 
4.6%
3 5
 
3.3%
0 4
 
2.6%
6 4
 
2.6%
8 3
 
2.0%
9 3
 
2.0%
5 2
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 69
98.6%
[ 1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 69
98.6%
] 1
 
1.4%
Space Separator
ValueCountFrequency (%)
857
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37192
77.0%
Latin 9676
 
20.0%
Common 1415
 
2.9%
Han 38
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1515
 
4.1%
1281
 
3.4%
1236
 
3.3%
1183
 
3.2%
929
 
2.5%
687
 
1.8%
652
 
1.8%
639
 
1.7%
555
 
1.5%
498
 
1.3%
Other values (663) 28017
75.3%
Latin
ValueCountFrequency (%)
o 1074
 
11.1%
e 749
 
7.7%
s 698
 
7.2%
r 633
 
6.5%
i 561
 
5.8%
n 557
 
5.8%
a 548
 
5.7%
l 404
 
4.2%
d 338
 
3.5%
t 330
 
3.4%
Other values (40) 3784
39.1%
Han
ValueCountFrequency (%)
5
 
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Other values (18) 18
47.4%
Common
ValueCountFrequency (%)
857
60.6%
? 89
 
6.3%
( 69
 
4.9%
) 69
 
4.9%
2 66
 
4.7%
1 59
 
4.2%
& 49
 
3.5%
' 30
 
2.1%
- 25
 
1.8%
. 20
 
1.4%
Other values (16) 82
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37189
77.0%
ASCII 11066
 
22.9%
CJK 38
 
0.1%
None 25
 
0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1515
 
4.1%
1281
 
3.4%
1236
 
3.3%
1183
 
3.2%
929
 
2.5%
687
 
1.8%
652
 
1.8%
639
 
1.7%
555
 
1.5%
498
 
1.3%
Other values (660) 28014
75.3%
ASCII
ValueCountFrequency (%)
o 1074
 
9.7%
857
 
7.7%
e 749
 
6.8%
s 698
 
6.3%
r 633
 
5.7%
i 561
 
5.1%
n 557
 
5.0%
a 548
 
5.0%
l 404
 
3.7%
d 338
 
3.1%
Other values (64) 4647
42.0%
None
ValueCountFrequency (%)
· 18
72.0%
7
 
28.0%
CJK
ValueCountFrequency (%)
5
 
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Other values (18) 18
47.4%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct9960
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:13:36.413896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.5385
Min length5

Characters and Unicode

Total characters105385
Distinct characters43
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9920 ?
Unique (%)99.2%

Sample

1st row373.4-148
2nd row340.13-14
3rd row325.1-316
4th row더책 892.95-1-1
5th row327.856-202
ValueCountFrequency (%)
아동 2462
 
16.6%
그림책 1089
 
7.3%
영어 567
 
3.8%
더책 251
 
1.7%
시니어 190
 
1.3%
큰글자 101
 
0.7%
보드북 75
 
0.5%
mom 67
 
0.5%
아동점자 46
 
0.3%
아세안 13
 
0.1%
Other values (9901) 10013
67.3%
2024-04-21T10:13:36.821636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 13545
12.9%
1 12207
11.6%
3 9946
 
9.4%
8 9872
 
9.4%
2 7794
 
7.4%
4 6278
 
6.0%
. 5772
 
5.5%
0 5051
 
4.8%
5 5010
 
4.8%
4874
 
4.6%
Other values (33) 25036
23.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69126
65.6%
Dash Punctuation 13545
 
12.9%
Other Letter 11174
 
10.6%
Other Punctuation 5772
 
5.5%
Space Separator 4874
 
4.6%
Math Symbol 685
 
0.6%
Uppercase Letter 202
 
0.2%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2521
22.6%
2508
22.4%
1340
12.0%
1089
9.7%
1089
9.7%
757
 
6.8%
567
 
5.1%
251
 
2.2%
190
 
1.7%
190
 
1.7%
Other values (13) 672
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 12207
17.7%
3 9946
14.4%
8 9872
14.3%
2 7794
11.3%
4 6278
9.1%
0 5051
7.3%
5 5010
7.2%
9 4779
 
6.9%
7 4676
 
6.8%
6 3513
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
M 134
66.3%
O 67
33.2%
A 1
 
0.5%
Dash Punctuation
ValueCountFrequency (%)
- 13545
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5772
100.0%
Space Separator
ValueCountFrequency (%)
4874
100.0%
Math Symbol
ValueCountFrequency (%)
= 685
100.0%
Close Punctuation
ValueCountFrequency (%)
] 3
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
v 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 94008
89.2%
Hangul 11174
 
10.6%
Latin 203
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2521
22.6%
2508
22.4%
1340
12.0%
1089
9.7%
1089
9.7%
757
 
6.8%
567
 
5.1%
251
 
2.2%
190
 
1.7%
190
 
1.7%
Other values (13) 672
 
6.0%
Common
ValueCountFrequency (%)
- 13545
14.4%
1 12207
13.0%
3 9946
10.6%
8 9872
10.5%
2 7794
8.3%
4 6278
6.7%
. 5772
6.1%
0 5051
 
5.4%
5 5010
 
5.3%
4874
 
5.2%
Other values (6) 13659
14.5%
Latin
ValueCountFrequency (%)
M 134
66.0%
O 67
33.0%
A 1
 
0.5%
v 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94211
89.4%
Hangul 11174
 
10.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 13545
14.4%
1 12207
13.0%
3 9946
10.6%
8 9872
10.5%
2 7794
8.3%
4 6278
6.7%
. 5772
6.1%
0 5051
 
5.4%
5 5010
 
5.3%
4874
 
5.2%
Other values (10) 13862
14.7%
Hangul
ValueCountFrequency (%)
2521
22.6%
2508
22.4%
1340
12.0%
1089
9.7%
1089
9.7%
757
 
6.8%
567
 
5.1%
251
 
2.2%
190
 
1.7%
190
 
1.7%
Other values (13) 672
 
6.0%

Interactions

2024-04-21T10:13:32.814962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-21T10:13:32.927609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:13:33.024022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록번호서명저자발행자청구기호
5587255873ABN000083498(외대부고 공신들의)진짜 1등 공부법 : 진학 야전사령과 박인호 선생님의 SKY 입시공략의 비밀박인호 지음글로세움373.4-148
2359023591ABN000139176(내 손으로 만드는)내 삶을 위한 정치 : 청소년을 위한 대한민국 정치 사용 설명서박선민 지음곰곰340.13-14
4617046171ABN000101702인사이드 아웃 : 사람이 만드는 기업의 미래강성춘 지음21세기북스325.1-316
7097470975ABN000061532(더책)문제가 생겼어요이보나 흐미엘레프스카 글·그림 ; 이지원 옮김논장더책 892.95-1-1
3494534946ABN000121806미국 주식 스타터팩 : 미국 주식 초심자를 위한 토탈 솔루션정두현 지음BJpublic(비제이퍼블릭)327.856-202
8306183062ABN000043687빅히트 : 구매 버튼을 누르게 하는 마케팅 신기술필 바든 지음 ; 이현주 옮김씨앗을뿌리는사람325.5-53
5387053871ABN000086935배드 블러드 : 테라노스의 비밀과 거짓말존 캐리루 지음 ; 박아린 옮김와이즈베리325.04-226
5051650517ABN000094282옥중서신. 1, 김대중이 이희호에게 : 편지로 새긴 사랑, 자유, 민주주의김대중 지음시대의창816.7-286-1
6394963950ABN000071357밀리미터 학교정휘창 지음 ; 황정혜 그림소소담담아동 813.8-2103
6198261983ABN000074394시를 읽는 오후 : 시인 최영미, 생의 길목에서 만난 마흔네 편의 시최영미 지음해냄809.1-8
번호등록번호서명저자발행자청구기호
8505985060ABN000042527Big Eggby Molly CoxeRandom House영어 808-15-1-30
7996079961ABN000048199우주 레시피 : 지구인을 위한 달콤한 우주 특강손영종 지음오르트443.1-33
51285129ABN000162515(천황과 무사의 나라) 일본박혜정 글 ; 김옥재 그림휴먼어린이그림책 909-3-11
4113241133ABN000108718[아가맘]Alpabet Phonics Book:130Words블루래빗 편블루래빗보드북 740-1-1
92459246ABN000157964사이다 고민툰 : 답답하고 불안한 사춘기 속마음 처방전안태일 글 ; 옥이샘 만화지식프레임아동 183.3-3
3950939510ABN000112502화성 연대기레이 브래드버리 지음 ; 조호근 옮김현대문학843-1621
1121711218ABN000155897(지금 잘 살고 있나 싶을 때)나를 리뷰하는 법김혜원 지음유영818-1941
6997269973ABN000062813개미에게 배우는 부지런함 : 개미의 직업최재천 글 ; 박상현 그림리젬그림책 495-20-3
5675956760ABN000081447(비밀이야의)맛있는 프랑스 : 블로거 비밀이야의 프랑스 미식여행 가이드배동렬 글·사진BR미디어594.019-36
5202752028ABN000090212시바견 곤 이야기. 4가게야마 나오미 글·그림 ; 김수현 옮김한겨레출판838-142-4