Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells13
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Text5
DateTime1

Dataset

Description송파어린이영어도서관 보유 영어도서 목록에 대한 등록번호, 청구기호, 도서명, 저자명, 발행자명, 기준일자 데이터를 제공합니다.
Author서울특별시 송파구
URLhttps://www.data.go.kr/data/15112402/fileData.do

Alerts

기준일자 has constant value ""Constant
연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:29:44.286753
Analysis finished2023-12-12 22:29:45.940828
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14764.645
Minimum5
Maximum29590
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:46.020124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile1470.8
Q17343.75
median14752
Q322115.5
95-th percentile28166.1
Maximum29590
Range29585
Interquartile range (IQR)14771.75

Descriptive statistics

Standard deviation8549.954
Coefficient of variation (CV)0.57908292
Kurtosis-1.2013744
Mean14764.645
Median Absolute Deviation (MAD)7392.5
Skewness0.0081907472
Sum1.4764645 × 108
Variance73101713
MonotonicityNot monotonic
2023-12-13T07:29:46.169219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13340 1
 
< 0.1%
4128 1
 
< 0.1%
24503 1
 
< 0.1%
10216 1
 
< 0.1%
12840 1
 
< 0.1%
24700 1
 
< 0.1%
19300 1
 
< 0.1%
7331 1
 
< 0.1%
1563 1
 
< 0.1%
20206 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5 1
< 0.1%
12 1
< 0.1%
21 1
< 0.1%
26 1
< 0.1%
31 1
< 0.1%
33 1
< 0.1%
35 1
< 0.1%
37 1
< 0.1%
38 1
< 0.1%
39 1
< 0.1%
ValueCountFrequency (%)
29590 1
< 0.1%
29589 1
< 0.1%
29587 1
< 0.1%
29585 1
< 0.1%
29584 1
< 0.1%
29576 1
< 0.1%
29573 1
< 0.1%
29571 1
< 0.1%
29570 1
< 0.1%
29560 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:29:46.384615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowIM0000016815
2nd rowIM0000008302
3rd rowIM0000023762
4th rowIM0000031052
5th rowIM0000017023
ValueCountFrequency (%)
im0000016815 1
 
< 0.1%
im0000008316 1
 
< 0.1%
im0000007152 1
 
< 0.1%
im0000010448 1
 
< 0.1%
im0000026286 1
 
< 0.1%
im0000012642 1
 
< 0.1%
im0000018054 1
 
< 0.1%
im0000026171 1
 
< 0.1%
xs0000000731 1
 
< 0.1%
im0000003596 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T07:29:46.704256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 58274
48.6%
M 9369
 
7.8%
I 9066
 
7.6%
1 7325
 
6.1%
2 6928
 
5.8%
3 4528
 
3.8%
5 3918
 
3.3%
8 3880
 
3.2%
7 3874
 
3.2%
9 3801
 
3.2%
Other values (5) 9037
 
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 58274
58.3%
1 7325
 
7.3%
2 6928
 
6.9%
3 4528
 
4.5%
5 3918
 
3.9%
8 3880
 
3.9%
7 3874
 
3.9%
9 3801
 
3.8%
6 3743
 
3.7%
4 3729
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
M 9369
46.8%
I 9066
45.3%
X 631
 
3.2%
S 631
 
3.2%
H 303
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 58274
58.3%
1 7325
 
7.3%
2 6928
 
6.9%
3 4528
 
4.5%
5 3918
 
3.9%
8 3880
 
3.9%
7 3874
 
3.9%
9 3801
 
3.8%
6 3743
 
3.7%
4 3729
 
3.7%
Latin
ValueCountFrequency (%)
M 9369
46.8%
I 9066
45.3%
X 631
 
3.2%
S 631
 
3.2%
H 303
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 58274
48.6%
M 9369
 
7.8%
I 9066
 
7.6%
1 7325
 
6.1%
2 6928
 
5.8%
3 4528
 
3.8%
5 3918
 
3.3%
8 3880
 
3.2%
7 3874
 
3.2%
9 3801
 
3.2%
Other values (5) 9037
 
7.5%
Distinct9282
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:29:46.954318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length16.2113
Min length11

Characters and Unicode

Total characters162113
Distinct characters88
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8921 ?
Unique (%)89.2%

Sample

1st rowAU 843.5-W651f=2
2nd rowAU 843.5-B966wb
3rd rowBET 843.5-J12c=2
4th rowAU 843.6-C278v-el
5th rowGR 843-S368l-L1-ab
ValueCountFrequency (%)
gr 2005
 
10.0%
ch 1766
 
8.8%
bet 1691
 
8.4%
nf 867
 
4.3%
au 793
 
4.0%
pk 656
 
3.3%
lil 583
 
2.9%
grl 550
 
2.7%
wds 383
 
1.9%
fav 287
 
1.4%
Other values (9234) 10488
52.3%
2023-12-13T07:29:47.347215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 16643
 
10.3%
8 12915
 
8.0%
3 11749
 
7.2%
4 11699
 
7.2%
10069
 
6.2%
6 8970
 
5.5%
. 7149
 
4.4%
2 5664
 
3.5%
1 5093
 
3.1%
5 4947
 
3.1%
Other values (78) 67215
41.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 72014
44.4%
Uppercase Letter 37230
23.0%
Lowercase Letter 17028
 
10.5%
Dash Punctuation 16643
 
10.3%
Space Separator 10069
 
6.2%
Other Punctuation 7157
 
4.4%
Math Symbol 1923
 
1.2%
Other Letter 34
 
< 0.1%
Connector Punctuation 15
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 3975
 
10.7%
G 3210
 
8.6%
B 2832
 
7.6%
C 2727
 
7.3%
H 2560
 
6.9%
S 2439
 
6.6%
T 2317
 
6.2%
L 2292
 
6.2%
E 2071
 
5.6%
A 1777
 
4.8%
Other values (16) 11030
29.6%
Lowercase Letter
ValueCountFrequency (%)
s 1218
 
7.2%
a 1201
 
7.1%
m 1078
 
6.3%
t 982
 
5.8%
h 972
 
5.7%
i 957
 
5.6%
e 957
 
5.6%
p 915
 
5.4%
l 871
 
5.1%
c 864
 
5.1%
Other values (16) 7013
41.2%
Other Letter
ValueCountFrequency (%)
12
35.3%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%
Decimal Number
ValueCountFrequency (%)
8 12915
17.9%
3 11749
16.3%
4 11699
16.2%
6 8970
12.5%
2 5664
7.9%
1 5093
 
7.1%
5 4947
 
6.9%
7 4885
 
6.8%
9 4540
 
6.3%
0 1552
 
2.2%
Other Punctuation
ValueCountFrequency (%)
. 7149
99.9%
, 7
 
0.1%
/ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 1851
96.3%
+ 72
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 16643
100.0%
Space Separator
ValueCountFrequency (%)
10069
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 107821
66.5%
Latin 54258
33.5%
Hangul 34
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 3975
 
7.3%
G 3210
 
5.9%
B 2832
 
5.2%
C 2727
 
5.0%
H 2560
 
4.7%
S 2439
 
4.5%
T 2317
 
4.3%
L 2292
 
4.2%
E 2071
 
3.8%
A 1777
 
3.3%
Other values (42) 28058
51.7%
Common
ValueCountFrequency (%)
- 16643
15.4%
8 12915
12.0%
3 11749
10.9%
4 11699
10.9%
10069
9.3%
6 8970
8.3%
. 7149
6.6%
2 5664
 
5.3%
1 5093
 
4.7%
5 4947
 
4.6%
Other values (8) 12923
12.0%
Hangul
ValueCountFrequency (%)
12
35.3%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 162079
> 99.9%
Hangul 32
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 16643
 
10.3%
8 12915
 
8.0%
3 11749
 
7.2%
4 11699
 
7.2%
10069
 
6.2%
6 8970
 
5.5%
. 7149
 
4.4%
2 5664
 
3.5%
1 5093
 
3.1%
5 4947
 
3.1%
Other values (60) 67181
41.4%
Hangul
ValueCountFrequency (%)
12
37.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (6) 6
18.8%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct9202
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:29:47.677468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length166
Median length92
Mean length26.6691
Min length2

Characters and Unicode

Total characters266691
Distinct characters248
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8554 ?
Unique (%)85.5%

Sample

1st rowFree Fall
2nd rowWhere's Julius?
3rd row(The) Caterpillar and the Polliwog
4th row(The) Very Hungry Caterpillar Eats Lunch : A Colors Book
5th row(Lego) City, All Aboard!
ValueCountFrequency (%)
the 3980
 
8.3%
1729
 
3.6%
and 1310
 
2.7%
a 1165
 
2.4%
of 1076
 
2.2%
to 507
 
1.1%
in 482
 
1.0%
book 341
 
0.7%
is 293
 
0.6%
my 280
 
0.6%
Other values (7728) 36732
76.7%
2023-12-13T07:29:48.101058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38043
 
14.3%
e 23465
 
8.8%
o 15697
 
5.9%
a 15593
 
5.8%
t 13174
 
4.9%
r 13174
 
4.9%
n 12821
 
4.8%
i 12623
 
4.7%
s 11740
 
4.4%
h 9095
 
3.4%
Other values (238) 101266
38.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 179081
67.1%
Space Separator 38043
 
14.3%
Uppercase Letter 33497
 
12.6%
Other Punctuation 8108
 
3.0%
Close Punctuation 2587
 
1.0%
Open Punctuation 2587
 
1.0%
Decimal Number 1928
 
0.7%
Dash Punctuation 535
 
0.2%
Other Letter 270
 
0.1%
Math Symbol 48
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
2.6%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (145) 217
80.4%
Lowercase Letter
ValueCountFrequency (%)
e 23465
13.1%
o 15697
 
8.8%
a 15593
 
8.7%
t 13174
 
7.4%
r 13174
 
7.4%
n 12821
 
7.2%
i 12623
 
7.0%
s 11740
 
6.6%
h 9095
 
5.1%
l 8238
 
4.6%
Other values (16) 43461
24.3%
Uppercase Letter
ValueCountFrequency (%)
T 4062
 
12.1%
S 3048
 
9.1%
B 2425
 
7.2%
M 2310
 
6.9%
A 2304
 
6.9%
W 1914
 
5.7%
C 1907
 
5.7%
D 1676
 
5.0%
P 1676
 
5.0%
F 1583
 
4.7%
Other values (16) 10592
31.6%
Other Punctuation
ValueCountFrequency (%)
, 1929
23.8%
. 1642
20.3%
: 1564
19.3%
' 1272
15.7%
! 1004
12.4%
? 455
 
5.6%
& 174
 
2.1%
; 18
 
0.2%
/ 15
 
0.2%
" 12
 
0.1%
Other values (6) 23
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 512
26.6%
2 311
16.1%
3 259
13.4%
0 193
 
10.0%
4 166
 
8.6%
5 122
 
6.3%
6 103
 
5.3%
9 98
 
5.1%
7 83
 
4.3%
8 81
 
4.2%
Math Symbol
ValueCountFrequency (%)
= 24
50.0%
+ 19
39.6%
~ 3
 
6.2%
> 1
 
2.1%
< 1
 
2.1%
Close Punctuation
ValueCountFrequency (%)
) 2573
99.5%
] 14
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 2573
99.5%
[ 14
 
0.5%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
38043
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 535
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 212581
79.7%
Common 53840
 
20.2%
Hangul 268
 
0.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
2.6%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (143) 215
80.2%
Latin
ValueCountFrequency (%)
e 23465
 
11.0%
o 15697
 
7.4%
a 15593
 
7.3%
t 13174
 
6.2%
r 13174
 
6.2%
n 12821
 
6.0%
i 12623
 
5.9%
s 11740
 
5.5%
h 9095
 
4.3%
l 8238
 
3.9%
Other values (44) 76961
36.2%
Common
ValueCountFrequency (%)
38043
70.7%
) 2573
 
4.8%
( 2573
 
4.8%
, 1929
 
3.6%
. 1642
 
3.0%
: 1564
 
2.9%
' 1272
 
2.4%
! 1004
 
1.9%
- 535
 
1.0%
1 512
 
1.0%
Other values (29) 2193
 
4.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 266404
99.9%
Hangul 268
 
0.1%
None 8
 
< 0.1%
Punctuation 6
 
< 0.1%
Number Forms 3
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38043
 
14.3%
e 23465
 
8.8%
o 15697
 
5.9%
a 15593
 
5.9%
t 13174
 
4.9%
r 13174
 
4.9%
n 12821
 
4.8%
i 12623
 
4.7%
s 11740
 
4.4%
h 9095
 
3.4%
Other values (76) 100979
37.9%
Hangul
ValueCountFrequency (%)
7
 
2.6%
7
 
2.6%
6
 
2.2%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (143) 215
80.2%
None
ValueCountFrequency (%)
· 5
62.5%
2
 
25.0%
1
 
12.5%
Punctuation
ValueCountFrequency (%)
4
66.7%
2
33.3%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct5426
Distinct (%)54.3%
Missing11
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T07:29:48.400277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length237
Median length147
Mean length29.347282
Min length1

Characters and Unicode

Total characters293150
Distinct characters170
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4077 ?
Unique (%)40.8%

Sample

1st rowDavid Wiesner
2nd rowJohn Burningham
3rd rowby Jack Kent
4th rowEric Carle
5th rowQuinlan B. Lee
ValueCountFrequency (%)
5312
 
11.0%
by 4594
 
9.5%
illustrated 3096
 
6.4%
david 329
 
0.7%
pictures 318
 
0.7%
alex 312
 
0.6%
hunt 292
 
0.6%
roderick 283
 
0.6%
brychta 278
 
0.6%
john 241
 
0.5%
Other values (6245) 33398
68.9%
2023-12-13T07:29:48.811088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38776
 
13.2%
e 24067
 
8.2%
a 22259
 
7.6%
r 18437
 
6.3%
l 17417
 
5.9%
i 16618
 
5.7%
t 16090
 
5.5%
n 15976
 
5.4%
o 11504
 
3.9%
s 11450
 
3.9%
Other values (160) 100556
34.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 212433
72.5%
Space Separator 38776
 
13.2%
Uppercase Letter 34187
 
11.7%
Other Punctuation 7061
 
2.4%
Other Letter 412
 
0.1%
Dash Punctuation 159
 
0.1%
Open Punctuation 58
 
< 0.1%
Close Punctuation 56
 
< 0.1%
Final Punctuation 3
 
< 0.1%
Math Symbol 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
5.8%
19
 
4.6%
17
 
4.1%
15
 
3.6%
15
 
3.6%
15
 
3.6%
13
 
3.2%
12
 
2.9%
12
 
2.9%
12
 
2.9%
Other values (85) 258
62.6%
Lowercase Letter
ValueCountFrequency (%)
e 24067
11.3%
a 22259
10.5%
r 18437
 
8.7%
l 17417
 
8.2%
i 16618
 
7.8%
t 16090
 
7.6%
n 15976
 
7.5%
o 11504
 
5.4%
s 11450
 
5.4%
y 9497
 
4.5%
Other values (17) 49118
23.1%
Uppercase Letter
ValueCountFrequency (%)
M 3084
 
9.0%
S 2911
 
8.5%
B 2699
 
7.9%
J 2386
 
7.0%
A 2274
 
6.7%
D 2219
 
6.5%
C 2155
 
6.3%
R 2012
 
5.9%
L 1982
 
5.8%
H 1736
 
5.1%
Other values (16) 10729
31.4%
Other Punctuation
ValueCountFrequency (%)
; 5246
74.3%
. 1139
 
16.1%
, 436
 
6.2%
' 130
 
1.8%
& 52
 
0.7%
: 24
 
0.3%
? 21
 
0.3%
/ 9
 
0.1%
2
 
< 0.1%
2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 49
84.5%
( 9
 
15.5%
Close Punctuation
ValueCountFrequency (%)
] 47
83.9%
) 9
 
16.1%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
38776
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 159
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 246621
84.1%
Common 46117
 
15.7%
Hangul 409
 
0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
5.9%
19
 
4.6%
17
 
4.2%
15
 
3.7%
15
 
3.7%
15
 
3.7%
13
 
3.2%
12
 
2.9%
12
 
2.9%
12
 
2.9%
Other values (82) 255
62.3%
Latin
ValueCountFrequency (%)
e 24067
 
9.8%
a 22259
 
9.0%
r 18437
 
7.5%
l 17417
 
7.1%
i 16618
 
6.7%
t 16090
 
6.5%
n 15976
 
6.5%
o 11504
 
4.7%
s 11450
 
4.6%
y 9497
 
3.9%
Other values (44) 83306
33.8%
Common
ValueCountFrequency (%)
38776
84.1%
; 5246
 
11.4%
. 1139
 
2.5%
, 436
 
0.9%
- 159
 
0.3%
' 130
 
0.3%
& 52
 
0.1%
[ 49
 
0.1%
] 47
 
0.1%
: 24
 
0.1%
Other values (11) 59
 
0.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 292727
99.9%
Hangul 409
 
0.1%
None 7
 
< 0.1%
Punctuation 3
 
< 0.1%
CJK 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38776
 
13.2%
e 24067
 
8.2%
a 22259
 
7.6%
r 18437
 
6.3%
l 17417
 
5.9%
i 16618
 
5.7%
t 16090
 
5.5%
n 15976
 
5.5%
o 11504
 
3.9%
s 11450
 
3.9%
Other values (60) 100133
34.2%
Hangul
ValueCountFrequency (%)
24
 
5.9%
19
 
4.6%
17
 
4.2%
15
 
3.7%
15
 
3.7%
15
 
3.7%
13
 
3.2%
12
 
2.9%
12
 
2.9%
12
 
2.9%
Other values (82) 255
62.3%
Punctuation
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
ø 3
42.9%
2
28.6%
2
28.6%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct1277
Distinct (%)12.8%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T07:29:49.021898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length47
Mean length14.493399
Min length1

Characters and Unicode

Total characters144905
Distinct characters137
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique666 ?
Unique (%)6.7%

Sample

1st rowHarperCollins
2nd rowRed fox
3rd rowSimon & Schuster
4th rowPenguin Young Readers
5th rowScholastic
ValueCountFrequency (%)
books 2125
 
10.5%
scholastic 1316
 
6.5%
press 945
 
4.7%
house 465
 
2.3%
463
 
2.3%
oxford 459
 
2.3%
puffin 450
 
2.2%
random 407
 
2.0%
university 404
 
2.0%
harpercollins 350
 
1.7%
Other values (956) 12928
63.6%
2023-12-13T07:29:49.373197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 13702
 
9.5%
s 10623
 
7.3%
10340
 
7.1%
r 9593
 
6.6%
e 9016
 
6.2%
i 8753
 
6.0%
n 8157
 
5.6%
a 7464
 
5.2%
l 6973
 
4.8%
t 4935
 
3.4%
Other values (127) 55349
38.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 110990
76.6%
Uppercase Letter 21130
 
14.6%
Space Separator 10340
 
7.1%
Other Punctuation 1691
 
1.2%
Other Letter 656
 
0.5%
Dash Punctuation 28
 
< 0.1%
Close Punctuation 22
 
< 0.1%
Open Punctuation 22
 
< 0.1%
Math Symbol 15
 
< 0.1%
Decimal Number 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
12.3%
69
10.5%
68
10.4%
66
10.1%
66
10.1%
65
9.9%
65
9.9%
65
9.9%
16
 
2.4%
13
 
2.0%
Other values (52) 82
12.5%
Lowercase Letter
ValueCountFrequency (%)
o 13702
12.3%
s 10623
 
9.6%
r 9593
 
8.6%
e 9016
 
8.1%
i 8753
 
7.9%
n 8157
 
7.3%
a 7464
 
6.7%
l 6973
 
6.3%
t 4935
 
4.4%
c 4477
 
4.0%
Other values (16) 27297
24.6%
Uppercase Letter
ValueCountFrequency (%)
B 2741
13.0%
P 2577
12.2%
S 2448
11.6%
H 2161
10.2%
C 1959
 
9.3%
R 1046
 
5.0%
M 825
 
3.9%
O 767
 
3.6%
A 756
 
3.6%
D 752
 
3.6%
Other values (16) 5098
24.1%
Other Punctuation
ValueCountFrequency (%)
& 485
28.7%
' 342
20.2%
. 308
18.2%
: 297
17.6%
, 181
 
10.7%
24
 
1.4%
/ 14
 
0.8%
· 13
 
0.8%
! 12
 
0.7%
; 11
 
0.7%
Decimal Number
ValueCountFrequency (%)
6 3
30.0%
3 3
30.0%
0 3
30.0%
4 1
 
10.0%
Close Punctuation
ValueCountFrequency (%)
) 21
95.5%
] 1
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 21
95.5%
[ 1
 
4.5%
Space Separator
ValueCountFrequency (%)
10340
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Math Symbol
ValueCountFrequency (%)
+ 15
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 132120
91.2%
Common 12129
 
8.4%
Hangul 656
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
12.3%
69
10.5%
68
10.4%
66
10.1%
66
10.1%
65
9.9%
65
9.9%
65
9.9%
16
 
2.4%
13
 
2.0%
Other values (52) 82
12.5%
Latin
ValueCountFrequency (%)
o 13702
 
10.4%
s 10623
 
8.0%
r 9593
 
7.3%
e 9016
 
6.8%
i 8753
 
6.6%
n 8157
 
6.2%
a 7464
 
5.6%
l 6973
 
5.3%
t 4935
 
3.7%
c 4477
 
3.4%
Other values (42) 48427
36.7%
Common
ValueCountFrequency (%)
10340
85.3%
& 485
 
4.0%
' 342
 
2.8%
. 308
 
2.5%
: 297
 
2.4%
, 181
 
1.5%
- 28
 
0.2%
24
 
0.2%
) 21
 
0.2%
( 21
 
0.2%
Other values (13) 82
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 144208
99.5%
Hangul 656
 
0.5%
None 41
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 13702
 
9.5%
s 10623
 
7.4%
10340
 
7.2%
r 9593
 
6.7%
e 9016
 
6.3%
i 8753
 
6.1%
n 8157
 
5.7%
a 7464
 
5.2%
l 6973
 
4.8%
t 4935
 
3.4%
Other values (62) 54652
37.9%
Hangul
ValueCountFrequency (%)
81
12.3%
69
10.5%
68
10.4%
66
10.1%
66
10.1%
65
9.9%
65
9.9%
65
9.9%
16
 
2.4%
13
 
2.0%
Other values (52) 82
12.5%
None
ValueCountFrequency (%)
24
58.5%
· 13
31.7%
4
 
9.8%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-02-17 00:00:00
Maximum2023-02-17 00:00:00
2023-12-13T07:29:49.473233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:29:49.545406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T07:29:45.473755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T07:29:45.617102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:29:45.755995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:29:45.886004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번등록번호청구기호도서명저자명발행자명기준일자
1333913340IM0000016815AU 843.5-W651f=2Free FallDavid WiesnerHarperCollins2023-02-17
72547255IM0000008302AU 843.5-B966wbWhere's Julius?John BurninghamRed fox2023-02-17
2127821279IM0000023762BET 843.5-J12c=2(The) Caterpillar and the Polliwogby Jack KentSimon & Schuster2023-02-17
2740127402IM0000031052AU 843.6-C278v-el(The) Very Hungry Caterpillar Eats Lunch : A Colors BookEric CarlePenguin Young Readers2023-02-17
1354413545IM0000017023GR 843-S368l-L1-ab(Lego) City, All Aboard!Quinlan B. LeeScholastic2023-02-17
1736317364IM0000020710BET 843.6-D276l(The) Legend of Rock Paper ScissorsDrew Daywalt ; illustrated by Adam RexBalzer + Bray2023-02-17
29252926IM0000001889GRL 744-H313r-SRed Balloons, TheCindy HarrisHMH2023-02-17
1482214823IM0000015936NF 408-H293l-LFS 1=2(A) Nest Full of EggsPriscilla Belz Jenkins ; illustrated by Lizzy RockwellHarperCollins2023-02-17
1201212013IM0000014770CH 808.9-S838c-1=3Classic starts: the adventures of Huckleberry FinnMark Twain ; original by Oliver Ho ; Illustrated by Dan AndreasenSterling2023-02-17
1493514936IM0000015904NF 408-N277sl-Kids-PrSleep, Bear!Shelby AlinskyNational Geographic2023-02-17
연번등록번호청구기호도서명저자명발행자명기준일자
2049520496IM0000022375CH 843.6-K14g-4=2Go Girl!. 4, (The) New GirlRowan McauleySquare Fish2023-02-17
12311232IM0000002138GRL 556-K192d-ADrive Toward The FutureAnn KaskeScholastic2023-02-17
49164917IM0000007248AU 843.6-C278wh-ba=2Baby Bear, Baby Bear, What do you See?Bill Martin ; pictures by Eric CarlePuffin Books2023-02-17
354355IM0000000736GR 843.6-G878m-All2Martin luther king, jr. and the march on washingtonFrances E. ruffinScholastic2023-02-17
2190721908IM0000023483CH 843.6-K94m-8Magic Bone. 8, Rootin' Tootin' Cow DogNancy Krulik ; illustrated by Sevastien BraunGrosset & Dunlap2023-02-17
43484349IM0000007362AU 843.5-D419mTomie dePaola's more mother goose favoritesTomie DePaolaGrosset & Dunlap2023-02-17
2809728098IM0000030522BET 843.6-B885d(A) Dark Dark TaleRuth BrownTWOPONDS2023-02-17
40254026IM0000003581GRL 082-R813p-S(The)panda bearDeborah ChilekRosen Pub.2023-02-17
53225323IM0000002981GR 843.6-G878l-All3Lightning : it's electrifyingJennifer Dussling ; Lori OsieckiGrosset & Dunlap2023-02-17
146147IM0000001636GRL 843-M216w-SWhen the king rides byMargaret Mahy ; Betina OgdenMondo2023-02-17