Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Text5
Numeric1

Dataset

Description부산광역시_연제구_자료관도서목록_20200916
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15048049

Alerts

등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:12:51.616636
Analysis finished2023-12-10 16:12:55.495079
Duration3.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:12:55.710781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowABN000057095
2nd rowABN000091178
3rd rowABN000023272
4th rowABN000030192
5th rowABN000014114
ValueCountFrequency (%)
abn000057095 1
 
< 0.1%
abn000004880 1
 
< 0.1%
abn000061218 1
 
< 0.1%
abn000006078 1
 
< 0.1%
abn000012637 1
 
< 0.1%
abn000015519 1
 
< 0.1%
abn000034703 1
 
< 0.1%
abn000040443 1
 
< 0.1%
abn000000455 1
 
< 0.1%
abn000032016 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-11T01:12:56.181631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 45574
38.0%
A 10000
 
8.3%
B 10000
 
8.3%
N 9326
 
7.8%
1 5171
 
4.3%
4 5119
 
4.3%
5 5034
 
4.2%
3 5033
 
4.2%
6 4988
 
4.2%
2 4971
 
4.1%
Other values (6) 14784
 
12.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90000
75.0%
Uppercase Letter 30000
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 45574
50.6%
1 5171
 
5.7%
4 5119
 
5.7%
5 5034
 
5.6%
3 5033
 
5.6%
6 4988
 
5.5%
2 4971
 
5.5%
7 4931
 
5.5%
8 4893
 
5.4%
9 4286
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 9326
31.1%
E 374
 
1.2%
F 225
 
0.8%
H 75
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common 90000
75.0%
Latin 30000
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 45574
50.6%
1 5171
 
5.7%
4 5119
 
5.7%
5 5034
 
5.6%
3 5033
 
5.6%
6 4988
 
5.5%
2 4971
 
5.5%
7 4931
 
5.5%
8 4893
 
5.4%
9 4286
 
4.8%
Latin
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 9326
31.1%
E 374
 
1.2%
F 225
 
0.8%
H 75
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 45574
38.0%
A 10000
 
8.3%
B 10000
 
8.3%
N 9326
 
7.8%
1 5171
 
4.3%
4 5119
 
4.3%
5 5034
 
4.2%
3 5033
 
4.2%
6 4988
 
4.2%
2 4971
 
4.1%
Other values (6) 14784
 
12.3%

서명
Text

Distinct9879
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:12:56.598664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length302
Median length88
Mean length21.7737
Min length1

Characters and Unicode

Total characters217737
Distinct characters1555
Distinct categories17 ?
Distinct scripts8 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9763 ?
Unique (%)97.6%

Sample

1st row구글 애널리틱스로 모아보는 데이터 : 기본 보고서를 넘어 통합 마케팅 분석 센터로 가는 길
2nd row이끼. 1
3rd row미래학자의 통찰법
4th row바이올리니스트의 엄지 : 사랑과 전쟁과 천재성에 관한 DNA 이야기
5th row테마로 보는 서양미술
ValueCountFrequency (%)
3383
 
6.4%
the 446
 
0.8%
이야기 370
 
0.7%
2 287
 
0.5%
1 260
 
0.5%
and 208
 
0.4%
of 191
 
0.4%
190
 
0.4%
위한 157
 
0.3%
우리 154
 
0.3%
Other values (22534) 47260
89.3%
2023-12-11T01:12:57.146070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43261
 
19.9%
e 3853
 
1.8%
3561
 
1.6%
: 3295
 
1.5%
3208
 
1.5%
a 2700
 
1.2%
o 2605
 
1.2%
2605
 
1.2%
, 2518
 
1.2%
n 2304
 
1.1%
Other values (1545) 147827
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117536
54.0%
Space Separator 43261
 
19.9%
Lowercase Letter 30821
 
14.2%
Other Punctuation 8993
 
4.1%
Uppercase Letter 6047
 
2.8%
Decimal Number 5916
 
2.7%
Open Punctuation 2193
 
1.0%
Close Punctuation 2193
 
1.0%
Math Symbol 503
 
0.2%
Dash Punctuation 242
 
0.1%
Other values (7) 32
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3561
 
3.0%
3208
 
2.7%
2605
 
2.2%
2013
 
1.7%
1732
 
1.5%
1660
 
1.4%
1657
 
1.4%
1567
 
1.3%
1520
 
1.3%
1494
 
1.3%
Other values (1402) 96519
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 3853
12.5%
a 2700
 
8.8%
o 2605
 
8.5%
n 2304
 
7.5%
t 2241
 
7.3%
i 2206
 
7.2%
r 2185
 
7.1%
s 2014
 
6.5%
h 1464
 
4.8%
l 1392
 
4.5%
Other values (38) 7857
25.5%
Uppercase Letter
ValueCountFrequency (%)
L 845
14.0%
T 588
 
9.7%
S 499
 
8.3%
B 341
 
5.6%
A 339
 
5.6%
M 339
 
5.6%
D 322
 
5.3%
C 319
 
5.3%
P 273
 
4.5%
W 253
 
4.2%
Other values (18) 1929
31.9%
Other Punctuation
ValueCountFrequency (%)
: 3295
36.6%
, 2518
28.0%
. 1365
15.2%
! 679
 
7.6%
? 409
 
4.5%
' 303
 
3.4%
· 211
 
2.3%
; 49
 
0.5%
/ 36
 
0.4%
& 36
 
0.4%
Other values (12) 92
 
1.0%
Decimal Number
ValueCountFrequency (%)
0 1570
26.5%
1 1105
18.7%
2 895
15.1%
3 591
 
10.0%
5 548
 
9.3%
4 392
 
6.6%
7 246
 
4.2%
6 211
 
3.6%
9 183
 
3.1%
8 175
 
3.0%
Math Symbol
ValueCountFrequency (%)
= 405
80.5%
+ 33
 
6.6%
~ 33
 
6.6%
< 9
 
1.8%
> 9
 
1.8%
5
 
1.0%
× 4
 
0.8%
| 3
 
0.6%
2
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 1816
82.8%
[ 358
 
16.3%
10
 
0.5%
4
 
0.2%
3
 
0.1%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1816
82.8%
] 358
 
16.3%
10
 
0.5%
4
 
0.2%
3
 
0.1%
2
 
0.1%
Letter Number
ValueCountFrequency (%)
5
50.0%
3
30.0%
1
 
10.0%
1
 
10.0%
Other Symbol
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
43261
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 242
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 6
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117337
53.9%
Common 63323
29.1%
Latin 36839
 
16.9%
Han 184
 
0.1%
Cyrillic 35
 
< 0.1%
Hiragana 12
 
< 0.1%
Greek 4
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3561
 
3.0%
3208
 
2.7%
2605
 
2.2%
2013
 
1.7%
1732
 
1.5%
1660
 
1.4%
1657
 
1.4%
1567
 
1.3%
1520
 
1.3%
1494
 
1.3%
Other values (1260) 96320
82.1%
Han
ValueCountFrequency (%)
7
 
3.8%
6
 
3.3%
5
 
2.7%
5
 
2.7%
4
 
2.2%
4
 
2.2%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (119) 141
76.6%
Common
ValueCountFrequency (%)
43261
68.3%
: 3295
 
5.2%
, 2518
 
4.0%
( 1816
 
2.9%
) 1816
 
2.9%
0 1570
 
2.5%
. 1365
 
2.2%
1 1105
 
1.7%
2 895
 
1.4%
! 679
 
1.1%
Other values (53) 5003
 
7.9%
Latin
ValueCountFrequency (%)
e 3853
 
10.5%
a 2700
 
7.3%
o 2605
 
7.1%
n 2304
 
6.3%
t 2241
 
6.1%
i 2206
 
6.0%
r 2185
 
5.9%
s 2014
 
5.5%
h 1464
 
4.0%
l 1392
 
3.8%
Other values (46) 13875
37.7%
Cyrillic
ValueCountFrequency (%)
е 3
 
8.6%
о 3
 
8.6%
с 3
 
8.6%
н 3
 
8.6%
ы 2
 
5.7%
а 2
 
5.7%
р 2
 
5.7%
у 2
 
5.7%
к 2
 
5.7%
т 1
 
2.9%
Other values (12) 12
34.3%
Hiragana
ValueCountFrequency (%)
3
25.0%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Greek
ValueCountFrequency (%)
π 3
75.0%
τ 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117336
53.9%
ASCII 99819
45.8%
None 302
 
0.1%
CJK 174
 
0.1%
Cyrillic 35
 
< 0.1%
Punctuation 22
 
< 0.1%
Hiragana 12
 
< 0.1%
Number Forms 10
 
< 0.1%
CJK Compat Ideographs 10
 
< 0.1%
Math Operators 5
 
< 0.1%
Other values (6) 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43261
43.3%
e 3853
 
3.9%
: 3295
 
3.3%
a 2700
 
2.7%
o 2605
 
2.6%
, 2518
 
2.5%
n 2304
 
2.3%
t 2241
 
2.2%
i 2206
 
2.2%
r 2185
 
2.2%
Other values (79) 32651
32.7%
Hangul
ValueCountFrequency (%)
3561
 
3.0%
3208
 
2.7%
2605
 
2.2%
2013
 
1.7%
1732
 
1.5%
1660
 
1.4%
1657
 
1.4%
1567
 
1.3%
1520
 
1.3%
1494
 
1.3%
Other values (1259) 96319
82.1%
None
ValueCountFrequency (%)
· 211
69.9%
20
 
6.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
4
 
1.3%
4
 
1.3%
× 4
 
1.3%
3
 
1.0%
Other values (9) 17
 
5.6%
Punctuation
ValueCountFrequency (%)
14
63.6%
4
 
18.2%
3
 
13.6%
1
 
4.5%
CJK
ValueCountFrequency (%)
7
 
4.0%
6
 
3.4%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (111) 131
75.3%
Number Forms
ValueCountFrequency (%)
5
50.0%
3
30.0%
1
 
10.0%
1
 
10.0%
Math Operators
ValueCountFrequency (%)
5
100.0%
Hiragana
ValueCountFrequency (%)
3
25.0%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Misc Symbols
ValueCountFrequency (%)
3
100.0%
Cyrillic
ValueCountFrequency (%)
е 3
 
8.6%
о 3
 
8.6%
с 3
 
8.6%
н 3
 
8.6%
ы 2
 
5.7%
а 2
 
5.7%
р 2
 
5.7%
у 2
 
5.7%
к 2
 
5.7%
т 1
 
2.9%
Other values (12) 12
34.3%
Arrows
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct8729
Distinct (%)87.3%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-11T01:12:57.568715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length231
Median length118
Mean length17.730973
Min length2

Characters and Unicode

Total characters177292
Distinct characters1020
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8042 ?
Unique (%)80.4%

Sample

1st row다니엘 와이스버그 지음 ; 송용근 옮김
2nd row윤태호 지음
3rd row최윤식 지음
4th row샘 킨 지음 ; 이충호 옮김
5th row권용준 지음
ValueCountFrequency (%)
7119
 
14.5%
지음 5137
 
10.5%
옮김 2829
 
5.8%
그림 2290
 
4.7%
1875
 
3.8%
by 1742
 
3.5%
illustrated 404
 
0.8%
글·그림 368
 
0.7%
공]지음 326
 
0.7%
190
 
0.4%
Other values (13750) 26835
54.6%
2023-12-11T01:12:58.097674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39464
22.3%
; 7105
 
4.0%
6224
 
3.5%
5731
 
3.2%
5049
 
2.8%
e 3516
 
2.0%
2966
 
1.7%
2940
 
1.7%
2914
 
1.6%
a 2886
 
1.6%
Other values (1010) 98497
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89028
50.2%
Space Separator 39464
22.3%
Lowercase Letter 32787
 
18.5%
Other Punctuation 8925
 
5.0%
Uppercase Letter 4957
 
2.8%
Open Punctuation 996
 
0.6%
Close Punctuation 996
 
0.6%
Dash Punctuation 77
 
< 0.1%
Decimal Number 40
 
< 0.1%
Math Symbol 18
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6224
 
7.0%
5731
 
6.4%
5049
 
5.7%
2966
 
3.3%
2940
 
3.3%
2914
 
3.3%
2817
 
3.2%
2485
 
2.8%
1497
 
1.7%
1482
 
1.7%
Other values (921) 54923
61.7%
Lowercase Letter
ValueCountFrequency (%)
e 3516
10.7%
a 2886
 
8.8%
r 2697
 
8.2%
t 2556
 
7.8%
i 2447
 
7.5%
y 2414
 
7.4%
l 2298
 
7.0%
n 2183
 
6.7%
b 1933
 
5.9%
o 1813
 
5.5%
Other values (16) 8044
24.5%
Uppercase Letter
ValueCountFrequency (%)
B 550
 
11.1%
M 483
 
9.7%
S 411
 
8.3%
R 389
 
7.8%
J 317
 
6.4%
A 310
 
6.3%
C 272
 
5.5%
P 266
 
5.4%
H 264
 
5.3%
D 251
 
5.1%
Other values (16) 1444
29.1%
Other Punctuation
ValueCountFrequency (%)
; 7105
79.6%
, 693
 
7.8%
. 534
 
6.0%
· 459
 
5.1%
: 99
 
1.1%
& 14
 
0.2%
' 11
 
0.1%
/ 5
 
0.1%
4
 
< 0.1%
% 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 8
20.0%
6 6
15.0%
8 5
12.5%
0 5
12.5%
9 4
10.0%
3 4
10.0%
4 4
10.0%
2 2
 
5.0%
7 1
 
2.5%
5 1
 
2.5%
Open Punctuation
ValueCountFrequency (%)
[ 987
99.1%
( 5
 
0.5%
3
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 987
99.1%
) 5
 
0.5%
3
 
0.3%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 8
44.4%
< 8
44.4%
1
 
5.6%
1
 
5.6%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
39464
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 89008
50.2%
Common 50520
28.5%
Latin 37744
21.3%
Han 20
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6224
 
7.0%
5731
 
6.4%
5049
 
5.7%
2966
 
3.3%
2940
 
3.3%
2914
 
3.3%
2817
 
3.2%
2485
 
2.8%
1497
 
1.7%
1482
 
1.7%
Other values (904) 54903
61.7%
Latin
ValueCountFrequency (%)
e 3516
 
9.3%
a 2886
 
7.6%
r 2697
 
7.1%
t 2556
 
6.8%
i 2447
 
6.5%
y 2414
 
6.4%
l 2298
 
6.1%
n 2183
 
5.8%
b 1933
 
5.1%
o 1813
 
4.8%
Other values (42) 13001
34.4%
Common
ValueCountFrequency (%)
39464
78.1%
; 7105
 
14.1%
[ 987
 
2.0%
] 987
 
2.0%
, 693
 
1.4%
. 534
 
1.1%
· 459
 
0.9%
: 99
 
0.2%
- 77
 
0.2%
& 14
 
< 0.1%
Other values (27) 101
 
0.2%
Han
ValueCountFrequency (%)
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 89007
50.2%
ASCII 87787
49.5%
None 474
 
0.3%
CJK 20
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39464
45.0%
; 7105
 
8.1%
e 3516
 
4.0%
a 2886
 
3.3%
r 2697
 
3.1%
t 2556
 
2.9%
i 2447
 
2.8%
y 2414
 
2.7%
l 2298
 
2.6%
n 2183
 
2.5%
Other values (68) 20221
23.0%
Hangul
ValueCountFrequency (%)
6224
 
7.0%
5731
 
6.4%
5049
 
5.7%
2966
 
3.3%
2940
 
3.3%
2914
 
3.3%
2817
 
3.2%
2485
 
2.8%
1497
 
1.7%
1482
 
1.7%
Other values (903) 54902
61.7%
None
ValueCountFrequency (%)
· 459
96.8%
4
 
0.8%
3
 
0.6%
3
 
0.6%
1
 
0.2%
½ 1
 
0.2%
1
 
0.2%
1
 
0.2%
1
 
0.2%
CJK
ValueCountFrequency (%)
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2698
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:12:58.433867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length54
Mean length5.5819
Min length1

Characters and Unicode

Total characters55819
Distinct characters736
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1435 ?
Unique (%)14.3%

Sample

1st row에이콘
2nd row재미주의
3rd row김영사
4th row해나무
5th row살림
ValueCountFrequency (%)
books 202
 
1.8%
문학동네 170
 
1.5%
창비 157
 
1.4%
비룡소 137
 
1.2%
scholastic 125
 
1.1%
press 105
 
0.9%
사계절 86
 
0.7%
시공주니어 83
 
0.7%
민음사 82
 
0.7%
살림 71
 
0.6%
Other values (2728) 10308
89.4%
2023-12-11T01:12:58.944978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 1972
 
3.5%
1526
 
2.7%
e 1409
 
2.5%
r 1348
 
2.4%
s 1348
 
2.4%
1314
 
2.4%
1233
 
2.2%
n 1149
 
2.1%
i 1129
 
2.0%
1101
 
2.0%
Other values (726) 42290
75.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34969
62.6%
Lowercase Letter 15104
27.1%
Uppercase Letter 3325
 
6.0%
Space Separator 1526
 
2.7%
Other Punctuation 384
 
0.7%
Decimal Number 179
 
0.3%
Close Punctuation 155
 
0.3%
Open Punctuation 155
 
0.3%
Dash Punctuation 18
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1314
 
3.8%
1233
 
3.5%
1101
 
3.1%
1003
 
2.9%
876
 
2.5%
668
 
1.9%
665
 
1.9%
632
 
1.8%
549
 
1.6%
519
 
1.5%
Other values (645) 26409
75.5%
Uppercase Letter
ValueCountFrequency (%)
B 456
13.7%
P 338
10.2%
R 313
 
9.4%
S 299
 
9.0%
H 258
 
7.8%
C 255
 
7.7%
M 232
 
7.0%
A 150
 
4.5%
O 121
 
3.6%
U 118
 
3.5%
Other values (16) 785
23.6%
Lowercase Letter
ValueCountFrequency (%)
o 1972
13.1%
e 1409
 
9.3%
r 1348
 
8.9%
s 1348
 
8.9%
n 1149
 
7.6%
i 1129
 
7.5%
a 1002
 
6.6%
l 753
 
5.0%
d 589
 
3.9%
c 565
 
3.7%
Other values (15) 3840
25.4%
Other Punctuation
ValueCountFrequency (%)
: 106
27.6%
& 80
20.8%
. 49
12.8%
· 47
12.2%
' 39
 
10.2%
, 34
 
8.9%
11
 
2.9%
# 6
 
1.6%
/ 3
 
0.8%
; 3
 
0.8%
Other values (3) 6
 
1.6%
Decimal Number
ValueCountFrequency (%)
2 74
41.3%
1 68
38.0%
0 11
 
6.1%
3 9
 
5.0%
4 7
 
3.9%
6 4
 
2.2%
5 3
 
1.7%
8 2
 
1.1%
9 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
] 98
63.2%
) 57
36.8%
Open Punctuation
ValueCountFrequency (%)
[ 98
63.2%
( 57
36.8%
Space Separator
ValueCountFrequency (%)
1526
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34947
62.6%
Latin 18429
33.0%
Common 2421
 
4.3%
Han 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1314
 
3.8%
1233
 
3.5%
1101
 
3.2%
1003
 
2.9%
876
 
2.5%
668
 
1.9%
665
 
1.9%
632
 
1.8%
549
 
1.6%
519
 
1.5%
Other values (628) 26387
75.5%
Latin
ValueCountFrequency (%)
o 1972
 
10.7%
e 1409
 
7.6%
r 1348
 
7.3%
s 1348
 
7.3%
n 1149
 
6.2%
i 1129
 
6.1%
a 1002
 
5.4%
l 753
 
4.1%
d 589
 
3.2%
c 565
 
3.1%
Other values (41) 7165
38.9%
Common
ValueCountFrequency (%)
1526
63.0%
: 106
 
4.4%
] 98
 
4.0%
[ 98
 
4.0%
& 80
 
3.3%
2 74
 
3.1%
1 68
 
2.8%
) 57
 
2.4%
( 57
 
2.4%
. 49
 
2.0%
Other values (20) 208
 
8.6%
Han
ValueCountFrequency (%)
3
13.6%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (7) 7
31.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34947
62.6%
ASCII 20789
37.2%
None 61
 
0.1%
CJK 22
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 1972
 
9.5%
1526
 
7.3%
e 1409
 
6.8%
r 1348
 
6.5%
s 1348
 
6.5%
n 1149
 
5.5%
i 1129
 
5.4%
a 1002
 
4.8%
l 753
 
3.6%
d 589
 
2.8%
Other values (67) 8564
41.2%
Hangul
ValueCountFrequency (%)
1314
 
3.8%
1233
 
3.5%
1101
 
3.2%
1003
 
2.9%
876
 
2.5%
668
 
1.9%
665
 
1.9%
632
 
1.8%
549
 
1.6%
519
 
1.5%
Other values (628) 26387
75.5%
None
ValueCountFrequency (%)
· 47
77.0%
11
 
18.0%
2
 
3.3%
1
 
1.6%
CJK
ValueCountFrequency (%)
3
13.6%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (7) 7
31.8%

발행년
Real number (ℝ)

Distinct43
Distinct (%)0.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2013.3442
Minimum1948
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:12:59.096661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1948
5-th percentile2006
Q12011
median2014
Q32016
95-th percentile2019
Maximum2020
Range72
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.4617817
Coefficient of variation (CV)0.0022161047
Kurtosis14.724329
Mean2013.3442
Median Absolute Deviation (MAD)2
Skewness-2.323977
Sum20131429
Variance19.907496
MonotonicityNot monotonic
2023-12-11T01:12:59.242355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
2014 1382
13.8%
2015 1358
13.6%
2018 970
9.7%
2016 896
9.0%
2017 839
8.4%
2012 828
8.3%
2013 695
7.0%
2011 527
 
5.3%
2019 504
 
5.0%
2010 409
 
4.1%
Other values (33) 1591
15.9%
ValueCountFrequency (%)
1948 1
 
< 0.1%
1957 1
 
< 0.1%
1964 1
 
< 0.1%
1968 1
 
< 0.1%
1979 1
 
< 0.1%
1980 3
< 0.1%
1981 1
 
< 0.1%
1984 2
< 0.1%
1985 4
< 0.1%
1986 4
< 0.1%
ValueCountFrequency (%)
2020 21
 
0.2%
2019 504
 
5.0%
2018 970
9.7%
2017 839
8.4%
2016 896
9.0%
2015 1358
13.6%
2014 1382
13.8%
2013 695
7.0%
2012 828
8.3%
2011 527
 
5.3%
Distinct9794
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:12:59.691587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length10.0835
Min length5

Characters and Unicode

Total characters100835
Distinct characters39
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9730 ?
Unique (%)97.3%

Sample

1st row005.76-9
2nd row657.3-2-1
3rd row331.544-9=2
4th row474.3224-1
5th row031-3-176
ValueCountFrequency (%)
아동 2024
 
13.7%
영어 1318
 
8.9%
그림책 934
 
6.3%
dvd 225
 
1.5%
더책 98
 
0.7%
큰글자 47
 
0.3%
mom 40
 
0.3%
보드북 38
 
0.3%
시니어 37
 
0.2%
808-3 24
 
0.2%
Other values (9645) 10019
67.7%
2023-12-11T01:13:00.143349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 13742
13.6%
1 11747
11.6%
8 9739
9.7%
3 8846
 
8.8%
2 7067
 
7.0%
4 5952
 
5.9%
0 5388
 
5.3%
. 5305
 
5.3%
4804
 
4.8%
5 4543
 
4.5%
Other values (29) 23702
23.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 65147
64.6%
Dash Punctuation 13742
 
13.6%
Other Letter 10178
 
10.1%
Other Punctuation 5305
 
5.3%
Space Separator 4804
 
4.8%
Uppercase Letter 796
 
0.8%
Math Symbol 627
 
0.6%
Lowercase Letter 236
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2046
20.1%
2046
20.1%
1355
13.3%
1318
12.9%
1032
10.1%
934
9.2%
934
9.2%
98
 
1.0%
72
 
0.7%
47
 
0.5%
Other values (9) 296
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 11747
18.0%
8 9739
14.9%
3 8846
13.6%
2 7067
10.8%
4 5952
9.1%
0 5388
8.3%
5 4543
 
7.0%
9 4419
 
6.8%
7 4024
 
6.2%
6 3422
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
D 450
56.5%
V 225
28.3%
M 80
 
10.1%
O 40
 
5.0%
A 1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 13742
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5305
100.0%
Space Separator
ValueCountFrequency (%)
4804
100.0%
Math Symbol
ValueCountFrequency (%)
= 627
100.0%
Lowercase Letter
ValueCountFrequency (%)
v 236
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 89625
88.9%
Hangul 10178
 
10.1%
Latin 1032
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2046
20.1%
2046
20.1%
1355
13.3%
1318
12.9%
1032
10.1%
934
9.2%
934
9.2%
98
 
1.0%
72
 
0.7%
47
 
0.5%
Other values (9) 296
 
2.9%
Common
ValueCountFrequency (%)
- 13742
15.3%
1 11747
13.1%
8 9739
10.9%
3 8846
9.9%
2 7067
7.9%
4 5952
6.6%
0 5388
 
6.0%
. 5305
 
5.9%
4804
 
5.4%
5 4543
 
5.1%
Other values (4) 12492
13.9%
Latin
ValueCountFrequency (%)
D 450
43.6%
v 236
22.9%
V 225
21.8%
M 80
 
7.8%
O 40
 
3.9%
A 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90657
89.9%
Hangul 10178
 
10.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 13742
15.2%
1 11747
13.0%
8 9739
10.7%
3 8846
9.8%
2 7067
7.8%
4 5952
6.6%
0 5388
 
5.9%
. 5305
 
5.9%
4804
 
5.3%
5 4543
 
5.0%
Other values (10) 13524
14.9%
Hangul
ValueCountFrequency (%)
2046
20.1%
2046
20.1%
1355
13.3%
1318
12.9%
1032
10.1%
934
9.2%
934
9.2%
98
 
1.0%
72
 
0.7%
47
 
0.5%
Other values (9) 296
 
2.9%

Interactions

2023-12-11T01:12:54.879939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:12:55.103217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:12:55.265676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:12:55.411536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호서명저자명발행자명발행년청구기호
61172ABN000057095구글 애널리틱스로 모아보는 데이터 : 기본 보고서를 넘어 통합 마케팅 분석 센터로 가는 길다니엘 와이스버그 지음 ; 송용근 옮김에이콘2016005.76-9
93298ABN000091178이끼. 1윤태호 지음재미주의2015657.3-2-1
29329ABN000023272미래학자의 통찰법최윤식 지음김영사2014331.544-9=2
35670ABN000030192바이올리니스트의 엄지 : 사랑과 전쟁과 천재성에 관한 DNA 이야기샘 킨 지음 ; 이충호 옮김해나무2014474.3224-1
20329ABN000014114테마로 보는 서양미술권용준 지음살림2007031-3-176
68122ABN000064286나를 고백한다 : 존재에 대한 자문을 이끌어내는 논리적이고 사적인 고백피에르 바야르 지음 ; 김병욱 옮김여름언덕2014868-36
82995ABN000079794위저드 베이커리구병모 지음창비2018813.7-286=2
85553ABN000083001천사의 날개크리스틴 리슨 글 ; 제인 채프먼 그림 ; 윤희선 옮김세상모든책2008그림책 853-147
69548ABN000065790메르타 할머니, 라스베이거스로 가다카타리나 잉엘만순드베리 지음 ; 정장진 옮김열린책들2017859.7-22
62687ABN000058661바우돌리노. 상움베르토 에코 지음 ; 이현경 옮김열린책들2013883-20-1
등록번호서명저자명발행자명발행년청구기호
6627ABN000000316Addie's Bad Dayedited by HarperTrophyHarperTrophy2008영어 808-3-v2-51
89303ABN000086877서랍 속 먼지 나라에 무슨 일이?!남동윤 지음씨드북2019그림책 813-1447
80304ABN000076956그래서 어디를 살까요 : 알면 돈 되는 신나는 부동산 잡학사전빠숑, 서울휘, 아임해피 [공]지음다산북스2018327.87-148
62283ABN000058243펫 닥터스 = Pet doctors : 반려동물과 행복하게 오래 살기 위한 맞춤 지침서sky pet park <펫닥터스> 제작팀, 강무숙, 강종일, 권대현, 권영항, 김미령, 김선아, 김재영, 박지혜, 서상혁, 유경근, 윤병국, 윤홍준, 이민지, 장웅기, 한재웅[공]지음비타북스2016527.386-8
20107ABN000013891(또 한권의)벽돌 : 건축가 서현의 난독일기서현 글효형출판2011029.1-32
52871ABN000048397내가 정말 좋아하는 건?베아트리스 퐁타넬 글 ; 마르크 부타방 그림 ; 김영신 옮김큰북작은북2006아동 863-102
37574ABN000032214Fairy sugar drop's sleepoverillustrated by Michelle ToddIgloo2012영어 843-1592
78462ABN000075070시월의 말. 1콜린 매컬로 지음 ; 강선재...[등]옮김교유서가2017843-1038-1
52719ABN000048244(세계인의 버킷 리스트 여행지)아이슬란드 링로드 = Iceland ring road tour조대현 글·사진다연2015982.3902-3
80191ABN000076838(5학년)교과서에 나오는 위인들위인전 편찬위원회 편자유토론2010아동 990.8-32-5