Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells104
Missing cells (%)0.2%
Duplicate rows24
Duplicate rows (%)0.2%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Text4
Numeric1

Dataset

Description한국형사법무정책연구원 도서관 소장자료의 목록정보(도서명, 저자, 출판사 등) 제공합니다. 소장자료의 세부사항 확인 등은 한국형사법무정책연구원 전자도서관 홈페이지를 이용 바랍니다.
Author한국형사법무정책연구원
URLhttps://www.data.go.kr/data/3038094/fileData.do

Alerts

Dataset has 24 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 07:22:35.905501
Analysis finished2023-12-12 07:22:39.029885
Duration3.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서명
Text

Distinct8628
Distinct (%)86.3%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T16:22:39.338144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length518
Median length208
Mean length37.239448
Min length2

Characters and Unicode

Total characters372320
Distinct characters1917
Distinct categories16 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7982 ?
Unique (%)79.8%

Sample

1st row한국사회 폭력문화의 구조화에 관한 연구
2nd rowSocial psychology
3rd rowBetubungsmittelgesetz
4th row(알기쉬운) 수사 형사 실무 :수사기술과 판례 중심으로
5th row분단독일의 정치사회학
ValueCountFrequency (%)
and 1735
 
3.1%
the 1560
 
2.8%
of 1404
 
2.5%
in 987
 
1.8%
und 722
 
1.3%
der 581
 
1.0%
연구 555
 
1.0%
470
 
0.8%
a 468
 
0.8%
관한 424
 
0.8%
Other values (16781) 46920
84.0%
2023-12-12T16:22:39.832897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45856
 
12.3%
e 28648
 
7.7%
i 20901
 
5.6%
n 20612
 
5.5%
t 17479
 
4.7%
r 17204
 
4.6%
a 16026
 
4.3%
s 14759
 
4.0%
o 14438
 
3.9%
c 10719
 
2.9%
Other values (1907) 165678
44.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 229424
61.6%
Other Letter 65651
 
17.6%
Space Separator 45856
 
12.3%
Uppercase Letter 16627
 
4.5%
Decimal Number 5408
 
1.5%
Other Punctuation 5321
 
1.4%
Open Punctuation 1467
 
0.4%
Close Punctuation 1467
 
0.4%
Dash Punctuation 937
 
0.3%
Letter Number 88
 
< 0.1%
Other values (6) 74
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2063
 
3.1%
1674
 
2.5%
1287
 
2.0%
1129
 
1.7%
1068
 
1.6%
1026
 
1.6%
1005
 
1.5%
932
 
1.4%
823
 
1.3%
819
 
1.2%
Other values (1788) 53825
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 28648
12.5%
i 20901
 
9.1%
n 20612
 
9.0%
t 17479
 
7.6%
r 17204
 
7.5%
a 16026
 
7.0%
s 14759
 
6.4%
o 14438
 
6.3%
c 10719
 
4.7%
l 10000
 
4.4%
Other values (18) 58638
25.6%
Uppercase Letter
ValueCountFrequency (%)
S 1975
 
11.9%
A 1239
 
7.5%
T 1193
 
7.2%
C 1148
 
6.9%
P 995
 
6.0%
I 975
 
5.9%
D 950
 
5.7%
E 922
 
5.5%
R 842
 
5.1%
G 755
 
4.5%
Other values (16) 5633
33.9%
Other Punctuation
ValueCountFrequency (%)
: 2535
47.6%
, 1685
31.7%
. 417
 
7.8%
· 176
 
3.3%
' 167
 
3.1%
§ 122
 
2.3%
" 67
 
1.3%
& 59
 
1.1%
/ 51
 
1.0%
; 14
 
0.3%
Other values (7) 28
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 1262
23.3%
2 943
17.4%
0 832
15.4%
9 605
11.2%
8 377
 
7.0%
3 357
 
6.6%
7 288
 
5.3%
4 281
 
5.2%
5 232
 
4.3%
6 231
 
4.3%
Letter Number
ValueCountFrequency (%)
28
31.8%
27
30.7%
14
15.9%
9
 
10.2%
4
 
4.5%
2
 
2.3%
2
 
2.3%
1
 
1.1%
1
 
1.1%
Math Symbol
ValueCountFrequency (%)
~ 11
28.2%
< 7
17.9%
> 7
17.9%
+ 6
15.4%
= 6
15.4%
1
 
2.6%
1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 1408
96.0%
[ 45
 
3.1%
9
 
0.6%
2
 
0.1%
2
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1407
95.9%
] 45
 
3.1%
10
 
0.7%
2
 
0.1%
2
 
0.1%
1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 923
98.5%
14
 
1.5%
Modifier Symbol
ValueCountFrequency (%)
˙ 13
92.9%
` 1
 
7.1%
Final Punctuation
ValueCountFrequency (%)
12
92.3%
1
 
7.7%
Space Separator
ValueCountFrequency (%)
45856
100.0%
Format
ValueCountFrequency (%)
­ 5
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 246133
66.1%
Common 60530
 
16.3%
Hangul 49810
 
13.4%
Han 13793
 
3.7%
Hiragana 1072
 
0.3%
Katakana 976
 
0.3%
Greek 6
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
932
 
6.8%
444
 
3.2%
330
 
2.4%
290
 
2.1%
287
 
2.1%
224
 
1.6%
218
 
1.6%
201
 
1.5%
169
 
1.2%
169
 
1.2%
Other values (947) 10529
76.3%
Hangul
ValueCountFrequency (%)
2063
 
4.1%
1674
 
3.4%
1287
 
2.6%
1129
 
2.3%
1068
 
2.1%
1026
 
2.1%
1005
 
2.0%
823
 
1.7%
819
 
1.6%
806
 
1.6%
Other values (707) 38110
76.5%
Katakana
ValueCountFrequency (%)
85
 
8.7%
62
 
6.4%
49
 
5.0%
41
 
4.2%
39
 
4.0%
39
 
4.0%
38
 
3.9%
35
 
3.6%
33
 
3.4%
32
 
3.3%
Other values (61) 523
53.6%
Latin
ValueCountFrequency (%)
e 28648
 
11.6%
i 20901
 
8.5%
n 20612
 
8.4%
t 17479
 
7.1%
r 17204
 
7.0%
a 16026
 
6.5%
s 14759
 
6.0%
o 14438
 
5.9%
c 10719
 
4.4%
l 10000
 
4.1%
Other values (52) 75347
30.6%
Common
ValueCountFrequency (%)
45856
75.8%
: 2535
 
4.2%
, 1685
 
2.8%
( 1408
 
2.3%
) 1407
 
2.3%
1 1262
 
2.1%
2 943
 
1.6%
- 923
 
1.5%
0 832
 
1.4%
9 605
 
1.0%
Other values (46) 3074
 
5.1%
Hiragana
ValueCountFrequency (%)
338
31.5%
181
16.9%
72
 
6.7%
53
 
4.9%
34
 
3.2%
33
 
3.1%
27
 
2.5%
23
 
2.1%
21
 
2.0%
19
 
1.8%
Other values (43) 271
25.3%
Greek
ValueCountFrequency (%)
β 6
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 306052
82.2%
Hangul 49784
 
13.4%
CJK 13527
 
3.6%
Hiragana 1072
 
0.3%
Katakana 976
 
0.3%
None 485
 
0.1%
CJK Compat Ideographs 266
 
0.1%
Number Forms 88
 
< 0.1%
Punctuation 29
 
< 0.1%
Compat Jamo 26
 
< 0.1%
Other values (2) 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45856
15.0%
e 28648
 
9.4%
i 20901
 
6.8%
n 20612
 
6.7%
t 17479
 
5.7%
r 17204
 
5.6%
a 16026
 
5.2%
s 14759
 
4.8%
o 14438
 
4.7%
c 10719
 
3.5%
Other values (77) 99410
32.5%
Hangul
ValueCountFrequency (%)
2063
 
4.1%
1674
 
3.4%
1287
 
2.6%
1129
 
2.3%
1068
 
2.1%
1026
 
2.1%
1005
 
2.0%
823
 
1.7%
819
 
1.6%
806
 
1.6%
Other values (703) 38084
76.5%
CJK
ValueCountFrequency (%)
932
 
6.9%
444
 
3.3%
330
 
2.4%
290
 
2.1%
287
 
2.1%
224
 
1.7%
218
 
1.6%
201
 
1.5%
169
 
1.2%
169
 
1.2%
Other values (907) 10263
75.9%
Hiragana
ValueCountFrequency (%)
338
31.5%
181
16.9%
72
 
6.7%
53
 
4.9%
34
 
3.2%
33
 
3.1%
27
 
2.5%
23
 
2.1%
21
 
2.0%
19
 
1.8%
Other values (43) 271
25.3%
None
ValueCountFrequency (%)
· 176
36.3%
ß 136
28.0%
§ 122
25.2%
10
 
2.1%
9
 
1.9%
9
 
1.9%
β 6
 
1.2%
­ 5
 
1.0%
2
 
0.4%
2
 
0.4%
Other values (6) 8
 
1.6%
Katakana
ValueCountFrequency (%)
85
 
8.7%
62
 
6.4%
49
 
5.0%
41
 
4.2%
39
 
4.0%
39
 
4.0%
38
 
3.9%
35
 
3.6%
33
 
3.4%
32
 
3.3%
Other values (61) 523
53.6%
CJK Compat Ideographs
ValueCountFrequency (%)
73
27.4%
54
20.3%
23
 
8.6%
14
 
5.3%
13
 
4.9%
11
 
4.1%
10
 
3.8%
8
 
3.0%
6
 
2.3%
5
 
1.9%
Other values (30) 49
18.4%
Number Forms
ValueCountFrequency (%)
28
31.8%
27
30.7%
14
15.9%
9
 
10.2%
4
 
4.5%
2
 
2.3%
2
 
2.3%
1
 
1.1%
1
 
1.1%
Compat Jamo
ValueCountFrequency (%)
22
84.6%
2
 
7.7%
1
 
3.8%
1
 
3.8%
Punctuation
ValueCountFrequency (%)
14
48.3%
12
41.4%
2
 
6.9%
1
 
3.4%
Modifier Letters
ValueCountFrequency (%)
˙ 13
100.0%
Math Operators
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자
Text

Distinct6514
Distinct (%)65.1%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T16:22:40.282821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length128
Median length93
Mean length12.428643
Min length1

Characters and Unicode

Total characters124274
Distinct characters1261
Distinct categories13 ?
Distinct scripts7 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5775 ?
Unique (%)57.8%

Sample

1st row한국형사정책연구원
2nd rowKenneth J., Gergen
3rd rowHarald Hans, Krner
4th row서주연
5th row다렌돌프,랄프
ValueCountFrequency (%)
한국형사정책연구원 1220
 
6.1%
of 173
 
0.9%
j 166
 
0.8%
법무부 157
 
0.8%
a 148
 
0.7%
s 146
 
0.7%
대검찰청 139
 
0.7%
r 131
 
0.7%
institute 124
 
0.6%
justice 120
 
0.6%
Other values (8088) 17340
87.3%
2023-12-12T16:22:40.917066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9865
 
7.9%
e 8210
 
6.6%
a 6288
 
5.1%
r 6116
 
4.9%
n 6050
 
4.9%
i 5179
 
4.2%
, 5020
 
4.0%
o 4131
 
3.3%
t 3946
 
3.2%
l 3807
 
3.1%
Other values (1251) 65662
52.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 64132
51.6%
Other Letter 29783
24.0%
Uppercase Letter 13958
 
11.2%
Space Separator 9867
 
7.9%
Other Punctuation 6149
 
4.9%
Dash Punctuation 261
 
0.2%
Close Punctuation 51
 
< 0.1%
Open Punctuation 50
 
< 0.1%
Decimal Number 15
 
< 0.1%
Final Punctuation 4
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1639
 
5.5%
1593
 
5.3%
1582
 
5.3%
1567
 
5.3%
1554
 
5.2%
1506
 
5.1%
1497
 
5.0%
1353
 
4.5%
1259
 
4.2%
546
 
1.8%
Other values (1169) 15687
52.7%
Lowercase Letter
ValueCountFrequency (%)
e 8210
12.8%
a 6288
9.8%
r 6116
9.5%
n 6050
9.4%
i 5179
 
8.1%
o 4131
 
6.4%
t 3946
 
6.2%
l 3807
 
5.9%
s 3423
 
5.3%
h 2620
 
4.1%
Other values (19) 14362
22.4%
Uppercase Letter
ValueCountFrequency (%)
S 1087
 
7.8%
J 1025
 
7.3%
M 960
 
6.9%
R 901
 
6.5%
C 858
 
6.1%
H 846
 
6.1%
B 831
 
6.0%
A 806
 
5.8%
D 722
 
5.2%
L 646
 
4.6%
Other values (16) 5276
37.8%
Other Punctuation
ValueCountFrequency (%)
, 5020
81.6%
. 1047
 
17.0%
' 28
 
0.5%
· 21
 
0.3%
/ 16
 
0.3%
& 10
 
0.2%
; 5
 
0.1%
" 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
6 4
26.7%
3 4
26.7%
2 3
20.0%
0 2
13.3%
7 1
 
6.7%
1 1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 35
68.6%
] 15
29.4%
1
 
2.0%
Space Separator
ValueCountFrequency (%)
9865
> 99.9%
  2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 35
70.0%
[ 15
30.0%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 261
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 78089
62.8%
Hangul 28030
 
22.6%
Common 16401
 
13.2%
Han 1525
 
1.2%
Katakana 210
 
0.2%
Hiragana 18
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1639
 
5.8%
1593
 
5.7%
1582
 
5.6%
1567
 
5.6%
1554
 
5.5%
1506
 
5.4%
1497
 
5.3%
1353
 
4.8%
1259
 
4.5%
546
 
1.9%
Other values (588) 13934
49.7%
Han
ValueCountFrequency (%)
38
 
2.5%
33
 
2.2%
31
 
2.0%
31
 
2.0%
22
 
1.4%
18
 
1.2%
17
 
1.1%
17
 
1.1%
15
 
1.0%
15
 
1.0%
Other values (500) 1288
84.5%
Katakana
ValueCountFrequency (%)
20
 
9.5%
13
 
6.2%
10
 
4.8%
9
 
4.3%
8
 
3.8%
7
 
3.3%
7
 
3.3%
7
 
3.3%
6
 
2.9%
6
 
2.9%
Other values (47) 117
55.7%
Latin
ValueCountFrequency (%)
e 8210
 
10.5%
a 6288
 
8.1%
r 6116
 
7.8%
n 6050
 
7.7%
i 5179
 
6.6%
o 4131
 
5.3%
t 3946
 
5.1%
l 3807
 
4.9%
s 3423
 
4.4%
h 2620
 
3.4%
Other values (44) 28319
36.3%
Common
ValueCountFrequency (%)
9865
60.1%
, 5020
30.6%
. 1047
 
6.4%
- 261
 
1.6%
( 35
 
0.2%
) 35
 
0.2%
' 28
 
0.2%
· 21
 
0.1%
/ 16
 
0.1%
] 15
 
0.1%
Other values (17) 58
 
0.4%
Hiragana
ValueCountFrequency (%)
2
11.1%
2
11.1%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Greek
ValueCountFrequency (%)
β 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94448
76.0%
Hangul 28030
 
22.6%
CJK 1513
 
1.2%
Katakana 210
 
0.2%
None 38
 
< 0.1%
Hiragana 18
 
< 0.1%
CJK Compat Ideographs 12
 
< 0.1%
Punctuation 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9865
 
10.4%
e 8210
 
8.7%
a 6288
 
6.7%
r 6116
 
6.5%
n 6050
 
6.4%
i 5179
 
5.5%
, 5020
 
5.3%
o 4131
 
4.4%
t 3946
 
4.2%
l 3807
 
4.0%
Other values (64) 35836
37.9%
Hangul
ValueCountFrequency (%)
1639
 
5.8%
1593
 
5.7%
1582
 
5.6%
1567
 
5.6%
1554
 
5.5%
1506
 
5.4%
1497
 
5.3%
1353
 
4.8%
1259
 
4.5%
546
 
1.9%
Other values (588) 13934
49.7%
CJK
ValueCountFrequency (%)
38
 
2.5%
33
 
2.2%
31
 
2.0%
31
 
2.0%
22
 
1.5%
18
 
1.2%
17
 
1.1%
17
 
1.1%
15
 
1.0%
15
 
1.0%
Other values (492) 1276
84.3%
None
ValueCountFrequency (%)
· 21
55.3%
ß 11
28.9%
  2
 
5.3%
ʼn 2
 
5.3%
1
 
2.6%
β 1
 
2.6%
Katakana
ValueCountFrequency (%)
20
 
9.5%
13
 
6.2%
10
 
4.8%
9
 
4.3%
8
 
3.8%
7
 
3.3%
7
 
3.3%
7
 
3.3%
6
 
2.9%
6
 
2.9%
Other values (47) 117
55.7%
Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
25.0%
2
16.7%
2
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Hiragana
ValueCountFrequency (%)
2
11.1%
2
11.1%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Distinct2687
Distinct (%)27.1%
Missing80
Missing (%)0.8%
Memory size156.2 KiB
2023-12-12T16:22:41.356913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length75
Mean length13.089415
Min length1

Characters and Unicode

Total characters129847
Distinct characters958
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1722 ?
Unique (%)17.4%

Sample

1st row한국형사정책연구원
2nd rowSpringer-Verlag
3rd rowC.H.Beck'sche Verlagsbuchhandlung
4th row국문사
5th row한길사
ValueCountFrequency (%)
한국형사정책연구원 1279
 
6.7%
press 985
 
5.2%
university 565
 
3.0%
verlag 415
 
2.2%
412
 
2.2%
of 318
 
1.7%
information 305
 
1.6%
dissertation 302
 
1.6%
umi 302
 
1.6%
services 289
 
1.5%
Other values (2649) 13890
72.9%
2023-12-12T16:22:42.011051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9147
 
7.0%
e 8941
 
6.9%
r 7037
 
5.4%
s 6494
 
5.0%
i 6444
 
5.0%
n 6117
 
4.7%
a 5313
 
4.1%
t 5157
 
4.0%
o 4547
 
3.5%
l 3952
 
3.0%
Other values (948) 66698
51.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 74306
57.2%
Other Letter 29441
 
22.7%
Uppercase Letter 14751
 
11.4%
Space Separator 9147
 
7.0%
Other Punctuation 1755
 
1.4%
Dash Punctuation 325
 
0.3%
Decimal Number 49
 
< 0.1%
Close Punctuation 33
 
< 0.1%
Open Punctuation 33
 
< 0.1%
Math Symbol 3
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1945
 
6.6%
1613
 
5.5%
1594
 
5.4%
1545
 
5.2%
1460
 
5.0%
1446
 
4.9%
1381
 
4.7%
1336
 
4.5%
1328
 
4.5%
553
 
1.9%
Other values (866) 15240
51.8%
Lowercase Letter
ValueCountFrequency (%)
e 8941
12.0%
r 7037
9.5%
s 6494
 
8.7%
i 6444
 
8.7%
n 6117
 
8.2%
a 5313
 
7.2%
t 5157
 
6.9%
o 4547
 
6.1%
l 3952
 
5.3%
c 2810
 
3.8%
Other values (17) 17494
23.5%
Uppercase Letter
ValueCountFrequency (%)
P 1984
13.4%
C 1205
 
8.2%
U 1194
 
8.1%
I 1187
 
8.0%
S 1174
 
8.0%
M 810
 
5.5%
H 784
 
5.3%
D 743
 
5.0%
B 683
 
4.6%
V 629
 
4.3%
Other values (16) 4358
29.5%
Decimal Number
ValueCountFrequency (%)
1 18
36.7%
9 10
20.4%
2 8
16.3%
6 3
 
6.1%
8 3
 
6.1%
4 2
 
4.1%
5 2
 
4.1%
3 1
 
2.0%
0 1
 
2.0%
7 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 948
54.0%
& 353
 
20.1%
, 220
 
12.5%
' 171
 
9.7%
/ 41
 
2.3%
· 10
 
0.6%
5
 
0.3%
; 4
 
0.2%
: 3
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 323
99.4%
1
 
0.3%
1
 
0.3%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
9147
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 89057
68.6%
Hangul 21641
 
16.7%
Common 11349
 
8.7%
Han 7636
 
5.9%
Katakana 105
 
0.1%
Hiragana 59
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
553
 
7.2%
467
 
6.1%
293
 
3.8%
291
 
3.8%
242
 
3.2%
213
 
2.8%
211
 
2.8%
211
 
2.8%
180
 
2.4%
151
 
2.0%
Other values (416) 4824
63.2%
Hangul
ValueCountFrequency (%)
1945
 
9.0%
1613
 
7.5%
1594
 
7.4%
1545
 
7.1%
1460
 
6.7%
1446
 
6.7%
1381
 
6.4%
1336
 
6.2%
1328
 
6.1%
320
 
1.5%
Other values (391) 7673
35.5%
Latin
ValueCountFrequency (%)
e 8941
 
10.0%
r 7037
 
7.9%
s 6494
 
7.3%
i 6444
 
7.2%
n 6117
 
6.9%
a 5313
 
6.0%
t 5157
 
5.8%
o 4547
 
5.1%
l 3952
 
4.4%
c 2810
 
3.2%
Other values (43) 32245
36.2%
Katakana
ValueCountFrequency (%)
8
 
7.6%
7
 
6.7%
7
 
6.7%
7
 
6.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.8%
5
 
4.8%
Other values (26) 42
40.0%
Common
ValueCountFrequency (%)
9147
80.6%
. 948
 
8.4%
& 353
 
3.1%
- 323
 
2.8%
, 220
 
1.9%
' 171
 
1.5%
/ 41
 
0.4%
) 33
 
0.3%
( 33
 
0.3%
1 18
 
0.2%
Other values (19) 62
 
0.5%
Hiragana
ValueCountFrequency (%)
11
18.6%
10
16.9%
10
16.9%
10
16.9%
7
11.9%
3
 
5.1%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (3) 3
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100384
77.3%
Hangul 21636
 
16.7%
CJK 7591
 
5.8%
Katakana 105
 
0.1%
Hiragana 59
 
< 0.1%
CJK Compat Ideographs 45
 
< 0.1%
None 17
 
< 0.1%
Compat Jamo 5
 
< 0.1%
Punctuation 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9147
 
9.1%
e 8941
 
8.9%
r 7037
 
7.0%
s 6494
 
6.5%
i 6444
 
6.4%
n 6117
 
6.1%
a 5313
 
5.3%
t 5157
 
5.1%
o 4547
 
4.5%
l 3952
 
3.9%
Other values (64) 37235
37.1%
Hangul
ValueCountFrequency (%)
1945
 
9.0%
1613
 
7.5%
1594
 
7.4%
1545
 
7.1%
1460
 
6.7%
1446
 
6.7%
1381
 
6.4%
1336
 
6.2%
1328
 
6.1%
320
 
1.5%
Other values (388) 7668
35.4%
CJK
ValueCountFrequency (%)
553
 
7.3%
467
 
6.2%
293
 
3.9%
291
 
3.8%
242
 
3.2%
213
 
2.8%
211
 
2.8%
211
 
2.8%
180
 
2.4%
151
 
2.0%
Other values (401) 4779
63.0%
CJK Compat Ideographs
ValueCountFrequency (%)
12
26.7%
6
13.3%
5
11.1%
5
11.1%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
Other values (5) 5
11.1%
Hiragana
ValueCountFrequency (%)
11
18.6%
10
16.9%
10
16.9%
10
16.9%
7
11.9%
3
 
5.1%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (3) 3
 
5.1%
None
ValueCountFrequency (%)
· 10
58.8%
5
29.4%
ß 1
 
5.9%
1
 
5.9%
Katakana
ValueCountFrequency (%)
8
 
7.6%
7
 
6.7%
7
 
6.7%
7
 
6.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.8%
5
 
4.8%
Other values (26) 42
40.0%
Compat Jamo
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Punctuation
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

출판년
Real number (ℝ)

Distinct116
Distinct (%)1.2%
Missing8
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1997.2622
Minimum1881
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:22:42.175755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1881
5-th percentile1974
Q11987
median1999
Q32010
95-th percentile2018
Maximum2022
Range141
Interquartile range (IQR)23

Descriptive statistics

Standard deviation16.163441
Coefficient of variation (CV)0.0080927988
Kurtosis5.060253
Mean1997.2622
Median Absolute Deviation (MAD)11
Skewness-1.4084062
Sum19956644
Variance261.25683
MonotonicityNot monotonic
2023-12-12T16:22:42.363781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1988 462
 
4.6%
1989 413
 
4.1%
1986 346
 
3.5%
1987 312
 
3.1%
2014 289
 
2.9%
2007 283
 
2.8%
2011 272
 
2.7%
2006 264
 
2.6%
2003 264
 
2.6%
2010 261
 
2.6%
Other values (106) 6826
68.3%
ValueCountFrequency (%)
1881 1
< 0.1%
1895 2
< 0.1%
1896 1
< 0.1%
1897 1
< 0.1%
1898 1
< 0.1%
1899 2
< 0.1%
1901 2
< 0.1%
1902 1
< 0.1%
1903 2
< 0.1%
1904 1
< 0.1%
ValueCountFrequency (%)
2022 20
 
0.2%
2021 198
2.0%
2020 158
1.6%
2019 122
1.2%
2018 125
1.2%
2017 217
2.2%
2016 201
2.0%
2015 221
2.2%
2014 289
2.9%
2013 226
2.3%
Distinct9884
Distinct (%)99.0%
Missing13
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T16:22:42.812010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length13.575448
Min length1

Characters and Unicode

Total characters135578
Distinct characters529
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9804 ?
Unique (%)98.2%

Sample

1st rowKIC 08-13 3
2nd row302 .G367S
3rd row344.0440263 .K84B
4th row363.25 서77수
5th row306.20943 .다294분
ValueCountFrequency (%)
v 1336
 
5.1%
kic 1297
 
5.0%
2 796
 
3.1%
1 526
 
2.0%
c 461
 
1.8%
345.0074 288
 
1.1%
3 268
 
1.0%
345 235
 
0.9%
e 231
 
0.9%
364 181
 
0.7%
Other values (10432) 20380
78.4%
2023-12-12T16:22:43.430199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16351
12.1%
3 12299
 
9.1%
. 11743
 
8.7%
4 9864
 
7.3%
0 8914
 
6.6%
1 8474
 
6.3%
2 8448
 
6.2%
6 7219
 
5.3%
5 6982
 
5.1%
9 5088
 
3.8%
Other values (519) 40196
29.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 76284
56.3%
Uppercase Letter 17108
 
12.6%
Space Separator 16351
 
12.1%
Other Punctuation 12199
 
9.0%
Other Letter 9529
 
7.0%
Lowercase Letter 2336
 
1.7%
Dash Punctuation 1751
 
1.3%
Letter Number 6
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
702
 
7.4%
461
 
4.8%
410
 
4.3%
295
 
3.1%
261
 
2.7%
256
 
2.7%
233
 
2.4%
214
 
2.2%
186
 
2.0%
158
 
1.7%
Other values (451) 6353
66.7%
Uppercase Letter
ValueCountFrequency (%)
C 2278
13.3%
I 2108
12.3%
K 1821
 
10.6%
S 1525
 
8.9%
E 962
 
5.6%
M 897
 
5.2%
B 835
 
4.9%
A 816
 
4.8%
U 634
 
3.7%
P 607
 
3.5%
Other values (16) 4625
27.0%
Lowercase Letter
ValueCountFrequency (%)
v 1548
66.3%
c 706
30.2%
a 17
 
0.7%
d 16
 
0.7%
i 13
 
0.6%
l 12
 
0.5%
y 5
 
0.2%
o 5
 
0.2%
r 3
 
0.1%
t 2
 
0.1%
Other values (7) 9
 
0.4%
Decimal Number
ValueCountFrequency (%)
3 12299
16.1%
4 9864
12.9%
0 8914
11.7%
1 8474
11.1%
2 8448
11.1%
6 7219
9.5%
5 6982
9.2%
9 5088
6.7%
7 5022
6.6%
8 3974
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 11743
96.3%
/ 451
 
3.7%
' 3
 
< 0.1%
, 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Other Symbol
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
16351
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1751
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 106599
78.6%
Latin 19450
 
14.3%
Hangul 9529
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
702
 
7.4%
461
 
4.8%
410
 
4.3%
295
 
3.1%
261
 
2.7%
256
 
2.7%
233
 
2.4%
214
 
2.2%
186
 
2.0%
158
 
1.7%
Other values (451) 6353
66.7%
Latin
ValueCountFrequency (%)
C 2278
 
11.7%
I 2108
 
10.8%
K 1821
 
9.4%
v 1548
 
8.0%
S 1525
 
7.8%
E 962
 
4.9%
M 897
 
4.6%
B 835
 
4.3%
A 816
 
4.2%
c 706
 
3.6%
Other values (36) 5954
30.6%
Common
ValueCountFrequency (%)
16351
15.3%
3 12299
11.5%
. 11743
11.0%
4 9864
9.3%
0 8914
8.4%
1 8474
7.9%
2 8448
7.9%
6 7219
6.8%
5 6982
6.5%
9 5088
 
4.8%
Other values (12) 11217
10.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 126040
93.0%
Hangul 9528
 
7.0%
Number Forms 6
 
< 0.1%
Geometric Shapes 2
 
< 0.1%
Misc Symbols 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16351
13.0%
3 12299
9.8%
. 11743
 
9.3%
4 9864
 
7.8%
0 8914
 
7.1%
1 8474
 
6.7%
2 8448
 
6.7%
6 7219
 
5.7%
5 6982
 
5.5%
9 5088
 
4.0%
Other values (52) 30658
24.3%
Hangul
ValueCountFrequency (%)
702
 
7.4%
461
 
4.8%
410
 
4.3%
295
 
3.1%
261
 
2.7%
256
 
2.7%
233
 
2.4%
214
 
2.2%
186
 
2.0%
158
 
1.7%
Other values (450) 6352
66.7%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T16:22:38.453860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T16:22:38.636451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:22:38.771941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:22:38.915579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서명저자출판사출판년청구기호
29353한국사회 폭력문화의 구조화에 관한 연구한국형사정책연구원한국형사정책연구원2008KIC 08-13 3
10599Social psychologyKenneth J., GergenSpringer-Verlag1986302 .G367S
11619BetubungsmittelgesetzHarald Hans, KrnerC.H.Beck'sche Verlagsbuchhandlung1990344.0440263 .K84B
27580(알기쉬운) 수사 형사 실무 :수사기술과 판례 중심으로서주연국문사2006363.25 서77수
628분단독일의 정치사회학다렌돌프,랄프한길사1986306.20943 .다294분
24101사이버로펌에 대한 합리적인 규제 및 입법방안한국형사정책연구원한국형사정책연구원2003KIC 03-41 c. 2
35567Die Nebenfolge im System strafrechtlicher Sanktionen Eine Untersuchung zur Dogmatik der Nebenfolge sowie zur Einordnung von Normen als NebenfolgeSebastian SobotaDuncker & Humblot2015345.430773 S443N
1496民衆時代의 論理노명식展望社1981305.56 .노34민
6835Betrug als Wirtschaftsdelikt Eine dogmatischempirische Untersuchung anhand einer Aktenanalyse von 1696 Betrugsverfahren in der Bundesrepublik Deutschland aus den Jahren 1974-79Leßner, JohannaCentaurus-Verlagsgesellschaft1984364.168 L639B
5465Richter und Staatsanwalt im Dienst fr den Brger Die Vortrge und Referate des Deutschen Richtertages 1987 in HamburgDeutscher, RichterbundCarl Heymanns Verlag KG1988345.05 .R535R
서명저자출판사출판년청구기호
6209Komplikationsdichte arztlicher EingriffeWolfgang,, MattigGustav Fischer Verlag1983617.01 .M444K
2270基礎社會學안전삼랑東洋經濟新報社1988301 .안74기 v. 3 1988
24497국가마약퇴치전략과 소년형사정책한국형사정책연구원한국형사정책연구원2004KIC 04-44 2
1915(The) Oxford English dictionaryJ. A., SimpsonClarendon Press1989423 .S613O v. 20
10822Soviet and East European transport problemsJohn,, AmblerSt. Martin's Press1985388.947 .A493S
35563Europisches StrafrechtBernd HeckerSpringer Verlag2015345.24 B524E E.5
26256식품안전, 소비자의 마음에 답이 있다 :전 총리실 전문위원이 제안하는 식품안전진단서곽노성(주)에세이2008363.192 곽25식
32226Being Realistic about ReasonsScanlon, T. MOxford University2014170 S283B
34014한·중 자유무역협정에 따른 형사정책 대응전략 연구한국형사정책연구원한국형사정책연구원2012KIC 12-T16
2538Constitutional law, the American constitution, constitutional rights and libertiesWilliam B., LockhartWest Publishing Co1987342.085 .L816C

Duplicate rows

Most frequently occurring

서명저자출판사출판년청구기호# duplicates
8Statistiques criminelles internationalesInternational criminal police organization, [ed.]Interpol1959364.021 .I61S6
2Hong Kong Correctional servicesThe Commissioner of Correctional ServiceThe Commissioner of Correctional Services1990CCS .C824S5
5Materialien zum Bericht der Kommission zur Auswertung der Erfahrungen mit dem reformierten §218 StGB I-IIIDer Bundesminister fr Jugend, Familie und Gesundheit PostfachW. Kohlhammer1981362.074 .K79S v. 92.3
14現代搜査叢書제일가제법령출판사第一加除法令出版社1987363.25074 .제68현 v. II-3
17검사의 기소재량에 관한 연구한국형사정책연구원한국형사정책연구원1993KIC 92-163
0Comparative social researchRichard F., TomassonJAI Press Inc1980301.05 T655C c. 12
1Epidemiologic trends in drug abuse :proceedings community epidemiology work groupNational Institute on Drug AbuseU. S. Department of Health and Human Services1989NIDA .E642
3In the Shadow of Justice :Postwar Liberalism and the Remaking of Political PhilosophyKatrina ForresterPrinceton University Press2019320.011 K19I2
4Integration through law :Europe and the American federal experienceMauro,, CappellettiWalter de Gruyter1985340.2 .C247I v. 1.2
6Quellen zur Reform des Straf und StrafprozeßrechtsSchubert, WernerWalter de Gruyter & Co1988345.0074 .S384Q v. 2.2.2