Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells6
Missing cells (%)< 0.1%
Duplicate rows39
Duplicate rows (%)0.4%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Text4

Dataset

Description경북학생문화회관 종합정보자료실에 소장 중인 도서 목록
Author경상북도교육청 경상북도교육청문화원
URLhttps://www.data.go.kr/data/3077714/fileData.do

Alerts

Dataset has 39 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 03:55:51.545968
Analysis finished2023-12-12 03:55:54.264471
Duration2.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서명
Text

Distinct9839
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:55:54.666219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length190
Median length85
Mean length18.7888
Min length1

Characters and Unicode

Total characters187888
Distinct characters1551
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9738 ?
Unique (%)97.4%

Sample

1st row(잠수네) 프리스쿨 영어공부법 : 엄마와 아이가 모두 행복한 5세·6세·7세 로드맵
2nd row당신이 옳다 : 큰글자도서
3rd row내 거랑 바꿀래
4th row구리와 구라의 소풍
5th row고사성어 백과사전
ValueCountFrequency (%)
2973
 
6.2%
이야기 426
 
0.9%
2 308
 
0.6%
1 293
 
0.6%
장편소설 287
 
0.6%
위한 192
 
0.4%
우리 187
 
0.4%
the 173
 
0.4%
146
 
0.3%
3 134
 
0.3%
Other values (20765) 43055
89.4%
2023-12-12T12:55:55.370391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39494
 
21.0%
3423
 
1.8%
3380
 
1.8%
: 2993
 
1.6%
2443
 
1.3%
1938
 
1.0%
1726
 
0.9%
e 1676
 
0.9%
1655
 
0.9%
1640
 
0.9%
Other values (1541) 127520
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117118
62.3%
Space Separator 39494
 
21.0%
Lowercase Letter 13458
 
7.2%
Other Punctuation 7131
 
3.8%
Decimal Number 3941
 
2.1%
Uppercase Letter 2568
 
1.4%
Close Punctuation 1661
 
0.9%
Open Punctuation 1660
 
0.9%
Math Symbol 660
 
0.4%
Dash Punctuation 180
 
0.1%
Other values (2) 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3423
 
2.9%
3380
 
2.9%
2443
 
2.1%
1938
 
1.7%
1726
 
1.5%
1655
 
1.4%
1640
 
1.4%
1589
 
1.4%
1563
 
1.3%
1526
 
1.3%
Other values (1413) 96235
82.2%
Lowercase Letter
ValueCountFrequency (%)
e 1676
12.5%
o 1214
 
9.0%
i 1089
 
8.1%
a 1062
 
7.9%
n 1013
 
7.5%
t 921
 
6.8%
r 899
 
6.7%
s 824
 
6.1%
l 615
 
4.6%
h 596
 
4.4%
Other values (28) 3549
26.4%
Uppercase Letter
ValueCountFrequency (%)
T 243
 
9.5%
S 236
 
9.2%
C 178
 
6.9%
E 172
 
6.7%
W 166
 
6.5%
A 162
 
6.3%
I 127
 
4.9%
L 111
 
4.3%
P 111
 
4.3%
M 107
 
4.2%
Other values (18) 955
37.2%
Other Punctuation
ValueCountFrequency (%)
: 2993
42.0%
. 1453
20.4%
, 1376
19.3%
? 390
 
5.5%
! 380
 
5.3%
· 185
 
2.6%
' 96
 
1.3%
57
 
0.8%
49
 
0.7%
& 41
 
0.6%
Other values (13) 111
 
1.6%
Math Symbol
ValueCountFrequency (%)
= 552
83.6%
~ 43
 
6.5%
+ 30
 
4.5%
> 7
 
1.1%
< 7
 
1.1%
6
 
0.9%
6
 
0.9%
| 4
 
0.6%
3
 
0.5%
1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 1056
26.8%
2 731
18.5%
0 682
17.3%
3 372
 
9.4%
5 274
 
7.0%
4 248
 
6.3%
6 182
 
4.6%
7 140
 
3.6%
9 134
 
3.4%
8 122
 
3.1%
Close Punctuation
ValueCountFrequency (%)
) 1625
97.8%
] 29
 
1.7%
4
 
0.2%
2
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1624
97.8%
[ 29
 
1.7%
4
 
0.2%
2
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
Other Symbol
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
39494
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 180
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 116844
62.2%
Common 54734
29.1%
Latin 16017
 
8.5%
Han 269
 
0.1%
Cyrillic 19
 
< 0.1%
Katakana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3423
 
2.9%
3380
 
2.9%
2443
 
2.1%
1938
 
1.7%
1726
 
1.5%
1655
 
1.4%
1640
 
1.4%
1589
 
1.4%
1563
 
1.3%
1526
 
1.3%
Other values (1288) 95961
82.1%
Han
ValueCountFrequency (%)
19
 
7.1%
14
 
5.2%
13
 
4.8%
13
 
4.8%
11
 
4.1%
11
 
4.1%
9
 
3.3%
8
 
3.0%
6
 
2.2%
6
 
2.2%
Other values (110) 159
59.1%
Common
ValueCountFrequency (%)
39494
72.2%
: 2993
 
5.5%
) 1625
 
3.0%
( 1624
 
3.0%
. 1453
 
2.7%
, 1376
 
2.5%
1 1056
 
1.9%
2 731
 
1.3%
0 682
 
1.2%
= 552
 
1.0%
Other values (49) 3148
 
5.8%
Latin
ValueCountFrequency (%)
e 1676
 
10.5%
o 1214
 
7.6%
i 1089
 
6.8%
a 1062
 
6.6%
n 1013
 
6.3%
t 921
 
5.8%
r 899
 
5.6%
s 824
 
5.1%
l 615
 
3.8%
h 596
 
3.7%
Other values (45) 6108
38.1%
Cyrillic
ValueCountFrequency (%)
е 3
15.8%
н 2
10.5%
и 2
10.5%
о 2
10.5%
в 1
 
5.3%
з 1
 
5.3%
З 1
 
5.3%
д 1
 
5.3%
а 1
 
5.3%
п 1
 
5.3%
Other values (4) 4
21.1%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 116834
62.2%
ASCII 70348
37.4%
None 372
 
0.2%
CJK 257
 
0.1%
Cyrillic 19
 
< 0.1%
CJK Compat Ideographs 12
 
< 0.1%
Number Forms 10
 
< 0.1%
Compat Jamo 10
 
< 0.1%
Punctuation 7
 
< 0.1%
Math Operators 6
 
< 0.1%
Other values (5) 13
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39494
56.1%
: 2993
 
4.3%
e 1676
 
2.4%
) 1625
 
2.3%
( 1624
 
2.3%
. 1453
 
2.1%
, 1376
 
2.0%
o 1214
 
1.7%
i 1089
 
1.5%
a 1062
 
1.5%
Other values (76) 16742
23.8%
Hangul
ValueCountFrequency (%)
3423
 
2.9%
3380
 
2.9%
2443
 
2.1%
1938
 
1.7%
1726
 
1.5%
1655
 
1.4%
1640
 
1.4%
1589
 
1.4%
1563
 
1.3%
1526
 
1.3%
Other values (1284) 95951
82.1%
None
ValueCountFrequency (%)
· 185
49.7%
57
 
15.3%
49
 
13.2%
20
 
5.4%
16
 
4.3%
8
 
2.2%
6
 
1.6%
6
 
1.6%
4
 
1.1%
4
 
1.1%
Other values (9) 17
 
4.6%
CJK
ValueCountFrequency (%)
19
 
7.4%
14
 
5.4%
13
 
5.1%
13
 
5.1%
11
 
4.3%
11
 
4.3%
9
 
3.5%
6
 
2.3%
6
 
2.3%
5
 
1.9%
Other values (105) 150
58.4%
CJK Compat Ideographs
ValueCountFrequency (%)
8
66.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Punctuation
ValueCountFrequency (%)
7
100.0%
Number Forms
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
Math Operators
ValueCountFrequency (%)
6
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
4
40.0%
2
20.0%
2
20.0%
2
20.0%
Cyrillic
ValueCountFrequency (%)
е 3
15.8%
н 2
10.5%
и 2
10.5%
о 2
10.5%
в 1
 
5.3%
з 1
 
5.3%
З 1
 
5.3%
д 1
 
5.3%
а 1
 
5.3%
п 1
 
5.3%
Other values (4) 4
21.1%
Arrows
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Distinct8807
Distinct (%)88.1%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T12:55:55.876845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length138
Median length115
Mean length15.70227
Min length2

Characters and Unicode

Total characters157007
Distinct characters1052
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8153 ?
Unique (%)81.5%

Sample

1st row이신애 지음
2nd row정혜신 지음
3rd row엘자 드베르누아 글 ; 피에르 브이예 그림
4th row나카가와 리에코 글 ; 야마와키 유리코 그림 ; 고광미 옮김
5th row김원중 편저
ValueCountFrequency (%)
7636
 
15.8%
지음 4645
 
9.6%
3096
 
6.4%
그림 2959
 
6.1%
옮김 2702
 
5.6%
글·그림 350
 
0.7%
엮음 306
 
0.6%
by 216
 
0.4%
공]지음 195
 
0.4%
글.그림 190
 
0.4%
Other values (12732) 26063
53.9%
2023-12-12T12:55:56.667010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39113
24.9%
; 7627
 
4.9%
5573
 
3.5%
5280
 
3.4%
5116
 
3.3%
3919
 
2.5%
3871
 
2.5%
3729
 
2.4%
3271
 
2.1%
2793
 
1.8%
Other values (1042) 76715
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99105
63.1%
Space Separator 39113
 
24.9%
Other Punctuation 9146
 
5.8%
Lowercase Letter 6258
 
4.0%
Uppercase Letter 1655
 
1.1%
Open Punctuation 793
 
0.5%
Close Punctuation 788
 
0.5%
Dash Punctuation 63
 
< 0.1%
Decimal Number 46
 
< 0.1%
Math Symbol 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5573
 
5.6%
5280
 
5.3%
5116
 
5.2%
3919
 
4.0%
3871
 
3.9%
3729
 
3.8%
3271
 
3.3%
2793
 
2.8%
1675
 
1.7%
1591
 
1.6%
Other values (958) 62287
62.8%
Lowercase Letter
ValueCountFrequency (%)
e 714
11.4%
a 670
10.7%
n 641
10.2%
i 502
 
8.0%
r 455
 
7.3%
t 380
 
6.1%
o 376
 
6.0%
s 344
 
5.5%
l 328
 
5.2%
y 326
 
5.2%
Other values (16) 1522
24.3%
Uppercase Letter
ValueCountFrequency (%)
S 192
 
11.6%
M 166
 
10.0%
B 121
 
7.3%
D 117
 
7.1%
K 108
 
6.5%
J 106
 
6.4%
E 97
 
5.9%
L 96
 
5.8%
C 80
 
4.8%
R 75
 
4.5%
Other values (16) 497
30.0%
Decimal Number
ValueCountFrequency (%)
3 11
23.9%
1 9
19.6%
6 6
13.0%
2 5
10.9%
8 4
 
8.7%
0 4
 
8.7%
5 3
 
6.5%
4 3
 
6.5%
9 1
 
2.2%
Other Punctuation
ValueCountFrequency (%)
; 7627
83.4%
. 671
 
7.3%
· 427
 
4.7%
, 308
 
3.4%
: 94
 
1.0%
& 13
 
0.1%
' 5
 
0.1%
! 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 785
99.0%
3
 
0.4%
( 2
 
0.3%
2
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 781
99.1%
) 2
 
0.3%
2
 
0.3%
2
 
0.3%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 18
51.4%
< 17
48.6%
Space Separator
ValueCountFrequency (%)
39113
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99010
63.1%
Common 49989
31.8%
Latin 7913
 
5.0%
Han 95
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5573
 
5.6%
5280
 
5.3%
5116
 
5.2%
3919
 
4.0%
3871
 
3.9%
3729
 
3.8%
3271
 
3.3%
2793
 
2.8%
1675
 
1.7%
1591
 
1.6%
Other values (898) 62192
62.8%
Han
ValueCountFrequency (%)
17
 
17.9%
6
 
6.3%
5
 
5.3%
3
 
3.2%
3
 
3.2%
2
 
2.1%
2
 
2.1%
2
 
2.1%
2
 
2.1%
2
 
2.1%
Other values (50) 51
53.7%
Latin
ValueCountFrequency (%)
e 714
 
9.0%
a 670
 
8.5%
n 641
 
8.1%
i 502
 
6.3%
r 455
 
5.8%
t 380
 
4.8%
o 376
 
4.8%
s 344
 
4.3%
l 328
 
4.1%
y 326
 
4.1%
Other values (42) 3177
40.1%
Common
ValueCountFrequency (%)
39113
78.2%
; 7627
 
15.3%
[ 785
 
1.6%
] 781
 
1.6%
. 671
 
1.3%
· 427
 
0.9%
, 308
 
0.6%
: 94
 
0.2%
- 63
 
0.1%
> 18
 
< 0.1%
Other values (22) 102
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99004
63.1%
ASCII 57459
36.6%
None 438
 
0.3%
CJK 90
 
0.1%
Compat Jamo 6
 
< 0.1%
Enclosed Alphanum 5
 
< 0.1%
CJK Compat Ideographs 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39113
68.1%
; 7627
 
13.3%
[ 785
 
1.4%
] 781
 
1.4%
e 714
 
1.2%
. 671
 
1.2%
a 670
 
1.2%
n 641
 
1.1%
i 502
 
0.9%
r 455
 
0.8%
Other values (66) 5500
 
9.6%
Hangul
ValueCountFrequency (%)
5573
 
5.6%
5280
 
5.3%
5116
 
5.2%
3919
 
4.0%
3871
 
3.9%
3729
 
3.8%
3271
 
3.3%
2793
 
2.8%
1675
 
1.7%
1591
 
1.6%
Other values (897) 62186
62.8%
None
ValueCountFrequency (%)
· 427
97.5%
3
 
0.7%
2
 
0.5%
2
 
0.5%
2
 
0.5%
1
 
0.2%
1
 
0.2%
CJK
ValueCountFrequency (%)
17
 
18.9%
6
 
6.7%
5
 
5.6%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (47) 47
52.2%
Compat Jamo
ValueCountFrequency (%)
6
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
5
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Distinct2237
Distinct (%)22.4%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T12:55:57.200223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length4.4458784
Min length1

Characters and Unicode

Total characters44441
Distinct characters711
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1165 ?
Unique (%)11.7%

Sample

1st rowRHK
2nd row해냄
3rd row교원
4th row한림출판사
5th row민음사
ValueCountFrequency (%)
교원 317
 
3.0%
문학동네 182
 
1.7%
비룡소 164
 
1.6%
창비 133
 
1.3%
자음과모음 115
 
1.1%
사계절 110
 
1.1%
민음사 109
 
1.0%
시공주니어 109
 
1.0%
주니어김영사 109
 
1.0%
아이세움 105
 
1.0%
Other values (2291) 8969
86.1%
2023-12-12T12:55:57.908844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1670
 
3.8%
1251
 
2.8%
1247
 
2.8%
1014
 
2.3%
888
 
2.0%
710
 
1.6%
640
 
1.4%
629
 
1.4%
602
 
1.4%
576
 
1.3%
Other values (701) 35214
79.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39141
88.1%
Lowercase Letter 3444
 
7.7%
Uppercase Letter 1040
 
2.3%
Space Separator 426
 
1.0%
Decimal Number 157
 
0.4%
Other Punctuation 103
 
0.2%
Open Punctuation 59
 
0.1%
Close Punctuation 59
 
0.1%
Dash Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1670
 
4.3%
1251
 
3.2%
1247
 
3.2%
1014
 
2.6%
888
 
2.3%
710
 
1.8%
640
 
1.6%
629
 
1.6%
602
 
1.5%
576
 
1.5%
Other values (625) 29914
76.4%
Lowercase Letter
ValueCountFrequency (%)
o 393
11.4%
s 370
10.7%
i 321
 
9.3%
a 274
 
8.0%
n 272
 
7.9%
l 247
 
7.2%
e 220
 
6.4%
r 217
 
6.3%
h 149
 
4.3%
k 133
 
3.9%
Other values (15) 848
24.6%
Uppercase Letter
ValueCountFrequency (%)
B 155
14.9%
M 134
12.9%
K 92
 
8.8%
C 73
 
7.0%
P 62
 
6.0%
S 61
 
5.9%
H 60
 
5.8%
O 50
 
4.8%
D 45
 
4.3%
A 43
 
4.1%
Other values (15) 265
25.5%
Other Punctuation
ValueCountFrequency (%)
& 47
45.6%
· 11
 
10.7%
10
 
9.7%
. 9
 
8.7%
, 7
 
6.8%
@ 4
 
3.9%
? 4
 
3.9%
' 3
 
2.9%
; 3
 
2.9%
: 3
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 68
43.3%
2 67
42.7%
3 5
 
3.2%
6 5
 
3.2%
0 5
 
3.2%
7 2
 
1.3%
8 2
 
1.3%
5 2
 
1.3%
4 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 57
96.6%
[ 2
 
3.4%
Close Punctuation
ValueCountFrequency (%)
) 57
96.6%
] 2
 
3.4%
Space Separator
ValueCountFrequency (%)
426
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39091
88.0%
Latin 4484
 
10.1%
Common 816
 
1.8%
Han 50
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1670
 
4.3%
1251
 
3.2%
1247
 
3.2%
1014
 
2.6%
888
 
2.3%
710
 
1.8%
640
 
1.6%
629
 
1.6%
602
 
1.5%
576
 
1.5%
Other values (600) 29864
76.4%
Latin
ValueCountFrequency (%)
o 393
 
8.8%
s 370
 
8.3%
i 321
 
7.2%
a 274
 
6.1%
n 272
 
6.1%
l 247
 
5.5%
e 220
 
4.9%
r 217
 
4.8%
B 155
 
3.5%
h 149
 
3.3%
Other values (40) 1866
41.6%
Common
ValueCountFrequency (%)
426
52.2%
1 68
 
8.3%
2 67
 
8.2%
( 57
 
7.0%
) 57
 
7.0%
& 47
 
5.8%
- 12
 
1.5%
· 11
 
1.3%
10
 
1.2%
. 9
 
1.1%
Other values (16) 52
 
6.4%
Han
ValueCountFrequency (%)
8
16.0%
7
14.0%
4
 
8.0%
4
 
8.0%
4
 
8.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
1
 
2.0%
Other values (15) 15
30.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39091
88.0%
ASCII 5279
 
11.9%
CJK 50
 
0.1%
None 21
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1670
 
4.3%
1251
 
3.2%
1247
 
3.2%
1014
 
2.6%
888
 
2.3%
710
 
1.8%
640
 
1.6%
629
 
1.6%
602
 
1.5%
576
 
1.5%
Other values (600) 29864
76.4%
ASCII
ValueCountFrequency (%)
426
 
8.1%
o 393
 
7.4%
s 370
 
7.0%
i 321
 
6.1%
a 274
 
5.2%
n 272
 
5.2%
l 247
 
4.7%
e 220
 
4.2%
r 217
 
4.1%
B 155
 
2.9%
Other values (64) 2384
45.2%
None
ValueCountFrequency (%)
· 11
52.4%
10
47.6%
CJK
ValueCountFrequency (%)
8
16.0%
7
14.0%
4
 
8.0%
4
 
8.0%
4
 
8.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
1
 
2.0%
Other values (15) 15
30.0%
Distinct53
Distinct (%)0.5%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T12:55:58.195101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.0131013
Min length3

Characters and Unicode

Total characters40127
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)0.2%

Sample

1st row2014
2nd row2019
3rd row2009
4th row2007
5th row2007
ValueCountFrequency (%)
2010 1797
18.0%
2009 1243
12.4%
2011 968
9.7%
2008 836
 
8.4%
2007 573
 
5.7%
2006 523
 
5.2%
2005 481
 
4.8%
2017 467
 
4.7%
2014 424
 
4.2%
2016 374
 
3.7%
Other values (38) 2313
23.1%
2023-12-12T12:55:58.663524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 16068
40.0%
2 10516
26.2%
1 6743
16.8%
9 1681
 
4.2%
8 1183
 
2.9%
7 1045
 
2.6%
6 907
 
2.3%
5 809
 
2.0%
4 647
 
1.6%
3 440
 
1.1%
Other values (7) 88
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40039
99.8%
Open Punctuation 38
 
0.1%
Close Punctuation 38
 
0.1%
Dash Punctuation 10
 
< 0.1%
Other Letter 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 16068
40.1%
2 10516
26.3%
1 6743
16.8%
9 1681
 
4.2%
8 1183
 
3.0%
7 1045
 
2.6%
6 907
 
2.3%
5 809
 
2.0%
4 647
 
1.6%
3 440
 
1.1%
Open Punctuation
ValueCountFrequency (%)
[ 37
97.4%
( 1
 
2.6%
Close Punctuation
ValueCountFrequency (%)
] 37
97.4%
) 1
 
2.6%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40125
> 99.9%
Hangul 2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 16068
40.0%
2 10516
26.2%
1 6743
16.8%
9 1681
 
4.2%
8 1183
 
2.9%
7 1045
 
2.6%
6 907
 
2.3%
5 809
 
2.0%
4 647
 
1.6%
3 440
 
1.1%
Other values (5) 86
 
0.2%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40125
> 99.9%
Hangul 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 16068
40.0%
2 10516
26.2%
1 6743
16.8%
9 1681
 
4.2%
8 1183
 
2.9%
7 1045
 
2.6%
6 907
 
2.3%
5 809
 
2.0%
4 647
 
1.6%
3 440
 
1.1%
Other values (5) 86
 
0.2%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Missing values

2023-12-12T12:55:53.950557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:55:54.055175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:55:54.184497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서명저작자발행자발행연도
38865(잠수네) 프리스쿨 영어공부법 : 엄마와 아이가 모두 행복한 5세·6세·7세 로드맵이신애 지음RHK2014
49134당신이 옳다 : 큰글자도서정혜신 지음해냄2019
1871내 거랑 바꿀래엘자 드베르누아 글 ; 피에르 브이예 그림교원2009
1356구리와 구라의 소풍나카가와 리에코 글 ; 야마와키 유리코 그림 ; 고광미 옮김한림출판사2007
15424고사성어 백과사전김원중 편저민음사2007
20373람세스 .2 영원의 신전자크,크리스티앙문학동네2010
12452스타벅스에서는 그란데를 사라요시모토 요시오 지음 ; 홍성민 옮김동아일보사2008
19801우리 서로 사랑할 수 있다면 : 용혜원 신작 시집용혜원 지음 ; 임효 그림나무생각2010
41915느영나영 제주조지욱 글 ; 김동성 그림나는별2015
3238초원의 집. 3로라 잉걸스 와일더 글 ; 가스 윌리엄스 그림 ; 김석희 옮김비룡소2005
서명저작자발행자발행연도
8326노란 누드최영주 지음미술문화2008
5339(이이화 선생님의)고구려 바로알기. 2, 장수왕에서 마지막 왕 보장왕까지이이화 원작 ; 최금락 구성 ; 원병조 그림해피북스2007
6060(푸름이)세계자연과학 = prumi science. 8, 관엽 식물 ~ 구황 식물한국자연생태과학원 엮음푸름이닷컴2005
33724숨겨진 심리학 : 최고의 프로파일러가 알려주는 설득과 협상의 비밀표창원 지음토네이도2011
32500(생각이 쑥쑥) 나의 첫 경제책. 1:, 돈이 뭐예요?클레어 레웰린 지음 ; 마이크 고든 그림 ; 최연순 옮김상상스쿨2011
20192자연주의 채식요리이양지 지음 ; 한지선 [외]요리리스컴2010
45811안녕, 웨이안 :칭산 소설칭산 지음한겨레출판2018
36847안네의 일기Anne Frank 지음 ; Kay Sam Shephard 옮김THE TEXT2009
59무슨 생각하니로버트 잉펜 글. 그림 ; 문우일 옮김국민서관2006
2194(21세기)먼나라 이웃나라. 11, 미국 2(역사편)이원복 글·그림김영사2004

Duplicate rows

Most frequently occurring

서명저작자발행자발행연도# duplicates
15과학자가 들려주는 과학이야기 . 1-100정완상 지음자음과모음200817
11EQ 휴먼 파워. 1-60정제광 외 글 ; 이형진 외 그림한국톨스토이201315
8(영유아 통합발달 프로그램)뽀삐프뢰벨유아교육연구소 글 ; 이의정 그림 ; 오은영 감수베틀북201210
31저학년 명작 도서관. 1-28편집부예림당20079
12Little classic book. 1-20그림 형제예림당20106
10(칼빈)聖經註釋. 1-20존 칼빈 原著 ; 존 칼빈 성경주석 출판위원회 편역성서연구원20125
14개구쟁이 아치. 1-19지은이: 기요노 사치코 ; 옮긴이: 고향옥비룡소2009-20104
38한국민족문화대백과사전. 1-28한국정신문화연구원,한국정신문화연구원19914
23무엇일까?[애플비편집부] 편애플비20093
0(At Home)in the cityby Sharon GordonMarshall Cavendish20082