Overview

Dataset statistics

Number of variables10
Number of observations384
Missing cells8
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.1 KiB
Average record size in memory80.3 B

Variable types

Text8
Categorical2

Dataset

Description중랑구시설관리공단 산하 도서관 12월 신착자료 정보
Author중랑구시설관리공단
URLhttps://www.data.go.kr/data/15044311/fileData.do

Alerts

Unnamed: 6 is highly overall correlated with Unnamed: 9High correlation
Unnamed: 9 is highly overall correlated with Unnamed: 6High correlation

Reproduction

Analysis started2023-12-12 20:40:00.639050
Analysis finished2023-12-12 20:40:02.522969
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct383
Distinct (%)100.0%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:02.940279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.7180157
Min length1

Characters and Unicode

Total characters1041
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique383 ?
Unique (%)100.0%

Sample

1st rowNo.
2nd row1
3rd row2
4th row3
5th row4
ValueCountFrequency (%)
no 1
 
0.3%
262 1
 
0.3%
261 1
 
0.3%
260 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
256 1
 
0.3%
255 1
 
0.3%
254 1
 
0.3%
Other values (373) 373
97.4%
2023-12-13T05:40:03.566346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 179
17.2%
2 179
17.2%
3 161
15.5%
4 78
7.5%
5 78
7.5%
6 78
7.5%
7 78
7.5%
8 71
 
6.8%
9 68
 
6.5%
0 68
 
6.5%
Other values (3) 3
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1038
99.7%
Uppercase Letter 1
 
0.1%
Lowercase Letter 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 179
17.2%
2 179
17.2%
3 161
15.5%
4 78
7.5%
5 78
7.5%
6 78
7.5%
7 78
7.5%
8 71
 
6.8%
9 68
 
6.6%
0 68
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
o 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1039
99.8%
Latin 2
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 179
17.2%
2 179
17.2%
3 161
15.5%
4 78
7.5%
5 78
7.5%
6 78
7.5%
7 78
7.5%
8 71
 
6.8%
9 68
 
6.5%
0 68
 
6.5%
Latin
ValueCountFrequency (%)
N 1
50.0%
o 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1041
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 179
17.2%
2 179
17.2%
3 161
15.5%
4 78
7.5%
5 78
7.5%
6 78
7.5%
7 78
7.5%
8 71
 
6.8%
9 68
 
6.5%
0 68
 
6.5%
Other values (3) 3
 
0.3%
Distinct383
Distinct (%)100.0%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:03.815151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.979112
Min length4

Characters and Unicode

Total characters4588
Distinct characters17
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique383 ?
Unique (%)100.0%

Sample

1st row등록번호
2nd rowWM0000007658
3rd rowWM0000007659
4th rowWM0000007668
5th rowWM0000007669
ValueCountFrequency (%)
등록번호 1
 
0.3%
wm0000007916 1
 
0.3%
wm0000007915 1
 
0.3%
wm0000007914 1
 
0.3%
wm0000007913 1
 
0.3%
wm0000007912 1
 
0.3%
wm0000007922 1
 
0.3%
wm0000007921 1
 
0.3%
wm0000007920 1
 
0.3%
wm0000007911 1
 
0.3%
Other values (373) 373
97.4%
2023-12-13T05:40:04.243297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2219
48.4%
7 559
 
12.2%
M 382
 
8.3%
W 306
 
6.7%
8 240
 
5.2%
1 155
 
3.4%
2 155
 
3.4%
9 137
 
3.0%
6 123
 
2.7%
3 79
 
1.7%
Other values (7) 233
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3820
83.3%
Uppercase Letter 764
 
16.7%
Other Letter 4
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2219
58.1%
7 559
 
14.6%
8 240
 
6.3%
1 155
 
4.1%
2 155
 
4.1%
9 137
 
3.6%
6 123
 
3.2%
3 79
 
2.1%
5 77
 
2.0%
4 76
 
2.0%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Uppercase Letter
ValueCountFrequency (%)
M 382
50.0%
W 306
40.1%
K 76
 
9.9%

Most occurring scripts

ValueCountFrequency (%)
Common 3820
83.3%
Latin 764
 
16.7%
Hangul 4
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2219
58.1%
7 559
 
14.6%
8 240
 
6.3%
1 155
 
4.1%
2 155
 
4.1%
9 137
 
3.6%
6 123
 
3.2%
3 79
 
2.1%
5 77
 
2.0%
4 76
 
2.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Latin
ValueCountFrequency (%)
M 382
50.0%
W 306
40.1%
K 76
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4584
99.9%
Hangul 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2219
48.4%
7 559
 
12.2%
M 382
 
8.3%
W 306
 
6.7%
8 240
 
5.2%
1 155
 
3.4%
2 155
 
3.4%
9 137
 
3.0%
6 123
 
2.7%
3 79
 
1.7%
Other values (3) 229
 
5.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct383
Distinct (%)100.0%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:04.546928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length17.467363
Min length4

Characters and Unicode

Total characters6690
Distinct characters63
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique383 ?
Unique (%)100.0%

Sample

1st row청구기호
2nd rowCB 510-K65l-v.7
3rd rowCB 747-H955o-v.1-1
4th rowCB 747-H955o-v.1-10
5th rowCB 747-H955o-v.1-11
ValueCountFrequency (%)
cb 273
35.7%
ge 76
 
9.9%
ch 33
 
4.3%
747-h955o-v.8-9 1
 
0.1%
747-h955o-v.7-9 1
 
0.1%
747-h955o-v.9-5 1
 
0.1%
747-h955o-v.9-4 1
 
0.1%
747-h955o-v.9-3 1
 
0.1%
747-h955o-v.9-2 1
 
0.1%
747-h955o-v.9-12 1
 
0.1%
Other values (376) 376
49.2%
2023-12-13T05:40:05.009655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 964
14.4%
5 683
 
10.2%
7 651
 
9.7%
4 459
 
6.9%
382
 
5.7%
9 358
 
5.4%
. 351
 
5.2%
H 316
 
4.7%
C 306
 
4.6%
v 300
 
4.5%
Other values (53) 1920
28.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3160
47.2%
Uppercase Letter 1070
 
16.0%
Dash Punctuation 964
 
14.4%
Lowercase Letter 606
 
9.1%
Space Separator 382
 
5.7%
Other Punctuation 351
 
5.2%
Other Letter 156
 
2.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
19.9%
24
15.4%
17
10.9%
15
9.6%
14
9.0%
9
 
5.8%
8
 
5.1%
7
 
4.5%
6
 
3.8%
5
 
3.2%
Other values (13) 20
12.8%
Lowercase Letter
ValueCountFrequency (%)
v 300
49.5%
o 285
47.0%
i 3
 
0.5%
r 2
 
0.3%
c 2
 
0.3%
n 2
 
0.3%
b 2
 
0.3%
p 2
 
0.3%
g 2
 
0.3%
s 2
 
0.3%
Other values (4) 4
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
H 316
29.5%
C 306
28.6%
B 275
25.7%
G 79
 
7.4%
E 77
 
7.2%
S 5
 
0.5%
R 3
 
0.3%
M 3
 
0.3%
L 2
 
0.2%
K 2
 
0.2%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
5 683
21.6%
7 651
20.6%
4 459
14.5%
9 358
11.3%
3 255
 
8.1%
1 251
 
7.9%
2 213
 
6.7%
8 124
 
3.9%
6 92
 
2.9%
0 74
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 964
100.0%
Space Separator
ValueCountFrequency (%)
382
100.0%
Other Punctuation
ValueCountFrequency (%)
. 351
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4858
72.6%
Latin 1676
 
25.1%
Hangul 156
 
2.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
H 316
18.9%
C 306
18.3%
v 300
17.9%
o 285
17.0%
B 275
16.4%
G 79
 
4.7%
E 77
 
4.6%
S 5
 
0.3%
R 3
 
0.2%
M 3
 
0.2%
Other values (16) 27
 
1.6%
Hangul
ValueCountFrequency (%)
31
19.9%
24
15.4%
17
10.9%
15
9.6%
14
9.0%
9
 
5.8%
8
 
5.1%
7
 
4.5%
6
 
3.8%
5
 
3.2%
Other values (13) 20
12.8%
Common
ValueCountFrequency (%)
- 964
19.8%
5 683
14.1%
7 651
13.4%
4 459
9.4%
382
 
7.9%
9 358
 
7.4%
. 351
 
7.2%
3 255
 
5.2%
1 251
 
5.2%
2 213
 
4.4%
Other values (4) 291
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6534
97.7%
Compat Jamo 147
 
2.2%
Hangul 9
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 964
14.8%
5 683
 
10.5%
7 651
 
10.0%
4 459
 
7.0%
382
 
5.8%
9 358
 
5.5%
. 351
 
5.4%
H 316
 
4.8%
C 306
 
4.7%
v 300
 
4.6%
Other values (30) 1764
27.0%
Compat Jamo
ValueCountFrequency (%)
31
21.1%
24
16.3%
17
11.6%
15
10.2%
14
9.5%
9
 
6.1%
8
 
5.4%
7
 
4.8%
6
 
4.1%
5
 
3.4%
Other values (4) 11
 
7.5%
Hangul
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Distinct157
Distinct (%)41.0%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:05.307437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length51
Mean length18.715405
Min length2

Characters and Unicode

Total characters7168
Distinct characters439
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)26.4%

Sample

1st row서명
2nd rowSkin
3rd rowWhat a mess!
4th row(The) headache
5th row(The) headache
ValueCountFrequency (%)
the 174
 
11.4%
68
 
4.5%
a 24
 
1.6%
what 21
 
1.4%
in 20
 
1.3%
of 19
 
1.2%
egg 12
 
0.8%
hop 12
 
0.8%
got 12
 
0.8%
and 12
 
0.8%
Other values (658) 1148
75.4%
2023-12-13T05:40:05.765613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1139
 
15.9%
e 554
 
7.7%
a 305
 
4.3%
t 294
 
4.1%
h 251
 
3.5%
s 227
 
3.2%
o 217
 
3.0%
n 214
 
3.0%
i 213
 
3.0%
r 179
 
2.5%
Other values (429) 3575
49.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3588
50.1%
Other Letter 1640
22.9%
Space Separator 1139
 
15.9%
Uppercase Letter 328
 
4.6%
Other Punctuation 178
 
2.5%
Open Punctuation 133
 
1.9%
Close Punctuation 133
 
1.9%
Decimal Number 22
 
0.3%
Math Symbol 6
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
3.8%
46
 
2.8%
42
 
2.6%
38
 
2.3%
37
 
2.3%
27
 
1.6%
27
 
1.6%
27
 
1.6%
27
 
1.6%
23
 
1.4%
Other values (365) 1283
78.2%
Lowercase Letter
ValueCountFrequency (%)
e 554
15.4%
a 305
 
8.5%
t 294
 
8.2%
h 251
 
7.0%
s 227
 
6.3%
o 217
 
6.0%
n 214
 
6.0%
i 213
 
5.9%
r 179
 
5.0%
p 161
 
4.5%
Other values (15) 973
27.1%
Uppercase Letter
ValueCountFrequency (%)
T 125
38.1%
W 29
 
8.8%
H 24
 
7.3%
F 19
 
5.8%
K 19
 
5.8%
S 14
 
4.3%
G 13
 
4.0%
M 12
 
3.7%
P 12
 
3.7%
R 9
 
2.7%
Other values (9) 52
15.9%
Decimal Number
ValueCountFrequency (%)
1 7
31.8%
2 5
22.7%
0 3
13.6%
9 2
 
9.1%
6 2
 
9.1%
7 1
 
4.5%
3 1
 
4.5%
5 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
: 62
34.8%
' 32
18.0%
! 29
16.3%
, 23
 
12.9%
? 20
 
11.2%
· 7
 
3.9%
. 5
 
2.8%
Space Separator
ValueCountFrequency (%)
1139
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Math Symbol
ValueCountFrequency (%)
= 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3916
54.6%
Hangul 1640
22.9%
Common 1612
22.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
3.8%
46
 
2.8%
42
 
2.6%
38
 
2.3%
37
 
2.3%
27
 
1.6%
27
 
1.6%
27
 
1.6%
27
 
1.6%
23
 
1.4%
Other values (365) 1283
78.2%
Latin
ValueCountFrequency (%)
e 554
14.1%
a 305
 
7.8%
t 294
 
7.5%
h 251
 
6.4%
s 227
 
5.8%
o 217
 
5.5%
n 214
 
5.5%
i 213
 
5.4%
r 179
 
4.6%
p 161
 
4.1%
Other values (34) 1301
33.2%
Common
ValueCountFrequency (%)
1139
70.7%
( 133
 
8.3%
) 133
 
8.3%
: 62
 
3.8%
' 32
 
2.0%
! 29
 
1.8%
, 23
 
1.4%
? 20
 
1.2%
· 7
 
0.4%
1 7
 
0.4%
Other values (10) 27
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5521
77.0%
Hangul 1640
 
22.9%
None 7
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1139
20.6%
e 554
 
10.0%
a 305
 
5.5%
t 294
 
5.3%
h 251
 
4.5%
s 227
 
4.1%
o 217
 
3.9%
n 214
 
3.9%
i 213
 
3.9%
r 179
 
3.2%
Other values (53) 1928
34.9%
Hangul
ValueCountFrequency (%)
63
 
3.8%
46
 
2.8%
42
 
2.6%
38
 
2.3%
37
 
2.3%
27
 
1.6%
27
 
1.6%
27
 
1.6%
27
 
1.6%
23
 
1.4%
Other values (365) 1283
78.2%
None
ValueCountFrequency (%)
· 7
100.0%
Distinct109
Distinct (%)28.5%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:05.946393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length64
Mean length47.172324
Min length3

Characters and Unicode

Total characters18067
Distinct characters256
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)25.8%

Sample

1st row저작자
2nd rowby Cynthia Klingel, Robert B. Noyed ; photographs by Gregg Andersen
3rd rowwritten by Roderick Hunt ; illustrated by Alex Brychta
4th rowwritten by Roderick Hunt ; illustrated by Alex Brychta
5th rowwritten by Roderick Hunt ; illustrated by Alex Brychta
ValueCountFrequency (%)
by 602
19.3%
328
10.5%
written 288
9.3%
hunt 282
9.1%
alex 282
9.1%
brychta 282
9.1%
roderick 275
8.8%
illustrated 258
8.3%
story 72
 
2.3%
지음 70
 
2.2%
Other values (260) 373
12.0%
2023-12-13T05:40:06.330315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2936
16.3%
t 1835
 
10.2%
r 1255
 
6.9%
e 1161
 
6.4%
y 966
 
5.3%
i 951
 
5.3%
l 909
 
5.0%
n 646
 
3.6%
a 640
 
3.5%
b 606
 
3.4%
Other values (246) 6162
34.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12726
70.4%
Space Separator 2936
 
16.3%
Uppercase Letter 1223
 
6.8%
Other Letter 747
 
4.1%
Other Punctuation 389
 
2.2%
Close Punctuation 22
 
0.1%
Open Punctuation 22
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
9.9%
73
 
9.8%
40
 
5.4%
39
 
5.2%
25
 
3.3%
19
 
2.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (194) 439
58.8%
Lowercase Letter
ValueCountFrequency (%)
t 1835
14.4%
r 1255
 
9.9%
e 1161
 
9.1%
y 966
 
7.6%
i 951
 
7.5%
l 909
 
7.1%
n 646
 
5.1%
a 640
 
5.0%
b 606
 
4.8%
u 595
 
4.7%
Other values (15) 3162
24.8%
Uppercase Letter
ValueCountFrequency (%)
B 290
23.7%
A 285
23.3%
H 283
23.1%
R 283
23.1%
M 13
 
1.1%
S 13
 
1.1%
D 11
 
0.9%
C 7
 
0.6%
G 7
 
0.6%
L 6
 
0.5%
Other values (9) 25
 
2.0%
Other Punctuation
ValueCountFrequency (%)
; 328
84.3%
. 37
 
9.5%
, 24
 
6.2%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
2936
100.0%
Close Punctuation
ValueCountFrequency (%)
] 22
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 13949
77.2%
Common 3371
 
18.7%
Hangul 747
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
9.9%
73
 
9.8%
40
 
5.4%
39
 
5.2%
25
 
3.3%
19
 
2.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (194) 439
58.8%
Latin
ValueCountFrequency (%)
t 1835
13.2%
r 1255
 
9.0%
e 1161
 
8.3%
y 966
 
6.9%
i 951
 
6.8%
l 909
 
6.5%
n 646
 
4.6%
a 640
 
4.6%
b 606
 
4.3%
u 595
 
4.3%
Other values (34) 4385
31.4%
Common
ValueCountFrequency (%)
2936
87.1%
; 328
 
9.7%
. 37
 
1.1%
, 24
 
0.7%
] 22
 
0.7%
[ 22
 
0.7%
> 1
 
< 0.1%
< 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17320
95.9%
Hangul 747
 
4.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2936
17.0%
t 1835
 
10.6%
r 1255
 
7.2%
e 1161
 
6.7%
y 966
 
5.6%
i 951
 
5.5%
l 909
 
5.2%
n 646
 
3.7%
a 640
 
3.7%
b 606
 
3.5%
Other values (42) 5415
31.3%
Hangul
ValueCountFrequency (%)
74
 
9.9%
73
 
9.8%
40
 
5.4%
39
 
5.2%
25
 
3.3%
19
 
2.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (194) 439
58.8%
Distinct79
Distinct (%)20.6%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:06.642667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length23
Mean length18.853786
Min length1

Characters and Unicode

Total characters7221
Distinct characters171
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)17.5%

Sample

1st row발행자
2nd rowWeeklyReader Early Learning Library
3rd rowOxford University Press
4th rowOxford University Press
5th rowOxford University Press
ValueCountFrequency (%)
press 285
28.8%
oxford 282
28.5%
university 282
28.5%
알마 8
 
0.8%
오월의봄 4
 
0.4%
삼성경제연구소 4
 
0.4%
책세상 3
 
0.3%
alfred 3
 
0.3%
a 3
 
0.3%
knopf 3
 
0.3%
Other values (95) 111
 
11.2%
2023-12-13T05:40:07.160024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 874
 
12.1%
s 871
 
12.1%
605
 
8.4%
e 599
 
8.3%
i 585
 
8.1%
n 313
 
4.3%
o 305
 
4.2%
d 299
 
4.1%
t 293
 
4.1%
P 292
 
4.0%
Other values (161) 2185
30.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5402
74.8%
Uppercase Letter 911
 
12.6%
Space Separator 605
 
8.4%
Other Letter 292
 
4.0%
Other Punctuation 9
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
5.5%
10
 
3.4%
9
 
3.1%
8
 
2.7%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (114) 210
71.9%
Lowercase Letter
ValueCountFrequency (%)
r 874
16.2%
s 871
16.1%
e 599
11.1%
i 585
10.8%
n 313
 
5.8%
o 305
 
5.6%
d 299
 
5.5%
t 293
 
5.4%
f 292
 
5.4%
y 290
 
5.4%
Other values (12) 681
12.6%
Uppercase Letter
ValueCountFrequency (%)
P 292
32.1%
O 282
31.0%
U 282
31.0%
A 12
 
1.3%
H 8
 
0.9%
C 5
 
0.5%
G 4
 
0.4%
S 4
 
0.4%
W 4
 
0.4%
K 3
 
0.3%
Other values (9) 15
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
' 1
 
11.1%
1
 
11.1%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
605
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6313
87.4%
Common 616
 
8.5%
Hangul 292
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
5.5%
10
 
3.4%
9
 
3.1%
8
 
2.7%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (114) 210
71.9%
Latin
ValueCountFrequency (%)
r 874
13.8%
s 871
13.8%
e 599
 
9.5%
i 585
 
9.3%
n 313
 
5.0%
o 305
 
4.8%
d 299
 
4.7%
t 293
 
4.6%
P 292
 
4.6%
f 292
 
4.6%
Other values (31) 1590
25.2%
Common
ValueCountFrequency (%)
605
98.2%
. 7
 
1.1%
2 1
 
0.2%
1 1
 
0.2%
' 1
 
0.2%
1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6928
95.9%
Hangul 292
 
4.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 874
12.6%
s 871
12.6%
605
 
8.7%
e 599
 
8.6%
i 585
 
8.4%
n 313
 
4.5%
o 305
 
4.4%
d 299
 
4.3%
t 293
 
4.2%
P 292
 
4.2%
Other values (36) 1892
27.3%
Hangul
ValueCountFrequency (%)
16
 
5.5%
10
 
3.4%
9
 
3.1%
8
 
2.7%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (114) 210
71.9%
None
ValueCountFrequency (%)
1
100.0%

Unnamed: 6
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2011
195 
2008
72 
2014
55 
2012
 
14
2010
 
12
Other values (12)
36 

Length

Max length11
Median length4
Mean length4.4348958
Min length3

Unique

Unique5 ?
Unique (%)1.3%

Sample

1st row<NA>
2nd row발행년
3rd row2013, c2002
4th row2011
5th row2011

Common Values

ValueCountFrequency (%)
2011 195
50.8%
2008 72
 
18.8%
2014 55
 
14.3%
2012 14
 
3.6%
2010 12
 
3.1%
2013 10
 
2.6%
2013, c2009 6
 
1.6%
2013, c2007 5
 
1.3%
2013, c2010 4
 
1.0%
2013, c2008 2
 
0.5%
Other values (7) 9
 
2.3%

Length

2023-12-13T05:40:07.359904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2011 195
47.8%
2008 72
 
17.6%
2014 55
 
13.5%
2013 34
 
8.3%
2012 14
 
3.4%
2010 12
 
2.9%
c2009 6
 
1.5%
c2007 5
 
1.2%
c2010 4
 
1.0%
c2008 2
 
0.5%
Other values (7) 9
 
2.2%
Distinct52
Distinct (%)13.6%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:07.591095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.4281984
Min length4

Characters and Unicode

Total characters1696
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)5.7%

Sample

1st row 가격
2nd row25200
3rd row6583
4th row6583
5th row6583
ValueCountFrequency (%)
7333 84
21.9%
6583 48
12.5%
8333 36
9.4%
8300 36
9.4%
11028 36
9.4%
13333 24
 
6.3%
14555 18
 
4.7%
15000 15
 
3.9%
14000 6
 
1.6%
18000 5
 
1.3%
Other values (42) 75
19.6%
2023-12-13T05:40:07.945854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 559
33.0%
0 374
22.1%
8 181
 
10.7%
1 181
 
10.7%
5 134
 
7.9%
7 95
 
5.6%
2 69
 
4.1%
6 53
 
3.1%
4 29
 
1.7%
9 17
 
1.0%
Other values (3) 4
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1692
99.8%
Space Separator 2
 
0.1%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 559
33.0%
0 374
22.1%
8 181
 
10.7%
1 181
 
10.7%
5 134
 
7.9%
7 95
 
5.6%
2 69
 
4.1%
6 53
 
3.1%
4 29
 
1.7%
9 17
 
1.0%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1694
99.9%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
3 559
33.0%
0 374
22.1%
8 181
 
10.7%
1 181
 
10.7%
5 134
 
7.9%
7 95
 
5.6%
2 69
 
4.1%
6 53
 
3.1%
4 29
 
1.7%
9 17
 
1.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1694
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 559
33.0%
0 374
22.1%
8 181
 
10.7%
1 181
 
10.7%
5 134
 
7.9%
7 95
 
5.6%
2 69
 
4.1%
6 53
 
3.1%
4 29
 
1.7%
9 17
 
1.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct116
Distinct (%)30.3%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T05:40:08.275373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length28
Mean length18.083551
Min length4

Characters and Unicode

Total characters6926
Distinct characters45
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)24.5%

Sample

1st row형태사항
2nd row24p.:col. ill.;18cm
3rd row8p.:col. ill.;20cm+CD-ROM 1매
4th row8p.:col. ill.;20cm
5th row8p.:col. ill.;20cm
ValueCountFrequency (%)
ill.;20cm 144
21.1%
16p.:col 108
15.8%
1매 58
 
8.5%
8p.:col 42
 
6.1%
ill.;22cm 33
 
4.8%
24p.:col 31
 
4.5%
24p.:ill.;20cm 30
 
4.4%
ill.;20cm+cd-rom 24
 
3.5%
32p.:col 19
 
2.8%
1v.:col 17
 
2.5%
Other values (112) 177
25.9%
2023-12-13T05:40:08.855261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 919
13.3%
l 843
12.2%
2 632
 
9.1%
c 613
 
8.9%
; 382
 
5.5%
m 382
 
5.5%
p 363
 
5.2%
: 353
 
5.1%
i 306
 
4.4%
300
 
4.3%
Other values (35) 1833
26.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2757
39.8%
Other Punctuation 1665
24.0%
Decimal Number 1601
23.1%
Space Separator 300
 
4.3%
Uppercase Letter 282
 
4.1%
Other Letter 184
 
2.7%
Math Symbol 60
 
0.9%
Dash Punctuation 55
 
0.8%
Close Punctuation 11
 
0.2%
Open Punctuation 11
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
31.5%
48
26.1%
37
20.1%
11
 
6.0%
10
 
5.4%
10
 
5.4%
4
 
2.2%
2
 
1.1%
1
 
0.5%
1
 
0.5%
Other values (2) 2
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 632
39.5%
1 249
 
15.6%
0 217
 
13.6%
6 142
 
8.9%
3 123
 
7.7%
4 105
 
6.6%
8 59
 
3.7%
9 34
 
2.1%
7 20
 
1.2%
5 20
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
l 843
30.6%
c 613
22.2%
m 382
13.9%
p 363
13.2%
i 306
 
11.1%
o 231
 
8.4%
v 19
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
D 59
20.9%
C 57
20.2%
M 55
19.5%
O 55
19.5%
R 55
19.5%
V 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 919
55.2%
; 382
22.9%
: 353
 
21.2%
, 11
 
0.7%
Math Symbol
ValueCountFrequency (%)
+ 58
96.7%
× 2
 
3.3%
Space Separator
ValueCountFrequency (%)
300
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Close Punctuation
ValueCountFrequency (%)
] 11
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3703
53.5%
Latin 3039
43.9%
Hangul 184
 
2.7%

Most frequent character per script

Common
ValueCountFrequency (%)
. 919
24.8%
2 632
17.1%
; 382
10.3%
: 353
 
9.5%
300
 
8.1%
1 249
 
6.7%
0 217
 
5.9%
6 142
 
3.8%
3 123
 
3.3%
4 105
 
2.8%
Other values (10) 281
 
7.6%
Latin
ValueCountFrequency (%)
l 843
27.7%
c 613
20.2%
m 382
12.6%
p 363
11.9%
i 306
 
10.1%
o 231
 
7.6%
D 59
 
1.9%
C 57
 
1.9%
M 55
 
1.8%
O 55
 
1.8%
Other values (3) 75
 
2.5%
Hangul
ValueCountFrequency (%)
58
31.5%
48
26.1%
37
20.1%
11
 
6.0%
10
 
5.4%
10
 
5.4%
4
 
2.2%
2
 
1.1%
1
 
0.5%
1
 
0.5%
Other values (2) 2
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6740
97.3%
Hangul 184
 
2.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 919
13.6%
l 843
12.5%
2 632
 
9.4%
c 613
 
9.1%
; 382
 
5.7%
m 382
 
5.7%
p 363
 
5.4%
: 353
 
5.2%
i 306
 
4.5%
300
 
4.5%
Other values (22) 1647
24.4%
Hangul
ValueCountFrequency (%)
58
31.5%
48
26.1%
37
20.1%
11
 
6.0%
10
 
5.4%
10
 
5.4%
4
 
2.2%
2
 
1.1%
1
 
0.5%
1
 
0.5%
Other values (2) 2
 
1.1%
None
ValueCountFrequency (%)
× 2
100.0%

Unnamed: 9
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
[중랑]유아자료실
273 
[중랑]종합자료실
76 
[중랑]어린이자료실
33 
<NA>
 
1
자료실명
 
1

Length

Max length10
Median length9
Mean length9.0598958
Min length4

Unique

Unique2 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row자료실명
3rd row[중랑]유아자료실
4th row[중랑]유아자료실
5th row[중랑]유아자료실

Common Values

ValueCountFrequency (%)
[중랑]유아자료실 273
71.1%
[중랑]종합자료실 76
 
19.8%
[중랑]어린이자료실 33
 
8.6%
<NA> 1
 
0.3%
자료실명 1
 
0.3%

Length

2023-12-13T05:40:09.052313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:40:09.215894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중랑]유아자료실 273
71.1%
중랑]종합자료실 76
 
19.8%
중랑]어린이자료실 33
 
8.6%
na 1
 
0.3%
자료실명 1
 
0.3%

Correlations

2023-12-13T05:40:09.358415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 9
Unnamed: 51.0000.9890.9970.977
Unnamed: 60.9891.0000.9880.998
Unnamed: 70.9970.9881.0001.000
Unnamed: 90.9770.9981.0001.000
2023-12-13T05:40:09.485087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 6Unnamed: 9
Unnamed: 61.0000.929
Unnamed: 90.9291.000
2023-12-13T05:40:09.586789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 6Unnamed: 9
Unnamed: 61.0000.929
Unnamed: 90.9291.000

Missing values

2023-12-13T05:40:02.007642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:40:02.192143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:40:02.385077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

[2014년 12월] 중랑구립정보도서관 신착자료 목록Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1No.등록번호청구기호서명저작자발행자발행년가격형태사항자료실명
21WM0000007658CB 510-K65l-v.7Skinby Cynthia Klingel, Robert B. Noyed ; photographs by Gregg AndersenWeeklyReader Early Learning Library2013, c20022520024p.:col. ill.;18cm[중랑]유아자료실
32WM0000007659CB 747-H955o-v.1-1What a mess!written by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm+CD-ROM 1매[중랑]유아자료실
43WM0000007668CB 747-H955o-v.1-10(The) headachewritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm[중랑]유아자료실
54WM0000007669CB 747-H955o-v.1-11(The) headachewritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm[중랑]유아자료실
65WM0000007670CB 747-H955o-v.1-12(The) headachewritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm[중랑]유아자료실
76WM0000007671CB 747-H955o-v.1-13Hide and seekwritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm+CD-ROM 1매[중랑]유아자료실
87WM0000007672CB 747-H955o-v.1-14Hide and seekwritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm[중랑]유아자료실
98WM0000007673CB 747-H955o-v.1-15Hide and seekwritten by Roderick Hunt ; illustrated by Alex BrychtaOxford University Press201165838p.:col. ill.;20cm[중랑]유아자료실
[2014년 12월] 중랑구립정보도서관 신착자료 목록Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
374373KM0000172833GE 372-ㅂ526ㅈ(아이의 학력, 인성, 재능을 키워주는) 작은 학교의 힘박찬영 지음시공사201413000250p.:삽도;23cm[중랑]종합자료실
375374KM0000172865GE 375.3-ㅂ184ㅇ엄마가 모르는 초등 3학년 교실 : 공부 습관이 길러지고 사회성이 자라는 교실 생활 들여다보기박관수 지음평사리201313000237p.;23cm[중랑]종합자료실
376375KM0000172873GE 377.1-ㄴ69ㄱ기업가의 방문 : 어느 기업 대학에서 생긴 일노영수 지음후마니타스201415000263p.;22cm[중랑]종합자료실
377376KM0000172799GE 377.268-ㅇ728ㄷ대학생의 진로 멘토링 = Design your career : 미래를 알면 직업이 보인다이무근, 이찬 [공] 저교육과학사201414000342p.:도표;25cm[중랑]종합자료실
378377KM0000172797GE 377.268-ㅈ612ㄴ(재미있는 진로선택) 내 인생의 네비게이션조명실 지음계명대학교 출판부201412000169p.:삽도;25cm[중랑]종합자료실
379378KM0000172863GE 453.9-ㅇ475ㄴ날씨 충격 = Weather shock : 대한민국 기후변화 탐사 리포트온케이웨더 취재팀 지음코난북스201414000269p.:삽도;23cm[중랑]종합자료실
380379KM0000172807GE 470.4-ㅂ492ㅁ모든 생명은 서로 돕는다 : 수의사 아빠가 딸에게 들려주는 생명, 공존, 생태 이야기박종무 지음리수201417900292p.:삽도, 도표;23cm[중랑]종합자료실
381380KM0000172798GE 518.44-ㄱ425ㅂ불량 제약회사 : 제약회사는 어떻게 의사를 속이고 환자에게 해를 입히는가벤 골드에이커 지음 ; 안형식, 권민 [같이] 옮김공존201422000519p.:삽도;22cm[중랑]종합자료실
382381KM0000172866GE 598.1-ㄱ867ㅇ엄마의 꿈이 아이의 인생을 결정한다 : 잃어버린, 사라져버린, 포기해버린 나를 찾아서!김윤경 지음프롬북스201413800256p.:삽도;22cm[중랑]종합자료실
383382KM0000172855GE 691.15-ㄱ424ㅁ마인크래프트 이야기 : 블록, 픽셀, 페도라, 그리고 억만장자 되기다니엘 골드버그, 리누스 라르손 공저 ; 이진복 옮김인간희극201414800263p.:삽도;23cm[중랑]종합자료실