Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

Categorical1
Text5

Dataset

Description경산시 소속 도서관에서 구입한 도서목록 현황 데이터로 등록번호, 서명, 저작자, 발행자, 발행년의 데이터를 포함하고 있습니다.
Author경상북도 경산시
URLhttps://www.data.go.kr/data/3033237/fileData.do

Alerts

등록번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 06:03:46.011305
Analysis finished2024-03-23 06:03:52.272280
Duration6.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
작은도서관
5052 
시립도서관
2919 
장산도서관
2029 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row작은도서관
2nd row작은도서관
3rd row시립도서관
4th row작은도서관
5th row시립도서관

Common Values

ValueCountFrequency (%)
작은도서관 5052
50.5%
시립도서관 2919
29.2%
장산도서관 2029
20.3%

Length

2024-03-23T06:03:52.503685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:03:52.825841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
작은도서관 5052
50.5%
시립도서관 2919
29.2%
장산도서관 2029
20.3%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T06:03:53.390420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters21
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowJB0000023898
2nd rowJB0000023746
3rd rowGB0000106534
4th rowJE0000009626
5th rowGB0000106914
ValueCountFrequency (%)
jb0000023898 1
 
< 0.1%
gb0000107444 1
 
< 0.1%
jc0000010505 1
 
< 0.1%
je0000009919 1
 
< 0.1%
gb0000106592 1
 
< 0.1%
gb0000105374 1
 
< 0.1%
om0000051665 1
 
< 0.1%
je0000009470 1
 
< 0.1%
gb0000107800 1
 
< 0.1%
jc0000010864 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-03-23T06:03:54.602617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 58842
49.0%
1 8226
 
6.9%
J 5052
 
4.2%
5 4922
 
4.1%
7 4831
 
4.0%
8 4255
 
3.5%
2 4024
 
3.4%
6 3928
 
3.3%
9 3897
 
3.2%
4 3693
 
3.1%
Other values (11) 18330
 
15.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
J 5052
25.3%
B 3510
17.5%
E 2925
14.6%
G 2919
14.6%
O 2029
10.1%
M 1034
 
5.2%
K 995
 
5.0%
C 611
 
3.1%
A 558
 
2.8%
R 365
 
1.8%
Decimal Number
ValueCountFrequency (%)
0 58842
58.8%
1 8226
 
8.2%
5 4922
 
4.9%
7 4831
 
4.8%
8 4255
 
4.3%
2 4024
 
4.0%
6 3928
 
3.9%
9 3897
 
3.9%
4 3693
 
3.7%
3 3382
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
J 5052
25.3%
B 3510
17.5%
E 2925
14.6%
G 2919
14.6%
O 2029
10.1%
M 1034
 
5.2%
K 995
 
5.0%
C 611
 
3.1%
A 558
 
2.8%
R 365
 
1.8%
Common
ValueCountFrequency (%)
0 58842
58.8%
1 8226
 
8.2%
5 4922
 
4.9%
7 4831
 
4.8%
8 4255
 
4.3%
2 4024
 
4.0%
6 3928
 
3.9%
9 3897
 
3.9%
4 3693
 
3.7%
3 3382
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 58842
49.0%
1 8226
 
6.9%
J 5052
 
4.2%
5 4922
 
4.1%
7 4831
 
4.0%
8 4255
 
3.5%
2 4024
 
3.4%
6 3928
 
3.3%
9 3897
 
3.2%
4 3693
 
3.1%
Other values (11) 18330
 
15.3%

서명
Text

Distinct8951
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T06:03:55.633446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length171
Median length87
Mean length25.8054
Min length1

Characters and Unicode

Total characters258054
Distinct characters1434
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8015 ?
Unique (%)80.2%

Sample

1st row나의 봄날인 너에게 : 인생의 꽃샘추위에 지지 않는 햇살 같은 위로
2nd row(The)first journey
3rd row한국의 인어들 : 전설 설화 속 신비한 인어를 찾아서
4th row[빅북] 할머니, 어디가요? 밤 주우러 간다! : 옥이네 가을 이야기
5th row언제나 함께
ValueCountFrequency (%)
5203
 
7.9%
the 633
 
1.0%
이야기 388
 
0.6%
위한 372
 
0.6%
장편소설 291
 
0.4%
1 253
 
0.4%
2 246
 
0.4%
and 195
 
0.3%
189
 
0.3%
그림책 187
 
0.3%
Other values (21009) 58137
88.0%
2024-03-23T06:03:57.546082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60218
 
23.3%
: 4734
 
1.8%
4296
 
1.7%
4126
 
1.6%
e 3744
 
1.5%
3600
 
1.4%
, 2451
 
0.9%
o 2334
 
0.9%
a 2333
 
0.9%
2311
 
0.9%
Other values (1424) 167907
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 145745
56.5%
Space Separator 60218
23.3%
Lowercase Letter 27879
 
10.8%
Other Punctuation 10759
 
4.2%
Decimal Number 4608
 
1.8%
Uppercase Letter 4580
 
1.8%
Open Punctuation 1793
 
0.7%
Close Punctuation 1793
 
0.7%
Math Symbol 431
 
0.2%
Dash Punctuation 225
 
0.1%
Other values (6) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4296
 
2.9%
4126
 
2.8%
3600
 
2.5%
2311
 
1.6%
2184
 
1.5%
2069
 
1.4%
2045
 
1.4%
2043
 
1.4%
1994
 
1.4%
1781
 
1.2%
Other values (1317) 119296
81.9%
Lowercase Letter
ValueCountFrequency (%)
e 3744
13.4%
o 2334
 
8.4%
a 2333
 
8.4%
t 2268
 
8.1%
n 2031
 
7.3%
r 1789
 
6.4%
i 1762
 
6.3%
s 1581
 
5.7%
h 1448
 
5.2%
l 1240
 
4.4%
Other values (16) 7349
26.4%
Uppercase Letter
ValueCountFrequency (%)
T 665
14.5%
S 372
 
8.1%
M 289
 
6.3%
A 278
 
6.1%
G 273
 
6.0%
B 265
 
5.8%
I 245
 
5.3%
P 243
 
5.3%
D 207
 
4.5%
F 195
 
4.3%
Other values (16) 1548
33.8%
Other Punctuation
ValueCountFrequency (%)
: 4734
44.0%
, 2451
22.8%
. 1590
 
14.8%
! 863
 
8.0%
? 499
 
4.6%
' 298
 
2.8%
· 221
 
2.1%
& 47
 
0.4%
% 19
 
0.2%
; 10
 
0.1%
Other values (6) 27
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 1091
23.7%
2 804
17.4%
0 720
15.6%
3 502
10.9%
5 382
 
8.3%
4 317
 
6.9%
6 245
 
5.3%
7 189
 
4.1%
9 181
 
3.9%
8 177
 
3.8%
Math Symbol
ValueCountFrequency (%)
= 335
77.7%
~ 40
 
9.3%
+ 21
 
4.9%
< 12
 
2.8%
> 11
 
2.6%
× 6
 
1.4%
| 5
 
1.2%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1663
92.7%
[ 118
 
6.6%
6
 
0.3%
4
 
0.2%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1663
92.7%
] 118
 
6.6%
6
 
0.3%
4
 
0.2%
2
 
0.1%
Other Symbol
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Letter Number
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
60218
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 225
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 145688
56.5%
Common 79844
30.9%
Latin 32465
 
12.6%
Han 57
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4296
 
2.9%
4126
 
2.8%
3600
 
2.5%
2311
 
1.6%
2184
 
1.5%
2069
 
1.4%
2045
 
1.4%
2043
 
1.4%
1994
 
1.4%
1781
 
1.2%
Other values (1273) 119239
81.8%
Latin
ValueCountFrequency (%)
e 3744
 
11.5%
o 2334
 
7.2%
a 2333
 
7.2%
t 2268
 
7.0%
n 2031
 
6.3%
r 1789
 
5.5%
i 1762
 
5.4%
s 1581
 
4.9%
h 1448
 
4.5%
l 1240
 
3.8%
Other values (44) 11935
36.8%
Common
ValueCountFrequency (%)
60218
75.4%
: 4734
 
5.9%
, 2451
 
3.1%
( 1663
 
2.1%
) 1663
 
2.1%
. 1590
 
2.0%
1 1091
 
1.4%
! 863
 
1.1%
2 804
 
1.0%
0 720
 
0.9%
Other values (43) 4047
 
5.1%
Han
ValueCountFrequency (%)
6
 
10.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
Other values (34) 34
59.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 145683
56.5%
ASCII 112029
43.4%
None 259
 
0.1%
CJK 56
 
< 0.1%
Punctuation 9
 
< 0.1%
Number Forms 6
 
< 0.1%
Compat Jamo 5
 
< 0.1%
Misc Symbols 3
 
< 0.1%
Geometric Shapes 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
60218
53.8%
: 4734
 
4.2%
e 3744
 
3.3%
, 2451
 
2.2%
o 2334
 
2.1%
a 2333
 
2.1%
t 2268
 
2.0%
n 2031
 
1.8%
r 1789
 
1.6%
i 1762
 
1.6%
Other values (77) 28365
25.3%
Hangul
ValueCountFrequency (%)
4296
 
2.9%
4126
 
2.8%
3600
 
2.5%
2311
 
1.6%
2184
 
1.5%
2069
 
1.4%
2045
 
1.4%
2043
 
1.4%
1994
 
1.4%
1781
 
1.2%
Other values (1268) 119234
81.8%
None
ValueCountFrequency (%)
· 221
85.3%
× 6
 
2.3%
6
 
2.3%
6
 
2.3%
5
 
1.9%
4
 
1.5%
4
 
1.5%
2
 
0.8%
2
 
0.8%
1
 
0.4%
Other values (2) 2
 
0.8%
CJK
ValueCountFrequency (%)
6
 
10.7%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
Other values (33) 33
58.9%
Number Forms
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Punctuation
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%
Misc Symbols
ValueCountFrequency (%)
3
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct7530
Distinct (%)75.3%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-23T06:03:58.472925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length695
Median length121
Mean length19.109111
Min length2

Characters and Unicode

Total characters191072
Distinct characters995
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6521 ?
Unique (%)65.2%

Sample

1st row정혜영 지음
2nd rowDavid Shannon,Molly Bang
3rd row글쓴이: 차율이 ; 그린이: 가지
4th row조혜란 그리고 씀
5th row프셰므스와브 베흐테로비츠 글 ; 에밀리아 지우바크 그림 ; 초록햇비 옮김
ValueCountFrequency (%)
지음 3858
 
8.6%
3573
 
8.0%
옮김 2713
 
6.0%
그림 2512
 
5.6%
by 2094
 
4.7%
1413
 
3.1%
roderick 608
 
1.4%
brychta 593
 
1.3%
글·그림 528
 
1.2%
alex 504
 
1.1%
Other values (12085) 26477
59.0%
2024-03-23T06:04:00.056066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35611
 
18.6%
; 7061
 
3.7%
6079
 
3.2%
5081
 
2.7%
t 4926
 
2.6%
4869
 
2.5%
e 4458
 
2.3%
r 4016
 
2.1%
3614
 
1.9%
3551
 
1.9%
Other values (985) 111806
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91128
47.7%
Lowercase Letter 45863
24.0%
Space Separator 35611
 
18.6%
Other Punctuation 11726
 
6.1%
Uppercase Letter 6176
 
3.2%
Close Punctuation 222
 
0.1%
Open Punctuation 222
 
0.1%
Dash Punctuation 64
 
< 0.1%
Decimal Number 31
 
< 0.1%
Math Symbol 28
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6079
 
6.7%
5081
 
5.6%
4869
 
5.3%
3614
 
4.0%
3551
 
3.9%
3481
 
3.8%
3135
 
3.4%
2920
 
3.2%
1647
 
1.8%
1367
 
1.5%
Other values (903) 55384
60.8%
Lowercase Letter
ValueCountFrequency (%)
t 4926
10.7%
e 4458
 
9.7%
r 4016
 
8.8%
a 3513
 
7.7%
l 3371
 
7.4%
i 3320
 
7.2%
y 3193
 
7.0%
n 2841
 
6.2%
b 2304
 
5.0%
o 1987
 
4.3%
Other values (16) 11934
26.0%
Uppercase Letter
ValueCountFrequency (%)
B 818
13.2%
A 808
13.1%
R 792
12.8%
H 741
12.0%
S 509
8.2%
M 355
 
5.7%
J 336
 
5.4%
D 244
 
4.0%
C 225
 
3.6%
G 179
 
2.9%
Other values (15) 1169
18.9%
Other Punctuation
ValueCountFrequency (%)
; 7061
60.2%
, 2357
 
20.1%
: 1192
 
10.2%
· 688
 
5.9%
. 407
 
3.5%
? 9
 
0.1%
' 5
 
< 0.1%
& 4
 
< 0.1%
/ 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 7
22.6%
1 7
22.6%
3 6
19.4%
0 3
9.7%
5 2
 
6.5%
4 2
 
6.5%
9 2
 
6.5%
7 1
 
3.2%
8 1
 
3.2%
Math Symbol
ValueCountFrequency (%)
< 13
46.4%
> 13
46.4%
1
 
3.6%
| 1
 
3.6%
Close Punctuation
ValueCountFrequency (%)
] 199
89.6%
) 21
 
9.5%
2
 
0.9%
Open Punctuation
ValueCountFrequency (%)
[ 199
89.6%
( 21
 
9.5%
2
 
0.9%
Space Separator
ValueCountFrequency (%)
35611
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 91097
47.7%
Latin 52039
27.2%
Common 47905
25.1%
Han 31
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6079
 
6.7%
5081
 
5.6%
4869
 
5.3%
3614
 
4.0%
3551
 
3.9%
3481
 
3.8%
3135
 
3.4%
2920
 
3.2%
1647
 
1.8%
1367
 
1.5%
Other values (876) 55353
60.8%
Latin
ValueCountFrequency (%)
t 4926
 
9.5%
e 4458
 
8.6%
r 4016
 
7.7%
a 3513
 
6.8%
l 3371
 
6.5%
i 3320
 
6.4%
y 3193
 
6.1%
n 2841
 
5.5%
b 2304
 
4.4%
o 1987
 
3.8%
Other values (41) 18110
34.8%
Common
ValueCountFrequency (%)
35611
74.3%
; 7061
 
14.7%
, 2357
 
4.9%
: 1192
 
2.5%
· 688
 
1.4%
. 407
 
0.8%
] 199
 
0.4%
[ 199
 
0.4%
- 64
 
0.1%
( 21
 
< 0.1%
Other values (21) 106
 
0.2%
Han
ValueCountFrequency (%)
3
 
9.7%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (17) 17
54.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99250
51.9%
Hangul 91096
47.7%
None 693
 
0.4%
CJK 31
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35611
35.9%
; 7061
 
7.1%
t 4926
 
5.0%
e 4458
 
4.5%
r 4016
 
4.0%
a 3513
 
3.5%
l 3371
 
3.4%
i 3320
 
3.3%
y 3193
 
3.2%
n 2841
 
2.9%
Other values (67) 26940
27.1%
Hangul
ValueCountFrequency (%)
6079
 
6.7%
5081
 
5.6%
4869
 
5.3%
3614
 
4.0%
3551
 
3.9%
3481
 
3.8%
3135
 
3.4%
2920
 
3.2%
1647
 
1.8%
1367
 
1.5%
Other values (875) 55352
60.8%
None
ValueCountFrequency (%)
· 688
99.3%
2
 
0.3%
2
 
0.3%
1
 
0.1%
CJK
ValueCountFrequency (%)
3
 
9.7%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (17) 17
54.8%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2254
Distinct (%)22.5%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-23T06:04:00.746689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length44
Mean length7.9963993
Min length1

Characters and Unicode

Total characters79948
Distinct characters706
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1112 ?
Unique (%)11.1%

Sample

1st row놀(다산북스)
2nd rowScholastic
3rd row고래가숨쉬는도서관
4th row보리
5th row노랑꼬리별
ValueCountFrequency (%)
oxford 858
 
6.1%
press 820
 
5.8%
university 795
 
5.6%
books 212
 
1.5%
위즈덤하우스 150
 
1.1%
서울문화사 147
 
1.0%
창비 132
 
0.9%
문학동네 125
 
0.9%
아울북 121
 
0.9%
비룡소 115
 
0.8%
Other values (2237) 10661
75.4%
2024-03-23T06:04:02.148099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4151
 
5.2%
s 3257
 
4.1%
r 3150
 
3.9%
o 2457
 
3.1%
i 2430
 
3.0%
e 2422
 
3.0%
1856
 
2.3%
1554
 
1.9%
n 1552
 
1.9%
1547
 
1.9%
Other values (696) 55572
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42007
52.5%
Lowercase Letter 25158
31.5%
Uppercase Letter 5469
 
6.8%
Space Separator 4151
 
5.2%
Other Punctuation 1614
 
2.0%
Open Punctuation 662
 
0.8%
Close Punctuation 662
 
0.8%
Decimal Number 216
 
0.3%
Dash Punctuation 6
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1856
 
4.4%
1554
 
3.7%
1547
 
3.7%
1336
 
3.2%
1111
 
2.6%
805
 
1.9%
754
 
1.8%
613
 
1.5%
538
 
1.3%
532
 
1.3%
Other values (618) 31361
74.7%
Lowercase Letter
ValueCountFrequency (%)
s 3257
12.9%
r 3150
12.5%
o 2457
9.8%
i 2430
9.7%
e 2422
9.6%
n 1552
 
6.2%
t 1286
 
5.1%
d 1159
 
4.6%
f 949
 
3.8%
y 888
 
3.5%
Other values (16) 5608
22.3%
Uppercase Letter
ValueCountFrequency (%)
O 957
17.5%
P 878
16.1%
U 809
14.8%
B 456
8.3%
S 401
7.3%
H 306
 
5.6%
R 261
 
4.8%
C 158
 
2.9%
M 136
 
2.5%
K 127
 
2.3%
Other values (15) 980
17.9%
Decimal Number
ValueCountFrequency (%)
2 105
48.6%
1 73
33.8%
6 11
 
5.1%
4 9
 
4.2%
8 7
 
3.2%
9 6
 
2.8%
7 2
 
0.9%
5 1
 
0.5%
0 1
 
0.5%
3 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
: 1436
89.0%
& 65
 
4.0%
. 29
 
1.8%
· 22
 
1.4%
' 17
 
1.1%
16
 
1.0%
; 12
 
0.7%
, 11
 
0.7%
/ 6
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 661
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 661
99.8%
] 1
 
0.2%
Space Separator
ValueCountFrequency (%)
4151
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41999
52.5%
Latin 30627
38.3%
Common 7313
 
9.1%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1856
 
4.4%
1554
 
3.7%
1547
 
3.7%
1336
 
3.2%
1111
 
2.6%
805
 
1.9%
754
 
1.8%
613
 
1.5%
538
 
1.3%
532
 
1.3%
Other values (612) 31353
74.7%
Latin
ValueCountFrequency (%)
s 3257
 
10.6%
r 3150
 
10.3%
o 2457
 
8.0%
i 2430
 
7.9%
e 2422
 
7.9%
n 1552
 
5.1%
t 1286
 
4.2%
d 1159
 
3.8%
O 957
 
3.1%
f 949
 
3.1%
Other values (41) 11008
35.9%
Common
ValueCountFrequency (%)
4151
56.8%
: 1436
 
19.6%
( 661
 
9.0%
) 661
 
9.0%
2 105
 
1.4%
1 73
 
1.0%
& 65
 
0.9%
. 29
 
0.4%
· 22
 
0.3%
' 17
 
0.2%
Other values (16) 93
 
1.3%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41993
52.5%
ASCII 37902
47.4%
None 39
 
< 0.1%
CJK 9
 
< 0.1%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4151
 
11.0%
s 3257
 
8.6%
r 3150
 
8.3%
o 2457
 
6.5%
i 2430
 
6.4%
e 2422
 
6.4%
n 1552
 
4.1%
: 1436
 
3.8%
t 1286
 
3.4%
d 1159
 
3.1%
Other values (65) 14602
38.5%
Hangul
ValueCountFrequency (%)
1856
 
4.4%
1554
 
3.7%
1547
 
3.7%
1336
 
3.2%
1111
 
2.6%
805
 
1.9%
754
 
1.8%
613
 
1.5%
538
 
1.3%
532
 
1.3%
Other values (607) 31347
74.6%
None
ValueCountFrequency (%)
· 22
56.4%
16
41.0%
1
 
2.6%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Compat Jamo
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Distinct52
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T06:04:02.520913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.0073
Min length4

Characters and Unicode

Total characters40073
Distinct characters16
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)0.2%

Sample

1st row2023
2nd row2022
3rd row2022
4th row2022
5th row2022
ValueCountFrequency (%)
2022 6661
66.6%
2023 2368
 
23.7%
2021 410
 
4.1%
2020 156
 
1.6%
2019 108
 
1.1%
2018 70
 
0.7%
2017 34
 
0.3%
2016 27
 
0.3%
2015 24
 
0.2%
2012 21
 
0.2%
Other values (31) 122
 
1.2%
2024-03-23T06:04:03.339774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 26272
65.6%
0 10209
 
25.5%
3 2393
 
6.0%
1 771
 
1.9%
9 143
 
0.4%
8 82
 
0.2%
7 50
 
0.1%
6 41
 
0.1%
5 27
 
0.1%
4 20
 
< 0.1%
Other values (6) 65
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40008
99.8%
Other Punctuation 21
 
0.1%
Open Punctuation 16
 
< 0.1%
Close Punctuation 16
 
< 0.1%
Lowercase Letter 11
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 26272
65.7%
0 10209
 
25.5%
3 2393
 
6.0%
1 771
 
1.9%
9 143
 
0.4%
8 82
 
0.2%
7 50
 
0.1%
6 41
 
0.1%
5 27
 
0.1%
4 20
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 19
90.5%
, 2
 
9.5%
Open Punctuation
ValueCountFrequency (%)
[ 16
100.0%
Close Punctuation
ValueCountFrequency (%)
] 16
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 11
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40062
> 99.9%
Latin 11
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 26272
65.6%
0 10209
 
25.5%
3 2393
 
6.0%
1 771
 
1.9%
9 143
 
0.4%
8 82
 
0.2%
7 50
 
0.1%
6 41
 
0.1%
5 27
 
0.1%
4 20
 
< 0.1%
Other values (5) 54
 
0.1%
Latin
ValueCountFrequency (%)
c 11
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40073
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 26272
65.6%
0 10209
 
25.5%
3 2393
 
6.0%
1 771
 
1.9%
9 143
 
0.4%
8 82
 
0.2%
7 50
 
0.1%
6 41
 
0.1%
5 27
 
0.1%
4 20
 
< 0.1%
Other values (6) 65
 
0.2%

Correlations

2024-03-23T06:04:03.687579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분발행년
구분1.0000.501
발행년0.5011.000

Missing values

2024-03-23T06:03:51.453735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:03:51.786942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T06:03:52.057056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분등록번호서명저작자발행자발행년
7177작은도서관JB0000023898나의 봄날인 너에게 : 인생의 꽃샘추위에 지지 않는 햇살 같은 위로정혜영 지음놀(다산북스)2023
7025작은도서관JB0000023746(The)first journeyDavid Shannon,Molly BangScholastic2022
2163시립도서관GB0000106534한국의 인어들 : 전설 설화 속 신비한 인어를 찾아서글쓴이: 차율이 ; 그린이: 가지고래가숨쉬는도서관2022
10418작은도서관JE0000009626[빅북] 할머니, 어디가요? 밤 주우러 간다! : 옥이네 가을 이야기조혜란 그리고 씀보리2022
2543시립도서관GB0000106914언제나 함께프셰므스와브 베흐테로비츠 글 ; 에밀리아 지우바크 그림 ; 초록햇비 옮김노랑꼬리별2022
10077작은도서관JE0000009285음식을 먹으면 어디로 갈까요? : 우리 아이 첫 과학책 : 소화와 영양소케이티 데이니스 글;댄 테일러 그림;신인수 옮김어스본코리아2022
3216시립도서관GB0000107587배드 가이즈. 10, 끝까지 뭉친다에런 블레이비 지음 ; 신수진 옮김비룡소2022
8054작은도서관JC0000010918파리의 미술관 = Les Mus?es ? Paris : 루브르에서 퐁피두까지 가장 아름다운 파리를 만나는 시간이혜준,임현승,정희태,최준호 지음 ;욘즈 일러스트Clove(클로브)2023
12095작은도서관JR0000007652Land of lettersby Roderick Hunt;illustrated by Alex BrychtaOxford University Press2022
5574장산도서관OM0000051685그래, 언젠가는김해우 글단비청소년2022
구분등록번호서명저작자발행자발행년
1137시립도서관GB0000105507자유죽음 : 살아가면서 선택할 수 있는 유일한 것에 대하여장 아메리 지음 ; 김희상 옮김위즈덤하우스2022
12035작은도서관JR0000007592(The)Minibeast zoowritten by Roderick Hunt,Alex BrychtaOxford University Press2022
6261작은도서관JA0000027969(초등학생이 알아야 할 참 쉬운)수학사라 헐,톰 뭄브레이 글;폴 보스턴 그림;송지혜 옮김UsborneKorea :비룡소인터내셔널2023
6621작은도서관JA0000028329선생님은 무섭단 말이야!안수민 글;김성영 그림리틀씨앤톡 :씨앤톡2021
6806작은도서관JB0000023527한 달 만에 블로그 일 방문자 수 1,000명 만들기권호영 지음;Freepik 일러스트푸른향기2023
1393시립도서관GB0000105763성공하려면 습관을 바꿔라 : 자신의 습관이 인생을 바꾼다이범준 엮음매월당2022
3948장산도서관OK0000000428산타클로스와 산타 마을의 일 년마우리 쿤나스 글·그림 ; 페트리 칼리올라 옮김북뱅크2022
1116시립도서관GB0000105486Little giraffe's big ideaJoshua George 외Little Hippo2018
5605장산도서관OM0000051716바람 불고 고요한 : 김명리 시집김명리 지음문학동네2022
3026시립도서관GB0000107397똑똑, 우리는 매일 문을 엽니다아네스 드 레스트라드 글 ; 마갈리 뒬랭 그림 ; 이정주 옮김씨드북2022