Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells29
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory781.2 KiB
Average record size in memory80.0 B

Variable types

Categorical2
Text6
DateTime1

Dataset

Description경기도 구리시지역내에 위치한 3개 시립도서관의 소장도서에 대한 목록과 상세정보(서명, 저자, 출판사 등)를 제공합니다.
URLhttps://www.data.go.kr/data/3038636/fileData.do

Alerts

도서관명 has constant value ""Constant
관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:43:16.987183
Analysis finished2023-12-12 05:43:20.333830
Duration3.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인창도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인창도서관
2nd row인창도서관
3rd row인창도서관
4th row인창도서관
5th row인창도서관

Common Values

ValueCountFrequency (%)
인창도서관 10000
100.0%

Length

2023-12-12T14:43:20.400215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:43:20.518591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인창도서관 10000
100.0%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:43:20.744965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowEM0000003380
2nd rowEM0000036353
3rd rowCM0000055167
4th rowDE0000010191
5th rowCR0000002467
ValueCountFrequency (%)
em0000003380 1
 
< 0.1%
de0000006006 1
 
< 0.1%
em0000027905 1
 
< 0.1%
cr0000007915 1
 
< 0.1%
cr0000005429 1
 
< 0.1%
em0000003408 1
 
< 0.1%
em0000039483 1
 
< 0.1%
em0000008722 1
 
< 0.1%
em0000000470 1
 
< 0.1%
em0000049603 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T14:43:21.317478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 57087
47.6%
M 7095
 
5.9%
E 6925
 
5.8%
1 6127
 
5.1%
2 5317
 
4.4%
4 5098
 
4.2%
3 4984
 
4.2%
5 4428
 
3.7%
9 4398
 
3.7%
6 4338
 
3.6%
Other values (5) 14203
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 57087
57.1%
1 6127
 
6.1%
2 5317
 
5.3%
4 5098
 
5.1%
3 4984
 
5.0%
5 4428
 
4.4%
9 4398
 
4.4%
6 4338
 
4.3%
8 4152
 
4.2%
7 4071
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
M 7095
35.5%
E 6925
34.6%
C 2980
14.9%
D 2086
 
10.4%
R 914
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 57087
57.1%
1 6127
 
6.1%
2 5317
 
5.3%
4 5098
 
5.1%
3 4984
 
5.0%
5 4428
 
4.4%
9 4398
 
4.4%
6 4338
 
4.3%
8 4152
 
4.2%
7 4071
 
4.1%
Latin
ValueCountFrequency (%)
M 7095
35.5%
E 6925
34.6%
C 2980
14.9%
D 2086
 
10.4%
R 914
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 57087
47.6%
M 7095
 
5.9%
E 6925
 
5.8%
1 6127
 
5.1%
2 5317
 
4.4%
4 5098
 
4.2%
3 4984
 
4.2%
5 4428
 
3.7%
9 4398
 
3.7%
6 4338
 
3.6%
Other values (5) 14203
 
11.8%

서명
Text

Distinct9939
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:43:21.754861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length233
Median length124
Mean length16.2905
Min length1

Characters and Unicode

Total characters162905
Distinct characters2052
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9880 ?
Unique (%)98.8%

Sample

1st row우주로의 여행. 1
2nd row교육 및 상담 심리학
3rd row1학년 백점국어
4th row우호적인 무관심
5th row이집트 왕자
ValueCountFrequency (%)
2521
 
6.2%
1 412
 
1.0%
2 383
 
0.9%
이야기 260
 
0.6%
3 177
 
0.4%
the 169
 
0.4%
위한 144
 
0.4%
4 116
 
0.3%
of 116
 
0.3%
역사 110
 
0.3%
Other values (19719) 36032
89.1%
2023-12-12T14:43:22.410629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30977
 
19.0%
3112
 
1.9%
2455
 
1.5%
: 2221
 
1.4%
. 1974
 
1.2%
e 1899
 
1.2%
1756
 
1.1%
( 1667
 
1.0%
) 1666
 
1.0%
1646
 
1.0%
Other values (2042) 113532
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99387
61.0%
Space Separator 30977
 
19.0%
Lowercase Letter 15133
 
9.3%
Other Punctuation 6180
 
3.8%
Decimal Number 4994
 
3.1%
Uppercase Letter 2093
 
1.3%
Open Punctuation 1746
 
1.1%
Close Punctuation 1745
 
1.1%
Math Symbol 344
 
0.2%
Dash Punctuation 231
 
0.1%
Other values (5) 75
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3112
 
3.1%
2455
 
2.5%
1756
 
1.8%
1646
 
1.7%
1516
 
1.5%
1434
 
1.4%
1415
 
1.4%
1199
 
1.2%
1189
 
1.2%
1136
 
1.1%
Other values (1900) 82529
83.0%
Lowercase Letter
ValueCountFrequency (%)
e 1899
12.5%
o 1311
 
8.7%
a 1274
 
8.4%
i 1158
 
7.7%
r 1121
 
7.4%
t 1075
 
7.1%
n 1072
 
7.1%
s 1027
 
6.8%
l 633
 
4.2%
d 607
 
4.0%
Other values (31) 3956
26.1%
Uppercase Letter
ValueCountFrequency (%)
S 208
 
9.9%
T 183
 
8.7%
A 159
 
7.6%
C 135
 
6.5%
D 119
 
5.7%
B 118
 
5.6%
E 101
 
4.8%
I 100
 
4.8%
M 97
 
4.6%
O 97
 
4.6%
Other values (30) 776
37.1%
Other Punctuation
ValueCountFrequency (%)
: 2221
35.9%
. 1974
31.9%
, 1164
18.8%
! 328
 
5.3%
· 267
 
4.3%
' 108
 
1.7%
& 34
 
0.6%
/ 25
 
0.4%
; 19
 
0.3%
10
 
0.2%
Other values (7) 30
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 1305
26.1%
2 1087
21.8%
0 817
16.4%
3 443
 
8.9%
5 320
 
6.4%
4 286
 
5.7%
9 226
 
4.5%
6 196
 
3.9%
7 160
 
3.2%
8 154
 
3.1%
Math Symbol
ValueCountFrequency (%)
= 259
75.3%
~ 60
 
17.4%
+ 16
 
4.7%
5
 
1.5%
× 2
 
0.6%
< 1
 
0.3%
> 1
 
0.3%
Letter Number
ValueCountFrequency (%)
23
35.9%
21
32.8%
13
20.3%
3
 
4.7%
2
 
3.1%
1
 
1.6%
1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 1667
95.5%
[ 64
 
3.7%
8
 
0.5%
4
 
0.2%
3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1666
95.5%
] 64
 
3.7%
8
 
0.5%
4
 
0.2%
3
 
0.2%
Other Symbol
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Number
ValueCountFrequency (%)
2
66.7%
² 1
33.3%
Space Separator
ValueCountFrequency (%)
30977
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 231
100.0%
Format
ValueCountFrequency (%)
­ 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96965
59.5%
Common 46228
28.4%
Latin 17243
 
10.6%
Han 2416
 
1.5%
Cyrillic 47
 
< 0.1%
Hiragana 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3112
 
3.2%
2455
 
2.5%
1756
 
1.8%
1646
 
1.7%
1516
 
1.6%
1434
 
1.5%
1415
 
1.5%
1199
 
1.2%
1189
 
1.2%
1136
 
1.2%
Other values (1229) 80107
82.6%
Han
ValueCountFrequency (%)
83
 
3.4%
60
 
2.5%
41
 
1.7%
41
 
1.7%
39
 
1.6%
36
 
1.5%
33
 
1.4%
32
 
1.3%
29
 
1.2%
28
 
1.2%
Other values (657) 1994
82.5%
Latin
ValueCountFrequency (%)
e 1899
 
11.0%
o 1311
 
7.6%
a 1274
 
7.4%
i 1158
 
6.7%
r 1121
 
6.5%
t 1075
 
6.2%
n 1072
 
6.2%
s 1027
 
6.0%
l 633
 
3.7%
d 607
 
3.5%
Other values (49) 6066
35.2%
Common
ValueCountFrequency (%)
30977
67.0%
: 2221
 
4.8%
. 1974
 
4.3%
( 1667
 
3.6%
) 1666
 
3.6%
1 1305
 
2.8%
, 1164
 
2.5%
2 1087
 
2.4%
0 817
 
1.8%
3 443
 
1.0%
Other values (44) 2907
 
6.3%
Cyrillic
ValueCountFrequency (%)
а 4
 
8.5%
С 4
 
8.5%
и 3
 
6.4%
К 3
 
6.4%
О 3
 
6.4%
Р 3
 
6.4%
Й 2
 
4.3%
А 2
 
4.3%
в 2
 
4.3%
л 2
 
4.3%
Other values (19) 19
40.4%
Hiragana
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96951
59.5%
ASCII 63069
38.7%
CJK 2362
 
1.4%
None 321
 
0.2%
Number Forms 64
 
< 0.1%
CJK Compat Ideographs 54
 
< 0.1%
Cyrillic 47
 
< 0.1%
Compat Jamo 14
 
< 0.1%
Punctuation 6
 
< 0.1%
Hiragana 6
 
< 0.1%
Other values (4) 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30977
49.1%
: 2221
 
3.5%
. 1974
 
3.1%
e 1899
 
3.0%
( 1667
 
2.6%
) 1666
 
2.6%
o 1311
 
2.1%
1 1305
 
2.1%
a 1274
 
2.0%
, 1164
 
1.8%
Other values (76) 17611
27.9%
Hangul
ValueCountFrequency (%)
3112
 
3.2%
2455
 
2.5%
1756
 
1.8%
1646
 
1.7%
1516
 
1.6%
1434
 
1.5%
1415
 
1.5%
1199
 
1.2%
1189
 
1.2%
1136
 
1.2%
Other values (1222) 80093
82.6%
None
ValueCountFrequency (%)
· 267
83.2%
10
 
3.1%
8
 
2.5%
8
 
2.5%
6
 
1.9%
4
 
1.2%
4
 
1.2%
3
 
0.9%
3
 
0.9%
­ 3
 
0.9%
Other values (3) 5
 
1.6%
CJK
ValueCountFrequency (%)
83
 
3.5%
60
 
2.5%
41
 
1.7%
41
 
1.7%
39
 
1.7%
36
 
1.5%
33
 
1.4%
32
 
1.4%
29
 
1.2%
28
 
1.2%
Other values (628) 1940
82.1%
Number Forms
ValueCountFrequency (%)
23
35.9%
21
32.8%
13
20.3%
3
 
4.7%
2
 
3.1%
1
 
1.6%
1
 
1.6%
CJK Compat Ideographs
ValueCountFrequency (%)
7
 
13.0%
5
 
9.3%
4
 
7.4%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
2
 
3.7%
2
 
3.7%
2
 
3.7%
Other values (19) 21
38.9%
Punctuation
ValueCountFrequency (%)
6
100.0%
Math Operators
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
5
35.7%
2
 
14.3%
2
 
14.3%
2
 
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Cyrillic
ValueCountFrequency (%)
а 4
 
8.5%
С 4
 
8.5%
и 3
 
6.4%
К 3
 
6.4%
О 3
 
6.4%
Р 3
 
6.4%
Й 2
 
4.3%
А 2
 
4.3%
в 2
 
4.3%
л 2
 
4.3%
Other values (19) 19
40.4%
Hiragana
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%
Enclosed Alphanum
ValueCountFrequency (%)
2
66.7%
1
33.3%
Misc Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct9026
Distinct (%)90.5%
Missing26
Missing (%)0.3%
Memory size156.2 KiB
2023-12-12T14:43:22.793422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length125
Median length89
Mean length14.687588
Min length2

Characters and Unicode

Total characters146494
Distinct characters1546
Distinct categories11 ?
Distinct scripts7 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8462 ?
Unique (%)84.8%

Sample

1st rowAndrew Fraknoi;David Morrison;Sidney Wolff [공]저;윤홍식...[등]역
2nd row이용남...[등]편
3rd row서지원 글 ; 이상미 그림
4th row최윤정 지음
5th row브렌다 챔프만
ValueCountFrequency (%)
4851
 
12.0%
지음 3905
 
9.7%
옮김 2226
 
5.5%
그림 1348
 
3.3%
1110
 
2.7%
614
 
1.5%
감독 555
 
1.4%
405
 
1.0%
by 332
 
0.8%
308
 
0.8%
Other values (14343) 24804
61.3%
2023-12-12T14:43:23.439511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31207
 
21.3%
; 5849
 
4.0%
5253
 
3.6%
4992
 
3.4%
4223
 
2.9%
2719
 
1.9%
2341
 
1.6%
[ 2080
 
1.4%
] 2078
 
1.4%
1815
 
1.2%
Other values (1536) 83937
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89000
60.8%
Space Separator 31207
 
21.3%
Lowercase Letter 11500
 
7.9%
Other Punctuation 8114
 
5.5%
Uppercase Letter 2376
 
1.6%
Open Punctuation 2088
 
1.4%
Close Punctuation 2086
 
1.4%
Decimal Number 60
 
< 0.1%
Dash Punctuation 52
 
< 0.1%
Math Symbol 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5253
 
5.9%
4992
 
5.6%
4223
 
4.7%
2719
 
3.1%
2341
 
2.6%
1815
 
2.0%
1726
 
1.9%
1598
 
1.8%
1357
 
1.5%
1221
 
1.4%
Other values (1443) 61755
69.4%
Lowercase Letter
ValueCountFrequency (%)
e 1245
10.8%
a 1053
 
9.2%
r 985
 
8.6%
t 977
 
8.5%
i 920
 
8.0%
n 889
 
7.7%
l 751
 
6.5%
o 697
 
6.1%
y 572
 
5.0%
s 501
 
4.4%
Other values (23) 2910
25.3%
Uppercase Letter
ValueCountFrequency (%)
B 222
 
9.3%
S 201
 
8.5%
A 200
 
8.4%
R 174
 
7.3%
J 169
 
7.1%
M 160
 
6.7%
C 149
 
6.3%
H 141
 
5.9%
D 126
 
5.3%
L 96
 
4.0%
Other values (19) 738
31.1%
Other Punctuation
ValueCountFrequency (%)
; 5849
72.1%
. 1192
 
14.7%
, 499
 
6.1%
· 307
 
3.8%
: 230
 
2.8%
/ 19
 
0.2%
& 7
 
0.1%
' 6
 
0.1%
3
 
< 0.1%
2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 19
31.7%
1 16
26.7%
0 8
13.3%
4 5
 
8.3%
8 5
 
8.3%
5 4
 
6.7%
9 1
 
1.7%
3 1
 
1.7%
6 1
 
1.7%
Open Punctuation
ValueCountFrequency (%)
[ 2080
99.6%
( 7
 
0.3%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 2078
99.6%
) 7
 
0.3%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 4
40.0%
> 4
40.0%
+ 2
20.0%
Space Separator
ValueCountFrequency (%)
31207
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87004
59.4%
Common 43618
29.8%
Latin 13866
 
9.5%
Han 1974
 
1.3%
Katakana 12
 
< 0.1%
Hiragana 10
 
< 0.1%
Cyrillic 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5253
 
6.0%
4992
 
5.7%
4223
 
4.9%
2719
 
3.1%
2341
 
2.7%
1815
 
2.1%
1726
 
2.0%
1598
 
1.8%
1357
 
1.6%
1221
 
1.4%
Other values (889) 59759
68.7%
Han
ValueCountFrequency (%)
197
 
10.0%
87
 
4.4%
68
 
3.4%
63
 
3.2%
62
 
3.1%
36
 
1.8%
24
 
1.2%
24
 
1.2%
22
 
1.1%
21
 
1.1%
Other values (525) 1370
69.4%
Latin
ValueCountFrequency (%)
e 1245
 
9.0%
a 1053
 
7.6%
r 985
 
7.1%
t 977
 
7.0%
i 920
 
6.6%
n 889
 
6.4%
l 751
 
5.4%
o 697
 
5.0%
y 572
 
4.1%
s 501
 
3.6%
Other values (42) 5276
38.0%
Common
ValueCountFrequency (%)
31207
71.5%
; 5849
 
13.4%
[ 2080
 
4.8%
] 2078
 
4.8%
. 1192
 
2.7%
, 499
 
1.1%
· 307
 
0.7%
: 230
 
0.5%
- 52
 
0.1%
2 19
 
< 0.1%
Other values (21) 105
 
0.2%
Katakana
ValueCountFrequency (%)
2
16.7%
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Cyrillic
ValueCountFrequency (%)
Ю 1
10.0%
о 1
10.0%
к 1
10.0%
н 1
10.0%
е 1
10.0%
и 1
10.0%
с 1
10.0%
в 1
10.0%
О 1
10.0%
Г 1
10.0%
Hiragana
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87004
59.4%
ASCII 57169
39.0%
CJK 1883
 
1.3%
None 315
 
0.2%
CJK Compat Ideographs 91
 
0.1%
Katakana 12
 
< 0.1%
Hiragana 10
 
< 0.1%
Cyrillic 10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31207
54.6%
; 5849
 
10.2%
[ 2080
 
3.6%
] 2078
 
3.6%
e 1245
 
2.2%
. 1192
 
2.1%
a 1053
 
1.8%
r 985
 
1.7%
t 977
 
1.7%
i 920
 
1.6%
Other values (67) 9583
 
16.8%
Hangul
ValueCountFrequency (%)
5253
 
6.0%
4992
 
5.7%
4223
 
4.9%
2719
 
3.1%
2341
 
2.7%
1815
 
2.1%
1726
 
2.0%
1598
 
1.8%
1357
 
1.6%
1221
 
1.4%
Other values (889) 59759
68.7%
None
ValueCountFrequency (%)
· 307
97.5%
3
 
1.0%
2
 
0.6%
1
 
0.3%
1
 
0.3%
­ 1
 
0.3%
CJK
ValueCountFrequency (%)
197
 
10.5%
87
 
4.6%
68
 
3.6%
63
 
3.3%
36
 
1.9%
24
 
1.3%
24
 
1.3%
22
 
1.2%
21
 
1.1%
16
 
0.8%
Other values (508) 1325
70.4%
CJK Compat Ideographs
ValueCountFrequency (%)
62
68.1%
5
 
5.5%
4
 
4.4%
4
 
4.4%
2
 
2.2%
2
 
2.2%
2
 
2.2%
1
 
1.1%
1
 
1.1%
1
 
1.1%
Other values (7) 7
 
7.7%
Hiragana
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Katakana
ValueCountFrequency (%)
2
16.7%
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Cyrillic
ValueCountFrequency (%)
Ю 1
10.0%
о 1
10.0%
к 1
10.0%
н 1
10.0%
е 1
10.0%
и 1
10.0%
с 1
10.0%
в 1
10.0%
О 1
10.0%
Г 1
10.0%
Distinct3220
Distinct (%)32.2%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T14:43:23.907081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length38
Mean length5.0835251
Min length1

Characters and Unicode

Total characters50820
Distinct characters893
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1804 ?
Unique (%)18.0%

Sample

1st row청범출판사
2nd row교육과학사
3rd row처음주니어
4th row바람의 아이들
5th rowCJ엔터테인먼트
ValueCountFrequency (%)
김영사 82
 
0.7%
문학동네 80
 
0.7%
공급 79
 
0.7%
press 73
 
0.7%
출판부 66
 
0.6%
민음사 66
 
0.6%
university 63
 
0.6%
oxford 63
 
0.6%
엔터테인먼트 61
 
0.5%
살림 59
 
0.5%
Other values (3290) 10499
93.8%
2023-12-12T14:43:24.546537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2140
 
4.2%
1194
 
2.3%
1103
 
2.2%
960
 
1.9%
946
 
1.9%
741
 
1.5%
699
 
1.4%
647
 
1.3%
630
 
1.2%
616
 
1.2%
Other values (883) 41144
81.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42461
83.6%
Lowercase Letter 4538
 
8.9%
Uppercase Letter 1455
 
2.9%
Space Separator 1194
 
2.3%
Open Punctuation 340
 
0.7%
Close Punctuation 339
 
0.7%
Other Punctuation 306
 
0.6%
Decimal Number 167
 
0.3%
Dash Punctuation 19
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2140
 
5.0%
1103
 
2.6%
960
 
2.3%
946
 
2.2%
741
 
1.7%
699
 
1.6%
647
 
1.5%
630
 
1.5%
616
 
1.5%
599
 
1.4%
Other values (804) 33380
78.6%
Uppercase Letter
ValueCountFrequency (%)
B 216
14.8%
M 175
12.0%
C 119
 
8.2%
S 118
 
8.1%
P 112
 
7.7%
O 95
 
6.5%
U 82
 
5.6%
H 74
 
5.1%
D 64
 
4.4%
E 52
 
3.6%
Other values (15) 348
23.9%
Lowercase Letter
ValueCountFrequency (%)
o 515
11.3%
s 467
10.3%
r 457
10.1%
e 411
 
9.1%
i 384
 
8.5%
n 300
 
6.6%
a 293
 
6.5%
t 240
 
5.3%
l 187
 
4.1%
d 164
 
3.6%
Other values (14) 1120
24.7%
Other Punctuation
ValueCountFrequency (%)
· 79
25.8%
/ 59
19.3%
47
15.4%
& 41
13.4%
. 29
 
9.5%
, 23
 
7.5%
' 10
 
3.3%
@ 5
 
1.6%
# 3
 
1.0%
; 3
 
1.0%
Other values (4) 7
 
2.3%
Decimal Number
ValueCountFrequency (%)
2 67
40.1%
0 49
29.3%
1 26
 
15.6%
8 7
 
4.2%
5 6
 
3.6%
4 4
 
2.4%
9 3
 
1.8%
3 3
 
1.8%
6 2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
[ 323
95.0%
( 17
 
5.0%
Close Punctuation
ValueCountFrequency (%)
] 322
95.0%
) 17
 
5.0%
Space Separator
ValueCountFrequency (%)
1194
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41366
81.4%
Latin 5993
 
11.8%
Common 2366
 
4.7%
Han 1095
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2140
 
5.2%
1103
 
2.7%
960
 
2.3%
946
 
2.3%
741
 
1.8%
699
 
1.7%
647
 
1.6%
630
 
1.5%
616
 
1.5%
599
 
1.4%
Other values (616) 32285
78.0%
Han
ValueCountFrequency (%)
154
 
14.1%
124
 
11.3%
34
 
3.1%
33
 
3.0%
30
 
2.7%
25
 
2.3%
23
 
2.1%
23
 
2.1%
21
 
1.9%
20
 
1.8%
Other values (178) 608
55.5%
Latin
ValueCountFrequency (%)
o 515
 
8.6%
s 467
 
7.8%
r 457
 
7.6%
e 411
 
6.9%
i 384
 
6.4%
n 300
 
5.0%
a 293
 
4.9%
t 240
 
4.0%
B 216
 
3.6%
l 187
 
3.1%
Other values (39) 2523
42.1%
Common
ValueCountFrequency (%)
1194
50.5%
[ 323
 
13.7%
] 322
 
13.6%
· 79
 
3.3%
2 67
 
2.8%
/ 59
 
2.5%
0 49
 
2.1%
47
 
2.0%
& 41
 
1.7%
. 29
 
1.2%
Other values (20) 156
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41365
81.4%
ASCII 8227
 
16.2%
CJK 1091
 
2.1%
None 132
 
0.3%
CJK Compat Ideographs 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2140
 
5.2%
1103
 
2.7%
960
 
2.3%
946
 
2.3%
741
 
1.8%
699
 
1.7%
647
 
1.6%
630
 
1.5%
616
 
1.5%
599
 
1.4%
Other values (615) 32284
78.0%
ASCII
ValueCountFrequency (%)
1194
 
14.5%
o 515
 
6.3%
s 467
 
5.7%
r 457
 
5.6%
e 411
 
5.0%
i 384
 
4.7%
[ 323
 
3.9%
] 322
 
3.9%
n 300
 
3.6%
a 293
 
3.6%
Other values (64) 3561
43.3%
CJK
ValueCountFrequency (%)
154
 
14.1%
124
 
11.4%
34
 
3.1%
33
 
3.0%
30
 
2.7%
25
 
2.3%
23
 
2.1%
23
 
2.1%
21
 
1.9%
20
 
1.8%
Other values (176) 604
55.4%
None
ValueCountFrequency (%)
· 79
59.8%
47
35.6%
3
 
2.3%
2
 
1.5%
1
 
0.8%
CJK Compat Ideographs
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct170
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:43:24.797712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length4
Mean length4.1542
Min length4

Characters and Unicode

Total characters41542
Distinct characters30
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)0.9%

Sample

1st row1998
2nd row2004
3rd row2011
4th row2012
5th row2002
ValueCountFrequency (%)
2002 1072
 
10.6%
2004 1066
 
10.5%
2003 998
 
9.9%
2005 819
 
8.1%
2001 812
 
8.0%
2000 511
 
5.1%
2014 477
 
4.7%
2011 346
 
3.4%
2021 334
 
3.3%
2007 323
 
3.2%
Other values (143) 3353
33.2%
2023-12-12T14:43:25.218246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 16649
40.1%
2 11346
27.3%
1 4839
 
11.6%
9 2143
 
5.2%
4 1634
 
3.9%
3 1319
 
3.2%
5 1059
 
2.5%
7 626
 
1.5%
6 575
 
1.4%
8 519
 
1.2%
Other values (20) 833
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40709
98.0%
Close Punctuation 257
 
0.6%
Open Punctuation 256
 
0.6%
Other Letter 200
 
0.5%
Space Separator 111
 
0.3%
Lowercase Letter 8
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
61.5%
27
 
13.5%
11
 
5.5%
11
 
5.5%
9
 
4.5%
8
 
4.0%
3
 
1.5%
3
 
1.5%
1
 
0.5%
1
 
0.5%
Other values (3) 3
 
1.5%
Decimal Number
ValueCountFrequency (%)
0 16649
40.9%
2 11346
27.9%
1 4839
 
11.9%
9 2143
 
5.3%
4 1634
 
4.0%
3 1319
 
3.2%
5 1059
 
2.6%
7 626
 
1.5%
6 575
 
1.4%
8 519
 
1.3%
Close Punctuation
ValueCountFrequency (%)
] 132
51.4%
) 125
48.6%
Open Punctuation
ValueCountFrequency (%)
[ 131
51.2%
( 125
48.8%
Space Separator
ValueCountFrequency (%)
111
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 41334
99.5%
Hangul 200
 
0.5%
Latin 8
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 16649
40.3%
2 11346
27.4%
1 4839
 
11.7%
9 2143
 
5.2%
4 1634
 
4.0%
3 1319
 
3.2%
5 1059
 
2.6%
7 626
 
1.5%
6 575
 
1.4%
8 519
 
1.3%
Other values (6) 625
 
1.5%
Hangul
ValueCountFrequency (%)
123
61.5%
27
 
13.5%
11
 
5.5%
11
 
5.5%
9
 
4.5%
8
 
4.0%
3
 
1.5%
3
 
1.5%
1
 
0.5%
1
 
0.5%
Other values (3) 3
 
1.5%
Latin
ValueCountFrequency (%)
c 8
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41342
99.5%
Hangul 200
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 16649
40.3%
2 11346
27.4%
1 4839
 
11.7%
9 2143
 
5.2%
4 1634
 
4.0%
3 1319
 
3.2%
5 1059
 
2.6%
7 626
 
1.5%
6 575
 
1.4%
8 519
 
1.3%
Other values (7) 633
 
1.5%
Hangul
ValueCountFrequency (%)
123
61.5%
27
 
13.5%
11
 
5.5%
11
 
5.5%
9
 
4.5%
8
 
4.0%
3
 
1.5%
3
 
1.5%
1
 
0.5%
1
 
0.5%
Other values (3) 3
 
1.5%
Distinct9993
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:43:25.549912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length11.6098
Min length7

Characters and Unicode

Total characters116098
Distinct characters586
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9986 ?
Unique (%)99.9%

Sample

1st row443.1-프232ㅇ-1
2nd row370.18-교67ㄱ
3rd rowC710-서79ㅇ
4th row814.6-최67ㅇ
5th rowDV 600-2467
ValueCountFrequency (%)
dv 1003
 
8.6%
ce 604
 
5.2%
bi 28
 
0.2%
t3 4
 
< 0.1%
911.02-김19ㅎ 2
 
< 0.1%
991.1-강77ㅇ 2
 
< 0.1%
t4 2
 
< 0.1%
vt 2
 
< 0.1%
t2 2
 
< 0.1%
711.1-오73ㅎ 2
 
< 0.1%
Other values (9994) 9998
85.8%
2023-12-12T14:43:26.040835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 13432
 
11.6%
1 10960
 
9.4%
3 9211
 
7.9%
8 8259
 
7.1%
2 7563
 
6.5%
9 6516
 
5.6%
. 6378
 
5.5%
6 6249
 
5.4%
0 6246
 
5.4%
5 5540
 
4.8%
Other values (576) 35744
30.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 70456
60.7%
Other Letter 18588
 
16.0%
Dash Punctuation 13432
 
11.6%
Other Punctuation 6379
 
5.5%
Uppercase Letter 5004
 
4.3%
Space Separator 1649
 
1.4%
Math Symbol 513
 
0.4%
Open Punctuation 36
 
< 0.1%
Close Punctuation 36
 
< 0.1%
Format 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1917
 
10.3%
1286
 
6.9%
1124
 
6.0%
926
 
5.0%
795
 
4.3%
775
 
4.2%
769
 
4.1%
573
 
3.1%
565
 
3.0%
545
 
2.9%
Other values (539) 9313
50.1%
Uppercase Letter
ValueCountFrequency (%)
C 1832
36.6%
V 1005
20.1%
D 1003
20.0%
E 605
 
12.1%
O 237
 
4.7%
R 178
 
3.6%
L 60
 
1.2%
B 31
 
0.6%
I 28
 
0.6%
T 12
 
0.2%
Other values (5) 13
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 10960
15.6%
3 9211
13.1%
8 8259
11.7%
2 7563
10.7%
9 6516
9.2%
6 6249
8.9%
0 6246
8.9%
5 5540
7.9%
4 5105
7.2%
7 4807
6.8%
Other Punctuation
ValueCountFrequency (%)
. 6378
> 99.9%
, 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 512
99.8%
~ 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 35
97.2%
( 1
 
2.8%
Close Punctuation
ValueCountFrequency (%)
] 35
97.2%
) 1
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 13432
100.0%
Space Separator
ValueCountFrequency (%)
1649
100.0%
Format
ValueCountFrequency (%)
­ 4
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 92505
79.7%
Hangul 18586
 
16.0%
Latin 5005
 
4.3%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1917
 
10.3%
1286
 
6.9%
1124
 
6.0%
926
 
5.0%
795
 
4.3%
775
 
4.2%
769
 
4.1%
573
 
3.1%
565
 
3.0%
545
 
2.9%
Other values (537) 9311
50.1%
Common
ValueCountFrequency (%)
- 13432
14.5%
1 10960
11.8%
3 9211
10.0%
8 8259
8.9%
2 7563
8.2%
9 6516
7.0%
. 6378
6.9%
6 6249
6.8%
0 6246
6.8%
5 5540
6.0%
Other values (11) 12151
13.1%
Latin
ValueCountFrequency (%)
C 1832
36.6%
V 1005
20.1%
D 1003
20.0%
E 605
 
12.1%
O 237
 
4.7%
R 178
 
3.6%
L 60
 
1.2%
B 31
 
0.6%
I 28
 
0.6%
T 12
 
0.2%
Other values (6) 14
 
0.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97505
84.0%
Compat Jamo 9514
 
8.2%
Hangul 9072
 
7.8%
None 4
 
< 0.1%
CJK 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 13432
13.8%
1 10960
11.2%
3 9211
9.4%
8 8259
8.5%
2 7563
7.8%
9 6516
 
6.7%
. 6378
 
6.5%
6 6249
 
6.4%
0 6246
 
6.4%
5 5540
 
5.7%
Other values (25) 17151
17.6%
Compat Jamo
ValueCountFrequency (%)
1917
20.1%
1286
13.5%
1124
11.8%
926
9.7%
769
8.1%
573
 
6.0%
565
 
5.9%
545
 
5.7%
466
 
4.9%
323
 
3.4%
Other values (9) 1020
10.7%
Hangul
ValueCountFrequency (%)
795
 
8.8%
775
 
8.5%
328
 
3.6%
264
 
2.9%
217
 
2.4%
172
 
1.9%
146
 
1.6%
136
 
1.5%
127
 
1.4%
116
 
1.3%
Other values (518) 5996
66.1%
None
ValueCountFrequency (%)
­ 4
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 구리시청
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 구리시청
2nd row경기도 구리시청
3rd row경기도 구리시청
4th row경기도 구리시청
5th row경기도 구리시청

Common Values

ValueCountFrequency (%)
경기도 구리시청 10000
100.0%

Length

2023-12-12T14:43:26.200916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:43:26.292277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 10000
50.0%
구리시청 10000
50.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-07-12 00:00:00
Maximum2023-07-12 00:00:00
2023-12-12T14:43:26.377636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:43:26.485502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-12T14:43:19.945925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:43:20.123058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:43:20.263753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서관명등록번호서명저자출판사출판년청구기호관리기관명데이터기준일자
38579인창도서관EM0000003380우주로의 여행. 1Andrew Fraknoi;David Morrison;Sidney Wolff [공]저;윤홍식...[등]역청범출판사1998443.1-프232ㅇ-1경기도 구리시청2023-07-12
63994인창도서관EM0000036353교육 및 상담 심리학이용남...[등]편교육과학사2004370.18-교67ㄱ경기도 구리시청2023-07-12
4057인창도서관CM00000551671학년 백점국어서지원 글 ; 이상미 그림처음주니어2011C710-서79ㅇ경기도 구리시청2023-07-12
28136인창도서관DE0000010191우호적인 무관심최윤정 지음바람의 아이들2012814.6-최67ㅇ경기도 구리시청2023-07-12
15680인창도서관CR0000002467이집트 왕자브렌다 챔프만CJ엔터테인먼트2002DV 600-2467경기도 구리시청2023-07-12
31171인창도서관DE0000018897거리에서, 문득 : 조규찬 에세이조규찬 지음안나푸르나2015814.6-조17ㄱ=2경기도 구리시청2023-07-12
19534인창도서관CR0000009172바다가 들린다모치즈키 토모미 감독대원DVD2008DV 600-9172경기도 구리시청2023-07-12
43640인창도서관EM0000009167(중학생이 보는) 로빈슨 크루소다니엘 디포 지음 ; 성낙수 ; 정근용 ; 김은정 엮음 ; 박영의 옮김신원문화사2001808-디845ㄹ경기도 구리시청2023-07-12
67013인창도서관EM0000041205사유하는 교사 : 교육학적 사유를 위한 안내서A. 플리트너;H. 쇼이얼 엮음;송순재 편역내일을여는책2000370.1-플239ㅅ경기도 구리시청2023-07-12
36036인창도서관EM0000000425효사상과 조상숭배차용준 지음신아출판사2000151-차66ㅎ경기도 구리시청2023-07-12
도서관명등록번호서명저자출판사출판년청구기호관리기관명데이터기준일자
41557인창도서관EM0000006727정치광고의 이해와 활용탁진영 [저]커뮤니케이션북스1999340.2-탁79ㅈ경기도 구리시청2023-07-12
41800인창도서관EM0000007013아내의 기도로 남편을 돕는다스토미 오마샨 ; 김태곤 옮김생명의 말씀사2002237.2-오31ㅇ경기도 구리시청2023-07-12
10811인창도서관CM0000097733전우치전전상욱 글 ; 장선환 그림Humanist2021C813.5-초15ㅎ-2경기도 구리시청2023-07-12
39407인창도서관EM0000004260(莊峰散稿)華嚴思想과 禪金知見 著민족사2002228.4-김79ㅎ경기도 구리시청2023-07-12
60397인창도서관EM0000031086한국현대시인연구. 下문덕수...[등]책임편집푸른사상2001811.609-문223ㅎ-2경기도 구리시청2023-07-12
9428인창도서관CM0000080524폭력은 손에서 시작된단다 : 폭력에 대한 올바른 가치관 세우기마틴 애거시 글 ; 마리카 하인렌 그림 ; 마술연필 옮김보물창고2016CE 334.23-애13ㅍ경기도 구리시청2023-07-12
45967인창도서관EM0000012177명성황후. 2[강신재 지음]소담2001813.6-강59ㅁ-2경기도 구리시청2023-07-12
45053인창도서관EM0000011023(강원도) 122명산 코스집안경호 지음평화1996699.1-안14ㅂ경기도 구리시청2023-07-12
13444인창도서관CM0000100368드르렁 드르렁, 아빠는 왜 코를 골지앙드레 부샤르 글·그림 ; 이정주 옮김어린이작가정신2020CE 863-부52ㄷ경기도 구리시청2023-07-12
33740인창도서관DE00000221471000일의 창, 음식이력서 : 이젠 알고 먹자한스 콘라트 비잘스키 지음 ; 김완균 옮김대원사2018517.5-비71ㅊ경기도 구리시청2023-07-12