Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

Text3

Dataset

Description충남도립대학교 도서관에서 소장하고 있는 일반도서의 등록번호, 도서명 등 도서정보, 청구기호 정보가 포함된 목록 제공
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=400&beforeMenuCd=DOM_000000201001001000&publicdatapk=3046847

Alerts

등록번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:16:02.735182
Analysis finished2024-01-09 22:16:04.204716
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:16:04.316937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters90000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowKM0046880
2nd rowKM0021148
3rd rowKM0006239
4th rowKM0051204
5th rowKM0030413
ValueCountFrequency (%)
km0046880 1
 
< 0.1%
km0059591 1
 
< 0.1%
km0030537 1
 
< 0.1%
km0070405 1
 
< 0.1%
km0057690 1
 
< 0.1%
km0000105 1
 
< 0.1%
km0041848 1
 
< 0.1%
km0051116 1
 
< 0.1%
km0030908 1
 
< 0.1%
km0066661 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-01-10T07:16:04.568134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25369
28.2%
K 10000
 
11.1%
M 10000
 
11.1%
1 5448
 
6.1%
2 5410
 
6.0%
3 5358
 
6.0%
6 5321
 
5.9%
4 5306
 
5.9%
5 5268
 
5.9%
7 4653
 
5.2%
Other values (2) 7867
 
8.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 70000
77.8%
Uppercase Letter 20000
 
22.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25369
36.2%
1 5448
 
7.8%
2 5410
 
7.7%
3 5358
 
7.7%
6 5321
 
7.6%
4 5306
 
7.6%
5 5268
 
7.5%
7 4653
 
6.6%
8 3947
 
5.6%
9 3920
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
K 10000
50.0%
M 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70000
77.8%
Latin 20000
 
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 25369
36.2%
1 5448
 
7.8%
2 5410
 
7.7%
3 5358
 
7.7%
6 5321
 
7.6%
4 5306
 
7.6%
5 5268
 
7.5%
7 4653
 
6.6%
8 3947
 
5.6%
9 3920
 
5.6%
Latin
ValueCountFrequency (%)
K 10000
50.0%
M 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25369
28.2%
K 10000
 
11.1%
M 10000
 
11.1%
1 5448
 
6.1%
2 5410
 
6.0%
3 5358
 
6.0%
6 5321
 
5.9%
4 5306
 
5.9%
5 5268
 
5.9%
7 4653
 
5.2%
Other values (2) 7867
 
8.7%
Distinct9343
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:16:04.816992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length213
Median length156
Mean length21.7894
Min length1

Characters and Unicode

Total characters217894
Distinct characters2202
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8752 ?
Unique (%)87.5%

Sample

1st row건재 정인승 : 국어사전 만들기에 일생을 바친
2nd row관광법규의 이해와 사례분석
3rd row(新編)都市計劃
4th row조선, 도덕(道德)의 성찰 : 조선시대 유학의 도덕철학
5th row한국생활사박물관. 7 : 고려생활관 1
ValueCountFrequency (%)
5115
 
10.2%
장편소설 300
 
0.6%
1 296
 
0.6%
2 288
 
0.6%
of 247
 
0.5%
위한 201
 
0.4%
the 198
 
0.4%
연구 191
 
0.4%
188
 
0.4%
3 162
 
0.3%
Other values (21965) 42718
85.6%
2024-01-10T07:16:05.186011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40362
 
18.5%
: 4255
 
2.0%
4039
 
1.9%
e 2958
 
1.4%
2590
 
1.2%
2432
 
1.1%
o 2367
 
1.1%
i 2356
 
1.1%
n 2346
 
1.1%
2298
 
1.1%
Other values (2192) 151891
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 124726
57.2%
Space Separator 40362
 
18.5%
Lowercase Letter 26090
 
12.0%
Other Punctuation 7869
 
3.6%
Decimal Number 7786
 
3.6%
Uppercase Letter 5432
 
2.5%
Open Punctuation 2076
 
1.0%
Close Punctuation 2074
 
1.0%
Math Symbol 943
 
0.4%
Dash Punctuation 421
 
0.2%
Other values (5) 115
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4039
 
3.2%
2590
 
2.1%
2432
 
1.9%
2298
 
1.8%
1935
 
1.6%
1577
 
1.3%
1490
 
1.2%
1459
 
1.2%
1388
 
1.1%
1359
 
1.1%
Other values (2073) 104159
83.5%
Lowercase Letter
ValueCountFrequency (%)
e 2958
11.3%
o 2367
 
9.1%
i 2356
 
9.0%
n 2346
 
9.0%
a 2239
 
8.6%
t 2100
 
8.0%
r 1797
 
6.9%
s 1483
 
5.7%
l 1203
 
4.6%
c 1067
 
4.1%
Other values (16) 6174
23.7%
Uppercase Letter
ValueCountFrequency (%)
C 503
 
9.3%
S 478
 
8.8%
A 410
 
7.5%
T 403
 
7.4%
I 380
 
7.0%
E 374
 
6.9%
P 294
 
5.4%
O 286
 
5.3%
N 247
 
4.5%
M 246
 
4.5%
Other values (16) 1811
33.3%
Other Punctuation
ValueCountFrequency (%)
: 4255
54.1%
. 1930
24.5%
, 845
 
10.7%
· 346
 
4.4%
' 130
 
1.7%
! 122
 
1.6%
/ 99
 
1.3%
& 77
 
1.0%
25
 
0.3%
" 12
 
0.2%
Other values (8) 28
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 1745
22.4%
0 1661
21.3%
2 1648
21.2%
3 598
 
7.7%
9 586
 
7.5%
5 349
 
4.5%
4 343
 
4.4%
7 293
 
3.8%
6 286
 
3.7%
8 277
 
3.6%
Math Symbol
ValueCountFrequency (%)
= 776
82.3%
+ 73
 
7.7%
~ 67
 
7.1%
< 8
 
0.8%
> 8
 
0.8%
4
 
0.4%
2
 
0.2%
2
 
0.2%
2
 
0.2%
× 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1990
95.9%
[ 66
 
3.2%
12
 
0.6%
4
 
0.2%
2
 
0.1%
1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 1989
95.9%
] 66
 
3.2%
12
 
0.6%
3
 
0.1%
2
 
0.1%
1
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
36
36.4%
35
35.4%
15
15.2%
9
 
9.1%
4
 
4.0%
Other Symbol
ValueCountFrequency (%)
5
62.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Other Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
40362
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 421
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117450
53.9%
Common 61547
28.2%
Latin 31621
 
14.5%
Han 7268
 
3.3%
Hiragana 4
 
< 0.1%
Katakana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4039
 
3.4%
2590
 
2.2%
2432
 
2.1%
2298
 
2.0%
1935
 
1.6%
1577
 
1.3%
1490
 
1.3%
1459
 
1.2%
1388
 
1.2%
1359
 
1.2%
Other values (1133) 96883
82.5%
Han
ValueCountFrequency (%)
265
 
3.6%
175
 
2.4%
172
 
2.4%
139
 
1.9%
107
 
1.5%
95
 
1.3%
85
 
1.2%
85
 
1.2%
82
 
1.1%
80
 
1.1%
Other values (922) 5983
82.3%
Common
ValueCountFrequency (%)
40362
65.6%
: 4255
 
6.9%
( 1990
 
3.2%
) 1989
 
3.2%
. 1930
 
3.1%
1 1745
 
2.8%
0 1661
 
2.7%
2 1648
 
2.7%
, 845
 
1.4%
= 776
 
1.3%
Other values (52) 4346
 
7.1%
Latin
ValueCountFrequency (%)
e 2958
 
9.4%
o 2367
 
7.5%
i 2356
 
7.5%
n 2346
 
7.4%
a 2239
 
7.1%
t 2100
 
6.6%
r 1797
 
5.7%
s 1483
 
4.7%
l 1203
 
3.8%
c 1067
 
3.4%
Other values (47) 11705
37.0%
Hiragana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117322
53.8%
ASCII 92621
42.5%
CJK 7081
 
3.2%
None 426
 
0.2%
CJK Compat Ideographs 187
 
0.1%
Compat Jamo 128
 
0.1%
Number Forms 99
 
< 0.1%
Math Operators 8
 
< 0.1%
Misc Symbols 5
 
< 0.1%
Enclosed Alphanum 5
 
< 0.1%
Other values (5) 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40362
43.6%
: 4255
 
4.6%
e 2958
 
3.2%
o 2367
 
2.6%
i 2356
 
2.5%
n 2346
 
2.5%
a 2239
 
2.4%
t 2100
 
2.3%
( 1990
 
2.1%
) 1989
 
2.1%
Other values (77) 29659
32.0%
Hangul
ValueCountFrequency (%)
4039
 
3.4%
2590
 
2.2%
2432
 
2.1%
2298
 
2.0%
1935
 
1.6%
1577
 
1.3%
1490
 
1.3%
1459
 
1.2%
1388
 
1.2%
1359
 
1.2%
Other values (1126) 96755
82.5%
None
ValueCountFrequency (%)
· 346
81.2%
25
 
5.9%
12
 
2.8%
12
 
2.8%
11
 
2.6%
4
 
0.9%
3
 
0.7%
2
 
0.5%
2
 
0.5%
2
 
0.5%
Other values (7) 7
 
1.6%
CJK
ValueCountFrequency (%)
265
 
3.7%
175
 
2.5%
172
 
2.4%
139
 
2.0%
107
 
1.5%
95
 
1.3%
85
 
1.2%
85
 
1.2%
82
 
1.2%
80
 
1.1%
Other values (880) 5796
81.9%
Compat Jamo
ValueCountFrequency (%)
117
91.4%
3
 
2.3%
2
 
1.6%
2
 
1.6%
2
 
1.6%
1
 
0.8%
1
 
0.8%
CJK Compat Ideographs
ValueCountFrequency (%)
45
24.1%
18
 
9.6%
12
 
6.4%
10
 
5.3%
8
 
4.3%
7
 
3.7%
7
 
3.7%
6
 
3.2%
6
 
3.2%
5
 
2.7%
Other values (32) 63
33.7%
Number Forms
ValueCountFrequency (%)
36
36.4%
35
35.4%
15
15.2%
9
 
9.1%
4
 
4.0%
Misc Symbols
ValueCountFrequency (%)
5
100.0%
Math Operators
ValueCountFrequency (%)
4
50.0%
2
25.0%
2
25.0%
Enclosed Alphanum
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Punctuation
ValueCountFrequency (%)
2
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct8589
Distinct (%)85.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:16:05.394103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length20
Mean length11.94
Min length8

Characters and Unicode

Total characters119400
Distinct characters520
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7696 ?
Unique (%)77.0%

Sample

1st row990-한654ㄱ-C
2nd row326.39-신664ㄱ
3rd row539.7-윤824ㄷ
4th row151.5-윤678ㅈ
5th row911.069-한529ㅎ
ValueCountFrequency (%)
080-대383-r 28
 
0.3%
337-충85ㅊ-g 27
 
0.3%
322.004-국464ㄱ-g 15
 
0.1%
833.6-아334ㅋ오-c 15
 
0.1%
810.8-태446ㅎ 13
 
0.1%
813.608-동875ㅎ 13
 
0.1%
710.77-이375ㅅ 12
 
0.1%
080-시316 11
 
0.1%
811.608-미735 11
 
0.1%
813.6-조852ㅇ 10
 
0.1%
Other values (8583) 9849
98.5%
2024-01-10T07:16:05.696190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 11564
 
9.7%
3 10632
 
8.9%
1 10198
 
8.5%
5 8175
 
6.8%
. 8084
 
6.8%
9 7648
 
6.4%
8 7477
 
6.3%
6 7151
 
6.0%
7 7124
 
6.0%
2 6389
 
5.4%
Other values (510) 34958
29.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 76197
63.8%
Other Letter 21975
 
18.4%
Dash Punctuation 11564
 
9.7%
Other Punctuation 8084
 
6.8%
Uppercase Letter 1544
 
1.3%
Lowercase Letter 21
 
< 0.1%
Space Separator 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1737
 
7.9%
1496
 
6.8%
1355
 
6.2%
1348
 
6.1%
1300
 
5.9%
1243
 
5.7%
1113
 
5.1%
651
 
3.0%
548
 
2.5%
545
 
2.5%
Other values (467) 10639
48.4%
Uppercase Letter
ValueCountFrequency (%)
G 508
32.9%
R 391
25.3%
C 275
17.8%
E 117
 
7.6%
X 117
 
7.6%
U 86
 
5.6%
A 21
 
1.4%
T 16
 
1.0%
B 5
 
0.3%
K 3
 
0.2%
Other values (4) 5
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
m 5
23.8%
o 3
14.3%
c 2
 
9.5%
g 2
 
9.5%
e 2
 
9.5%
k 1
 
4.8%
v 1
 
4.8%
b 1
 
4.8%
d 1
 
4.8%
a 1
 
4.8%
Other values (2) 2
 
9.5%
Decimal Number
ValueCountFrequency (%)
3 10632
14.0%
1 10198
13.4%
5 8175
10.7%
9 7648
10.0%
8 7477
9.8%
6 7151
9.4%
7 7124
9.3%
2 6389
8.4%
4 6196
8.1%
0 5207
6.8%
Open Punctuation
ValueCountFrequency (%)
( 4
80.0%
[ 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 4
80.0%
] 1
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 11564
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8084
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 95860
80.3%
Hangul 21975
 
18.4%
Latin 1565
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1737
 
7.9%
1496
 
6.8%
1355
 
6.2%
1348
 
6.1%
1300
 
5.9%
1243
 
5.7%
1113
 
5.1%
651
 
3.0%
548
 
2.5%
545
 
2.5%
Other values (467) 10639
48.4%
Latin
ValueCountFrequency (%)
G 508
32.5%
R 391
25.0%
C 275
17.6%
E 117
 
7.5%
X 117
 
7.5%
U 86
 
5.5%
A 21
 
1.3%
T 16
 
1.0%
m 5
 
0.3%
B 5
 
0.3%
Other values (16) 24
 
1.5%
Common
ValueCountFrequency (%)
- 11564
12.1%
3 10632
11.1%
1 10198
10.6%
5 8175
8.5%
. 8084
8.4%
9 7648
8.0%
8 7477
7.8%
6 7151
7.5%
7 7124
7.4%
2 6389
6.7%
Other values (7) 11418
11.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97425
81.6%
Hangul 11753
 
9.8%
Compat Jamo 10222
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 11564
11.9%
3 10632
10.9%
1 10198
10.5%
5 8175
8.4%
. 8084
8.3%
9 7648
7.9%
8 7477
7.7%
6 7151
7.3%
7 7124
7.3%
2 6389
6.6%
Other values (33) 12983
13.3%
Compat Jamo
ValueCountFrequency (%)
1737
17.0%
1355
13.3%
1300
12.7%
1243
12.2%
1113
10.9%
651
 
6.4%
548
 
5.4%
542
 
5.3%
420
 
4.1%
391
 
3.8%
Other values (9) 922
9.0%
Hangul
ValueCountFrequency (%)
1496
 
12.7%
1348
 
11.5%
545
 
4.6%
475
 
4.0%
352
 
3.0%
310
 
2.6%
307
 
2.6%
219
 
1.9%
211
 
1.8%
208
 
1.8%
Other values (448) 6282
53.5%

Missing values

2024-01-10T07:16:04.110622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:16:04.170741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호도서정보청구기호
42363KM0046880건재 정인승 : 국어사전 만들기에 일생을 바친990-한654ㄱ-C
19050KM0021148관광법규의 이해와 사례분석326.39-신664ㄱ
5433KM0006239(新編)都市計劃539.7-윤824ㄷ
46214KM0051204조선, 도덕(道德)의 성찰 : 조선시대 유학의 도덕철학151.5-윤678ㅈ
27477KM0030413한국생활사박물관. 7 : 고려생활관 1911.069-한529ㅎ
44142KM0048882옵티미스트 : 인생의 '되도록 밝은 면' 탐구 보고서 : 낙관주의자189-쇼766ㅇ
53062KM0058397경호학개론350.79-김444ㄱ
52489KM0057824포스트드라마 연극의 지각방식과 관객의 역할 : 수행적인 것의 미학의 성과와 한계 = The modus of perception and the role of the audience in the postdramatic theatre : focused on the aesthetics of the performative684.01-김957ㅍ
44051KM0048778꽃보다 남자. 8833.6-카155ㄲ-8-C
5703KM0006532책임의 원칙 : 기술 시대의 생태학적 윤리191.2-요947ㅊ이
등록번호도서정보청구기호
44847KM0049700민족어교육과 외국어교육의 이중성 : 러시아의 한국어교육710.7-장581ㅁ
64619KM0069988(Win-Q) 위험물산업기사 필기 : 단기완성530.98077-이245ㅇ
3901KM0004544中國通史. 上912-전789ㅈ
37710KM004168616ㆍ17세기 시조의 동향과 경향811.3509-김536ㅅ
39563KM0043724세계로, 미래로 : Toward the World, Toward the Future377.604-박547ㅅ
13512KM0015020한국사의 주체적 인물들990-이571ㅎ
41393KM0045759국민권익백서350.31-국373ㄱ-G
16295KM0018104호텔관리회계론596.81-정117ㅎㅌ
16525KM0018358종합미용이론593-미827ㅈ
59100KM0064458영원한 소년과 창조성185.51-프856ㅇ홍