Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory113.0 B

Variable types

Text5
Numeric1
Categorical5
Boolean1
DateTime1

Dataset

Description경기도 군포시에 소재하는 공공 도서관의 소장 도서 목록에 대한 데이터로 등록번호, 서지번호, 청구기호, 서명, 저자, 발행처, 분관, 서고, 소장상태, 언어, 서지유형, 딸림여부, 최종수정일 항목을 제공합니다.
Author경기도 군포시
URLhttps://www.data.go.kr/data/15070273/fileData.do

Alerts

분관 has constant value ""Constant
서고 has constant value ""Constant
소장상태 has constant value ""Constant
언어 is highly imbalanced (96.6%)Imbalance
서지유형 is highly imbalanced (99.3%)Imbalance
딸림여부 is highly imbalanced (70.0%)Imbalance
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:47:27.858707
Analysis finished2023-12-12 09:47:31.100160
Duration3.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:47:31.421313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters80000
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowJE168976
2nd rowJE119320
3rd rowJE113926
4th rowJE226020
5th rowJE145814
ValueCountFrequency (%)
je168976 1
 
< 0.1%
je194627 1
 
< 0.1%
je255022 1
 
< 0.1%
je091217 1
 
< 0.1%
je181010 1
 
< 0.1%
je209038 1
 
< 0.1%
je165689 1
 
< 0.1%
je203864 1
 
< 0.1%
je164280 1
 
< 0.1%
je155030 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T18:47:32.032428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 11058
13.8%
J 10000
12.5%
E 9999
12.5%
2 7621
9.5%
0 7004
8.8%
6 5191
6.5%
5 5162
6.5%
9 5039
6.3%
4 5030
6.3%
3 4818
6.0%
Other values (3) 9078
11.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
75.0%
Uppercase Letter 20000
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 11058
18.4%
2 7621
12.7%
0 7004
11.7%
6 5191
8.7%
5 5162
8.6%
9 5039
8.4%
4 5030
8.4%
3 4818
8.0%
7 4622
7.7%
8 4455
7.4%
Uppercase Letter
ValueCountFrequency (%)
J 10000
50.0%
E 9999
50.0%
R 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 60000
75.0%
Latin 20000
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 11058
18.4%
2 7621
12.7%
0 7004
11.7%
6 5191
8.7%
5 5162
8.6%
9 5039
8.4%
4 5030
8.4%
3 4818
8.0%
7 4622
7.7%
8 4455
7.4%
Latin
ValueCountFrequency (%)
J 10000
50.0%
E 9999
50.0%
R 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 11058
13.8%
J 10000
12.5%
E 9999
12.5%
2 7621
9.5%
0 7004
8.8%
6 5191
6.5%
5 5162
6.5%
9 5039
6.3%
4 5030
6.3%
3 4818
6.0%
Other values (3) 9078
11.3%

서지번호
Real number (ℝ)

Distinct9972
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1829537.6
Minimum279151
Maximum3336885
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:47:32.234252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum279151
5-th percentile688320.6
Q11182129.2
median1758257
Q32543948.2
95-th percentile3209963
Maximum3336885
Range3057734
Interquartile range (IQR)1361819

Descriptive statistics

Standard deviation784893.5
Coefficient of variation (CV)0.42901194
Kurtosis-0.96023897
Mean1829537.6
Median Absolute Deviation (MAD)646917
Skewness0.21409569
Sum1.8295376 × 1010
Variance6.160578 × 1011
MonotonicityNot monotonic
2023-12-12T18:47:32.408424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1679685 2
 
< 0.1%
2680607 2
 
< 0.1%
3192134 2
 
< 0.1%
2657813 2
 
< 0.1%
3166916 2
 
< 0.1%
2620772 2
 
< 0.1%
898227 2
 
< 0.1%
2705665 2
 
< 0.1%
3174199 2
 
< 0.1%
3222695 2
 
< 0.1%
Other values (9962) 9980
99.8%
ValueCountFrequency (%)
279151 1
< 0.1%
528713 1
< 0.1%
528730 1
< 0.1%
528742 1
< 0.1%
528966 1
< 0.1%
528992 1
< 0.1%
529130 1
< 0.1%
529198 1
< 0.1%
529210 1
< 0.1%
529238 1
< 0.1%
ValueCountFrequency (%)
3336885 1
< 0.1%
3257648 1
< 0.1%
3257645 1
< 0.1%
3257639 1
< 0.1%
3257636 1
< 0.1%
3257633 1
< 0.1%
3257630 1
< 0.1%
3257626 1
< 0.1%
3257589 1
< 0.1%
3257583 1
< 0.1%
Distinct9462
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:47:32.847119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length10.5401
Min length5

Characters and Unicode

Total characters105401
Distinct characters581
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9274 ?
Unique (%)92.7%

Sample

1st row325.04 류17ㅇ
2nd row567.43 장68ㄷ
3rd row327.04 노56ㅌ
4th row981.4602 양68마
5th row610.92 배23ㅅ
ValueCountFrequency (%)
082 271
 
1.4%
325.211 199
 
1.0%
325.04 182
 
0.9%
325.1 145
 
0.7%
598.1 135
 
0.7%
747.5 135
 
0.7%
지32 99
 
0.5%
327.87 97
 
0.5%
327.856 96
 
0.5%
594.5 80
 
0.4%
Other values (9791) 18578
92.8%
2023-12-12T18:47:33.392314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10018
 
9.5%
2 8866
 
8.4%
5 8712
 
8.3%
. 8258
 
7.8%
3 7586
 
7.2%
4 7027
 
6.7%
1 6606
 
6.3%
7 6407
 
6.1%
9 6148
 
5.8%
8 6078
 
5.8%
Other values (571) 29695
28.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 67789
64.3%
Other Letter 19335
 
18.3%
Space Separator 10018
 
9.5%
Other Punctuation 8258
 
7.8%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1725
 
8.9%
1089
 
5.6%
1087
 
5.6%
952
 
4.9%
870
 
4.5%
713
 
3.7%
572
 
3.0%
557
 
2.9%
528
 
2.7%
485
 
2.5%
Other values (558) 10757
55.6%
Decimal Number
ValueCountFrequency (%)
2 8866
13.1%
5 8712
12.9%
3 7586
11.2%
4 7027
10.4%
1 6606
9.7%
7 6407
9.5%
9 6148
9.1%
8 6078
9.0%
6 5651
8.3%
0 4708
6.9%
Space Separator
ValueCountFrequency (%)
10018
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8258
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 86066
81.7%
Hangul 19335
 
18.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1725
 
8.9%
1089
 
5.6%
1087
 
5.6%
952
 
4.9%
870
 
4.5%
713
 
3.7%
572
 
3.0%
557
 
2.9%
528
 
2.7%
485
 
2.5%
Other values (558) 10757
55.6%
Common
ValueCountFrequency (%)
10018
11.6%
2 8866
10.3%
5 8712
10.1%
. 8258
9.6%
3 7586
8.8%
4 7027
8.2%
1 6606
7.7%
7 6407
7.4%
9 6148
7.1%
8 6078
7.1%
Other values (3) 10360
12.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 86066
81.7%
Hangul 11134
 
10.6%
Compat Jamo 8201
 
7.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10018
11.6%
2 8866
10.3%
5 8712
10.1%
. 8258
9.6%
3 7586
8.8%
4 7027
8.2%
1 6606
7.7%
7 6407
7.4%
9 6148
7.1%
8 6078
7.1%
Other values (3) 10360
12.0%
Compat Jamo
ValueCountFrequency (%)
1725
21.0%
1087
13.3%
870
10.6%
713
8.7%
572
 
7.0%
557
 
6.8%
528
 
6.4%
485
 
5.9%
461
 
5.6%
324
 
4.0%
Other values (9) 879
10.7%
Hangul
ValueCountFrequency (%)
1089
 
9.8%
952
 
8.6%
419
 
3.8%
310
 
2.8%
244
 
2.2%
224
 
2.0%
209
 
1.9%
200
 
1.8%
192
 
1.7%
185
 
1.7%
Other values (539) 7110
63.9%

서명
Text

Distinct9842
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:47:33.816169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length50
Mean length12.7961
Min length1

Characters and Unicode

Total characters127961
Distinct characters1473
Distinct categories14 ?
Distinct scripts7 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9698 ?
Unique (%)97.0%

Sample

1st row(천재자산가 류근영의)억만장자처럼 행동하라
2nd row더 레코딩
3rd row퇴직 후의 인생설계 재무설계
4th row말레이시아
5th row(흐름으로 읽는)서양건축의 역사
ValueCountFrequency (%)
이야기 180
 
0.6%
나는 120
 
0.4%
위한 107
 
0.3%
어떻게 98
 
0.3%
영어 95
 
0.3%
92
 
0.3%
81
 
0.3%
역사 75
 
0.2%
여행 75
 
0.2%
74
 
0.2%
Other values (16288) 30256
96.8%
2023-12-12T18:47:34.401463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21263
 
16.6%
2665
 
2.1%
2533
 
2.0%
2185
 
1.7%
) 1886
 
1.5%
( 1884
 
1.5%
1558
 
1.2%
1349
 
1.1%
1308
 
1.0%
1307
 
1.0%
Other values (1463) 90023
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92155
72.0%
Space Separator 21263
 
16.6%
Lowercase Letter 4118
 
3.2%
Decimal Number 2783
 
2.2%
Close Punctuation 1902
 
1.5%
Open Punctuation 1900
 
1.5%
Other Punctuation 1819
 
1.4%
Uppercase Letter 1772
 
1.4%
Math Symbol 177
 
0.1%
Dash Punctuation 50
 
< 0.1%
Other values (4) 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2665
 
2.9%
2533
 
2.7%
2185
 
2.4%
1558
 
1.7%
1349
 
1.5%
1308
 
1.4%
1307
 
1.4%
1226
 
1.3%
1226
 
1.3%
1211
 
1.3%
Other values (1348) 75587
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 477
11.6%
o 373
 
9.1%
a 363
 
8.8%
i 333
 
8.1%
t 310
 
7.5%
n 299
 
7.3%
r 272
 
6.6%
s 236
 
5.7%
l 182
 
4.4%
c 142
 
3.4%
Other values (24) 1131
27.5%
Uppercase Letter
ValueCountFrequency (%)
S 169
 
9.5%
C 169
 
9.5%
T 127
 
7.2%
A 124
 
7.0%
E 110
 
6.2%
D 105
 
5.9%
I 90
 
5.1%
P 86
 
4.9%
M 82
 
4.6%
O 81
 
4.6%
Other values (16) 629
35.5%
Other Punctuation
ValueCountFrequency (%)
, 778
42.8%
. 293
 
16.1%
? 184
 
10.1%
! 159
 
8.7%
· 103
 
5.7%
: 97
 
5.3%
& 51
 
2.8%
' 45
 
2.5%
32
 
1.8%
% 24
 
1.3%
Other values (10) 53
 
2.9%
Decimal Number
ValueCountFrequency (%)
0 815
29.3%
1 653
23.5%
2 418
15.0%
3 244
 
8.8%
5 174
 
6.3%
4 127
 
4.6%
7 101
 
3.6%
6 95
 
3.4%
9 82
 
2.9%
8 74
 
2.7%
Other Symbol
ValueCountFrequency (%)
3
37.5%
2
25.0%
1
 
12.5%
° 1
 
12.5%
1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 1886
99.2%
] 14
 
0.7%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1884
99.2%
[ 14
 
0.7%
1
 
0.1%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 111
62.7%
+ 46
26.0%
~ 18
 
10.2%
× 2
 
1.1%
Letter Number
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
21263
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 5
100.0%
Other Number
ValueCountFrequency (%)
² 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 91863
71.8%
Common 29909
 
23.4%
Latin 5888
 
4.6%
Han 279
 
0.2%
Hiragana 11
 
< 0.1%
Greek 9
 
< 0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2665
 
2.9%
2533
 
2.8%
2185
 
2.4%
1558
 
1.7%
1349
 
1.5%
1308
 
1.4%
1307
 
1.4%
1226
 
1.3%
1226
 
1.3%
1211
 
1.3%
Other values (1152) 75295
82.0%
Han
ValueCountFrequency (%)
10
 
3.6%
7
 
2.5%
6
 
2.2%
5
 
1.8%
5
 
1.8%
5
 
1.8%
5
 
1.8%
4
 
1.4%
4
 
1.4%
4
 
1.4%
Other values (175) 224
80.3%
Latin
ValueCountFrequency (%)
e 477
 
8.1%
o 373
 
6.3%
a 363
 
6.2%
i 333
 
5.7%
t 310
 
5.3%
n 299
 
5.1%
r 272
 
4.6%
s 236
 
4.0%
l 182
 
3.1%
S 169
 
2.9%
Other values (46) 2874
48.8%
Common
ValueCountFrequency (%)
21263
71.1%
) 1886
 
6.3%
( 1884
 
6.3%
0 815
 
2.7%
, 778
 
2.6%
1 653
 
2.2%
2 418
 
1.4%
. 293
 
1.0%
3 244
 
0.8%
? 184
 
0.6%
Other values (41) 1491
 
5.0%
Hiragana
ValueCountFrequency (%)
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Greek
ValueCountFrequency (%)
α 2
22.2%
μ 1
11.1%
τ 1
11.1%
ρ 1
11.1%
κ 1
11.1%
ο 1
11.1%
η 1
11.1%
δ 1
11.1%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 91862
71.8%
ASCII 35619
 
27.8%
CJK 269
 
0.2%
None 172
 
0.1%
Hiragana 11
 
< 0.1%
CJK Compat Ideographs 10
 
< 0.1%
Number Forms 7
 
< 0.1%
Misc Symbols 4
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Katakana 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21263
59.7%
) 1886
 
5.3%
( 1884
 
5.3%
0 815
 
2.3%
, 778
 
2.2%
1 653
 
1.8%
e 477
 
1.3%
2 418
 
1.2%
o 373
 
1.0%
a 363
 
1.0%
Other values (76) 6709
 
18.8%
Hangul
ValueCountFrequency (%)
2665
 
2.9%
2533
 
2.8%
2185
 
2.4%
1558
 
1.7%
1349
 
1.5%
1308
 
1.4%
1307
 
1.4%
1226
 
1.3%
1226
 
1.3%
1211
 
1.3%
Other values (1151) 75294
82.0%
None
ValueCountFrequency (%)
· 103
59.9%
32
 
18.6%
16
 
9.3%
α 2
 
1.2%
² 2
 
1.2%
× 2
 
1.2%
2
 
1.2%
1
 
0.6%
1
 
0.6%
1
 
0.6%
Other values (10) 10
 
5.8%
CJK
ValueCountFrequency (%)
10
 
3.7%
7
 
2.6%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (167) 214
79.6%
Number Forms
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Misc Symbols
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Hiragana
ValueCountFrequency (%)
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8402
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:47:34.801477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length3
Mean length4.6689
Min length2

Characters and Unicode

Total characters46689
Distinct characters890
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7312 ?
Unique (%)73.1%

Sample

1st row류근영
2nd row장인석
3rd row노순규
4th row양인선
5th row배대승
ValueCountFrequency (%)
74
 
0.5%
데이비드 43
 
0.3%
마이클 40
 
0.3%
리처드 35
 
0.3%
제임스 27
 
0.2%
j 26
 
0.2%
마크 25
 
0.2%
피터 23
 
0.2%
a 22
 
0.2%
22
 
0.2%
Other values (9440) 13146
97.5%
2023-12-12T18:47:35.367507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3484
 
7.5%
+ 2121
 
4.5%
1811
 
3.9%
1257
 
2.7%
1067
 
2.3%
761
 
1.6%
666
 
1.4%
569
 
1.2%
474
 
1.0%
442
 
0.9%
Other values (880) 34037
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40593
86.9%
Space Separator 3484
 
7.5%
Math Symbol 2125
 
4.6%
Uppercase Letter 290
 
0.6%
Lowercase Letter 95
 
0.2%
Other Punctuation 56
 
0.1%
Dash Punctuation 29
 
0.1%
Decimal Number 7
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1811
 
4.5%
1257
 
3.1%
1067
 
2.6%
761
 
1.9%
666
 
1.6%
569
 
1.4%
474
 
1.2%
442
 
1.1%
433
 
1.1%
406
 
1.0%
Other values (821) 32707
80.6%
Uppercase Letter
ValueCountFrequency (%)
J 37
12.8%
A 30
 
10.3%
L 23
 
7.9%
C 22
 
7.6%
S 19
 
6.6%
R 19
 
6.6%
M 16
 
5.5%
B 16
 
5.5%
H 15
 
5.2%
E 15
 
5.2%
Other values (13) 78
26.9%
Lowercase Letter
ValueCountFrequency (%)
a 10
10.5%
i 10
10.5%
n 9
9.5%
d 7
 
7.4%
h 7
 
7.4%
u 7
 
7.4%
e 7
 
7.4%
s 6
 
6.3%
r 6
 
6.3%
o 5
 
5.3%
Other values (10) 21
22.1%
Decimal Number
ValueCountFrequency (%)
1 3
42.9%
8 2
28.6%
2 1
 
14.3%
9 1
 
14.3%
Math Symbol
ValueCountFrequency (%)
+ 2121
99.8%
< 2
 
0.1%
> 2
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 50
89.3%
· 4
 
7.1%
& 2
 
3.6%
Open Punctuation
ValueCountFrequency (%)
[ 4
80.0%
( 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
] 4
80.0%
) 1
 
20.0%
Space Separator
ValueCountFrequency (%)
3484
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40585
86.9%
Common 5711
 
12.2%
Latin 385
 
0.8%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1811
 
4.5%
1257
 
3.1%
1067
 
2.6%
761
 
1.9%
666
 
1.6%
569
 
1.4%
474
 
1.2%
442
 
1.1%
433
 
1.1%
406
 
1.0%
Other values (813) 32699
80.6%
Latin
ValueCountFrequency (%)
J 37
 
9.6%
A 30
 
7.8%
L 23
 
6.0%
C 22
 
5.7%
S 19
 
4.9%
R 19
 
4.9%
M 16
 
4.2%
B 16
 
4.2%
H 15
 
3.9%
E 15
 
3.9%
Other values (33) 173
44.9%
Common
ValueCountFrequency (%)
3484
61.0%
+ 2121
37.1%
. 50
 
0.9%
- 29
 
0.5%
[ 4
 
0.1%
] 4
 
0.1%
· 4
 
0.1%
1 3
 
0.1%
< 2
 
< 0.1%
> 2
 
< 0.1%
Other values (6) 8
 
0.1%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40585
86.9%
ASCII 6092
 
13.0%
CJK 8
 
< 0.1%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3484
57.2%
+ 2121
34.8%
. 50
 
0.8%
J 37
 
0.6%
A 30
 
0.5%
- 29
 
0.5%
L 23
 
0.4%
C 22
 
0.4%
S 19
 
0.3%
R 19
 
0.3%
Other values (48) 258
 
4.2%
Hangul
ValueCountFrequency (%)
1811
 
4.5%
1257
 
3.1%
1067
 
2.6%
761
 
1.9%
666
 
1.6%
569
 
1.4%
474
 
1.2%
442
 
1.1%
433
 
1.1%
406
 
1.0%
Other values (813) 32699
80.6%
None
ValueCountFrequency (%)
· 4
100.0%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct3042
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:47:35.717789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length4.3606
Min length1

Characters and Unicode

Total characters43606
Distinct characters812
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1673 ?
Unique (%)16.7%

Sample

1st row날개미디어
2nd rowSRmusic
3rd row한국기업경영연구원
4th row꿈의지도
5th row대가
ValueCountFrequency (%)
21세기북스 80
 
0.8%
김영사 75
 
0.7%
지식을만드는지식 73
 
0.7%
한빛미디어 69
 
0.7%
길벗 63
 
0.6%
한스미디어 60
 
0.6%
살림 60
 
0.6%
학지사 57
 
0.6%
자음과모음 56
 
0.5%
매일경제신문사 54
 
0.5%
Other values (3079) 9637
93.7%
2023-12-12T18:47:36.229154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1958
 
4.5%
1354
 
3.1%
1218
 
2.8%
1000
 
2.3%
710
 
1.6%
o 640
 
1.5%
624
 
1.4%
606
 
1.4%
571
 
1.3%
563
 
1.3%
Other values (802) 34362
78.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37605
86.2%
Lowercase Letter 3724
 
8.5%
Uppercase Letter 1577
 
3.6%
Space Separator 284
 
0.7%
Decimal Number 248
 
0.6%
Other Punctuation 111
 
0.3%
Close Punctuation 23
 
0.1%
Open Punctuation 23
 
0.1%
Dash Punctuation 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1958
 
5.2%
1354
 
3.6%
1218
 
3.2%
1000
 
2.7%
710
 
1.9%
624
 
1.7%
606
 
1.6%
571
 
1.5%
563
 
1.5%
555
 
1.5%
Other values (727) 28446
75.6%
Uppercase Letter
ValueCountFrequency (%)
B 264
16.7%
M 140
 
8.9%
S 128
 
8.1%
K 111
 
7.0%
R 85
 
5.4%
H 82
 
5.2%
O 81
 
5.1%
C 79
 
5.0%
E 71
 
4.5%
P 65
 
4.1%
Other values (16) 471
29.9%
Lowercase Letter
ValueCountFrequency (%)
o 640
17.2%
s 365
9.8%
e 324
 
8.7%
a 270
 
7.3%
i 255
 
6.8%
n 254
 
6.8%
k 250
 
6.7%
b 185
 
5.0%
l 152
 
4.1%
r 152
 
4.1%
Other values (15) 877
23.5%
Decimal Number
ValueCountFrequency (%)
2 104
41.9%
1 95
38.3%
3 17
 
6.9%
0 12
 
4.8%
6 6
 
2.4%
5 6
 
2.4%
4 4
 
1.6%
8 2
 
0.8%
9 1
 
0.4%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 34
30.6%
& 29
26.1%
# 18
16.2%
: 11
 
9.9%
10
 
9.0%
· 6
 
5.4%
, 2
 
1.8%
' 1
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 17
73.9%
] 6
 
26.1%
Open Punctuation
ValueCountFrequency (%)
( 17
73.9%
[ 6
 
26.1%
Space Separator
ValueCountFrequency (%)
284
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37412
85.8%
Latin 5301
 
12.2%
Common 700
 
1.6%
Han 193
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1958
 
5.2%
1354
 
3.6%
1218
 
3.3%
1000
 
2.7%
710
 
1.9%
624
 
1.7%
606
 
1.6%
571
 
1.5%
563
 
1.5%
555
 
1.5%
Other values (656) 28253
75.5%
Han
ValueCountFrequency (%)
27
 
14.0%
22
 
11.4%
7
 
3.6%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
Other values (61) 100
51.8%
Latin
ValueCountFrequency (%)
o 640
 
12.1%
s 365
 
6.9%
e 324
 
6.1%
a 270
 
5.1%
B 264
 
5.0%
i 255
 
4.8%
n 254
 
4.8%
k 250
 
4.7%
b 185
 
3.5%
l 152
 
2.9%
Other values (41) 2342
44.2%
Common
ValueCountFrequency (%)
284
40.6%
2 104
 
14.9%
1 95
 
13.6%
. 34
 
4.9%
& 29
 
4.1%
# 18
 
2.6%
) 17
 
2.4%
3 17
 
2.4%
( 17
 
2.4%
0 12
 
1.7%
Other values (14) 73
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37405
85.8%
ASCII 5985
 
13.7%
CJK 192
 
0.4%
None 16
 
< 0.1%
Compat Jamo 7
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1958
 
5.2%
1354
 
3.6%
1218
 
3.3%
1000
 
2.7%
710
 
1.9%
624
 
1.7%
606
 
1.6%
571
 
1.5%
563
 
1.5%
555
 
1.5%
Other values (651) 28246
75.5%
ASCII
ValueCountFrequency (%)
o 640
 
10.7%
s 365
 
6.1%
e 324
 
5.4%
284
 
4.7%
a 270
 
4.5%
B 264
 
4.4%
i 255
 
4.3%
n 254
 
4.2%
k 250
 
4.2%
b 185
 
3.1%
Other values (63) 2894
48.4%
CJK
ValueCountFrequency (%)
27
 
14.1%
22
 
11.5%
7
 
3.6%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
Other values (60) 99
51.6%
None
ValueCountFrequency (%)
10
62.5%
· 6
37.5%
Compat Jamo
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

분관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
중앙도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙도서관
2nd row중앙도서관
3rd row중앙도서관
4th row중앙도서관
5th row중앙도서관

Common Values

ValueCountFrequency (%)
중앙도서관 10000
100.0%

Length

2023-12-12T18:47:36.382977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:36.482762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중앙도서관 10000
100.0%

서고
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반자료실1(2층)
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반자료실1(2층)
2nd row일반자료실1(2층)
3rd row일반자료실1(2층)
4th row일반자료실1(2층)
5th row일반자료실1(2층)

Common Values

ValueCountFrequency (%)
일반자료실1(2층) 10000
100.0%

Length

2023-12-12T18:47:36.599670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:36.696596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반자료실1(2층 10000
100.0%

소장상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
배가완료
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row배가완료
2nd row배가완료
3rd row배가완료
4th row배가완료
5th row배가완료

Common Values

ValueCountFrequency (%)
배가완료 10000
100.0%

Length

2023-12-12T18:47:36.799024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:36.907399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
배가완료 10000
100.0%

언어
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국내
9964 
국외
 
36

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내
2nd row국내
3rd row국내
4th row국내
5th row국내

Common Values

ValueCountFrequency (%)
국내 9964
99.6%
국외 36
 
0.4%

Length

2023-12-12T18:47:37.004050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:37.121710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내 9964
99.6%
국외 36
 
0.4%

서지유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반 단행본
9994 
아동도서
 
6

Length

Max length6
Median length6
Mean length5.9988
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반 단행본
2nd row일반 단행본
3rd row일반 단행본
4th row일반 단행본
5th row일반 단행본

Common Values

ValueCountFrequency (%)
일반 단행본 9994
99.9%
아동도서 6
 
0.1%

Length

2023-12-12T18:47:37.239539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:37.356960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9994
50.0%
단행본 9994
50.0%
아동도서 6
 
< 0.1%

딸림여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9468 
True
 
532
ValueCountFrequency (%)
False 9468
94.7%
True 532
 
5.3%
2023-12-12T18:47:37.457292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct1738
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2008-05-01 00:00:00
Maximum2023-11-23 00:00:00
2023-12-12T18:47:37.572149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:47:37.722523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:47:30.582886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:47:37.860285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서지번호언어서지유형딸림여부
서지번호1.0000.0300.0440.167
언어0.0301.0000.0000.039
서지유형0.0440.0001.0000.000
딸림여부0.1670.0390.0001.000
2023-12-12T18:47:37.959909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
언어서지유형딸림여부
언어1.0000.0000.025
서지유형0.0001.0000.000
딸림여부0.0250.0001.000
2023-12-12T18:47:38.044621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서지번호언어서지유형딸림여부
서지번호1.0000.0230.0330.128
언어0.0231.0000.0000.025
서지유형0.0330.0001.0000.000
딸림여부0.1280.0250.0001.000

Missing values

2023-12-12T18:47:30.772249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:47:30.987144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호서지번호청구기호서명저자발행처분관서고소장상태언어서지유형딸림여부최종수정일
50712JE1689761920689325.04 류17ㅇ(천재자산가 류근영의)억만장자처럼 행동하라류근영날개미디어중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2016-06-16
43127JE1193201146465567.43 장68ㄷ더 레코딩장인석SRmusic중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2017-04-30
43796JE1139261134569327.04 노56ㅌ퇴직 후의 인생설계 재무설계노순규한국기업경영연구원중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2014-04-27
70055JE2260202788553981.4602 양68마말레이시아양인선꿈의지도중앙도서관일반자료실1(2층)배가완료국내일반 단행본Y2019-10-23
35367JE1458141625948610.92 배23ㅅ(흐름으로 읽는)서양건축의 역사배대승대가중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2014-11-27
70299JE2212232740311235.7 프292ㅅ삶을 살리는 교육프란치스코+ 교황가톨릭대학교출판부중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2019-07-03
38973JE1474171632190982.702 문75ㅇ외로움, 힘껏 껴안다문종성어문학사중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2015-05-30
70891JE2297292822330747.5 이63읻Ashley의 사춘기 친구들을 위한 영어 회화 책이+ 애슐리아우룸중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2020-02-19
42148JE1187851147580747 와68ㅂ(CNN이 잘 들리는)받아쓰기 훈련와이비엠YBM중앙도서관일반자료실1(2층)배가완료국내일반 단행본Y2012-08-27
53243JE1712631945033376.5441 조62ㅈ중학 도형만점 공부법조안호행복한나무중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2016-10-25
등록번호서지번호청구기호서명저자발행처분관서고소장상태언어서지유형딸림여부최종수정일
24339JE1529821698110513.8914 바56ㅈ(자기심리학에 따른)정신치료바쉬+ 마이클 프란츠중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2015-05-11
10052JE096975862184321.329 부25부동산 설득심리이창석신광문화사중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2011-08-29
61276JE2104722608171982.7902 서63ㅅ세상의 서쪽 끝, 포르투갈서양수홍익중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2018-10-01
53JE2078122597637340.925 마48ㅇ어쨌거나 핑퐁마빌돌베개중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2018-07-26
66262JE2553013174222606.942 선25ㄱ그림들:모마 미술관 도슨트북선 도슨트나무의마음중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2022-04-07
54151JE1773971969407688.019 정44ㅊ천만 관객의 영화 천만 표의 정치정병기갈무리중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2016-12-27
38957JE1411831599447594.019 까192ㅎ홋카이도에 먹으러 가자까날니들북중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2014-07-22
15572JE089462833098005.3 최76어(내 업무 반으로 줄이는)엑셀 2010최준선멘토르중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2011-01-24
33524JE1560731724260024.4 이82ㅈ자료분류론이창수한국도서관협회중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2015-07-20
50196JE1723221956254029.8 이52ㄷ독서자본이상민서울문화사중앙도서관일반자료실1(2층)배가완료국내일반 단행본N2016-11-29