Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Numeric2
Text5
Categorical1

Dataset

Description한국수력원자력 경주 본사에서 운영하는 도서관(청심재) 소장자료 목록입니다. 책, CD, DVD의 서명 및 저작자, 청구기호 등을 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15119602/fileData.do

Alerts

비고 is highly imbalanced (86.0%)Imbalance
번호 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:50:34.884240
Analysis finished2023-12-12 10:50:41.539854
Duration6.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13078.029
Minimum1
Maximum26235
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T19:50:41.733746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1283.75
Q16456
median13118.5
Q319571.25
95-th percentile24852.2
Maximum26235
Range26234
Interquartile range (IQR)13115.25

Descriptive statistics

Standard deviation7569.183
Coefficient of variation (CV)0.57877095
Kurtosis-1.1955439
Mean13078.029
Median Absolute Deviation (MAD)6551
Skewness-0.0030343537
Sum1.3078029 × 108
Variance57292531
MonotonicityNot monotonic
2023-12-12T19:50:42.081670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16321 1
 
< 0.1%
5150 1
 
< 0.1%
12224 1
 
< 0.1%
11570 1
 
< 0.1%
13611 1
 
< 0.1%
19706 1
 
< 0.1%
16149 1
 
< 0.1%
6847 1
 
< 0.1%
16784 1
 
< 0.1%
23426 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
16 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
19 1
< 0.1%
20 1
< 0.1%
ValueCountFrequency (%)
26235 1
< 0.1%
26227 1
< 0.1%
26225 1
< 0.1%
26220 1
< 0.1%
26219 1
< 0.1%
26215 1
< 0.1%
26214 1
< 0.1%
26213 1
< 0.1%
26210 1
< 0.1%
26208 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:50:42.548193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowKM0000015376
2nd rowKM0000003983
3rd rowDV0000000195
4th rowKM0000004146
5th rowKM0000018014
ValueCountFrequency (%)
km0000015376 1
 
< 0.1%
km0000005900 1
 
< 0.1%
km0000008790 1
 
< 0.1%
km0000010311 1
 
< 0.1%
km0000011278 1
 
< 0.1%
km0000010624 1
 
< 0.1%
km0000012665 1
 
< 0.1%
km0000018761 1
 
< 0.1%
km0000015204 1
 
< 0.1%
km0000004203 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T19:50:43.170422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 58834
49.0%
M 9672
 
8.1%
K 9625
 
8.0%
1 8013
 
6.7%
2 6233
 
5.2%
4 4146
 
3.5%
3 4058
 
3.4%
5 3915
 
3.3%
8 3757
 
3.1%
6 3701
 
3.1%
Other values (6) 8046
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 58834
58.8%
1 8013
 
8.0%
2 6233
 
6.2%
4 4146
 
4.1%
3 4058
 
4.1%
5 3915
 
3.9%
8 3757
 
3.8%
6 3701
 
3.7%
9 3675
 
3.7%
7 3668
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
M 9672
48.4%
K 9625
48.1%
D 328
 
1.6%
V 298
 
1.5%
E 47
 
0.2%
C 30
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 58834
58.8%
1 8013
 
8.0%
2 6233
 
6.2%
4 4146
 
4.1%
3 4058
 
4.1%
5 3915
 
3.9%
8 3757
 
3.8%
6 3701
 
3.7%
9 3675
 
3.7%
7 3668
 
3.7%
Latin
ValueCountFrequency (%)
M 9672
48.4%
K 9625
48.1%
D 328
 
1.6%
V 298
 
1.5%
E 47
 
0.2%
C 30
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 58834
49.0%
M 9672
 
8.1%
K 9625
 
8.0%
1 8013
 
6.7%
2 6233
 
5.2%
4 4146
 
3.5%
3 4058
 
3.4%
5 3915
 
3.3%
8 3757
 
3.1%
6 3701
 
3.1%
Other values (6) 8046
 
6.7%

서명
Text

Distinct9925
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:50:43.834501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length184
Median length93
Mean length24.2809
Min length1

Characters and Unicode

Total characters242809
Distinct characters1818
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9861 ?
Unique (%)98.6%

Sample

1st row에니그마 : 로버트 해리스 장편소설
2nd row인권의 정치학 : 국가권력과 인권
3rd row(EBS 영상위인전)한국인물사. 2
4th row단체법 = Organization : 민법상 사단을 중심으로
5th row(생각 없이 준비 없이 떠나는 초간편)당일치기 총알여행 : '3분카레'처럼 간편한 초간단 여행 레시피
ValueCountFrequency (%)
6574
 
10.5%
장편소설 802
 
1.3%
이야기 388
 
0.6%
1 358
 
0.6%
2 354
 
0.6%
위한 218
 
0.3%
of 171
 
0.3%
역사 165
 
0.3%
161
 
0.3%
the 149
 
0.2%
Other values (24789) 53382
85.1%
2023-12-12T19:50:44.904270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53125
 
21.9%
: 6180
 
2.5%
4799
 
2.0%
3810
 
1.6%
2962
 
1.2%
e 2456
 
1.0%
2384
 
1.0%
2144
 
0.9%
2059
 
0.8%
2055
 
0.8%
Other values (1808) 160835
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 147130
60.6%
Space Separator 53125
 
21.9%
Lowercase Letter 20552
 
8.5%
Other Punctuation 10526
 
4.3%
Decimal Number 4862
 
2.0%
Uppercase Letter 2604
 
1.1%
Close Punctuation 1390
 
0.6%
Open Punctuation 1390
 
0.6%
Math Symbol 1028
 
0.4%
Dash Punctuation 162
 
0.1%
Other values (7) 40
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4799
 
3.3%
3810
 
2.6%
2962
 
2.0%
2384
 
1.6%
2144
 
1.5%
2059
 
1.4%
2055
 
1.4%
1997
 
1.4%
1985
 
1.3%
1806
 
1.2%
Other values (1663) 121129
82.3%
Lowercase Letter
ValueCountFrequency (%)
e 2456
12.0%
o 1813
 
8.8%
a 1742
 
8.5%
i 1604
 
7.8%
n 1526
 
7.4%
r 1502
 
7.3%
t 1422
 
6.9%
s 1289
 
6.3%
l 943
 
4.6%
c 735
 
3.6%
Other values (39) 5520
26.9%
Uppercase Letter
ValueCountFrequency (%)
S 257
 
9.9%
T 250
 
9.6%
E 169
 
6.5%
A 165
 
6.3%
C 160
 
6.1%
B 156
 
6.0%
M 125
 
4.8%
H 115
 
4.4%
I 115
 
4.4%
L 106
 
4.1%
Other values (21) 986
37.9%
Other Punctuation
ValueCountFrequency (%)
: 6180
58.7%
, 1910
 
18.1%
. 1525
 
14.5%
· 296
 
2.8%
! 282
 
2.7%
' 164
 
1.6%
& 65
 
0.6%
/ 31
 
0.3%
% 24
 
0.2%
; 17
 
0.2%
Other values (5) 32
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 1333
27.4%
2 860
17.7%
0 841
17.3%
3 467
 
9.6%
5 324
 
6.7%
4 316
 
6.5%
6 200
 
4.1%
9 194
 
4.0%
7 175
 
3.6%
8 152
 
3.1%
Math Symbol
ValueCountFrequency (%)
= 936
91.1%
~ 46
 
4.5%
+ 14
 
1.4%
> 11
 
1.1%
< 11
 
1.1%
3
 
0.3%
× 3
 
0.3%
| 3
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1344
96.7%
18
 
1.3%
] 17
 
1.2%
6
 
0.4%
3
 
0.2%
1
 
0.1%
} 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1344
96.7%
18
 
1.3%
[ 17
 
1.2%
6
 
0.4%
3
 
0.2%
1
 
0.1%
{ 1
 
0.1%
Letter Number
ValueCountFrequency (%)
10
50.0%
5
25.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
Other Symbol
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
® 1
14.3%
1
14.3%
Space Separator
ValueCountFrequency (%)
53125
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 162
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 146221
60.2%
Common 72503
29.9%
Latin 23035
 
9.5%
Han 901
 
0.4%
Cyrillic 141
 
0.1%
Katakana 4
 
< 0.1%
Hiragana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4799
 
3.3%
3810
 
2.6%
2962
 
2.0%
2384
 
1.6%
2144
 
1.5%
2059
 
1.4%
2055
 
1.4%
1997
 
1.4%
1985
 
1.4%
1806
 
1.2%
Other values (1269) 120220
82.2%
Han
ValueCountFrequency (%)
27
 
3.0%
27
 
3.0%
20
 
2.2%
20
 
2.2%
19
 
2.1%
18
 
2.0%
18
 
2.0%
16
 
1.8%
16
 
1.8%
14
 
1.6%
Other values (377) 706
78.4%
Common
ValueCountFrequency (%)
53125
73.3%
: 6180
 
8.5%
, 1910
 
2.6%
. 1525
 
2.1%
) 1344
 
1.9%
( 1344
 
1.9%
1 1333
 
1.8%
= 936
 
1.3%
2 860
 
1.2%
0 841
 
1.2%
Other values (50) 3105
 
4.3%
Latin
ValueCountFrequency (%)
e 2456
 
10.7%
o 1813
 
7.9%
a 1742
 
7.6%
i 1604
 
7.0%
n 1526
 
6.6%
r 1502
 
6.5%
t 1422
 
6.2%
s 1289
 
5.6%
l 943
 
4.1%
c 735
 
3.2%
Other values (47) 8003
34.7%
Cyrillic
ValueCountFrequency (%)
с 21
14.9%
к 15
10.6%
о 13
 
9.2%
р 12
 
8.5%
и 9
 
6.4%
й 8
 
5.7%
е 8
 
5.7%
у 7
 
5.0%
а 6
 
4.3%
в 6
 
4.3%
Other values (18) 36
25.5%
Hiragana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 146213
60.2%
ASCII 95131
39.2%
CJK 871
 
0.4%
None 364
 
0.1%
Cyrillic 141
 
0.1%
CJK Compat Ideographs 30
 
< 0.1%
Number Forms 20
 
< 0.1%
Punctuation 16
 
< 0.1%
Compat Jamo 8
 
< 0.1%
Katakana 4
 
< 0.1%
Other values (6) 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53125
55.8%
: 6180
 
6.5%
e 2456
 
2.6%
, 1910
 
2.0%
o 1813
 
1.9%
a 1742
 
1.8%
i 1604
 
1.7%
n 1526
 
1.6%
. 1525
 
1.6%
r 1502
 
1.6%
Other values (80) 21748
22.9%
Hangul
ValueCountFrequency (%)
4799
 
3.3%
3810
 
2.6%
2962
 
2.0%
2384
 
1.6%
2144
 
1.5%
2059
 
1.4%
2055
 
1.4%
1997
 
1.4%
1985
 
1.4%
1806
 
1.2%
Other values (1265) 120212
82.2%
None
ValueCountFrequency (%)
· 296
81.3%
18
 
4.9%
18
 
4.9%
6
 
1.6%
6
 
1.6%
4
 
1.1%
3
 
0.8%
3
 
0.8%
3
 
0.8%
× 3
 
0.8%
Other values (4) 4
 
1.1%
CJK
ValueCountFrequency (%)
27
 
3.1%
27
 
3.1%
20
 
2.3%
20
 
2.3%
19
 
2.2%
18
 
2.1%
18
 
2.1%
16
 
1.8%
16
 
1.8%
14
 
1.6%
Other values (355) 676
77.6%
Cyrillic
ValueCountFrequency (%)
с 21
14.9%
к 15
10.6%
о 13
 
9.2%
р 12
 
8.5%
и 9
 
6.4%
й 8
 
5.7%
е 8
 
5.7%
у 7
 
5.0%
а 6
 
4.3%
в 6
 
4.3%
Other values (18) 36
25.5%
Punctuation
ValueCountFrequency (%)
10
62.5%
4
 
25.0%
2
 
12.5%
Number Forms
ValueCountFrequency (%)
10
50.0%
5
25.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
Compat Jamo
ValueCountFrequency (%)
5
62.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
CJK Compat Ideographs
ValueCountFrequency (%)
5
16.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (12) 12
40.0%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Box Drawing
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8552
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:50:45.700708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length279
Median length199
Mean length14.4303
Min length2

Characters and Unicode

Total characters144303
Distinct characters1068
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7759 ?
Unique (%)77.6%

Sample

1st row로버트 해리스 지음 ; 조영학 옮김
2nd rowSabine C. Carey ; Mark Gibney ; Steven C. Poe [공]지음 ; 임상순 옮김
3rd rowEBS 기획
4th row宋五植 著
5th row신익수 지음
ValueCountFrequency (%)
지음 7171
 
16.2%
6494
 
14.6%
옮김 3518
 
7.9%
그림 826
 
1.9%
658
 
1.5%
공]지음 470
 
1.1%
314
 
0.7%
엮음 215
 
0.5%
감독 194
 
0.4%
181
 
0.4%
Other values (12858) 24329
54.8%
2023-12-12T19:50:46.906448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34422
23.9%
8431
 
5.8%
7943
 
5.5%
; 6489
 
4.5%
5937
 
4.1%
3747
 
2.6%
3173
 
2.2%
1596
 
1.1%
1410
 
1.0%
1255
 
0.9%
Other values (1058) 69900
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95719
66.3%
Space Separator 34422
 
23.9%
Other Punctuation 7863
 
5.4%
Lowercase Letter 2666
 
1.8%
Close Punctuation 1177
 
0.8%
Uppercase Letter 1177
 
0.8%
Open Punctuation 1176
 
0.8%
Decimal Number 44
 
< 0.1%
Dash Punctuation 37
 
< 0.1%
Math Symbol 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8431
 
8.8%
7943
 
8.3%
5937
 
6.2%
3747
 
3.9%
3173
 
3.3%
1596
 
1.7%
1410
 
1.5%
1255
 
1.3%
1194
 
1.2%
1124
 
1.2%
Other values (967) 59909
62.6%
Lowercase Letter
ValueCountFrequency (%)
e 343
12.9%
a 283
10.6%
r 216
 
8.1%
i 208
 
7.8%
o 199
 
7.5%
n 198
 
7.4%
t 176
 
6.6%
l 132
 
5.0%
d 104
 
3.9%
c 102
 
3.8%
Other values (24) 705
26.4%
Uppercase Letter
ValueCountFrequency (%)
S 151
12.8%
B 133
11.3%
K 91
 
7.7%
E 88
 
7.5%
A 82
 
7.0%
J 74
 
6.3%
R 74
 
6.3%
M 70
 
5.9%
H 57
 
4.8%
C 54
 
4.6%
Other values (17) 303
25.7%
Decimal Number
ValueCountFrequency (%)
3 9
20.5%
1 9
20.5%
0 8
18.2%
2 8
18.2%
7 4
9.1%
4 3
 
6.8%
8 1
 
2.3%
9 1
 
2.3%
6 1
 
2.3%
Other Punctuation
ValueCountFrequency (%)
; 6489
82.5%
. 413
 
5.3%
, 350
 
4.5%
· 311
 
4.0%
: 292
 
3.7%
/ 6
 
0.1%
& 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 1153
98.0%
) 17
 
1.4%
4
 
0.3%
2
 
0.2%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
[ 1152
98.0%
( 17
 
1.4%
4
 
0.3%
2
 
0.2%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
< 11
50.0%
> 11
50.0%
Space Separator
ValueCountFrequency (%)
34422
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95587
66.2%
Common 44741
31.0%
Latin 3831
 
2.7%
Han 130
 
0.1%
Cyrillic 12
 
< 0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8431
 
8.8%
7943
 
8.3%
5937
 
6.2%
3747
 
3.9%
3173
 
3.3%
1596
 
1.7%
1410
 
1.5%
1255
 
1.3%
1194
 
1.2%
1124
 
1.2%
Other values (874) 59777
62.5%
Han
ValueCountFrequency (%)
16
 
12.3%
5
 
3.8%
4
 
3.1%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (81) 89
68.5%
Latin
ValueCountFrequency (%)
e 343
 
9.0%
a 283
 
7.4%
r 216
 
5.6%
i 208
 
5.4%
o 199
 
5.2%
n 198
 
5.2%
t 176
 
4.6%
S 151
 
3.9%
B 133
 
3.5%
l 132
 
3.4%
Other values (39) 1792
46.8%
Common
ValueCountFrequency (%)
34422
76.9%
; 6489
 
14.5%
] 1153
 
2.6%
[ 1152
 
2.6%
. 413
 
0.9%
, 350
 
0.8%
· 311
 
0.7%
: 292
 
0.7%
- 37
 
0.1%
( 17
 
< 0.1%
Other values (20) 105
 
0.2%
Cyrillic
ValueCountFrequency (%)
С 1
8.3%
Б 1
8.3%
р 1
8.3%
и 1
8.3%
ч 1
8.3%
е 1
8.3%
н 1
8.3%
к 1
8.3%
о 1
8.3%
в 1
8.3%
Other values (2) 2
16.7%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95586
66.2%
ASCII 48247
33.4%
None 325
 
0.2%
CJK 129
 
0.1%
Cyrillic 12
 
< 0.1%
Katakana 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34422
71.3%
; 6489
 
13.4%
] 1153
 
2.4%
[ 1152
 
2.4%
. 413
 
0.9%
, 350
 
0.7%
e 343
 
0.7%
: 292
 
0.6%
a 283
 
0.6%
r 216
 
0.4%
Other values (62) 3134
 
6.5%
Hangul
ValueCountFrequency (%)
8431
 
8.8%
7943
 
8.3%
5937
 
6.2%
3747
 
3.9%
3173
 
3.3%
1596
 
1.7%
1410
 
1.5%
1255
 
1.3%
1194
 
1.2%
1124
 
1.2%
Other values (873) 59776
62.5%
None
ValueCountFrequency (%)
· 311
95.7%
4
 
1.2%
4
 
1.2%
2
 
0.6%
2
 
0.6%
1
 
0.3%
1
 
0.3%
CJK
ValueCountFrequency (%)
16
 
12.4%
5
 
3.9%
4
 
3.1%
3
 
2.3%
3
 
2.3%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (80) 88
68.2%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Cyrillic
ValueCountFrequency (%)
С 1
8.3%
Б 1
8.3%
р 1
8.3%
и 1
8.3%
ч 1
8.3%
е 1
8.3%
н 1
8.3%
к 1
8.3%
о 1
8.3%
в 1
8.3%
Other values (2) 2
16.7%
Distinct2877
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:50:47.553052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length29
Mean length5.1331
Min length1

Characters and Unicode

Total characters51331
Distinct characters774
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1508 ?
Unique (%)15.1%

Sample

1st rowRHK(알에이치코리아)
2nd row북스힐
3rd rowEBS 미디어센터 [공급]
4th row전남대학교출판부
5th row생각정거장
ValueCountFrequency (%)
문학동네 330
 
3.1%
민음사 231
 
2.2%
제작 124
 
1.2%
창비 98
 
0.9%
rhk(알에이치코리아 95
 
0.9%
지식을만드는지식 73
 
0.7%
서울문화사 70
 
0.7%
시공사 60
 
0.6%
김영사 59
 
0.6%
열린책들 58
 
0.5%
Other values (2935) 9412
88.7%
2023-12-12T19:50:48.541523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1677
 
3.3%
1549
 
3.0%
1279
 
2.5%
1242
 
2.4%
( 1154
 
2.2%
) 1154
 
2.2%
983
 
1.9%
946
 
1.8%
807
 
1.6%
690
 
1.3%
Other values (764) 39850
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43122
84.0%
Lowercase Letter 3043
 
5.9%
Uppercase Letter 1530
 
3.0%
Open Punctuation 1351
 
2.6%
Close Punctuation 1351
 
2.6%
Space Separator 611
 
1.2%
Decimal Number 196
 
0.4%
Other Punctuation 118
 
0.2%
Dash Punctuation 7
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1677
 
3.9%
1549
 
3.6%
1279
 
3.0%
1242
 
2.9%
983
 
2.3%
946
 
2.2%
807
 
1.9%
690
 
1.6%
670
 
1.6%
668
 
1.5%
Other values (688) 32611
75.6%
Uppercase Letter
ValueCountFrequency (%)
B 205
13.4%
K 159
10.4%
H 151
9.9%
S 139
9.1%
R 138
9.0%
E 109
 
7.1%
M 97
 
6.3%
P 84
 
5.5%
C 58
 
3.8%
U 55
 
3.6%
Other values (16) 335
21.9%
Lowercase Letter
ValueCountFrequency (%)
o 500
16.4%
s 345
11.3%
e 261
 
8.6%
i 259
 
8.5%
r 230
 
7.6%
a 194
 
6.4%
n 182
 
6.0%
k 171
 
5.6%
b 134
 
4.4%
t 99
 
3.3%
Other values (15) 668
22.0%
Decimal Number
ValueCountFrequency (%)
2 78
39.8%
1 67
34.2%
0 19
 
9.7%
3 12
 
6.1%
4 6
 
3.1%
5 5
 
2.6%
6 4
 
2.0%
9 4
 
2.0%
8 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
& 41
34.7%
· 23
19.5%
. 19
16.1%
# 18
15.3%
: 8
 
6.8%
/ 4
 
3.4%
, 3
 
2.5%
; 1
 
0.8%
' 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 1154
85.4%
[ 197
 
14.6%
Close Punctuation
ValueCountFrequency (%)
) 1154
85.4%
] 197
 
14.6%
Space Separator
ValueCountFrequency (%)
611
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43026
83.8%
Latin 4573
 
8.9%
Common 3636
 
7.1%
Han 96
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1677
 
3.9%
1549
 
3.6%
1279
 
3.0%
1242
 
2.9%
983
 
2.3%
946
 
2.2%
807
 
1.9%
690
 
1.6%
670
 
1.6%
668
 
1.6%
Other values (645) 32515
75.6%
Latin
ValueCountFrequency (%)
o 500
 
10.9%
s 345
 
7.5%
e 261
 
5.7%
i 259
 
5.7%
r 230
 
5.0%
B 205
 
4.5%
a 194
 
4.2%
n 182
 
4.0%
k 171
 
3.7%
K 159
 
3.5%
Other values (41) 2067
45.2%
Han
ValueCountFrequency (%)
11
 
11.5%
8
 
8.3%
6
 
6.2%
6
 
6.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.2%
3
 
3.1%
3
 
3.1%
Other values (33) 39
40.6%
Common
ValueCountFrequency (%)
( 1154
31.7%
) 1154
31.7%
611
16.8%
[ 197
 
5.4%
] 197
 
5.4%
2 78
 
2.1%
1 67
 
1.8%
& 41
 
1.1%
· 23
 
0.6%
0 19
 
0.5%
Other values (15) 95
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43025
83.8%
ASCII 8186
 
15.9%
CJK 95
 
0.2%
None 23
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1677
 
3.9%
1549
 
3.6%
1279
 
3.0%
1242
 
2.9%
983
 
2.3%
946
 
2.2%
807
 
1.9%
690
 
1.6%
670
 
1.6%
668
 
1.6%
Other values (644) 32514
75.6%
ASCII
ValueCountFrequency (%)
( 1154
 
14.1%
) 1154
 
14.1%
611
 
7.5%
o 500
 
6.1%
s 345
 
4.2%
e 261
 
3.2%
i 259
 
3.2%
r 230
 
2.8%
B 205
 
2.5%
[ 197
 
2.4%
Other values (65) 3270
39.9%
None
ValueCountFrequency (%)
· 23
100.0%
CJK
ValueCountFrequency (%)
11
 
11.6%
8
 
8.4%
6
 
6.3%
6
 
6.3%
6
 
6.3%
5
 
5.3%
5
 
5.3%
4
 
4.2%
3
 
3.2%
3
 
3.2%
Other values (32) 38
40.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

발행년
Real number (ℝ)

Distinct32
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.978
Minimum1927
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T19:50:48.848629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1927
5-th percentile2010
Q12013
median2014
Q32015
95-th percentile2017
Maximum2023
Range96
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.5569454
Coefficient of variation (CV)0.0012695995
Kurtosis143.18819
Mean2013.978
Median Absolute Deviation (MAD)1
Skewness-4.9685165
Sum20139780
Variance6.5379698
MonotonicityNot monotonic
2023-12-12T19:50:49.096947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
2014 2871
28.7%
2015 2321
23.2%
2013 1988
19.9%
2012 607
 
6.1%
2016 525
 
5.2%
2017 427
 
4.3%
2011 250
 
2.5%
2010 226
 
2.3%
2018 127
 
1.3%
2009 114
 
1.1%
Other values (22) 544
 
5.4%
ValueCountFrequency (%)
1927 1
 
< 0.1%
1981 1
 
< 0.1%
1990 1
 
< 0.1%
1992 2
 
< 0.1%
1994 1
 
< 0.1%
1996 2
 
< 0.1%
1997 3
 
< 0.1%
1998 1
 
< 0.1%
1999 3
 
< 0.1%
2000 13
0.1%
ValueCountFrequency (%)
2023 21
 
0.2%
2022 66
 
0.7%
2021 71
 
0.7%
2020 65
 
0.7%
2019 97
 
1.0%
2018 127
 
1.3%
2017 427
 
4.3%
2016 525
 
5.2%
2015 2321
23.2%
2014 2871
28.7%
Distinct9991
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:50:49.662038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length11.052
Min length7

Characters and Unicode

Total characters110520
Distinct characters592
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9983 ?
Unique (%)99.8%

Sample

1st row843.6-해298아
2nd row342.1-카294ㅇ
3rd rowC991.108-한16ㅎ-2
4th row365-송65ㄷ
5th row981.102-신68ㄷ
ValueCountFrequency (%)
c909-박64ㅂ 3
 
< 0.1%
813.7-박65ㅂ3-v.4 2
 
< 0.1%
c843-그298ㅅ 2
 
< 0.1%
814.7-최68ㄴ 2
 
< 0.1%
c980.24-송25ㅋ 2
 
< 0.1%
813.7-유15ㅇ-v.2 2
 
< 0.1%
811.7-김66ㅇ 2
 
< 0.1%
813.7-정54ㅅ 2
 
< 0.1%
340.265-김14ㄱ 1
 
< 0.1%
392-서64ㅈ 1
 
< 0.1%
Other values (9981) 9981
99.8%
2023-12-12T19:50:51.220883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12078
 
10.9%
8 10724
 
9.7%
1 9145
 
8.3%
. 8624
 
7.8%
3 7449
 
6.7%
2 7340
 
6.6%
4 6683
 
6.0%
5 6117
 
5.5%
9 5810
 
5.3%
7 5780
 
5.2%
Other values (582) 30770
27.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 67594
61.2%
Other Letter 19837
 
17.9%
Dash Punctuation 12078
 
10.9%
Other Punctuation 8624
 
7.8%
Uppercase Letter 1185
 
1.1%
Lowercase Letter 1136
 
1.0%
Math Symbol 66
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1791
 
9.0%
1168
 
5.9%
1038
 
5.2%
1035
 
5.2%
927
 
4.7%
752
 
3.8%
725
 
3.7%
710
 
3.6%
638
 
3.2%
618
 
3.1%
Other values (532) 10435
52.6%
Lowercase Letter
ValueCountFrequency (%)
v 1069
94.1%
o 25
 
2.2%
s 6
 
0.5%
r 5
 
0.4%
n 5
 
0.4%
p 5
 
0.4%
a 3
 
0.3%
t 3
 
0.3%
i 2
 
0.2%
l 2
 
0.2%
Other values (9) 11
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
C 1119
94.4%
O 23
 
1.9%
R 7
 
0.6%
A 6
 
0.5%
S 5
 
0.4%
M 4
 
0.3%
B 3
 
0.3%
D 3
 
0.3%
W 2
 
0.2%
Y 2
 
0.2%
Other values (8) 11
 
0.9%
Decimal Number
ValueCountFrequency (%)
8 10724
15.9%
1 9145
13.5%
3 7449
11.0%
2 7340
10.9%
4 6683
9.9%
5 6117
9.0%
9 5810
8.6%
7 5780
8.6%
6 5485
8.1%
0 3061
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 12078
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8624
100.0%
Math Symbol
ValueCountFrequency (%)
= 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 88362
80.0%
Hangul 19836
 
17.9%
Latin 2321
 
2.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1791
 
9.0%
1168
 
5.9%
1038
 
5.2%
1035
 
5.2%
927
 
4.7%
752
 
3.8%
725
 
3.7%
710
 
3.6%
638
 
3.2%
618
 
3.1%
Other values (531) 10434
52.6%
Latin
ValueCountFrequency (%)
C 1119
48.2%
v 1069
46.1%
o 25
 
1.1%
O 23
 
1.0%
R 7
 
0.3%
A 6
 
0.3%
s 6
 
0.3%
r 5
 
0.2%
n 5
 
0.2%
S 5
 
0.2%
Other values (27) 51
 
2.2%
Common
ValueCountFrequency (%)
- 12078
13.7%
8 10724
12.1%
1 9145
10.3%
. 8624
9.8%
3 7449
8.4%
2 7340
8.3%
4 6683
7.6%
5 6117
6.9%
9 5810
6.6%
7 5780
6.5%
Other values (3) 8612
9.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90683
82.1%
Hangul 10549
 
9.5%
Compat Jamo 9287
 
8.4%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12078
13.3%
8 10724
11.8%
1 9145
10.1%
. 8624
9.5%
3 7449
8.2%
2 7340
8.1%
4 6683
7.4%
5 6117
6.7%
9 5810
6.4%
7 5780
6.4%
Other values (40) 10933
12.1%
Compat Jamo
ValueCountFrequency (%)
1791
19.3%
1168
12.6%
1035
11.1%
752
8.1%
725
7.8%
710
 
7.6%
638
 
6.9%
618
 
6.7%
511
 
5.5%
351
 
3.8%
Other values (9) 988
10.6%
Hangul
ValueCountFrequency (%)
1038
 
9.8%
927
 
8.8%
388
 
3.7%
267
 
2.5%
260
 
2.5%
258
 
2.4%
217
 
2.1%
194
 
1.8%
182
 
1.7%
174
 
1.6%
Other values (512) 6644
63.0%
CJK
ValueCountFrequency (%)
1
100.0%

비고
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
9672 
DVD
 
299
CD
 
29

Length

Max length3
Median length1
Mean length1.0627
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd rowDVD
4th row
5th row

Common Values

ValueCountFrequency (%)
9672
96.7%
DVD 299
 
3.0%
CD 29
 
0.3%

Length

2023-12-12T19:50:51.572774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:50:51.833142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9672
96.7%
dvd 299
 
3.0%
cd 29
 
0.3%

Interactions

2023-12-12T19:50:40.503787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:50:40.076542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:50:40.748636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:50:40.284255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:50:51.973911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년비고
번호1.0000.1730.541
발행년0.1731.0000.106
비고0.5410.1061.000
2023-12-12T19:50:52.198927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년비고
번호1.0000.2310.385
발행년0.2311.0000.089
비고0.3850.0891.000

Missing values

2023-12-12T19:50:41.071415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:50:41.402488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록번호서명저자출판사발행년청구기호비고
1632016321KM0000015376에니그마 : 로버트 해리스 장편소설로버트 해리스 지음 ; 조영학 옮김RHK(알에이치코리아)2014843.6-해298아
49294930KM0000003983인권의 정치학 : 국가권력과 인권Sabine C. Carey ; Mark Gibney ; Steven C. Poe [공]지음 ; 임상순 옮김북스힐2013342.1-카294ㅇ
261262DV0000000195(EBS 영상위인전)한국인물사. 2EBS 기획EBS 미디어센터 [공급]2007C991.108-한16ㅎ-2DVD
50925093KM0000004146단체법 = Organization : 민법상 사단을 중심으로宋五植 著전남대학교출판부2015365-송65ㄷ
1895818959KM0000018014(생각 없이 준비 없이 떠나는 초간편)당일치기 총알여행 : '3분카레'처럼 간편한 초간단 여행 레시피신익수 지음생각정거장2015981.102-신68ㄷ
57255726KM0000004779방사화학 = Radiochemistry지은이: 강보선 ; 노경석 ; 박희곤 ; 송재흥 ; 이삼열 ; 이상훈 ; 이행기 ; 임채평 ; 정봉재 ; 주광태청구문화사2010431.48-강45ㅂ
93749375KM0000008428우리 시대의 우화 : 김명희 시집김명희 지음푸름사2014811.7-김34ㅇ
76397640KM0000006693사랑의 그림 : 명화 속 눈먼 욕망과 연애 유희최정은 지음세미콜론2013609.2-최74ㅅ
1001510016KM0000009069늑대는 눈알부터 자란다 : 김경주 희곡 = オオカミは目玉から育つ김경주 지음 ; 한성례 옮김난다(문학동네)2014812.6-김14ㄴ
113114DV0000000047어바웃 타임리차드 커티스 감독유니버셜2014688.2-커888ㅇDVD
번호등록번호서명저자출판사발행년청구기호비고
53035304KM0000004357조선의 습속조선총독부 편 ; 장두식 ; 김영순 옮김민속원2014380.911-조54자
2177121772KM0000020828보이지 않는 도시들이탈로 칼비노 지음 ; 이현경 옮김민음사2010808-세14마-138
1102911030KM0000010083은어낚시통신 : 윤대녕 소설집윤대녕 지음문학동네2010813.7-윤23ㅇ
2493624937KM0000024003(만화로 쉽게 배우는)전자회로 = Electronic circuit田中賢一 저 ; 高山ヤマ 그림 ; 이도희 역BM성안당2018569.3-다192ㅈ
58515852KM0000004905다윈 이후 : 다윈주의에 대한 오해와 이해를 말하다스티븐 제이 굴드 지음 ; 홍욱희 ; 홍동선 [공]옮김사이언스북스2011476.0162-굴27ㄷ
2508625087KM0000024153내 어머니 이야기 : 김은성 만화. 4김은성 지음애니북스2019818-김67ㄴ-4
76757676KM0000006729레드 : 처음 만나는 프렌치스티치와 크로스스티치 250아녜스 드라주-칼베 ; 안느 소이에-푸르넬 [공]작품 ; 프레데릭 뤼카노 사진 ; 김희정 옮김이끼북스2015636.5-드292ㄹ
1399213993KM0000013047나누고 싶은 이야기. 2 : 팜스코 가족들의 어제와 오늘, 그리고 내일팜스코 사람들 지음프린피아2014818-팜57ㄴ-v.2
2473724738KM0000023804위대한 발명의 실수투성이 역사샬럿 폴츠 존스 지음 ; 원지인 옮김보물창고(푸른책들)2018C507.5-존57ㅇ
2266722668KM0000021725좋은 아빠의 자격 : 아마추어 아빠에서 프로 아빠가 되는 길잡이서진석 지음북라이프2013598.1-서78ㅈ