Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells147
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description인천광역시 인재개발원 자료실 도서 목록 현황 자료 데이터 입니다.(청구 기호, 서명, 저작자, 발행자, 발행년 등의 항목을 제공합니다.)
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15052866&srcSe=7661IVAWM27C61E190

Alerts

저작자 has 134 (1.3%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 16:46:07.367073
Analysis finished2024-01-28 16:46:10.759141
Duration3.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8081.7713
Minimum1
Maximum16188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-29T01:46:10.841318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile820.95
Q14024.75
median8060.5
Q312093.25
95-th percentile15357.05
Maximum16188
Range16187
Interquartile range (IQR)8068.5

Descriptive statistics

Standard deviation4656.9642
Coefficient of variation (CV)0.57623064
Kurtosis-1.1904075
Mean8081.7713
Median Absolute Deviation (MAD)4034.5
Skewness0.0017899507
Sum80817713
Variance21687316
MonotonicityNot monotonic
2024-01-29T01:46:11.014495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
467 1
 
< 0.1%
14343 1
 
< 0.1%
10316 1
 
< 0.1%
14428 1
 
< 0.1%
4092 1
 
< 0.1%
7325 1
 
< 0.1%
4391 1
 
< 0.1%
4638 1
 
< 0.1%
8243 1
 
< 0.1%
309 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
16188 1
< 0.1%
16187 1
< 0.1%
16186 1
< 0.1%
16185 1
< 0.1%
16184 1
< 0.1%
16182 1
< 0.1%
16180 1
< 0.1%
16178 1
< 0.1%
16177 1
< 0.1%
16176 1
< 0.1%
Distinct9534
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-29T01:46:11.341336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length9.5577
Min length1

Characters and Unicode

Total characters95577
Distinct characters531
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9219 ?
Unique (%)92.2%

Sample

1st row982-웨68ㅁ
2nd row408-한16ㅁ
3rd row911-이14ㄴ
4th row223.59-천56ㄱ
5th row811-정34ㄷ
ValueCountFrequency (%)
408 33
 
0.3%
아동 25
 
0.2%
12
 
0.1%
811.32-신14ㅇ 11
 
0.1%
470.8 11
 
0.1%
811.33-김79ㅊ 10
 
0.1%
608-편78ㅇ 9
 
0.1%
909-정52ㅌ-1 8
 
0.1%
810.9-kㅇ 7
 
0.1%
912-나15ㅁ 7
 
0.1%
Other values (9536) 9925
98.7%
2024-01-29T01:46:11.832269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12111
12.7%
1 8800
 
9.2%
8 8255
 
8.6%
3 7746
 
8.1%
6 5675
 
5.9%
4 5663
 
5.9%
2 5649
 
5.9%
5 5522
 
5.8%
9 4620
 
4.8%
. 4096
 
4.3%
Other values (521) 27440
28.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59221
62.0%
Other Letter 19464
 
20.4%
Dash Punctuation 12111
 
12.7%
Other Punctuation 4115
 
4.3%
Math Symbol 290
 
0.3%
Uppercase Letter 285
 
0.3%
Space Separator 58
 
0.1%
Lowercase Letter 26
 
< 0.1%
Letter Number 3
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1768
 
9.1%
1381
 
7.1%
1075
 
5.5%
1047
 
5.4%
924
 
4.7%
899
 
4.6%
880
 
4.5%
635
 
3.3%
611
 
3.1%
570
 
2.9%
Other values (462) 9674
49.7%
Uppercase Letter
ValueCountFrequency (%)
W 68
23.9%
K 45
15.8%
B 25
 
8.8%
A 19
 
6.7%
E 19
 
6.7%
S 13
 
4.6%
C 13
 
4.6%
H 12
 
4.2%
M 11
 
3.9%
T 9
 
3.2%
Other values (11) 51
17.9%
Lowercase Letter
ValueCountFrequency (%)
v 10
38.5%
c 4
 
15.4%
w 2
 
7.7%
g 2
 
7.7%
d 2
 
7.7%
o 1
 
3.8%
s 1
 
3.8%
n 1
 
3.8%
p 1
 
3.8%
e 1
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 8800
14.9%
8 8255
13.9%
3 7746
13.1%
6 5675
9.6%
4 5663
9.6%
2 5649
9.5%
5 5522
9.3%
9 4620
7.8%
7 3838
6.5%
0 3453
 
5.8%
Other Punctuation
ValueCountFrequency (%)
. 4096
99.5%
: 14
 
0.3%
, 3
 
0.1%
? 1
 
< 0.1%
# 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 288
99.3%
~ 1
 
0.3%
1
 
0.3%
Letter Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 1
50.0%
[ 1
50.0%
Close Punctuation
ValueCountFrequency (%)
] 1
50.0%
) 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 12111
100.0%
Space Separator
ValueCountFrequency (%)
58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75799
79.3%
Hangul 19460
 
20.4%
Latin 314
 
0.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1768
 
9.1%
1381
 
7.1%
1075
 
5.5%
1047
 
5.4%
924
 
4.7%
899
 
4.6%
880
 
4.5%
635
 
3.3%
611
 
3.1%
570
 
2.9%
Other values (459) 9670
49.7%
Latin
ValueCountFrequency (%)
W 68
21.7%
K 45
14.3%
B 25
 
8.0%
A 19
 
6.1%
E 19
 
6.1%
S 13
 
4.1%
C 13
 
4.1%
H 12
 
3.8%
M 11
 
3.5%
v 10
 
3.2%
Other values (25) 79
25.2%
Common
ValueCountFrequency (%)
- 12111
16.0%
1 8800
11.6%
8 8255
10.9%
3 7746
10.2%
6 5675
7.5%
4 5663
7.5%
2 5649
7.5%
5 5522
7.3%
9 4620
 
6.1%
. 4096
 
5.4%
Other values (14) 7662
10.1%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 76109
79.6%
Hangul 9927
 
10.4%
Compat Jamo 9533
 
10.0%
CJK 4
 
< 0.1%
Number Forms 3
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12111
15.9%
1 8800
11.6%
8 8255
10.8%
3 7746
10.2%
6 5675
7.5%
4 5663
7.4%
2 5649
7.4%
5 5522
7.3%
9 4620
 
6.1%
. 4096
 
5.4%
Other values (45) 7972
10.5%
Compat Jamo
ValueCountFrequency (%)
1768
18.5%
1381
14.5%
1075
11.3%
899
9.4%
880
9.2%
635
 
6.7%
611
 
6.4%
570
 
6.0%
547
 
5.7%
368
 
3.9%
Other values (9) 799
8.4%
Hangul
ValueCountFrequency (%)
1047
 
10.5%
924
 
9.3%
406
 
4.1%
353
 
3.6%
279
 
2.8%
269
 
2.7%
236
 
2.4%
142
 
1.4%
140
 
1.4%
131
 
1.3%
Other values (440) 6000
60.4%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Number Forms
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%

서명
Text

Distinct9403
Distinct (%)94.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-01-29T01:46:12.136180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length155
Median length105
Mean length13.564556
Min length1

Characters and Unicode

Total characters135632
Distinct characters1634
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9133 ?
Unique (%)91.3%

Sample

1st row마크트웨인여행기 / . 상
2nd row목숨을 건 도전
3rd row노비의딸 조선왕을 낳다
4th row거꾸로 읽는부처님 말씀,이 뭐꼬 천수경
5th row동지여가슴맞대고 /
ValueCountFrequency (%)
5925
 
18.0%
1 561
 
1.7%
2 378
 
1.1%
3 156
 
0.5%
4 102
 
0.3%
장편소설 90
 
0.3%
위한 81
 
0.2%
이야기 79
 
0.2%
5 70
 
0.2%
6 63
 
0.2%
Other values (15464) 25367
77.2%
2024-01-29T01:46:12.646970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23556
 
17.4%
/ 3828
 
2.8%
2960
 
2.2%
. 2391
 
1.8%
2340
 
1.7%
1722
 
1.3%
1691
 
1.2%
1 1646
 
1.2%
1420
 
1.0%
1319
 
1.0%
Other values (1624) 92759
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91799
67.7%
Space Separator 23556
 
17.4%
Other Punctuation 8537
 
6.3%
Decimal Number 5294
 
3.9%
Lowercase Letter 3601
 
2.7%
Uppercase Letter 1002
 
0.7%
Open Punctuation 686
 
0.5%
Close Punctuation 684
 
0.5%
Dash Punctuation 310
 
0.2%
Math Symbol 159
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2960
 
3.2%
2340
 
2.5%
1722
 
1.9%
1691
 
1.8%
1420
 
1.5%
1319
 
1.4%
1309
 
1.4%
1307
 
1.4%
1211
 
1.3%
1163
 
1.3%
Other values (1528) 75357
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 392
10.9%
o 357
 
9.9%
n 311
 
8.6%
i 296
 
8.2%
a 262
 
7.3%
t 257
 
7.1%
s 225
 
6.2%
r 224
 
6.2%
h 161
 
4.5%
l 155
 
4.3%
Other values (16) 961
26.7%
Uppercase Letter
ValueCountFrequency (%)
E 76
 
7.6%
O 72
 
7.2%
S 71
 
7.1%
W 69
 
6.9%
A 67
 
6.7%
C 65
 
6.5%
I 61
 
6.1%
T 54
 
5.4%
H 52
 
5.2%
D 45
 
4.5%
Other values (16) 370
36.9%
Other Punctuation
ValueCountFrequency (%)
/ 3828
44.8%
. 2391
28.0%
: 1165
 
13.6%
, 671
 
7.9%
? 185
 
2.2%
! 128
 
1.5%
· 90
 
1.1%
' 35
 
0.4%
& 19
 
0.2%
% 14
 
0.2%
Other values (6) 11
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 1646
31.1%
2 1044
19.7%
0 744
14.1%
3 440
 
8.3%
4 342
 
6.5%
5 307
 
5.8%
9 268
 
5.1%
6 185
 
3.5%
7 175
 
3.3%
8 143
 
2.7%
Math Symbol
ValueCountFrequency (%)
= 129
81.1%
~ 14
 
8.8%
9
 
5.7%
+ 5
 
3.1%
< 1
 
0.6%
> 1
 
0.6%
Letter Number
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 677
98.7%
[ 7
 
1.0%
2
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 675
98.7%
] 7
 
1.0%
2
 
0.3%
Space Separator
ValueCountFrequency (%)
23556
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 310
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90313
66.6%
Common 39226
28.9%
Latin 4607
 
3.4%
Han 1486
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2960
 
3.3%
2340
 
2.6%
1722
 
1.9%
1691
 
1.9%
1420
 
1.6%
1319
 
1.5%
1309
 
1.4%
1307
 
1.4%
1211
 
1.3%
1163
 
1.3%
Other values (1170) 73871
81.8%
Han
ValueCountFrequency (%)
35
 
2.4%
35
 
2.4%
34
 
2.3%
31
 
2.1%
28
 
1.9%
27
 
1.8%
27
 
1.8%
27
 
1.8%
26
 
1.7%
25
 
1.7%
Other values (348) 1191
80.1%
Latin
ValueCountFrequency (%)
e 392
 
8.5%
o 357
 
7.7%
n 311
 
6.8%
i 296
 
6.4%
a 262
 
5.7%
t 257
 
5.6%
s 225
 
4.9%
r 224
 
4.9%
h 161
 
3.5%
l 155
 
3.4%
Other values (46) 1967
42.7%
Common
ValueCountFrequency (%)
23556
60.1%
/ 3828
 
9.8%
. 2391
 
6.1%
1 1646
 
4.2%
: 1165
 
3.0%
2 1044
 
2.7%
0 744
 
1.9%
( 677
 
1.7%
) 675
 
1.7%
, 671
 
1.7%
Other values (30) 2829
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90301
66.6%
ASCII 43721
32.2%
CJK 1457
 
1.1%
None 97
 
0.1%
CJK Compat Ideographs 29
 
< 0.1%
Compat Jamo 12
 
< 0.1%
Math Operators 9
 
< 0.1%
Number Forms 4
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23556
53.9%
/ 3828
 
8.8%
. 2391
 
5.5%
1 1646
 
3.8%
: 1165
 
2.7%
2 1044
 
2.4%
0 744
 
1.7%
( 677
 
1.5%
) 675
 
1.5%
, 671
 
1.5%
Other values (75) 7324
 
16.8%
Hangul
ValueCountFrequency (%)
2960
 
3.3%
2340
 
2.6%
1722
 
1.9%
1691
 
1.9%
1420
 
1.6%
1319
 
1.5%
1309
 
1.4%
1307
 
1.4%
1211
 
1.3%
1163
 
1.3%
Other values (1162) 73859
81.8%
None
ValueCountFrequency (%)
· 90
92.8%
2
 
2.1%
2
 
2.1%
2
 
2.1%
1
 
1.0%
CJK
ValueCountFrequency (%)
35
 
2.4%
35
 
2.4%
34
 
2.3%
31
 
2.1%
28
 
1.9%
27
 
1.9%
27
 
1.9%
27
 
1.9%
26
 
1.8%
25
 
1.7%
Other values (334) 1162
79.8%
Math Operators
ValueCountFrequency (%)
9
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
7
24.1%
4
13.8%
4
13.8%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (4) 4
13.8%
Compat Jamo
ValueCountFrequency (%)
4
33.3%
2
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Punctuation
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

저작자
Text

MISSING 

Distinct7425
Distinct (%)75.3%
Missing134
Missing (%)1.3%
Memory size156.2 KiB
2024-01-29T01:46:12.939861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length106
Median length79
Mean length7.5633489
Min length1

Characters and Unicode

Total characters74620
Distinct characters1226
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6423 ?
Unique (%)65.1%

Sample

1st row마크트 웨인 [지은이].
2nd row[한국데카르트 편집부 지음]
3rd row이경민
4th row천수경
5th row정명자
ValueCountFrequency (%)
1845
 
9.1%
지음 1347
 
6.7%
옮김 526
 
2.6%
그림 386
 
1.9%
325
 
1.6%
300
 
1.5%
지은이 290
 
1.4%
편집부 163
 
0.8%
94
 
0.5%
엮음 87
 
0.4%
Other values (9022) 14823
73.4%
2024-01-29T01:46:13.432876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10878
 
14.6%
2287
 
3.1%
2116
 
2.8%
1946
 
2.6%
; 1845
 
2.5%
1514
 
2.0%
. 1219
 
1.6%
984
 
1.3%
928
 
1.2%
819
 
1.1%
Other values (1216) 50084
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56869
76.2%
Space Separator 10878
 
14.6%
Other Punctuation 4243
 
5.7%
Uppercase Letter 686
 
0.9%
Lowercase Letter 624
 
0.8%
Open Punctuation 621
 
0.8%
Close Punctuation 621
 
0.8%
Decimal Number 57
 
0.1%
Math Symbol 12
 
< 0.1%
Dash Punctuation 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2287
 
4.0%
2116
 
3.7%
1946
 
3.4%
1514
 
2.7%
984
 
1.7%
928
 
1.6%
819
 
1.4%
788
 
1.4%
658
 
1.2%
592
 
1.0%
Other values (1138) 44237
77.8%
Lowercase Letter
ValueCountFrequency (%)
a 73
11.7%
i 61
 
9.8%
o 56
 
9.0%
e 56
 
9.0%
n 42
 
6.7%
r 40
 
6.4%
s 39
 
6.2%
l 32
 
5.1%
u 30
 
4.8%
t 28
 
4.5%
Other values (14) 167
26.8%
Uppercase Letter
ValueCountFrequency (%)
K 101
14.7%
B 73
10.6%
S 61
 
8.9%
C 59
 
8.6%
J 50
 
7.3%
R 44
 
6.4%
A 43
 
6.3%
E 36
 
5.2%
M 27
 
3.9%
H 24
 
3.5%
Other values (13) 168
24.5%
Other Punctuation
ValueCountFrequency (%)
; 1845
43.5%
. 1219
28.7%
, 788
18.6%
: 263
 
6.2%
· 63
 
1.5%
22
 
0.5%
& 22
 
0.5%
/ 18
 
0.4%
' 2
 
< 0.1%
! 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 13
22.8%
2 12
21.1%
3 6
10.5%
0 5
 
8.8%
5 5
 
8.8%
7 5
 
8.8%
4 4
 
7.0%
8 3
 
5.3%
9 3
 
5.3%
6 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
] 610
98.2%
) 6
 
1.0%
4
 
0.6%
} 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 611
98.4%
( 6
 
1.0%
4
 
0.6%
Math Symbol
ValueCountFrequency (%)
> 6
50.0%
< 6
50.0%
Space Separator
ValueCountFrequency (%)
10878
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55738
74.7%
Common 16441
 
22.0%
Latin 1310
 
1.8%
Han 1131
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2287
 
4.1%
2116
 
3.8%
1946
 
3.5%
1514
 
2.7%
984
 
1.8%
928
 
1.7%
819
 
1.5%
788
 
1.4%
658
 
1.2%
592
 
1.1%
Other values (819) 43106
77.3%
Han
ValueCountFrequency (%)
105
 
9.3%
96
 
8.5%
26
 
2.3%
23
 
2.0%
23
 
2.0%
23
 
2.0%
22
 
1.9%
21
 
1.9%
20
 
1.8%
19
 
1.7%
Other values (309) 753
66.6%
Latin
ValueCountFrequency (%)
K 101
 
7.7%
a 73
 
5.6%
B 73
 
5.6%
i 61
 
4.7%
S 61
 
4.7%
C 59
 
4.5%
o 56
 
4.3%
e 56
 
4.3%
J 50
 
3.8%
R 44
 
3.4%
Other values (37) 676
51.6%
Common
ValueCountFrequency (%)
10878
66.2%
; 1845
 
11.2%
. 1219
 
7.4%
, 788
 
4.8%
[ 611
 
3.7%
] 610
 
3.7%
: 263
 
1.6%
· 63
 
0.4%
22
 
0.1%
& 22
 
0.1%
Other values (21) 120
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55736
74.7%
ASCII 17658
 
23.7%
CJK 1094
 
1.5%
None 93
 
0.1%
CJK Compat Ideographs 37
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10878
61.6%
; 1845
 
10.4%
. 1219
 
6.9%
, 788
 
4.5%
[ 611
 
3.5%
] 610
 
3.5%
: 263
 
1.5%
K 101
 
0.6%
a 73
 
0.4%
B 73
 
0.4%
Other values (64) 1197
 
6.8%
Hangul
ValueCountFrequency (%)
2287
 
4.1%
2116
 
3.8%
1946
 
3.5%
1514
 
2.7%
984
 
1.8%
928
 
1.7%
819
 
1.5%
788
 
1.4%
658
 
1.2%
592
 
1.1%
Other values (818) 43104
77.3%
CJK
ValueCountFrequency (%)
105
 
9.6%
96
 
8.8%
23
 
2.1%
23
 
2.1%
23
 
2.1%
22
 
2.0%
21
 
1.9%
20
 
1.8%
19
 
1.7%
17
 
1.6%
Other values (298) 725
66.3%
None
ValueCountFrequency (%)
· 63
67.7%
22
 
23.7%
4
 
4.3%
4
 
4.3%
CJK Compat Ideographs
ValueCountFrequency (%)
26
70.3%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct3252
Distinct (%)32.5%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2024-01-29T01:46:13.753162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length22
Mean length4.794256
Min length1

Characters and Unicode

Total characters47909
Distinct characters871
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1948 ?
Unique (%)19.5%

Sample

1st row범우사,
2nd row한국데카르트
3rd row예문
4th row현암사
5th row풀빛,
ValueCountFrequency (%)
민음사 168
 
1.6%
문학동네 147
 
1.4%
김영사 133
 
1.3%
21세기북스 105
 
1.0%
박영사 92
 
0.9%
해냄 81
 
0.8%
창비 80
 
0.8%
위즈덤하우스 79
 
0.8%
웅진출판사 76
 
0.7%
예림당 73
 
0.7%
Other values (2906) 9155
89.9%
2024-01-29T01:46:14.226830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 4047
 
8.4%
2861
 
6.0%
1267
 
2.6%
1192
 
2.5%
1188
 
2.5%
1185
 
2.5%
801
 
1.7%
699
 
1.5%
685
 
1.4%
659
 
1.4%
Other values (861) 33325
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42466
88.6%
Other Punctuation 4160
 
8.7%
Decimal Number 330
 
0.7%
Lowercase Letter 322
 
0.7%
Uppercase Letter 305
 
0.6%
Space Separator 197
 
0.4%
Open Punctuation 63
 
0.1%
Close Punctuation 62
 
0.1%
Dash Punctuation 3
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2861
 
6.7%
1267
 
3.0%
1192
 
2.8%
1188
 
2.8%
1185
 
2.8%
801
 
1.9%
699
 
1.6%
685
 
1.6%
659
 
1.6%
519
 
1.2%
Other values (788) 31410
74.0%
Lowercase Letter
ValueCountFrequency (%)
o 56
17.4%
i 30
9.3%
k 29
9.0%
s 28
 
8.7%
n 24
 
7.5%
e 22
 
6.8%
a 19
 
5.9%
b 19
 
5.9%
r 14
 
4.3%
t 11
 
3.4%
Other values (14) 70
21.7%
Uppercase Letter
ValueCountFrequency (%)
B 83
27.2%
M 57
18.7%
K 25
 
8.2%
S 20
 
6.6%
P 14
 
4.6%
C 13
 
4.3%
R 11
 
3.6%
H 10
 
3.3%
N 10
 
3.3%
I 10
 
3.3%
Other values (13) 52
17.0%
Decimal Number
ValueCountFrequency (%)
2 133
40.3%
1 128
38.8%
0 12
 
3.6%
3 12
 
3.6%
4 11
 
3.3%
8 11
 
3.3%
9 10
 
3.0%
6 5
 
1.5%
7 5
 
1.5%
5 3
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 4047
97.3%
: 55
 
1.3%
& 40
 
1.0%
. 7
 
0.2%
5
 
0.1%
· 2
 
< 0.1%
/ 2
 
< 0.1%
; 1
 
< 0.1%
@ 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 62
98.4%
[ 1
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 60
96.8%
] 2
 
3.2%
Space Separator
ValueCountFrequency (%)
197
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41581
86.8%
Common 4816
 
10.1%
Han 885
 
1.8%
Latin 627
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2861
 
6.9%
1267
 
3.0%
1192
 
2.9%
1188
 
2.9%
1185
 
2.8%
801
 
1.9%
699
 
1.7%
685
 
1.6%
659
 
1.6%
519
 
1.2%
Other values (621) 30525
73.4%
Han
ValueCountFrequency (%)
127
 
14.4%
41
 
4.6%
40
 
4.5%
30
 
3.4%
25
 
2.8%
20
 
2.3%
20
 
2.3%
19
 
2.1%
18
 
2.0%
17
 
1.9%
Other values (157) 528
59.7%
Latin
ValueCountFrequency (%)
B 83
 
13.2%
M 57
 
9.1%
o 56
 
8.9%
i 30
 
4.8%
k 29
 
4.6%
s 28
 
4.5%
K 25
 
4.0%
n 24
 
3.8%
e 22
 
3.5%
S 20
 
3.2%
Other values (37) 253
40.4%
Common
ValueCountFrequency (%)
, 4047
84.0%
197
 
4.1%
2 133
 
2.8%
1 128
 
2.7%
( 62
 
1.3%
) 60
 
1.2%
: 55
 
1.1%
& 40
 
0.8%
0 12
 
0.2%
3 12
 
0.2%
Other values (16) 70
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41579
86.8%
ASCII 5436
 
11.3%
CJK 882
 
1.8%
None 7
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 4047
74.4%
197
 
3.6%
2 133
 
2.4%
1 128
 
2.4%
B 83
 
1.5%
( 62
 
1.1%
) 60
 
1.1%
M 57
 
1.0%
o 56
 
1.0%
: 55
 
1.0%
Other values (61) 558
 
10.3%
Hangul
ValueCountFrequency (%)
2861
 
6.9%
1267
 
3.0%
1192
 
2.9%
1188
 
2.9%
1185
 
2.8%
801
 
1.9%
699
 
1.7%
685
 
1.6%
659
 
1.6%
519
 
1.2%
Other values (619) 30523
73.4%
CJK
ValueCountFrequency (%)
127
 
14.4%
41
 
4.6%
40
 
4.5%
30
 
3.4%
25
 
2.8%
20
 
2.3%
20
 
2.3%
19
 
2.2%
18
 
2.0%
17
 
1.9%
Other values (154) 525
59.5%
None
ValueCountFrequency (%)
5
71.4%
· 2
 
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct88
Distinct (%)0.9%
Missing5
Missing (%)< 0.1%
Memory size156.2 KiB
2024-01-29T01:46:14.465037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.0132066
Min length3

Characters and Unicode

Total characters40112
Distinct characters26
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st row2000
2nd row2003
3rd row2010
4th row2008
5th row1985
ValueCountFrequency (%)
2007 732
 
7.3%
2008 696
 
7.0%
2011 521
 
5.2%
2012 501
 
5.0%
2010 499
 
5.0%
2001 453
 
4.5%
2009 401
 
4.0%
1997 398
 
4.0%
2013 395
 
4.0%
2000 368
 
3.7%
Other values (62) 5031
50.3%
2024-01-29T01:46:14.846797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11964
29.8%
2 8217
20.5%
1 7027
17.5%
9 6151
15.3%
8 2067
 
5.2%
7 1463
 
3.6%
3 899
 
2.2%
6 861
 
2.1%
4 721
 
1.8%
5 650
 
1.6%
Other values (16) 92
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40020
99.8%
Close Punctuation 34
 
0.1%
Open Punctuation 33
 
0.1%
Other Letter 15
 
< 0.1%
Dash Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11964
29.9%
2 8217
20.5%
1 7027
17.6%
9 6151
15.4%
8 2067
 
5.2%
7 1463
 
3.7%
3 899
 
2.2%
6 861
 
2.2%
4 721
 
1.8%
5 650
 
1.6%
Other Letter
ValueCountFrequency (%)
3
20.0%
3
20.0%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
] 28
82.4%
) 5
 
14.7%
1
 
2.9%
Open Punctuation
ValueCountFrequency (%)
[ 28
84.8%
( 5
 
15.2%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40097
> 99.9%
Hangul 13
 
< 0.1%
Han 2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11964
29.8%
2 8217
20.5%
1 7027
17.5%
9 6151
15.3%
8 2067
 
5.2%
7 1463
 
3.6%
3 899
 
2.2%
6 861
 
2.1%
4 721
 
1.8%
5 650
 
1.6%
Other values (6) 77
 
0.2%
Hangul
ValueCountFrequency (%)
3
23.1%
3
23.1%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40096
> 99.9%
Hangul 13
 
< 0.1%
CJK 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11964
29.8%
2 8217
20.5%
1 7027
17.5%
9 6151
15.3%
8 2067
 
5.2%
7 1463
 
3.6%
3 899
 
2.2%
6 861
 
2.1%
4 721
 
1.8%
5 650
 
1.6%
Other values (5) 76
 
0.2%
Hangul
ValueCountFrequency (%)
3
23.1%
3
23.1%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2024-01-29T01:46:09.885543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T01:46:14.951396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년
번호1.0000.754
발행년0.7541.000

Missing values

2024-01-29T01:46:10.051675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T01:46:10.175874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-29T01:46:10.689270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호청구기호서명저작자발행자발행년
466467982-웨68ㅁ마크트웨인여행기 / . 상마크트 웨인 [지은이].범우사,2000
14831484408-한16ㅁ목숨을 건 도전[한국데카르트 편집부 지음]한국데카르트2003
1269412695911-이14ㄴ노비의딸 조선왕을 낳다이경민예문2010
1090110902223.59-천56ㄱ거꾸로 읽는부처님 말씀,이 뭐꼬 천수경천수경현암사2008
998999811-정34ㄷ동지여가슴맞대고 /정명자풀빛,1985
68286829843-고66ㅋ-1카인과천사 /주디스고울드책마당,1996
1535415355401-장92ㄱ(장하석의)과학, 철학을 만나다장하석 지음지식플러스2018
1167711678330.95-세215ㅊ창업국가댄 세노르기운센2010
43254326813-이78ㄱ기억의저편 /이진현북박스,2002
1179911800325.3-안52ㅎ홍크! 위기를 기회로바꾸는 기러기리더십안상헌경향미디어2009
번호청구기호서명저작자발행자발행년
21532154813-김65ㅍ푸른연 /김요섭, 이태호대교출판,2001
1445914460470.8-한16ㄷ돌아온 고추잠자리[한국몬테소리 편집부 편]한국몬테소리2001
1186211863818-이38ㅅ사랑, 고마워요 고마워요이미나걷는나무2009
49594960517.3-헬57ㅇ우리가 미처 몰랐던 건강에 대한 진실 : 아프기 전에 실천해야 할 건강 지식 64가지헬스경향 지음원앤원콘텐츠그룹2013
280281818-유53ㅇ오늘의 퀴즈 : 아들, 너랑 노니까 너무 좋다. 진짜!유세윤, 유민하 지음미메시스2019
1304213043352.34-이54ㅈ정책분석론 이론과 기법이성우조명문화사2008
49284929813-김73ㄱ-1광야에눕다 /김재찬버팀목,1994
94829483234.3-최68ㅎ하늘에계신 우리아빠최인호열림원2008
71807181833.6-아22ㅌ통처아다치모토이치맛있는책2011
1493514936170-롤89ㄷ-2독수리의 눈으로. 2롤프 하이먼문학과지성사2007