Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells34
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Text5
Categorical1
Numeric1

Dataset

Description울산항만공사 열린도서관에 소장하고 있는 소장자료 현황입니다. 서명, 저자, 출판사, 출판년도가 포함되어 있습니다.
Author울산항만공사
URLhttps://www.data.go.kr/data/15105664/fileData.do

Alerts

구분 is highly imbalanced (59.3%)Imbalance
출판년도 is highly skewed (γ1 = 57.22192734)Skewed
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 14:38:36.545822
Analysis finished2024-03-14 14:38:40.930515
Duration4.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T23:38:42.091021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters70000
Distinct characters22
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row0004494
2nd row0005370
3rd rowKMI0272
4th row0008370
5th row0005053
ValueCountFrequency (%)
0004494 1
 
< 0.1%
gp00364 1
 
< 0.1%
0002067 1
 
< 0.1%
0007344 1
 
< 0.1%
0000112 1
 
< 0.1%
0001058 1
 
< 0.1%
mof0362 1
 
< 0.1%
0005671 1
 
< 0.1%
kmi0318 1
 
< 0.1%
0001879 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-03-14T23:38:43.558787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 30997
44.3%
1 5670
 
8.1%
2 3880
 
5.5%
3 3835
 
5.5%
4 3748
 
5.4%
8 3689
 
5.3%
7 3555
 
5.1%
5 3553
 
5.1%
6 3408
 
4.9%
9 3350
 
4.8%
Other values (12) 4315
 
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 65685
93.8%
Uppercase Letter 4315
 
6.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P 774
17.9%
G 729
16.9%
M 718
16.6%
K 405
9.4%
I 405
9.4%
C 392
9.1%
O 313
7.3%
F 313
7.3%
R 93
 
2.2%
T 83
 
1.9%
Other values (2) 90
 
2.1%
Decimal Number
ValueCountFrequency (%)
0 30997
47.2%
1 5670
 
8.6%
2 3880
 
5.9%
3 3835
 
5.8%
4 3748
 
5.7%
8 3689
 
5.6%
7 3555
 
5.4%
5 3553
 
5.4%
6 3408
 
5.2%
9 3350
 
5.1%

Most occurring scripts

ValueCountFrequency (%)
Common 65685
93.8%
Latin 4315
 
6.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 774
17.9%
G 729
16.9%
M 718
16.6%
K 405
9.4%
I 405
9.4%
C 392
9.1%
O 313
7.3%
F 313
7.3%
R 93
 
2.2%
T 83
 
1.9%
Other values (2) 90
 
2.1%
Common
ValueCountFrequency (%)
0 30997
47.2%
1 5670
 
8.6%
2 3880
 
5.9%
3 3835
 
5.8%
4 3748
 
5.7%
8 3689
 
5.6%
7 3555
 
5.4%
5 3553
 
5.4%
6 3408
 
5.2%
9 3350
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 30997
44.3%
1 5670
 
8.1%
2 3880
 
5.5%
3 3835
 
5.5%
4 3748
 
5.4%
8 3689
 
5.3%
7 3555
 
5.1%
5 3553
 
5.1%
6 3408
 
4.9%
9 3350
 
4.8%
Other values (12) 4315
 
6.2%

구분
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반도서
7940 
정부간행물(GP)
 
729
한국해양수산개발원(KMI)
 
405
아동도서
 
392
해양수산부(MOF)
 
313
Other values (3)
 
221

Length

Max length14
Median length4
Mean length5.0416
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반도서
2nd row일반도서
3rd row한국해양수산개발원(KMI)
4th row일반도서
5th row일반도서

Common Values

ValueCountFrequency (%)
일반도서 7940
79.4%
정부간행물(GP) 729
 
7.3%
한국해양수산개발원(KMI) 405
 
4.0%
아동도서 392
 
3.9%
해양수산부(MOF) 313
 
3.1%
참고자료(R) 93
 
0.9%
학술논문(T) 83
 
0.8%
울산항만공사(UPA) 45
 
0.4%

Length

2024-03-14T23:38:43.796007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:38:43.990604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반도서 7940
79.4%
정부간행물(gp 729
 
7.3%
한국해양수산개발원(kmi 405
 
4.0%
아동도서 392
 
3.9%
해양수산부(mof 313
 
3.1%
참고자료(r 93
 
0.9%
학술논문(t 83
 
0.8%
울산항만공사(upa 45
 
0.4%
Distinct9534
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T23:38:45.206133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length94
Mean length23.7777
Min length1

Characters and Unicode

Total characters237777
Distinct characters1496
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9217 ?
Unique (%)92.2%

Sample

1st row(프렌즈) 방콕
2nd rowIMF 견문록 : 세계경제의 중심 IMF 700일간의 기록
3rd row2017 수산·해양환경 통계
4th row우먼 인 윈도: A. J. 핀 장편소설
5th row쉿! 퇴직연금도 모르면서 은퇴설계를 하고 있다고 말하지 마라
ValueCountFrequency (%)
2534
 
4.3%
장편소설 455
 
0.8%
위한 396
 
0.7%
1 335
 
0.6%
2 323
 
0.6%
이야기 290
 
0.5%
연구 270
 
0.5%
233
 
0.4%
3 165
 
0.3%
어떻게 147
 
0.3%
Other values (22015) 53391
91.2%
2024-03-14T23:38:46.816852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48712
 
20.5%
: 5086
 
2.1%
4539
 
1.9%
3261
 
1.4%
2762
 
1.2%
2593
 
1.1%
2457
 
1.0%
2206
 
0.9%
2077
 
0.9%
1 2056
 
0.9%
Other values (1486) 162028
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 154046
64.8%
Space Separator 48712
 
20.5%
Lowercase Letter 10271
 
4.3%
Other Punctuation 9017
 
3.8%
Decimal Number 8555
 
3.6%
Uppercase Letter 3338
 
1.4%
Open Punctuation 1598
 
0.7%
Close Punctuation 1598
 
0.7%
Dash Punctuation 369
 
0.2%
Math Symbol 136
 
0.1%
Other values (4) 137
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4539
 
2.9%
3261
 
2.1%
2762
 
1.8%
2593
 
1.7%
2457
 
1.6%
2206
 
1.4%
2077
 
1.3%
1984
 
1.3%
1826
 
1.2%
1694
 
1.1%
Other values (1372) 128647
83.5%
Lowercase Letter
ValueCountFrequency (%)
e 1109
10.8%
o 941
 
9.2%
a 904
 
8.8%
n 903
 
8.8%
i 898
 
8.7%
t 838
 
8.2%
r 781
 
7.6%
s 670
 
6.5%
l 426
 
4.1%
d 346
 
3.4%
Other values (16) 2455
23.9%
Uppercase Letter
ValueCountFrequency (%)
S 365
 
10.9%
I 264
 
7.9%
P 248
 
7.4%
T 232
 
7.0%
A 231
 
6.9%
C 207
 
6.2%
E 199
 
6.0%
O 186
 
5.6%
R 186
 
5.6%
M 183
 
5.5%
Other values (16) 1037
31.1%
Other Punctuation
ValueCountFrequency (%)
: 5086
56.4%
. 1595
 
17.7%
, 1219
 
13.5%
· 492
 
5.5%
! 217
 
2.4%
? 170
 
1.9%
; 58
 
0.6%
& 45
 
0.5%
/ 39
 
0.4%
' 35
 
0.4%
Other values (8) 61
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 2056
24.0%
0 1916
22.4%
2 1872
21.9%
3 650
 
7.6%
5 466
 
5.4%
4 405
 
4.7%
6 324
 
3.8%
9 314
 
3.7%
7 290
 
3.4%
8 262
 
3.1%
Math Symbol
ValueCountFrequency (%)
~ 80
58.8%
+ 24
 
17.6%
> 12
 
8.8%
< 12
 
8.8%
4
 
2.9%
2
 
1.5%
× 1
 
0.7%
1
 
0.7%
Other Symbol
ValueCountFrequency (%)
12
57.1%
3
 
14.3%
® 3
 
14.3%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 1482
92.7%
[ 107
 
6.7%
5
 
0.3%
3
 
0.2%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1482
92.7%
] 107
 
6.7%
5
 
0.3%
3
 
0.2%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
21
40.4%
17
32.7%
8
 
15.4%
4
 
7.7%
2
 
3.8%
Modifier Symbol
ValueCountFrequency (%)
` 53
85.5%
´ 9
 
14.5%
Space Separator
ValueCountFrequency (%)
48712
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 369
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 153309
64.5%
Common 70070
29.5%
Latin 13661
 
5.7%
Han 735
 
0.3%
Hiragana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4539
 
3.0%
3261
 
2.1%
2762
 
1.8%
2593
 
1.7%
2457
 
1.6%
2206
 
1.4%
2077
 
1.4%
1984
 
1.3%
1826
 
1.2%
1694
 
1.1%
Other values (1178) 127910
83.4%
Han
ValueCountFrequency (%)
34
 
4.6%
30
 
4.1%
29
 
3.9%
29
 
3.9%
29
 
3.9%
29
 
3.9%
25
 
3.4%
24
 
3.3%
17
 
2.3%
15
 
2.0%
Other values (183) 474
64.5%
Common
ValueCountFrequency (%)
48712
69.5%
: 5086
 
7.3%
1 2056
 
2.9%
0 1916
 
2.7%
2 1872
 
2.7%
. 1595
 
2.3%
( 1482
 
2.1%
) 1482
 
2.1%
, 1219
 
1.7%
3 650
 
0.9%
Other values (47) 4000
 
5.7%
Latin
ValueCountFrequency (%)
e 1109
 
8.1%
o 941
 
6.9%
a 904
 
6.6%
n 903
 
6.6%
i 898
 
6.6%
t 838
 
6.1%
r 781
 
5.7%
s 670
 
4.9%
l 426
 
3.1%
S 365
 
2.7%
Other values (47) 5826
42.6%
Hiragana
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 153297
64.5%
ASCII 83086
34.9%
CJK 708
 
0.3%
None 567
 
0.2%
Number Forms 52
 
< 0.1%
CJK Compat Ideographs 27
 
< 0.1%
Enclosed Alphanum 12
 
< 0.1%
Compat Jamo 12
 
< 0.1%
Punctuation 6
 
< 0.1%
Letterlike Symbols 4
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48712
58.6%
: 5086
 
6.1%
1 2056
 
2.5%
0 1916
 
2.3%
2 1872
 
2.3%
. 1595
 
1.9%
( 1482
 
1.8%
) 1482
 
1.8%
, 1219
 
1.5%
e 1109
 
1.3%
Other values (75) 16557
 
19.9%
Hangul
ValueCountFrequency (%)
4539
 
3.0%
3261
 
2.1%
2762
 
1.8%
2593
 
1.7%
2457
 
1.6%
2206
 
1.4%
2077
 
1.4%
1984
 
1.3%
1826
 
1.2%
1694
 
1.1%
Other values (1175) 127898
83.4%
None
ValueCountFrequency (%)
· 492
86.8%
20
 
3.5%
13
 
2.3%
´ 9
 
1.6%
5
 
0.9%
5
 
0.9%
4
 
0.7%
4
 
0.7%
3
 
0.5%
3
 
0.5%
Other values (7) 9
 
1.6%
CJK
ValueCountFrequency (%)
34
 
4.8%
30
 
4.2%
29
 
4.1%
29
 
4.1%
29
 
4.1%
29
 
4.1%
25
 
3.5%
24
 
3.4%
15
 
2.1%
15
 
2.1%
Other values (174) 449
63.4%
Number Forms
ValueCountFrequency (%)
21
40.4%
17
32.7%
8
 
15.4%
4
 
7.7%
2
 
3.8%
CJK Compat Ideographs
ValueCountFrequency (%)
17
63.0%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Enclosed Alphanum
ValueCountFrequency (%)
12
100.0%
Compat Jamo
ValueCountFrequency (%)
9
75.0%
2
 
16.7%
1
 
8.3%
Punctuation
ValueCountFrequency (%)
6
100.0%
Letterlike Symbols
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Hiragana
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct6438
Distinct (%)64.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T23:38:47.800814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length63
Mean length8.0183
Min length2

Characters and Unicode

Total characters80183
Distinct characters978
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5268 ?
Unique (%)52.7%

Sample

1st row안진헌 지음
2nd row최광해 지음
3rd row한국해양수산개발원
4th row핀, A. J. 지음
5th row김현기 지음
ValueCountFrequency (%)
지음 5383
 
25.8%
글·그림 295
 
1.4%
223
 
1.1%
해양수산부 186
 
0.9%
178
 
0.9%
by 142
 
0.7%
한국해양수산개발원 141
 
0.7%
국토해양부 134
 
0.6%
82
 
0.4%
81
 
0.4%
Other values (7359) 14048
67.2%
2024-03-14T23:38:49.078701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10895
 
13.6%
6020
 
7.5%
5589
 
7.0%
, 3194
 
4.0%
1650
 
2.1%
1117
 
1.4%
858
 
1.1%
825
 
1.0%
734
 
0.9%
718
 
0.9%
Other values (968) 48583
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59713
74.5%
Space Separator 10895
 
13.6%
Other Punctuation 4117
 
5.1%
Lowercase Letter 3091
 
3.9%
Uppercase Letter 1384
 
1.7%
Open Punctuation 439
 
0.5%
Close Punctuation 432
 
0.5%
Decimal Number 51
 
0.1%
Math Symbol 34
 
< 0.1%
Dash Punctuation 22
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6020
 
10.1%
5589
 
9.4%
1650
 
2.8%
1117
 
1.9%
858
 
1.4%
825
 
1.4%
734
 
1.2%
718
 
1.2%
699
 
1.2%
699
 
1.2%
Other values (887) 40804
68.3%
Lowercase Letter
ValueCountFrequency (%)
e 390
12.6%
a 285
 
9.2%
n 251
 
8.1%
r 234
 
7.6%
i 234
 
7.6%
o 210
 
6.8%
y 195
 
6.3%
t 180
 
5.8%
b 176
 
5.7%
l 162
 
5.2%
Other values (16) 774
25.0%
Uppercase Letter
ValueCountFrequency (%)
S 136
 
9.8%
M 115
 
8.3%
C 109
 
7.9%
K 108
 
7.8%
A 100
 
7.2%
B 100
 
7.2%
R 78
 
5.6%
J 72
 
5.2%
T 63
 
4.6%
I 51
 
3.7%
Other values (16) 452
32.7%
Decimal Number
ValueCountFrequency (%)
1 11
21.6%
2 10
19.6%
0 7
13.7%
3 6
11.8%
7 4
 
7.8%
6 4
 
7.8%
4 4
 
7.8%
9 3
 
5.9%
8 2
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 3194
77.6%
. 456
 
11.1%
· 432
 
10.5%
; 22
 
0.5%
/ 4
 
0.1%
& 3
 
0.1%
' 3
 
0.1%
: 3
 
0.1%
Open Punctuation
ValueCountFrequency (%)
[ 270
61.5%
( 159
36.2%
10
 
2.3%
Close Punctuation
ValueCountFrequency (%)
] 262
60.6%
) 160
37.0%
10
 
2.3%
Math Symbol
ValueCountFrequency (%)
< 17
50.0%
> 17
50.0%
Space Separator
ValueCountFrequency (%)
10895
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59531
74.2%
Common 15995
 
19.9%
Latin 4475
 
5.6%
Han 182
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6020
 
10.1%
5589
 
9.4%
1650
 
2.8%
1117
 
1.9%
858
 
1.4%
825
 
1.4%
734
 
1.2%
718
 
1.2%
699
 
1.2%
699
 
1.2%
Other values (782) 40622
68.2%
Han
ValueCountFrequency (%)
21
 
11.5%
9
 
4.9%
7
 
3.8%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (95) 121
66.5%
Latin
ValueCountFrequency (%)
e 390
 
8.7%
a 285
 
6.4%
n 251
 
5.6%
r 234
 
5.2%
i 234
 
5.2%
o 210
 
4.7%
y 195
 
4.4%
t 180
 
4.0%
b 176
 
3.9%
l 162
 
3.6%
Other values (42) 2158
48.2%
Common
ValueCountFrequency (%)
10895
68.1%
, 3194
 
20.0%
. 456
 
2.9%
· 432
 
2.7%
[ 270
 
1.7%
] 262
 
1.6%
) 160
 
1.0%
( 159
 
1.0%
- 22
 
0.1%
; 22
 
0.1%
Other values (19) 123
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59531
74.2%
ASCII 20014
 
25.0%
None 452
 
0.6%
CJK 178
 
0.2%
Enclosed Alphanum 4
 
< 0.1%
CJK Compat Ideographs 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10895
54.4%
, 3194
 
16.0%
. 456
 
2.3%
e 390
 
1.9%
a 285
 
1.4%
[ 270
 
1.3%
] 262
 
1.3%
n 251
 
1.3%
r 234
 
1.2%
i 234
 
1.2%
Other values (67) 3543
 
17.7%
Hangul
ValueCountFrequency (%)
6020
 
10.1%
5589
 
9.4%
1650
 
2.8%
1117
 
1.9%
858
 
1.4%
825
 
1.4%
734
 
1.2%
718
 
1.2%
699
 
1.2%
699
 
1.2%
Other values (782) 40622
68.2%
None
ValueCountFrequency (%)
· 432
95.6%
10
 
2.2%
10
 
2.2%
CJK
ValueCountFrequency (%)
21
 
11.8%
9
 
5.1%
7
 
3.9%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (93) 117
65.7%
Enclosed Alphanum
ValueCountFrequency (%)
4
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Distinct2802
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T23:38:50.037454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length32
Mean length5.4428
Min length1

Characters and Unicode

Total characters54428
Distinct characters759
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1574 ?
Unique (%)15.7%

Sample

1st row중앙북스
2nd row21세기북스
3rd row한국해양수산개발원
4th row비채김영사
5th row한스컨텐츠
ValueCountFrequency (%)
한국해양수산개발원 355
 
3.4%
문학동네 188
 
1.8%
해양수산부 151
 
1.4%
민음사 134
 
1.3%
위즈덤하우스 133
 
1.3%
김영사 126
 
1.2%
창비 125
 
1.2%
국토해양부 120
 
1.1%
황금가지 91
 
0.9%
21세기북스 82
 
0.8%
Other values (2881) 8990
85.7%
2024-03-14T23:38:51.279264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1723
 
3.2%
1453
 
2.7%
1342
 
2.5%
1295
 
2.4%
1276
 
2.3%
1128
 
2.1%
1095
 
2.0%
989
 
1.8%
977
 
1.8%
841
 
1.5%
Other values (749) 42309
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47915
88.0%
Lowercase Letter 2796
 
5.1%
Uppercase Letter 1643
 
3.0%
Other Punctuation 590
 
1.1%
Space Separator 528
 
1.0%
Decimal Number 374
 
0.7%
Close Punctuation 288
 
0.5%
Open Punctuation 278
 
0.5%
Dash Punctuation 8
 
< 0.1%
Modifier Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1723
 
3.6%
1453
 
3.0%
1342
 
2.8%
1295
 
2.7%
1276
 
2.7%
1128
 
2.4%
1095
 
2.3%
989
 
2.1%
977
 
2.0%
841
 
1.8%
Other values (668) 35796
74.7%
Uppercase Letter
ValueCountFrequency (%)
B 204
12.4%
K 171
 
10.4%
M 169
 
10.3%
R 108
 
6.6%
H 103
 
6.3%
I 95
 
5.8%
A 88
 
5.4%
S 83
 
5.1%
P 81
 
4.9%
Y 67
 
4.1%
Other values (16) 474
28.8%
Lowercase Letter
ValueCountFrequency (%)
o 392
14.0%
s 269
 
9.6%
e 230
 
8.2%
a 222
 
7.9%
i 212
 
7.6%
r 197
 
7.0%
n 186
 
6.7%
k 143
 
5.1%
t 129
 
4.6%
l 127
 
4.5%
Other values (15) 689
24.6%
Other Punctuation
ValueCountFrequency (%)
; 307
52.0%
: 137
23.2%
, 55
 
9.3%
. 39
 
6.6%
& 28
 
4.7%
15
 
2.5%
· 3
 
0.5%
' 2
 
0.3%
# 2
 
0.3%
1
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 155
41.4%
1 144
38.5%
0 36
 
9.6%
4 11
 
2.9%
3 11
 
2.9%
8 9
 
2.4%
9 5
 
1.3%
7 1
 
0.3%
5 1
 
0.3%
6 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 281
97.6%
] 7
 
2.4%
Open Punctuation
ValueCountFrequency (%)
( 277
99.6%
[ 1
 
0.4%
Modifier Symbol
ValueCountFrequency (%)
` 6
85.7%
´ 1
 
14.3%
Space Separator
ValueCountFrequency (%)
528
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Math Symbol
ValueCountFrequency (%)
× 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47712
87.7%
Latin 4439
 
8.2%
Common 2074
 
3.8%
Han 203
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1723
 
3.6%
1453
 
3.0%
1342
 
2.8%
1295
 
2.7%
1276
 
2.7%
1128
 
2.4%
1095
 
2.3%
989
 
2.1%
977
 
2.0%
841
 
1.8%
Other values (614) 35593
74.6%
Han
ValueCountFrequency (%)
40
19.7%
21
 
10.3%
17
 
8.4%
13
 
6.4%
9
 
4.4%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (44) 77
37.9%
Latin
ValueCountFrequency (%)
o 392
 
8.8%
s 269
 
6.1%
e 230
 
5.2%
a 222
 
5.0%
i 212
 
4.8%
B 204
 
4.6%
r 197
 
4.4%
n 186
 
4.2%
K 171
 
3.9%
M 169
 
3.8%
Other values (41) 2187
49.3%
Common
ValueCountFrequency (%)
528
25.5%
; 307
14.8%
) 281
13.5%
( 277
13.4%
2 155
 
7.5%
1 144
 
6.9%
: 137
 
6.6%
, 55
 
2.7%
. 39
 
1.9%
0 36
 
1.7%
Other values (20) 115
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47711
87.7%
ASCII 6492
 
11.9%
CJK 203
 
0.4%
None 21
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1723
 
3.6%
1453
 
3.0%
1342
 
2.8%
1295
 
2.7%
1276
 
2.7%
1128
 
2.4%
1095
 
2.3%
989
 
2.1%
977
 
2.0%
841
 
1.8%
Other values (613) 35592
74.6%
ASCII
ValueCountFrequency (%)
528
 
8.1%
o 392
 
6.0%
; 307
 
4.7%
) 281
 
4.3%
( 277
 
4.3%
s 269
 
4.1%
e 230
 
3.5%
a 222
 
3.4%
i 212
 
3.3%
B 204
 
3.1%
Other values (66) 3570
55.0%
CJK
ValueCountFrequency (%)
40
19.7%
21
 
10.3%
17
 
8.4%
13
 
6.4%
9
 
4.4%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (44) 77
37.9%
None
ValueCountFrequency (%)
15
71.4%
· 3
 
14.3%
1
 
4.8%
´ 1
 
4.8%
× 1
 
4.8%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

출판년도
Real number (ℝ)

SKEWED 

Distinct41
Distinct (%)0.4%
Missing34
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean2020.3282
Minimum20
Maximum20022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T23:38:51.517521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile2007
Q12012
median2015
Q32018
95-th percentile2022
Maximum20022
Range20002
Interquartile range (IQR)6

Descriptive statistics

Standard deviation313.05455
Coefficient of variation (CV)0.15495232
Kurtosis3290.323
Mean2020.3282
Median Absolute Deviation (MAD)3
Skewness57.221927
Sum20134591
Variance98003.153
MonotonicityNot monotonic
2024-03-14T23:38:51.767451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
2016 921
 
9.2%
2015 882
 
8.8%
2017 849
 
8.5%
2013 802
 
8.0%
2018 748
 
7.5%
2012 741
 
7.4%
2014 701
 
7.0%
2019 626
 
6.3%
2020 618
 
6.2%
2022 486
 
4.9%
Other values (31) 2592
25.9%
ValueCountFrequency (%)
20 1
 
< 0.1%
1980 1
 
< 0.1%
1985 1
 
< 0.1%
1986 1
 
< 0.1%
1989 1
 
< 0.1%
1990 1
 
< 0.1%
1992 10
0.1%
1993 2
 
< 0.1%
1994 4
 
< 0.1%
1995 8
0.1%
ValueCountFrequency (%)
20022 1
 
< 0.1%
20021 2
 
< 0.1%
2046 1
 
< 0.1%
2023 258
 
2.6%
2022 486
4.9%
2021 462
4.6%
2020 618
6.2%
2019 626
6.3%
2018 748
7.5%
2017 849
8.5%
Distinct9811
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T23:38:53.104896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length12.6185
Min length5

Characters and Unicode

Total characters126185
Distinct characters567
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9642 ?
Unique (%)96.4%

Sample

1st row981.44502 안78ㅂ
2nd row327.95 최641I
3rd rowKMI529.05 한644ㅇ v.2017
4th row843.6 핀64ㅇ
5th row325.358 김94ㅅ
ValueCountFrequency (%)
v.2 414
 
1.8%
v.1 401
 
1.7%
818 369
 
1.6%
c.2 310
 
1.3%
813.7 271
 
1.2%
833.6 239
 
1.0%
843 225
 
1.0%
813.6 209
 
0.9%
v.3 207
 
0.9%
v.4 117
 
0.5%
Other values (8826) 20208
88.0%
2024-03-14T23:38:54.625657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12971
 
10.3%
. 10822
 
8.6%
3 10446
 
8.3%
2 9106
 
7.2%
1 8942
 
7.1%
8 8268
 
6.6%
5 7413
 
5.9%
6 7393
 
5.9%
4 6982
 
5.5%
9 6696
 
5.3%
Other values (557) 37146
29.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 74851
59.3%
Other Letter 19495
 
15.4%
Space Separator 12971
 
10.3%
Other Punctuation 10823
 
8.6%
Uppercase Letter 4651
 
3.7%
Lowercase Letter 3096
 
2.5%
Dash Punctuation 253
 
0.2%
Close Punctuation 15
 
< 0.1%
Open Punctuation 15
 
< 0.1%
Math Symbol 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2189
 
11.2%
1137
 
5.8%
1070
 
5.5%
1029
 
5.3%
963
 
4.9%
780
 
4.0%
779
 
4.0%
595
 
3.1%
572
 
2.9%
549
 
2.8%
Other values (492) 9832
50.4%
Uppercase Letter
ValueCountFrequency (%)
P 793
17.1%
G 747
16.1%
M 737
15.8%
K 442
9.5%
I 431
9.3%
C 422
9.1%
O 322
6.9%
F 319
6.9%
R 100
 
2.2%
T 98
 
2.1%
Other values (15) 240
 
5.2%
Lowercase Letter
ValueCountFrequency (%)
v 2538
82.0%
c 442
 
14.3%
y 12
 
0.4%
s 12
 
0.4%
j 12
 
0.4%
p 11
 
0.4%
m 10
 
0.3%
i 8
 
0.3%
f 6
 
0.2%
o 5
 
0.2%
Other values (12) 40
 
1.3%
Decimal Number
ValueCountFrequency (%)
3 10446
14.0%
2 9106
12.2%
1 8942
11.9%
8 8268
11.0%
5 7413
9.9%
6 7393
9.9%
4 6982
9.3%
9 6696
8.9%
7 5369
7.2%
0 4236
5.7%
Other Punctuation
ValueCountFrequency (%)
. 10822
> 99.9%
; 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 12
80.0%
~ 3
 
20.0%
Space Separator
ValueCountFrequency (%)
12971
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 253
100.0%
Close Punctuation
ValueCountFrequency (%)
] 15
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 98943
78.4%
Hangul 19495
 
15.4%
Latin 7747
 
6.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2189
 
11.2%
1137
 
5.8%
1070
 
5.5%
1029
 
5.3%
963
 
4.9%
780
 
4.0%
779
 
4.0%
595
 
3.1%
572
 
2.9%
549
 
2.8%
Other values (492) 9832
50.4%
Latin
ValueCountFrequency (%)
v 2538
32.8%
P 793
 
10.2%
G 747
 
9.6%
M 737
 
9.5%
c 442
 
5.7%
K 442
 
5.7%
I 431
 
5.6%
C 422
 
5.4%
O 322
 
4.2%
F 319
 
4.1%
Other values (37) 554
 
7.2%
Common
ValueCountFrequency (%)
12971
13.1%
. 10822
10.9%
3 10446
10.6%
2 9106
9.2%
1 8942
9.0%
8 8268
8.4%
5 7413
7.5%
6 7393
7.5%
4 6982
7.1%
9 6696
6.8%
Other values (8) 9904
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106690
84.6%
Hangul 9993
 
7.9%
Compat Jamo 9502
 
7.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12971
12.2%
. 10822
10.1%
3 10446
9.8%
2 9106
8.5%
1 8942
8.4%
8 8268
7.7%
5 7413
6.9%
6 7393
6.9%
4 6982
 
6.5%
9 6696
 
6.3%
Other values (55) 17651
16.5%
Compat Jamo
ValueCountFrequency (%)
2189
23.0%
1137
12.0%
1070
11.3%
1029
10.8%
779
 
8.2%
595
 
6.3%
549
 
5.8%
524
 
5.5%
514
 
5.4%
301
 
3.2%
Other values (9) 815
 
8.6%
Hangul
ValueCountFrequency (%)
963
 
9.6%
780
 
7.8%
572
 
5.7%
343
 
3.4%
242
 
2.4%
228
 
2.3%
220
 
2.2%
205
 
2.1%
194
 
1.9%
176
 
1.8%
Other values (473) 6070
60.7%

Interactions

2024-03-14T23:38:39.809332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:38:54.782010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분출판년도
구분1.0000.234
출판년도0.2341.000
2024-03-14T23:38:54.917951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년도구분
출판년도1.0000.152
구분0.1521.000

Missing values

2024-03-14T23:38:40.365666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:38:40.749464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호구분자료명저자출판사출판년도청구기호
40810004494일반도서(프렌즈) 방콕안진헌 지음중앙북스2016981.44502 안78ㅂ
49060005370일반도서IMF 견문록 : 세계경제의 중심 IMF 700일간의 기록최광해 지음21세기북스2016327.95 최641I
12286KMI0272한국해양수산개발원(KMI)2017 수산·해양환경 통계한국해양수산개발원한국해양수산개발원2017KMI529.05 한644ㅇ v.2017
77610008370일반도서우먼 인 윈도: A. J. 핀 장편소설핀, A. J. 지음비채김영사2019843.6 핀64ㅇ
45940005053일반도서쉿! 퇴직연금도 모르면서 은퇴설계를 하고 있다고 말하지 마라김현기 지음한스컨텐츠2016325.358 김94ㅅ
79720008581일반도서일의 기쁨과 슬픔: 장류진 소설집장류진 지음창비2019813.7 장296ㅇ
35820003980일반도서(사람이 알아야 할 모든 것) 생각의 역사. 1 : 불에서 프로이트까지왓슨, 피터 지음들녘2013909 왓57ㅅ v.1
80460008655일반도서(1년 만에 교포로 오해받은 김아란의) 영어 정복기김아란 지음시대인2019740.7 김62ㅇ
34990003896일반도서김수영 전집. 2 : 산문김수영 지음민음사2015810.81 김56ㄱ v.2
93630010264일반도서그냥 하지 말라: 당신의 모든 것이 메시지다송길영 지음북스톤2021331.544 송18ㄱ
등록번호구분자료명저자출판사출판년도청구기호
63890006952일반도서(역사저널) 그날. 4: 임진왜란제작팀, KBS 역사저널 그날 지음민음사2016911.05 케68ㄱ v.4
590000062일반도서결정적 순간의 대화외, 케리 패터슨 지음시아출판사2008802.5 케318ㄱ
11608GP01235정부간행물(GP)2010 건설신기술 품셈[일위대가표]한국건설신기술협회서울;한국건설신기술협회2010GP531 한426ㅇ
2270000250일반도서海運·物流用語 大辭典 : 해운, 육송, 항공, 항만, 무역, 해상·적하보험, 조선, 물류기기·창고 등 국내물류, 전자상거래 용어 집대성, 코리아쉬핑가제트코리아쉬핑가제트2006326.3603 코367ㅎ
2300000253일반도서우리나라 삼국지. 2 : 삼국의 정립임동주 지음마야2008813.6 임913ㅇ v.2
11373GP00545정부간행물(GP)2013 공공기관 협업 우수사례집기획재정부한국조세재정연구원2013GP325.36 기982ㅇ v.2013 c.2
73720007981일반도서지식ⓔ and. [10]지식채널ⓔ, EBS 지음북하우스2017001 E16ㅈ v.10
12199KMI0185한국해양수산개발원(KMI)김, 넙치, 전복 수출 확대 방안에 관한 연구옥영수한국해양수산개발원2009KMI529.2 옥847ㄱ
11428GP00764정부간행물(GP)2013 통상백서산업통상지원부산업통상지원부2013GP326.205 산389ㅇ v.2013
29960003361일반도서부러진 귀글·그림, 에르제2015863 에297ㅅ v.6