Overview

Dataset statistics

Number of variables15
Number of observations10000
Missing cells10976
Missing cells (%)7.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory129.0 B

Variable types

Text10
Numeric1
Categorical4

Dataset

Description도로교통공단 도서실에서 소장 중인 자료에 대한 장서명,저자,발행사,청구기호,발행년도,자료유형 등 서지정보 데이터
Author도로교통공단
URLhttps://www.data.go.kr/data/15049023/fileData.do

Alerts

별치기호 is highly overall correlated with 자료유형High correlation
자료유형 is highly overall correlated with 별치기호High correlation
복본기호 is highly imbalanced (85.5%)Imbalance
별치기호 is highly imbalanced (61.0%)Imbalance
소장처명 is highly imbalanced (50.2%)Imbalance
청구기호 has 1793 (17.9%) missing valuesMissing
색인청구기호 has 1793 (17.9%) missing valuesMissing
권책기호 has 7350 (73.5%) missing valuesMissing
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:05:42.989388
Analysis finished2023-12-12 05:05:48.053647
Duration5.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:05:48.396717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.2231
Min length1

Characters and Unicode

Total characters62231
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row9000074
2nd row21018
3rd row11205
4th row4270
5th rowAR002732
ValueCountFrequency (%)
9000074 1
 
< 0.1%
ar002419 1
 
< 0.1%
17563 1
 
< 0.1%
9084 1
 
< 0.1%
14000407 1
 
< 0.1%
5123 1
 
< 0.1%
15260 1
 
< 0.1%
11000004 1
 
< 0.1%
17554 1
 
< 0.1%
18000658 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T14:05:48.956817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17280
27.8%
1 9065
14.6%
2 5387
 
8.7%
3 4219
 
6.8%
4 4207
 
6.8%
6 3835
 
6.2%
5 3797
 
6.1%
8 3668
 
5.9%
7 3636
 
5.8%
9 3457
 
5.6%
Other values (2) 3680
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 58551
94.1%
Uppercase Letter 3680
 
5.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 17280
29.5%
1 9065
15.5%
2 5387
 
9.2%
3 4219
 
7.2%
4 4207
 
7.2%
6 3835
 
6.5%
5 3797
 
6.5%
8 3668
 
6.3%
7 3636
 
6.2%
9 3457
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 1840
50.0%
R 1840
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 58551
94.1%
Latin 3680
 
5.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 17280
29.5%
1 9065
15.5%
2 5387
 
9.2%
3 4219
 
7.2%
4 4207
 
7.2%
6 3835
 
6.5%
5 3797
 
6.5%
8 3668
 
6.3%
7 3636
 
6.2%
9 3457
 
5.9%
Latin
ValueCountFrequency (%)
A 1840
50.0%
R 1840
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62231
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17280
27.8%
1 9065
14.6%
2 5387
 
8.7%
3 4219
 
6.8%
4 4207
 
6.8%
6 3835
 
6.2%
5 3797
 
6.1%
8 3668
 
5.9%
7 3636
 
5.8%
9 3457
 
5.6%
Other values (2) 3680
 
5.9%

서명
Text

Distinct7732
Distinct (%)77.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:05:49.292260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length199
Median length132
Mean length29.3506
Min length2

Characters and Unicode

Total characters293506
Distinct characters1708
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6731 ?
Unique (%)67.3%

Sample

1st row오토 CAR 운전테크닉 : 자동변속운전면허 교재
2nd rowTransit 2014
3rd row파워포인트 프레젠테이션 실무활용테크닉 : 1분1초가 아까운 비즈니스맨을 위한
4th row交通安全 : 話の花束
5th row노년층 인구 증가에 대비한 교통안전
ValueCountFrequency (%)
4099
 
7.4%
of 1187
 
2.1%
연구 693
 
1.2%
and 666
 
1.2%
the 602
 
1.1%
591
 
1.1%
관한 464
 
0.8%
위한 373
 
0.7%
in 339
 
0.6%
교통사고 249
 
0.4%
Other values (19080) 46403
83.4%
2023-12-12T14:05:49.843040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46440
 
15.8%
e 9968
 
3.4%
n 8613
 
2.9%
a 8244
 
2.8%
i 8153
 
2.8%
o 7990
 
2.7%
t 7713
 
2.6%
r 6916
 
2.4%
s 5679
 
1.9%
c 3902
 
1.3%
Other values (1698) 179888
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125909
42.9%
Lowercase Letter 92122
31.4%
Space Separator 46440
 
15.8%
Uppercase Letter 12961
 
4.4%
Decimal Number 6058
 
2.1%
Other Punctuation 5368
 
1.8%
Open Punctuation 1400
 
0.5%
Close Punctuation 1398
 
0.5%
Math Symbol 1002
 
0.3%
Dash Punctuation 679
 
0.2%
Other values (7) 169
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3298
 
2.6%
3212
 
2.6%
2865
 
2.3%
2747
 
2.2%
2297
 
1.8%
2267
 
1.8%
1958
 
1.6%
1741
 
1.4%
1738
 
1.4%
1735
 
1.4%
Other values (1585) 102051
81.1%
Lowercase Letter
ValueCountFrequency (%)
e 9968
10.8%
n 8613
9.3%
a 8244
 
8.9%
i 8153
 
8.9%
o 7990
 
8.7%
t 7713
 
8.4%
r 6916
 
7.5%
s 5679
 
6.2%
c 3902
 
4.2%
l 3659
 
4.0%
Other values (16) 21285
23.1%
Uppercase Letter
ValueCountFrequency (%)
S 1335
 
10.3%
T 1236
 
9.5%
A 1136
 
8.8%
C 966
 
7.5%
E 963
 
7.4%
I 820
 
6.3%
P 729
 
5.6%
R 699
 
5.4%
O 596
 
4.6%
D 586
 
4.5%
Other values (16) 3895
30.1%
Other Punctuation
ValueCountFrequency (%)
: 3094
57.6%
, 711
 
13.2%
. 683
 
12.7%
· 241
 
4.5%
' 214
 
4.0%
! 135
 
2.5%
/ 108
 
2.0%
& 91
 
1.7%
" 40
 
0.7%
; 34
 
0.6%
Other values (5) 17
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 1745
28.8%
2 1243
20.5%
1 1193
19.7%
9 418
 
6.9%
3 357
 
5.9%
5 264
 
4.4%
4 247
 
4.1%
8 212
 
3.5%
6 211
 
3.5%
7 168
 
2.8%
Math Symbol
ValueCountFrequency (%)
= 915
91.3%
+ 47
 
4.7%
~ 26
 
2.6%
< 6
 
0.6%
> 6
 
0.6%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1291
92.2%
[ 103
 
7.4%
3
 
0.2%
2
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1289
92.2%
] 103
 
7.4%
3
 
0.2%
2
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
71
48.6%
46
31.5%
21
 
14.4%
5
 
3.4%
3
 
2.1%
Other Number
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
Other Symbol
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
46440
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 679
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122812
41.8%
Latin 105229
35.9%
Common 62368
21.2%
Han 2349
 
0.8%
Hiragana 433
 
0.1%
Katakana 315
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3298
 
2.7%
3212
 
2.6%
2865
 
2.3%
2747
 
2.2%
2297
 
1.9%
2267
 
1.8%
1958
 
1.6%
1741
 
1.4%
1738
 
1.4%
1735
 
1.4%
Other values (1007) 98954
80.6%
Han
ValueCountFrequency (%)
137
 
5.8%
132
 
5.6%
81
 
3.4%
75
 
3.2%
53
 
2.3%
43
 
1.8%
42
 
1.8%
39
 
1.7%
32
 
1.4%
32
 
1.4%
Other values (460) 1683
71.6%
Katakana
ValueCountFrequency (%)
23
 
7.3%
20
 
6.3%
19
 
6.0%
16
 
5.1%
15
 
4.8%
15
 
4.8%
12
 
3.8%
11
 
3.5%
10
 
3.2%
9
 
2.9%
Other values (50) 165
52.4%
Latin
ValueCountFrequency (%)
e 9968
 
9.5%
n 8613
 
8.2%
a 8244
 
7.8%
i 8153
 
7.7%
o 7990
 
7.6%
t 7713
 
7.3%
r 6916
 
6.6%
s 5679
 
5.4%
c 3902
 
3.7%
l 3659
 
3.5%
Other values (47) 34392
32.7%
Common
ValueCountFrequency (%)
46440
74.5%
: 3094
 
5.0%
0 1745
 
2.8%
( 1291
 
2.1%
) 1289
 
2.1%
2 1243
 
2.0%
1 1193
 
1.9%
= 915
 
1.5%
, 711
 
1.1%
. 683
 
1.1%
Other values (46) 3764
 
6.0%
Hiragana
ValueCountFrequency (%)
110
25.4%
38
 
8.8%
37
 
8.5%
37
 
8.5%
20
 
4.6%
14
 
3.2%
14
 
3.2%
11
 
2.5%
11
 
2.5%
10
 
2.3%
Other values (38) 131
30.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 167172
57.0%
Hangul 122805
41.8%
CJK 2324
 
0.8%
Hiragana 433
 
0.1%
Katakana 315
 
0.1%
None 256
 
0.1%
Number Forms 146
 
< 0.1%
CJK Compat Ideographs 25
 
< 0.1%
Punctuation 8
 
< 0.1%
Enclosed Alphanum 8
 
< 0.1%
Other values (4) 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46440
27.8%
e 9968
 
6.0%
n 8613
 
5.2%
a 8244
 
4.9%
i 8153
 
4.9%
o 7990
 
4.8%
t 7713
 
4.6%
r 6916
 
4.1%
s 5679
 
3.4%
c 3902
 
2.3%
Other values (76) 53554
32.0%
Hangul
ValueCountFrequency (%)
3298
 
2.7%
3212
 
2.6%
2865
 
2.3%
2747
 
2.2%
2297
 
1.9%
2267
 
1.8%
1958
 
1.6%
1741
 
1.4%
1738
 
1.4%
1735
 
1.4%
Other values (1005) 98947
80.6%
None
ValueCountFrequency (%)
· 241
94.1%
3
 
1.2%
3
 
1.2%
2
 
0.8%
2
 
0.8%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
CJK
ValueCountFrequency (%)
137
 
5.9%
132
 
5.7%
81
 
3.5%
75
 
3.2%
53
 
2.3%
43
 
1.9%
42
 
1.8%
39
 
1.7%
32
 
1.4%
32
 
1.4%
Other values (447) 1658
71.3%
Hiragana
ValueCountFrequency (%)
110
25.4%
38
 
8.8%
37
 
8.5%
37
 
8.5%
20
 
4.6%
14
 
3.2%
14
 
3.2%
11
 
2.5%
11
 
2.5%
10
 
2.3%
Other values (38) 131
30.3%
Number Forms
ValueCountFrequency (%)
71
48.6%
46
31.5%
21
 
14.4%
5
 
3.4%
3
 
2.1%
Katakana
ValueCountFrequency (%)
23
 
7.3%
20
 
6.3%
19
 
6.0%
16
 
5.1%
15
 
4.8%
15
 
4.8%
12
 
3.8%
11
 
3.5%
10
 
3.2%
9
 
2.9%
Other values (50) 165
52.4%
CJK Compat Ideographs
ValueCountFrequency (%)
7
28.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (3) 3
12.0%
Compat Jamo
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
CJK Compat
ValueCountFrequency (%)
5
100.0%
Punctuation
ValueCountFrequency (%)
4
50.0%
2
25.0%
2
25.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Math Operators
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct7708
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:05:50.318839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length166
Median length117
Mean length23.7922
Min length2

Characters and Unicode

Total characters237922
Distinct characters1090
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6698 ?
Unique (%)67.0%

Sample

1st row오토CAR운전테크닉자동변속운전면허교재
2nd rowTRANSIT2014
3rd row파워포인트프레젠테이션실무활용테크닉1분1초가아까운비즈니스맨을위한
4th row교통안전화노화속
5th row노년층인구증가에대비한교통안전
ValueCountFrequency (%)
대한교통학회지=journalofthetransportationresearchsocietyofkorea 75
 
0.7%
대한민국현행법령집 56
 
0.6%
교수연구논문집 55
 
0.5%
교통안전연구논집 44
 
0.4%
교통사고통계 32
 
0.3%
한국자동차공학회논문집=transactionsofkoreasocietyofengineers 29
 
0.3%
교통기술과정책 24
 
0.2%
도로교통안전백서 22
 
0.2%
신호등 22
 
0.2%
경찰백서 20
 
0.2%
Other values (7710) 9639
96.2%
2023-12-12T14:05:50.922649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 10825
 
4.5%
A 9335
 
3.9%
N 9092
 
3.8%
I 8971
 
3.8%
T 8843
 
3.7%
O 8578
 
3.6%
R 7611
 
3.2%
S 7012
 
2.9%
C 4866
 
2.0%
L 4003
 
1.7%
Other values (1080) 158786
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125903
52.9%
Uppercase Letter 104679
44.0%
Decimal Number 6058
 
2.5%
Math Symbol 964
 
0.4%
Letter Number 146
 
0.1%
Other Punctuation 98
 
< 0.1%
Lowercase Letter 36
 
< 0.1%
Space Separator 22
 
< 0.1%
Other Symbol 7
 
< 0.1%
Other Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3452
 
2.7%
3361
 
2.7%
2942
 
2.3%
2752
 
2.2%
2408
 
1.9%
2302
 
1.8%
1996
 
1.6%
1816
 
1.4%
1779
 
1.4%
1779
 
1.4%
Other values (1007) 101316
80.5%
Uppercase Letter
ValueCountFrequency (%)
E 10825
10.3%
A 9335
 
8.9%
N 9092
 
8.7%
I 8971
 
8.6%
T 8843
 
8.4%
O 8578
 
8.2%
R 7611
 
7.3%
S 7012
 
6.7%
C 4866
 
4.6%
L 4003
 
3.8%
Other values (16) 25543
24.4%
Lowercase Letter
ValueCountFrequency (%)
o 8
22.2%
r 4
11.1%
f 2
 
5.6%
h 2
 
5.6%
d 2
 
5.6%
p 2
 
5.6%
c 2
 
5.6%
m 2
 
5.6%
t 2
 
5.6%
e 2
 
5.6%
Other values (4) 8
22.2%
Decimal Number
ValueCountFrequency (%)
0 1745
28.8%
2 1243
20.5%
1 1193
19.7%
9 418
 
6.9%
3 357
 
5.9%
5 264
 
4.4%
4 247
 
4.1%
8 212
 
3.5%
6 211
 
3.5%
7 168
 
2.8%
Other Punctuation
ValueCountFrequency (%)
91
92.9%
4
 
4.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Letter Number
ValueCountFrequency (%)
71
48.6%
46
31.5%
21
 
14.4%
5
 
3.4%
3
 
2.1%
Other Number
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
Math Symbol
ValueCountFrequency (%)
= 915
94.9%
48
 
5.0%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
22
100.0%
Control
ValueCountFrequency (%)
 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 125902
52.9%
Latin 104861
44.1%
Common 7158
 
3.0%
Katakana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3452
 
2.7%
3361
 
2.7%
2942
 
2.3%
2752
 
2.2%
2408
 
1.9%
2302
 
1.8%
1996
 
1.6%
1816
 
1.4%
1779
 
1.4%
1779
 
1.4%
Other values (1006) 101315
80.5%
Latin
ValueCountFrequency (%)
E 10825
10.3%
A 9335
 
8.9%
N 9092
 
8.7%
I 8971
 
8.6%
T 8843
 
8.4%
O 8578
 
8.2%
R 7611
 
7.3%
S 7012
 
6.7%
C 4866
 
4.6%
L 4003
 
3.8%
Other values (35) 25725
24.5%
Common
ValueCountFrequency (%)
0 1745
24.4%
2 1243
17.4%
1 1193
16.7%
= 915
12.8%
9 418
 
5.8%
3 357
 
5.0%
5 264
 
3.7%
4 247
 
3.5%
8 212
 
3.0%
6 211
 
2.9%
Other values (18) 353
 
4.9%
Katakana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 125901
52.9%
ASCII 111712
47.0%
Number Forms 146
 
0.1%
None 142
 
0.1%
Enclosed Alphanum 8
 
< 0.1%
CJK Compat 5
 
< 0.1%
Punctuation 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Katakana 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 10825
 
9.7%
A 9335
 
8.4%
N 9092
 
8.1%
I 8971
 
8.0%
T 8843
 
7.9%
O 8578
 
7.7%
R 7611
 
6.8%
S 7012
 
6.3%
C 4866
 
4.4%
L 4003
 
3.6%
Other values (43) 32576
29.2%
Hangul
ValueCountFrequency (%)
3452
 
2.7%
3361
 
2.7%
2942
 
2.3%
2752
 
2.2%
2408
 
1.9%
2302
 
1.8%
1996
 
1.6%
1816
 
1.4%
1779
 
1.4%
1779
 
1.4%
Other values (1005) 101314
80.5%
None
ValueCountFrequency (%)
91
64.1%
48
33.8%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Number Forms
ValueCountFrequency (%)
71
48.6%
46
31.5%
21
 
14.4%
5
 
3.4%
3
 
2.1%
CJK Compat
ValueCountFrequency (%)
5
100.0%
Punctuation
ValueCountFrequency (%)
4
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct5222
Distinct (%)52.3%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T14:05:51.281880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length48
Mean length8.0955912
Min length2

Characters and Unicode

Total characters80794
Distinct characters861
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4115 ?
Unique (%)41.2%

Sample

1st row김재윤,조칠호,황만식
2nd rowTransportation Research Board
3rd row공병훈
4th row목내준부
5th row김준식
ValueCountFrequency (%)
도로교통공단 430
 
3.0%
도로교통안전관리공단 388
 
2.7%
research 376
 
2.6%
transportation 374
 
2.6%
board 373
 
2.6%
도로교통안전협회 313
 
2.2%
경찰청 210
 
1.5%
139
 
1.0%
교통사고종합분석센터 126
 
0.9%
대한교통학회 113
 
0.8%
Other values (6132) 11534
80.2%
2023-12-12T14:05:51.793411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4403
 
5.4%
a 2827
 
3.5%
r 2466
 
3.1%
2064
 
2.6%
o 2030
 
2.5%
e 2005
 
2.5%
1924
 
2.4%
n 1821
 
2.3%
. 1781
 
2.2%
1419
 
1.8%
Other values (851) 58054
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47871
59.3%
Lowercase Letter 20305
25.1%
Uppercase Letter 4726
 
5.8%
Space Separator 4403
 
5.4%
Other Punctuation 3234
 
4.0%
Decimal Number 120
 
0.1%
Dash Punctuation 55
 
0.1%
Close Punctuation 39
 
< 0.1%
Open Punctuation 37
 
< 0.1%
Math Symbol 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2064
 
4.3%
1924
 
4.0%
1419
 
3.0%
1309
 
2.7%
1129
 
2.4%
1062
 
2.2%
1004
 
2.1%
973
 
2.0%
943
 
2.0%
892
 
1.9%
Other values (770) 35152
73.4%
Lowercase Letter
ValueCountFrequency (%)
a 2827
13.9%
r 2466
12.1%
o 2030
10.0%
e 2005
9.9%
n 1821
9.0%
i 1330
 
6.6%
t 1316
 
6.5%
s 1259
 
6.2%
h 833
 
4.1%
d 754
 
3.7%
Other values (16) 3664
18.0%
Uppercase Letter
ValueCountFrequency (%)
B 600
12.7%
R 569
12.0%
T 531
 
11.2%
S 381
 
8.1%
M 258
 
5.5%
A 242
 
5.1%
J 220
 
4.7%
C 213
 
4.5%
D 194
 
4.1%
L 177
 
3.7%
Other values (16) 1341
28.4%
Decimal Number
ValueCountFrequency (%)
1 52
43.3%
2 18
 
15.0%
3 16
 
13.3%
5 9
 
7.5%
4 7
 
5.8%
6 5
 
4.2%
9 5
 
4.2%
7 3
 
2.5%
8 3
 
2.5%
0 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 1781
55.1%
, 1404
43.4%
/ 28
 
0.9%
& 9
 
0.3%
· 4
 
0.1%
' 4
 
0.1%
" 2
 
0.1%
: 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 33
84.6%
] 5
 
12.8%
} 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 32
86.5%
[ 5
 
13.5%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
4403
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47391
58.7%
Latin 25032
31.0%
Common 7891
 
9.8%
Han 422
 
0.5%
Katakana 58
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2064
 
4.4%
1924
 
4.1%
1419
 
3.0%
1309
 
2.8%
1129
 
2.4%
1062
 
2.2%
1004
 
2.1%
973
 
2.1%
943
 
2.0%
892
 
1.9%
Other values (649) 34672
73.2%
Han
ValueCountFrequency (%)
42
 
10.0%
42
 
10.0%
19
 
4.5%
16
 
3.8%
16
 
3.8%
15
 
3.6%
15
 
3.6%
15
 
3.6%
12
 
2.8%
12
 
2.8%
Other values (94) 218
51.7%
Latin
ValueCountFrequency (%)
a 2827
 
11.3%
r 2466
 
9.9%
o 2030
 
8.1%
e 2005
 
8.0%
n 1821
 
7.3%
i 1330
 
5.3%
t 1316
 
5.3%
s 1259
 
5.0%
h 833
 
3.3%
d 754
 
3.0%
Other values (43) 8391
33.5%
Common
ValueCountFrequency (%)
4403
55.8%
. 1781
22.6%
, 1404
 
17.8%
- 55
 
0.7%
1 52
 
0.7%
) 33
 
0.4%
( 32
 
0.4%
/ 28
 
0.4%
2 18
 
0.2%
3 16
 
0.2%
Other values (18) 69
 
0.9%
Katakana
ValueCountFrequency (%)
15
25.9%
14
24.1%
14
24.1%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (7) 7
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47390
58.7%
ASCII 32917
40.7%
CJK 419
 
0.5%
Katakana 58
 
0.1%
None 4
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Number Forms 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4403
 
13.4%
a 2827
 
8.6%
r 2466
 
7.5%
o 2030
 
6.2%
e 2005
 
6.1%
n 1821
 
5.5%
. 1781
 
5.4%
, 1404
 
4.3%
i 1330
 
4.0%
t 1316
 
4.0%
Other values (68) 11534
35.0%
Hangul
ValueCountFrequency (%)
2064
 
4.4%
1924
 
4.1%
1419
 
3.0%
1309
 
2.8%
1129
 
2.4%
1062
 
2.2%
1004
 
2.1%
973
 
2.1%
943
 
2.0%
892
 
1.9%
Other values (648) 34671
73.2%
CJK
ValueCountFrequency (%)
42
 
10.0%
42
 
10.0%
19
 
4.5%
16
 
3.8%
16
 
3.8%
15
 
3.6%
15
 
3.6%
15
 
3.6%
12
 
2.9%
12
 
2.9%
Other values (91) 215
51.3%
Katakana
ValueCountFrequency (%)
15
25.9%
14
24.1%
14
24.1%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (7) 7
12.1%
None
ValueCountFrequency (%)
· 4
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct5173
Distinct (%)51.8%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T14:05:52.160450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length42
Mean length7.3181363
Min length2

Characters and Unicode

Total characters73035
Distinct characters701
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4068 ?
Unique (%)40.8%

Sample

1st row김재윤조칠호황만식
2nd rowTRANSPORTATIONRESEARCHBOARD
3rd row공병훈
4th row목내준부
5th row김준식
ValueCountFrequency (%)
transportationresearchboard 369
 
3.7%
도로교통안전관리공단 308
 
3.1%
도로교통안전협회 307
 
3.1%
도로교통공단 197
 
2.0%
경찰청 192
 
1.9%
대한교통학회 113
 
1.1%
건설교통부 101
 
1.0%
도로교통공단교통사고종합분석센터 90
 
0.9%
교통개발연구원 75
 
0.8%
대한토목학회 61
 
0.6%
Other values (5164) 8169
81.8%
2023-12-12T14:05:52.716915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 3069
 
4.2%
R 3035
 
4.2%
E 2158
 
3.0%
2108
 
2.9%
O 2107
 
2.9%
1966
 
2.7%
N 1914
 
2.6%
T 1847
 
2.5%
S 1640
 
2.2%
1422
 
1.9%
Other values (691) 51769
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47871
65.5%
Uppercase Letter 25031
34.3%
Decimal Number 120
 
0.2%
Other Punctuation 9
 
< 0.1%
Space Separator 2
 
< 0.1%
Letter Number 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2108
 
4.4%
1966
 
4.1%
1422
 
3.0%
1313
 
2.7%
1149
 
2.4%
1064
 
2.2%
1016
 
2.1%
974
 
2.0%
945
 
2.0%
911
 
1.9%
Other values (651) 35003
73.1%
Uppercase Letter
ValueCountFrequency (%)
A 3069
12.3%
R 3035
12.1%
E 2158
 
8.6%
O 2107
 
8.4%
N 1914
 
7.6%
T 1847
 
7.4%
S 1640
 
6.6%
I 1410
 
5.6%
H 997
 
4.0%
D 948
 
3.8%
Other values (16) 5906
23.6%
Decimal Number
ValueCountFrequency (%)
1 52
43.3%
2 18
 
15.0%
3 16
 
13.3%
5 9
 
7.5%
4 7
 
5.8%
6 5
 
4.2%
9 5
 
4.2%
7 3
 
2.5%
8 3
 
2.5%
0 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
9
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47871
65.5%
Latin 25032
34.3%
Common 132
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2108
 
4.4%
1966
 
4.1%
1422
 
3.0%
1313
 
2.7%
1149
 
2.4%
1064
 
2.2%
1016
 
2.1%
974
 
2.0%
945
 
2.0%
911
 
1.9%
Other values (651) 35003
73.1%
Latin
ValueCountFrequency (%)
A 3069
12.3%
R 3035
12.1%
E 2158
 
8.6%
O 2107
 
8.4%
N 1914
 
7.6%
T 1847
 
7.4%
S 1640
 
6.6%
I 1410
 
5.6%
H 997
 
4.0%
D 948
 
3.8%
Other values (17) 5907
23.6%
Common
ValueCountFrequency (%)
1 52
39.4%
2 18
 
13.6%
3 16
 
12.1%
9
 
6.8%
5 9
 
6.8%
4 7
 
5.3%
6 5
 
3.8%
9 5
 
3.8%
7 3
 
2.3%
8 3
 
2.3%
Other values (3) 5
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47870
65.5%
ASCII 25153
34.4%
None 9
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Number Forms 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 3069
12.2%
R 3035
12.1%
E 2158
 
8.6%
O 2107
 
8.4%
N 1914
 
7.6%
T 1847
 
7.3%
S 1640
 
6.5%
I 1410
 
5.6%
H 997
 
4.0%
D 948
 
3.8%
Other values (27) 6028
24.0%
Hangul
ValueCountFrequency (%)
2108
 
4.4%
1966
 
4.1%
1422
 
3.0%
1313
 
2.7%
1149
 
2.4%
1064
 
2.2%
1016
 
2.1%
974
 
2.0%
945
 
2.0%
911
 
1.9%
Other values (650) 35002
73.1%
None
ValueCountFrequency (%)
9
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct2311
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:05:53.056027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length49
Mean length8.0281
Min length1

Characters and Unicode

Total characters80281
Distinct characters864
Distinct categories9 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1454 ?
Unique (%)14.5%

Sample

1st row골든벨
2nd rowTRB
3rd row길벗
4th row立花書房
5th row교통안전진흥공단
ValueCountFrequency (%)
도로교통안전관리공단 865
 
6.8%
도로교통공단 630
 
5.0%
도로교통안전협회 525
 
4.2%
research 397
 
3.1%
trb 366
 
2.9%
transportation 366
 
2.9%
board 354
 
2.8%
대한교통학회 258
 
2.0%
경찰청 185
 
1.5%
교통개발연구원 182
 
1.4%
Other values (2466) 8522
67.4%
2023-12-12T14:05:53.534884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3303
 
4.1%
3058
 
3.8%
2652
 
3.3%
2344
 
2.9%
r 2277
 
2.8%
a 2260
 
2.8%
o 2227
 
2.8%
2177
 
2.7%
1892
 
2.4%
1865
 
2.3%
Other values (854) 56226
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53663
66.8%
Lowercase Letter 18223
 
22.7%
Uppercase Letter 4779
 
6.0%
Space Separator 2652
 
3.3%
Other Punctuation 494
 
0.6%
Open Punctuation 164
 
0.2%
Close Punctuation 152
 
0.2%
Decimal Number 102
 
0.1%
Dash Punctuation 52
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3303
 
6.2%
3058
 
5.7%
2344
 
4.4%
2177
 
4.1%
1892
 
3.5%
1865
 
3.5%
1738
 
3.2%
1732
 
3.2%
1559
 
2.9%
1319
 
2.5%
Other values (783) 32676
60.9%
Lowercase Letter
ValueCountFrequency (%)
r 2277
12.5%
a 2260
12.4%
o 2227
12.2%
e 1851
10.2%
n 1687
9.3%
s 1362
7.5%
t 1147
6.3%
i 1033
 
5.7%
c 951
 
5.2%
h 690
 
3.8%
Other values (15) 2738
15.0%
Uppercase Letter
ValueCountFrequency (%)
T 978
20.5%
R 864
18.1%
B 819
17.1%
P 438
9.2%
S 340
 
7.1%
A 193
 
4.0%
J 150
 
3.1%
I 144
 
3.0%
O 137
 
2.9%
E 120
 
2.5%
Other values (13) 596
12.5%
Decimal Number
ValueCountFrequency (%)
2 39
38.2%
1 39
38.2%
3 6
 
5.9%
9 6
 
5.9%
5 3
 
2.9%
6 3
 
2.9%
0 3
 
2.9%
8 2
 
2.0%
4 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
: 170
34.4%
. 169
34.2%
· 60
 
12.1%
, 44
 
8.9%
& 42
 
8.5%
' 4
 
0.8%
; 3
 
0.6%
/ 2
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 156
95.1%
[ 8
 
4.9%
Close Punctuation
ValueCountFrequency (%)
) 144
94.7%
] 8
 
5.3%
Space Separator
ValueCountFrequency (%)
2652
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51897
64.6%
Latin 23002
28.7%
Common 3616
 
4.5%
Han 1550
 
1.9%
Katakana 148
 
0.2%
Hiragana 68
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3303
 
6.4%
3058
 
5.9%
2344
 
4.5%
2177
 
4.2%
1892
 
3.6%
1865
 
3.6%
1738
 
3.3%
1732
 
3.3%
1559
 
3.0%
1319
 
2.5%
Other values (544) 30910
59.6%
Han
ValueCountFrequency (%)
74
 
4.8%
66
 
4.3%
66
 
4.3%
61
 
3.9%
59
 
3.8%
48
 
3.1%
46
 
3.0%
42
 
2.7%
42
 
2.7%
40
 
2.6%
Other values (189) 1006
64.9%
Latin
ValueCountFrequency (%)
r 2277
 
9.9%
a 2260
 
9.8%
o 2227
 
9.7%
e 1851
 
8.0%
n 1687
 
7.3%
s 1362
 
5.9%
t 1147
 
5.0%
i 1033
 
4.5%
T 978
 
4.3%
c 951
 
4.1%
Other values (38) 7229
31.4%
Katakana
ValueCountFrequency (%)
33
22.3%
31
20.9%
30
20.3%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
3
 
2.0%
3
 
2.0%
2
 
1.4%
Other values (22) 28
18.9%
Common
ValueCountFrequency (%)
2652
73.3%
: 170
 
4.7%
. 169
 
4.7%
( 156
 
4.3%
) 144
 
4.0%
· 60
 
1.7%
- 52
 
1.4%
, 44
 
1.2%
& 42
 
1.2%
2 39
 
1.1%
Other values (13) 88
 
2.4%
Hiragana
ValueCountFrequency (%)
13
19.1%
13
19.1%
13
19.1%
13
19.1%
13
19.1%
1
 
1.5%
1
 
1.5%
1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51896
64.6%
ASCII 26558
33.1%
CJK 1535
 
1.9%
Katakana 148
 
0.2%
Hiragana 68
 
0.1%
None 60
 
0.1%
CJK Compat Ideographs 15
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3303
 
6.4%
3058
 
5.9%
2344
 
4.5%
2177
 
4.2%
1892
 
3.6%
1865
 
3.6%
1738
 
3.3%
1732
 
3.3%
1559
 
3.0%
1319
 
2.5%
Other values (543) 30909
59.6%
ASCII
ValueCountFrequency (%)
2652
 
10.0%
r 2277
 
8.6%
a 2260
 
8.5%
o 2227
 
8.4%
e 1851
 
7.0%
n 1687
 
6.4%
s 1362
 
5.1%
t 1147
 
4.3%
i 1033
 
3.9%
T 978
 
3.7%
Other values (60) 9084
34.2%
CJK
ValueCountFrequency (%)
74
 
4.8%
66
 
4.3%
66
 
4.3%
61
 
4.0%
59
 
3.8%
48
 
3.1%
46
 
3.0%
42
 
2.7%
42
 
2.7%
40
 
2.6%
Other values (184) 991
64.6%
None
ValueCountFrequency (%)
· 60
100.0%
Katakana
ValueCountFrequency (%)
33
22.3%
31
20.9%
30
20.3%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
3
 
2.0%
3
 
2.0%
2
 
1.4%
Other values (22) 28
18.9%
Hiragana
ValueCountFrequency (%)
13
19.1%
13
19.1%
13
19.1%
13
19.1%
13
19.1%
1
 
1.5%
1
 
1.5%
1
 
1.5%
CJK Compat Ideographs
ValueCountFrequency (%)
8
53.3%
3
 
20.0%
2
 
13.3%
1
 
6.7%
1
 
6.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2251
Distinct (%)22.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:05:53.839391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length44
Mean length7.6815
Min length1

Characters and Unicode

Total characters76815
Distinct characters600
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1390 ?
Unique (%)13.9%

Sample

1st row골든벨
2nd rowTRB
3rd row길벗
4th row입화서방
5th row교통안전진흥공단
ValueCountFrequency (%)
도로교통안전관리공단 811
 
8.1%
도로교통공단 620
 
6.2%
도로교통안전협회 515
 
5.1%
trb 366
 
3.7%
transportationresearchboard 354
 
3.5%
대한교통학회 258
 
2.6%
교통개발연구원 182
 
1.8%
경찰청 177
 
1.8%
pergamon 168
 
1.7%
thejournalpress 125
 
1.2%
Other values (2243) 6426
64.2%
2023-12-12T14:05:54.333350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3381
 
4.4%
R 3141
 
4.1%
3124
 
4.1%
A 2453
 
3.2%
2369
 
3.1%
O 2364
 
3.1%
2188
 
2.8%
T 2125
 
2.8%
E 1971
 
2.6%
1901
 
2.5%
Other values (590) 51798
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53663
69.9%
Uppercase Letter 23002
29.9%
Decimal Number 102
 
0.1%
Other Punctuation 42
 
0.1%
Space Separator 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3381
 
6.3%
3124
 
5.8%
2369
 
4.4%
2188
 
4.1%
1901
 
3.5%
1870
 
3.5%
1799
 
3.4%
1770
 
3.3%
1637
 
3.1%
1393
 
2.6%
Other values (552) 32231
60.1%
Uppercase Letter
ValueCountFrequency (%)
R 3141
13.7%
A 2453
10.7%
O 2364
10.3%
T 2125
9.2%
E 1971
8.6%
N 1737
 
7.6%
S 1702
 
7.4%
I 1177
 
5.1%
C 1031
 
4.5%
P 898
 
3.9%
Other values (15) 4403
19.1%
Decimal Number
ValueCountFrequency (%)
2 39
38.2%
1 39
38.2%
9 6
 
5.9%
3 6
 
5.9%
5 3
 
2.9%
6 3
 
2.9%
0 3
 
2.9%
8 2
 
2.0%
4 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
42
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53662
69.9%
Latin 23002
29.9%
Common 150
 
0.2%
Katakana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3381
 
6.3%
3124
 
5.8%
2369
 
4.4%
2188
 
4.1%
1901
 
3.5%
1870
 
3.5%
1799
 
3.4%
1770
 
3.3%
1637
 
3.1%
1393
 
2.6%
Other values (551) 32230
60.1%
Latin
ValueCountFrequency (%)
R 3141
13.7%
A 2453
10.7%
O 2364
10.3%
T 2125
9.2%
E 1971
8.6%
N 1737
 
7.6%
S 1702
 
7.4%
I 1177
 
5.1%
C 1031
 
4.5%
P 898
 
3.9%
Other values (15) 4403
19.1%
Common
ValueCountFrequency (%)
42
28.0%
2 39
26.0%
1 39
26.0%
9 6
 
4.0%
3 6
 
4.0%
5 3
 
2.0%
6 3
 
2.0%
0 3
 
2.0%
2
 
1.3%
( 2
 
1.3%
Other values (3) 5
 
3.3%
Katakana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53661
69.9%
ASCII 23110
30.1%
None 42
 
0.1%
Compat Jamo 1
 
< 0.1%
Katakana 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3381
 
6.3%
3124
 
5.8%
2369
 
4.4%
2188
 
4.1%
1901
 
3.5%
1870
 
3.5%
1799
 
3.4%
1770
 
3.3%
1637
 
3.1%
1393
 
2.6%
Other values (550) 32229
60.1%
ASCII
ValueCountFrequency (%)
R 3141
13.6%
A 2453
10.6%
O 2364
10.2%
T 2125
9.2%
E 1971
8.5%
N 1737
 
7.5%
S 1702
 
7.4%
I 1177
 
5.1%
C 1031
 
4.5%
P 898
 
3.9%
Other values (27) 4511
19.5%
None
ValueCountFrequency (%)
42
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%

출판년도
Real number (ℝ)

Distinct59
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1998.8954
Minimum1900
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:05:54.760517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1981
Q11993
median2001
Q32007
95-th percentile2015
Maximum2021
Range121
Interquartile range (IQR)14

Descriptive statistics

Standard deviation14.080983
Coefficient of variation (CV)0.0070443822
Kurtosis20.6181
Mean1998.8954
Median Absolute Deviation (MAD)7
Skewness-3.2953427
Sum19988954
Variance198.27409
MonotonicityNot monotonic
2023-12-12T14:05:54.920192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2007 485
 
4.9%
2004 478
 
4.8%
1996 401
 
4.0%
2002 396
 
4.0%
2000 387
 
3.9%
2001 371
 
3.7%
2003 358
 
3.6%
2008 337
 
3.4%
1999 331
 
3.3%
2005 304
 
3.0%
Other values (49) 6152
61.5%
ValueCountFrequency (%)
1900 94
0.9%
1950 1
 
< 0.1%
1953 1
 
< 0.1%
1963 1
 
< 0.1%
1966 1
 
< 0.1%
1967 4
 
< 0.1%
1969 22
 
0.2%
1970 4
 
< 0.1%
1971 8
 
0.1%
1972 6
 
0.1%
ValueCountFrequency (%)
2021 22
 
0.2%
2020 51
 
0.5%
2019 76
 
0.8%
2018 94
 
0.9%
2017 99
 
1.0%
2016 151
1.5%
2015 133
1.3%
2014 165
1.7%
2013 230
2.3%
2012 257
2.6%

청구기호
Text

MISSING 

Distinct5110
Distinct (%)62.3%
Missing1793
Missing (%)17.9%
Memory size156.2 KiB
2023-12-12T14:05:55.191168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length13.199951
Min length6

Characters and Unicode

Total characters108332
Distinct characters94
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4206 ?
Unique (%)51.2%

Sample

1st row629.28-ㄱ894ㅇ
2nd row388.06-T783t-ST
3rd row005.5-ㄱ432ㅍ-CD
4th row363.12-ㅁ592ㄱ
5th row363.1257-ㄷ68ㅇ-SR
ValueCountFrequency (%)
363.12565-ㄷ68ㄱ-sr 118
 
1.4%
363.1251-ㄷ68ㄱ-sr 76
 
0.9%
388.072-ㄷ52ㄷ 75
 
0.9%
340.52519-ㅂ754ㄷ 66
 
0.8%
624.072-ㄷ52ㄷ 59
 
0.7%
388.071-ㄷ68ㄱ-sr 55
 
0.7%
363.1251-ㄷ68ㄱ 49
 
0.6%
363.12505-ㄷ68ㄱ-sr 44
 
0.5%
388.06-t783t-st 43
 
0.5%
363.1251-ㄷ68ㅈ 40
 
0.5%
Other values (5101) 7584
92.4%
2023-12-12T14:05:55.894846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 10958
 
10.1%
- 10852
 
10.0%
8 8815
 
8.1%
2 8191
 
7.6%
6 7703
 
7.1%
. 7600
 
7.0%
1 7053
 
6.5%
5 6685
 
6.2%
4 5431
 
5.0%
7 4915
 
4.5%
Other values (84) 30129
27.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 68425
63.2%
Other Letter 14396
 
13.3%
Dash Punctuation 10852
 
10.0%
Other Punctuation 7609
 
7.0%
Uppercase Letter 6205
 
5.7%
Lowercase Letter 834
 
0.8%
Other Symbol 8
 
< 0.1%
Space Separator 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3354
23.3%
2395
16.6%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (22) 708
 
4.9%
Lowercase Letter
ValueCountFrequency (%)
t 160
19.2%
p 77
 
9.2%
s 60
 
7.2%
a 58
 
7.0%
r 49
 
5.9%
i 47
 
5.6%
c 44
 
5.3%
d 38
 
4.6%
h 37
 
4.4%
m 35
 
4.2%
Other values (14) 229
27.5%
Uppercase Letter
ValueCountFrequency (%)
S 2464
39.7%
R 1306
21.0%
K 780
 
12.6%
T 754
 
12.2%
D 204
 
3.3%
P 131
 
2.1%
I 107
 
1.7%
C 90
 
1.5%
E 72
 
1.2%
B 57
 
0.9%
Other values (13) 240
 
3.9%
Decimal Number
ValueCountFrequency (%)
3 10958
16.0%
8 8815
12.9%
2 8191
12.0%
6 7703
11.3%
1 7053
10.3%
5 6685
9.8%
4 5431
7.9%
7 4915
7.2%
0 4487
6.6%
9 4187
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 7600
99.9%
/ 9
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 10852
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 86897
80.2%
Hangul 14396
 
13.3%
Latin 7039
 
6.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 2464
35.0%
R 1306
18.6%
K 780
 
11.1%
T 754
 
10.7%
D 204
 
2.9%
t 160
 
2.3%
P 131
 
1.9%
I 107
 
1.5%
C 90
 
1.3%
p 77
 
1.1%
Other values (37) 966
 
13.7%
Hangul
ValueCountFrequency (%)
3354
23.3%
2395
16.6%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (22) 708
 
4.9%
Common
ValueCountFrequency (%)
3 10958
12.6%
- 10852
12.5%
8 8815
10.1%
2 8191
9.4%
6 7703
8.9%
. 7600
8.7%
1 7053
8.1%
5 6685
7.7%
4 5431
6.2%
7 4915
5.7%
Other values (5) 8694
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 93928
86.7%
Compat Jamo 14381
 
13.3%
Hangul 15
 
< 0.1%
Geometric Shapes 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 10958
11.7%
- 10852
11.6%
8 8815
9.4%
2 8191
8.7%
6 7703
8.2%
. 7600
8.1%
1 7053
7.5%
5 6685
7.1%
4 5431
 
5.8%
7 4915
 
5.2%
Other values (51) 15725
16.7%
Compat Jamo
ValueCountFrequency (%)
3354
23.3%
2395
16.7%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (9) 693
 
4.8%
Geometric Shapes
ValueCountFrequency (%)
8
100.0%
Hangul
ValueCountFrequency (%)
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (3) 3
20.0%

색인청구기호
Text

MISSING 

Distinct5110
Distinct (%)62.3%
Missing1793
Missing (%)17.9%
Memory size156.2 KiB
2023-12-12T14:05:56.306828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length13.198855
Min length5

Characters and Unicode

Total characters108323
Distinct characters93
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4206 ?
Unique (%)51.2%

Sample

1st row629.28 ㄱ894ㅇ
2nd row388.06 T783t ST
3rd row005.5 ㄱ432ㅍ CD
4th row363.12 ㅁ592ㄱ
5th row363.1257 ㄷ68ㅇ SR
ValueCountFrequency (%)
sr 1277
 
6.7%
sk 636
 
3.3%
ㄷ68ㄱ 554
 
2.9%
st 364
 
1.9%
363.1251 347
 
1.8%
388.06 313
 
1.6%
363.12565 238
 
1.2%
363.125 204
 
1.1%
ㄷ68ㅈ 152
 
0.8%
ㄷ52ㄷ 142
 
0.7%
Other values (5683) 14819
77.8%
2023-12-12T14:05:56.811172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 10958
 
10.1%
10846
 
10.0%
8 8815
 
8.1%
2 8191
 
7.6%
6 7703
 
7.1%
. 7600
 
7.0%
1 7053
 
6.5%
5 6685
 
6.2%
4 5431
 
5.0%
7 4915
 
4.5%
Other values (83) 30126
27.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 68425
63.2%
Other Letter 14396
 
13.3%
Space Separator 10846
 
10.0%
Other Punctuation 7609
 
7.0%
Uppercase Letter 6205
 
5.7%
Lowercase Letter 834
 
0.8%
Other Symbol 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3354
23.3%
2395
16.6%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (22) 708
 
4.9%
Lowercase Letter
ValueCountFrequency (%)
t 160
19.2%
p 77
 
9.2%
s 60
 
7.2%
a 58
 
7.0%
r 49
 
5.9%
i 47
 
5.6%
c 44
 
5.3%
d 38
 
4.6%
h 37
 
4.4%
m 35
 
4.2%
Other values (14) 229
27.5%
Uppercase Letter
ValueCountFrequency (%)
S 2464
39.7%
R 1306
21.0%
K 780
 
12.6%
T 754
 
12.2%
D 204
 
3.3%
P 131
 
2.1%
I 107
 
1.7%
C 90
 
1.5%
E 72
 
1.2%
B 57
 
0.9%
Other values (13) 240
 
3.9%
Decimal Number
ValueCountFrequency (%)
3 10958
16.0%
8 8815
12.9%
2 8191
12.0%
6 7703
11.3%
1 7053
10.3%
5 6685
9.8%
4 5431
7.9%
7 4915
7.2%
0 4487
6.6%
9 4187
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 7600
99.9%
/ 9
 
0.1%
Space Separator
ValueCountFrequency (%)
10846
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 86888
80.2%
Hangul 14396
 
13.3%
Latin 7039
 
6.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 2464
35.0%
R 1306
18.6%
K 780
 
11.1%
T 754
 
10.7%
D 204
 
2.9%
t 160
 
2.3%
P 131
 
1.9%
I 107
 
1.5%
C 90
 
1.3%
p 77
 
1.1%
Other values (37) 966
 
13.7%
Hangul
ValueCountFrequency (%)
3354
23.3%
2395
16.6%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (22) 708
 
4.9%
Common
ValueCountFrequency (%)
3 10958
12.6%
10846
12.5%
8 8815
10.1%
2 8191
9.4%
6 7703
8.9%
. 7600
8.7%
1 7053
8.1%
5 6685
7.7%
4 5431
6.3%
7 4915
5.7%
Other values (4) 8691
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 93919
86.7%
Compat Jamo 14381
 
13.3%
Hangul 15
 
< 0.1%
Geometric Shapes 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 10958
11.7%
10846
11.5%
8 8815
9.4%
2 8191
8.7%
6 7703
8.2%
. 7600
8.1%
1 7053
7.5%
5 6685
7.1%
4 5431
 
5.8%
7 4915
 
5.2%
Other values (50) 15722
16.7%
Compat Jamo
ValueCountFrequency (%)
3354
23.3%
2395
16.7%
2129
14.8%
1298
 
9.0%
1261
 
8.8%
1204
 
8.4%
807
 
5.6%
464
 
3.2%
445
 
3.1%
331
 
2.3%
Other values (9) 693
 
4.8%
Geometric Shapes
ValueCountFrequency (%)
8
100.0%
Hangul
ValueCountFrequency (%)
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (3) 3
20.0%

권책기호
Text

MISSING 

Distinct699
Distinct (%)26.4%
Missing7350
Missing (%)73.5%
Memory size156.2 KiB
2023-12-12T14:05:57.174253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length4.6826415
Min length1

Characters and Unicode

Total characters12409
Distinct characters99
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique510 ?
Unique (%)19.2%

Sample

1st row1981/續
2nd rowvol.2
3rd rowv.5
4th row1999,v.1
5th row1996
ValueCountFrequency (%)
v.1 232
 
8.2%
v.2 203
 
7.2%
v.3 105
 
3.7%
v.4 63
 
2.2%
v.5 57
 
2.0%
2003 56
 
2.0%
2004 53
 
1.9%
1996 51
 
1.8%
2005 49
 
1.7%
2002 48
 
1.7%
Other values (580) 1905
67.5%
2023-12-12T14:05:57.742355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1764
14.2%
v 1593
12.8%
1 1576
12.7%
2 1482
11.9%
0 1453
11.7%
9 1079
8.7%
3 456
 
3.7%
4 368
 
3.0%
8 331
 
2.7%
, 310
 
2.5%
Other values (89) 1997
16.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7607
61.3%
Other Punctuation 2119
 
17.1%
Lowercase Letter 1916
 
15.4%
Space Separator 173
 
1.4%
Dash Punctuation 143
 
1.2%
Open Punctuation 132
 
1.1%
Close Punctuation 132
 
1.1%
Other Letter 116
 
0.9%
Uppercase Letter 69
 
0.6%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
6.9%
7
 
6.0%
6
 
5.2%
6
 
5.2%
6
 
5.2%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (49) 66
56.9%
Lowercase Letter
ValueCountFrequency (%)
v 1593
83.1%
n 171
 
8.9%
o 64
 
3.3%
l 53
 
2.8%
t 13
 
0.7%
h 13
 
0.7%
e 3
 
0.2%
a 2
 
0.1%
b 2
 
0.1%
r 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 1576
20.7%
2 1482
19.5%
0 1453
19.1%
9 1079
14.2%
3 456
 
6.0%
4 368
 
4.8%
8 331
 
4.4%
6 305
 
4.0%
5 288
 
3.8%
7 269
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
C 18
26.1%
B 15
21.7%
A 15
21.7%
D 14
20.3%
F 3
 
4.3%
M 1
 
1.4%
J 1
 
1.4%
R 1
 
1.4%
V 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 1764
83.2%
, 310
 
14.6%
' 39
 
1.8%
/ 5
 
0.2%
& 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%
Open Punctuation
ValueCountFrequency (%)
( 132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 132
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10308
83.1%
Latin 1985
 
16.0%
Hangul 115
 
0.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
7.0%
7
 
6.1%
6
 
5.2%
6
 
5.2%
6
 
5.2%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (48) 65
56.5%
Common
ValueCountFrequency (%)
. 1764
17.1%
1 1576
15.3%
2 1482
14.4%
0 1453
14.1%
9 1079
10.5%
3 456
 
4.4%
4 368
 
3.6%
8 331
 
3.2%
, 310
 
3.0%
6 305
 
3.0%
Other values (10) 1184
11.5%
Latin
ValueCountFrequency (%)
v 1593
80.3%
n 171
 
8.6%
o 64
 
3.2%
l 53
 
2.7%
C 18
 
0.9%
B 15
 
0.8%
A 15
 
0.8%
D 14
 
0.7%
t 13
 
0.7%
h 13
 
0.7%
Other values (10) 16
 
0.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12293
99.1%
Hangul 115
 
0.9%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1764
14.3%
v 1593
13.0%
1 1576
12.8%
2 1482
12.1%
0 1453
11.8%
9 1079
8.8%
3 456
 
3.7%
4 368
 
3.0%
8 331
 
2.7%
, 310
 
2.5%
Other values (30) 1881
15.3%
Hangul
ValueCountFrequency (%)
8
 
7.0%
7
 
6.1%
6
 
5.2%
6
 
5.2%
6
 
5.2%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (48) 65
56.5%
CJK
ValueCountFrequency (%)
1
100.0%

복본기호
Categorical

IMBALANCE 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8941 
2
 
814
3
 
217
4
 
8
5
 
4
Other values (12)
 
16

Length

Max length4
Median length4
Mean length3.6838
Min length1

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row2
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8941
89.4%
2 814
 
8.1%
3 217
 
2.2%
4 8
 
0.1%
5 4
 
< 0.1%
7 2
 
< 0.1%
9 2
 
< 0.1%
6 2
 
< 0.1%
1 2
 
< 0.1%
v.6 1
 
< 0.1%
Other values (7) 7
 
0.1%

Length

2023-12-12T14:05:58.016427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 8941
89.4%
2 814
 
8.1%
3 217
 
2.2%
4 8
 
0.1%
5 4
 
< 0.1%
6 2
 
< 0.1%
1 2
 
< 0.1%
9 2
 
< 0.1%
7 2
 
< 0.1%
v.6 1
 
< 0.1%
Other values (7) 7
 
0.1%

별치기호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7153 
SR
1246 
SK
 
703
ST
 
369
KP
 
218
Other values (10)
 
311

Length

Max length4
Median length4
Mean length3.4315
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd rowST
3rd rowCD
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7153
71.5%
SR 1246
 
12.5%
SK 703
 
7.0%
ST 369
 
3.7%
KP 218
 
2.2%
SI 79
 
0.8%
CD 77
 
0.8%
SD 52
 
0.5%
EB 44
 
0.4%
SF 24
 
0.2%
Other values (5) 35
 
0.4%

Length

2023-12-12T14:05:58.169981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 7153
71.5%
sr 1246
 
12.5%
sk 703
 
7.0%
st 369
 
3.7%
kp 218
 
2.2%
si 79
 
0.8%
cd 77
 
0.8%
sd 52
 
0.5%
eb 44
 
0.4%
sf 24
 
0.2%
Other values (5) 35
 
0.4%

소장처명
Categorical

IMBALANCE 

Distinct38
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도서실
6266 
<NA>
 
429
부산교통방송
 
381
대구교통방송
 
295
전북교통방송
 
285
Other values (33)
2344 

Length

Max length7
Median length3
Mean length3.9226
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row전북지부
2nd row도서실
3rd row도서실
4th row도서실
5th row도서실

Common Values

ValueCountFrequency (%)
도서실 6266
62.7%
<NA> 429
 
4.3%
부산교통방송 381
 
3.8%
대구교통방송 295
 
2.9%
전북교통방송 285
 
2.9%
인천교통방송 226
 
2.3%
경찰청 218
 
2.2%
경북지부 203
 
2.0%
서울특별시지부 153
 
1.5%
광주교통방송 148
 
1.5%
Other values (28) 1396
 
14.0%

Length

2023-12-12T14:05:58.340569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도서실 6266
62.7%
na 429
 
4.3%
부산교통방송 381
 
3.8%
대구교통방송 295
 
2.9%
전북교통방송 285
 
2.9%
인천교통방송 226
 
2.3%
경찰청 218
 
2.2%
경북지부 203
 
2.0%
서울특별시지부 153
 
1.5%
광주교통방송 148
 
1.5%
Other values (28) 1396
 
14.0%

자료유형
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단행본
6264 
기사
1840 
공단연구보고서
1215 
연속간행물
 
488
비도서자료
 
97
Other values (2)
 
96

Length

Max length7
Median length3
Mean length3.4242
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단행본
2nd row단행본
3rd row비도서자료
4th row단행본
5th row기사

Common Values

ValueCountFrequency (%)
단행본 6264
62.6%
기사 1840
 
18.4%
공단연구보고서 1215
 
12.2%
연속간행물 488
 
4.9%
비도서자료 97
 
1.0%
학위논문 52
 
0.5%
전자책 44
 
0.4%

Length

2023-12-12T14:05:58.534902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:05:58.673246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단행본 6264
62.6%
기사 1840
 
18.4%
공단연구보고서 1215
 
12.2%
연속간행물 488
 
4.9%
비도서자료 97
 
1.0%
학위논문 52
 
0.5%
전자책 44
 
0.4%

Interactions

2023-12-12T14:05:47.186775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:05:58.772663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년도복본기호별치기호소장처명자료유형
출판년도1.0000.2030.3240.4030.305
복본기호0.2031.0000.0000.1100.150
별치기호0.3240.0001.0000.6930.981
소장처명0.4030.1100.6931.0000.409
자료유형0.3050.1500.9810.4091.000
2023-12-12T14:05:58.904680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장처명별치기호복본기호자료유형
소장처명1.0000.2980.0330.180
별치기호0.2981.0000.0000.788
복본기호0.0330.0001.0000.076
자료유형0.1800.7880.0761.000
2023-12-12T14:05:59.031980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년도복본기호별치기호소장처명자료유형
출판년도1.0000.0770.1880.1710.178
복본기호0.0771.0000.0000.0330.076
별치기호0.1880.0001.0000.2980.788
소장처명0.1710.0330.2981.0000.180
자료유형0.1780.0760.7880.1801.000

Missing values

2023-12-12T14:05:47.376328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:05:47.640427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:05:47.907077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호서명색인서명저자색인저자출판사색인출판사출판년도청구기호색인청구기호권책기호복본기호별치기호소장처명자료유형
282319000074오토 CAR 운전테크닉 : 자동변속운전면허 교재오토CAR운전테크닉자동변속운전면허교재김재윤,조칠호,황만식김재윤조칠호황만식골든벨골든벨1998629.28-ㄱ894ㅇ629.28 ㄱ894ㅇ<NA><NA><NA>전북지부단행본
2095321018Transit 2014TRANSIT2014Transportation Research BoardTRANSPORTATIONRESEARCHBOARDTRBTRB2014388.06-T783t-ST388.06 T783t ST<NA><NA>ST도서실단행본
1117911205파워포인트 프레젠테이션 실무활용테크닉 : 1분1초가 아까운 비즈니스맨을 위한파워포인트프레젠테이션실무활용테크닉1분1초가아까운비즈니스맨을위한공병훈공병훈길벗길벗2004005.5-ㄱ432ㅍ-CD005.5 ㄱ432ㅍ CD<NA><NA>CD도서실비도서자료
42584270交通安全 : 話の花束교통안전화노화속목내준부목내준부立花書房입화서방1979363.12-ㅁ592ㄱ363.12 ㅁ592ㄱ1981/續2<NA>도서실단행본
43651AR002732노년층 인구 증가에 대비한 교통안전노년층인구증가에대비한교통안전김준식김준식교통안전진흥공단교통안전진흥공단1997<NA><NA><NA><NA><NA>도서실기사
577580어린이 도로횡단 프로그램 개발 및 타당성에 관한 연구 : 횡단행도 및 통행실태조사를 중심으로어린이도로횡단프로그램개발및타당성에관한연구횡단행도및통행실태조사를중심으로도로교통안전협회도로교통안전협회도로교통안전협회도로교통안전협회1988363.1257-ㄷ68ㅇ-SR363.1257 ㄷ68ㅇ SR<NA>2SR도서실공단연구보고서
2059920662한양도성 연접지역 실태분석 및 합리적 관리방안 연구 = (A) study on the management system for the region adjacent to Hanyang-Dosung in Seoul한양도성연접지역실태분석및합리적관리방안연구=ASTUDYONTHEMANAGEMENTSYSTEMFORTHEREGIONADJACENTTOHANYANGDOSUNGINSEOUL장남종장남종서울연구원서울연구원2013711.58-ㅈ136ㅎ-SK711.58 ㅈ136ㅎ SK<NA><NA>SK도서실단행본
42797AR001864사업용 자동차 운전자의 사고특성에 관한 연구(Ⅴ)사업용자동차운전자의사고특성에관한연구Ⅴ최완석최완석교통안전진흥공단교통안전진흥공단1988<NA><NA><NA><NA><NA>도서실기사
48096AR011454철도화물운송을 위한 Hub-and-spokes서비스네트워크 디자인모형의 개발철도화물운송을위한HUBANDSPOKES서비스네트워크디자인모형의개발정승주정승주대한교통학회대한교통학회2003388.072388.072<NA><NA><NA><NA>기사
2277222861건축기술지침 = Architectural engineering guide : rev.1 : 기계건축기술지침=ARCHITECTURALENGINEERINGGUIDEREV1기계대한건축학회대한건축학회대한건축학회대한건축학회2018694-ㄷ52694 ㄷ52vol.2<NA><NA>도서실단행본
등록번호서명색인서명저자색인저자출판사색인출판사출판년도청구기호색인청구기호권책기호복본기호별치기호소장처명자료유형
3467016001228강변마을강변마을전경린전경린현대문학현대문학2010811.32-ㅈ264ㄱ811.32 ㅈ264ㄱ<NA><NA><NA>대구교통방송단행본
45206AR007328Forecasting Intermodal Competition in a Multimodal EnvironmentFORECASTINGINTERMODALCOMPETITIONINAMULTIMODALENVIRONMENTNeels, KevinNEELSKEVINTransportation Research BoardTRANSPORTATIONRESEARCHBOARD1987<NA><NA><NA><NA><NA>도서실기사
15821590교통단속용 무인장비 인수 성능시험 보고서교통단속용무인장비인수성능시험보고서도로교통안전협회도로교통안전협회도로교통안전협회도로교통안전협회1998629.04-ㄷ68ㄱ-SR629.04 ㄷ68ㄱ SR<NA>2SR도서실공단연구보고서
1860718667(제2차)여수시 교통안전기본계획제2차여수시교통안전기본계획여수시여수시여수시여수시2011353.98-ㅇ337ㅇ-SR353.98 ㅇ337ㅇ SR2011,v.2<NA>SR도서실공단연구보고서
2369023780시선으로부터 : 정세랑 장편소설시선으로부터정세랑장편소설정세랑정세랑문학동네문학동네2020895.735-ㅈ416ㅅ895.735 ㅈ416ㅅ<NA><NA><NA>도서실단행본
251533000121자동차관리법 및 안전관리자동차관리법및안전관리장상수장상수골든 벨골든벨1990343.094-ㅈ152ㅈ343.094 ㅈ152ㅈ<NA><NA><NA>대구광역시지부단행본
3242314001921세계미래보고서 2055세계미래보고서2055박영숙박영숙비즈니스북스비즈니스북스2017331.544331.544<NA><NA><NA>부산교통방송단행본
276757000418간부가 변하지 않으면 회사는 망한다간부가변하지않으면회사는망한다유지케 코지유지케코지한국생산성본부한국생산성본부1991174-ㅇ597ㄱ174 ㅇ597ㄱ<NA><NA><NA>충북지부단행본
1909019150대한교통학회지 = Journal of the transportation research society of korea대한교통학회지=JOURNALOFTHETRANSPORTATIONRESEARCHSOCIETYOFKOREA대한교통학회대한교통학회대한교통학회대한교통학회1983388.072-ㄷ52ㄷ388.072 ㄷ52ㄷv.126<NA><NA>도서실연속간행물
3887720001477노르웨이의 숲노르웨이의숲무라카미 하루키무라카미하루키민음사민음사2013813.30-ㅁ666ㄴ813.30 ㅁ666ㄴ<NA><NA><NA>전북교통방송단행본