Overview

Dataset statistics

Number of variables34
Number of observations10000
Missing cells30510
Missing cells (%)9.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 MiB
Average record size in memory285.0 B

Variable types

Numeric4
Text13
Unsupported3
Categorical13
DateTime1

Dataset

Description도로교통공단 소장도서 및 자료의 통제정보데이터(분류번호,국가코드,언어코드,자료상태코드,소장처코드 등)
Author도로교통공단
URLhttps://www.data.go.kr/data/15049024/fileData.do

Alerts

별치기호 is highly imbalanced (53.9%)Imbalance
언어코드 is highly imbalanced (76.6%)Imbalance
국가코드 is highly imbalanced (79.3%)Imbalance
자료유형코드 is highly imbalanced (55.0%)Imbalance
자료유형 is highly imbalanced (55.0%)Imbalance
마크유형구분 is highly imbalanced (71.2%)Imbalance
마크유형 is highly imbalanced (71.2%)Imbalance
포함자료 is highly imbalanced (60.6%)Imbalance
국가구분 is highly imbalanced (79.3%)Imbalance
분관정보 is highly imbalanced (65.7%)Imbalance
권책기호 has 6609 (66.1%) missing valuesMissing
복본기호 has 8660 (86.6%) missing valuesMissing
초록 has 9644 (96.4%) missing valuesMissing
국제표준도서번호 has 5342 (53.4%) missing valuesMissing
기관코드 has 207 (2.1%) missing valuesMissing
등록번호 has unique valuesUnique
복본기호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
보안등급 is an unsupported type, check if it needs cleaning or further analysisUnsupported
기관코드 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 21:57:35.812413
Analysis finished2023-12-12 21:57:39.803573
Duration3.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3234163.2
Minimum1
Maximum16001673
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:57:39.879913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1782.9
Q18888
median17554.5
Q34000676
95-th percentile16000004
Maximum16001673
Range16001672
Interquartile range (IQR)3991788

Descriptive statistics

Standard deviation5518517.9
Coefficient of variation (CV)1.7063202
Kurtosis0.2225516
Mean3234163.2
Median Absolute Deviation (MAD)11047
Skewness1.3874197
Sum3.2341632 × 1010
Variance3.045404 × 1013
MonotonicityNot monotonic
2023-12-13T06:57:40.034392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000510 1
 
< 0.1%
11474 1
 
< 0.1%
14000379 1
 
< 0.1%
20351 1
 
< 0.1%
16000138 1
 
< 0.1%
11197 1
 
< 0.1%
1000513 1
 
< 0.1%
13282 1
 
< 0.1%
4000475 1
 
< 0.1%
4798 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
14 1
< 0.1%
21 1
< 0.1%
25 1
< 0.1%
37 1
< 0.1%
39 1
< 0.1%
48 1
< 0.1%
50 1
< 0.1%
ValueCountFrequency (%)
16001673 1
< 0.1%
16001668 1
< 0.1%
16001666 1
< 0.1%
16001665 1
< 0.1%
16001663 1
< 0.1%
16001652 1
< 0.1%
16001648 1
< 0.1%
16001646 1
< 0.1%
16001644 1
< 0.1%
16001643 1
< 0.1%

서명
Text

Distinct6989
Distinct (%)69.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:40.390810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length209
Median length132
Mean length26.7109
Min length1

Characters and Unicode

Total characters267109
Distinct characters1692
Distinct categories16 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5796 ?
Unique (%)58.0%

Sample

1st row2009년판 교통사고통계분석
2nd rowEnjoy 괌 : No plan! No problem!
3rd row(알짜배기 세계여행 시리즈)중국
4th row카인의 후예 : 황순원 작품선
5th row교통공학 : 개정판
ValueCountFrequency (%)
4520
 
8.6%
of 967
 
1.8%
651
 
1.2%
연구 606
 
1.2%
and 563
 
1.1%
the 539
 
1.0%
관한 377
 
0.7%
위한 371
 
0.7%
journal 285
 
0.5%
transportation 276
 
0.5%
Other values (16795) 43234
82.5%
2023-12-13T06:57:40.906200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43153
 
16.2%
e 7104
 
2.7%
n 6635
 
2.5%
a 6544
 
2.4%
o 6357
 
2.4%
i 5896
 
2.2%
t 5654
 
2.1%
r 5352
 
2.0%
s 4010
 
1.5%
3441
 
1.3%
Other values (1682) 172963
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128818
48.2%
Lowercase Letter 68481
25.6%
Space Separator 43153
 
16.2%
Uppercase Letter 9450
 
3.5%
Decimal Number 6973
 
2.6%
Other Punctuation 5441
 
2.0%
Open Punctuation 1490
 
0.6%
Close Punctuation 1489
 
0.6%
Math Symbol 1224
 
0.5%
Dash Punctuation 451
 
0.2%
Other values (6) 139
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3441
 
2.7%
3284
 
2.5%
3174
 
2.5%
2531
 
2.0%
2265
 
1.8%
2091
 
1.6%
2087
 
1.6%
1839
 
1.4%
1720
 
1.3%
1717
 
1.3%
Other values (1576) 104669
81.3%
Lowercase Letter
ValueCountFrequency (%)
e 7104
10.4%
n 6635
9.7%
a 6544
9.6%
o 6357
9.3%
i 5896
 
8.6%
t 5654
 
8.3%
r 5352
 
7.8%
s 4010
 
5.9%
c 2872
 
4.2%
l 2705
 
4.0%
Other values (17) 15352
22.4%
Uppercase Letter
ValueCountFrequency (%)
S 1137
12.0%
T 1009
 
10.7%
C 755
 
8.0%
E 660
 
7.0%
A 611
 
6.5%
I 551
 
5.8%
P 540
 
5.7%
D 493
 
5.2%
R 450
 
4.8%
M 428
 
4.5%
Other values (16) 2816
29.8%
Other Punctuation
ValueCountFrequency (%)
: 3189
58.6%
, 774
 
14.2%
. 655
 
12.0%
· 236
 
4.3%
' 198
 
3.6%
! 142
 
2.6%
& 121
 
2.2%
/ 48
 
0.9%
; 31
 
0.6%
" 26
 
0.5%
Other values (4) 21
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 2161
31.0%
2 1514
21.7%
1 1313
18.8%
9 431
 
6.2%
3 407
 
5.8%
4 247
 
3.5%
5 247
 
3.5%
8 243
 
3.5%
6 231
 
3.3%
7 179
 
2.6%
Math Symbol
ValueCountFrequency (%)
= 1147
93.7%
+ 42
 
3.4%
~ 25
 
2.0%
> 4
 
0.3%
< 4
 
0.3%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1363
91.5%
[ 118
 
7.9%
4
 
0.3%
3
 
0.2%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1362
91.5%
] 118
 
7.9%
4
 
0.3%
3
 
0.2%
2
 
0.1%
Letter Number
ValueCountFrequency (%)
61
48.0%
44
34.6%
18
 
14.2%
4
 
3.1%
Other Symbol
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
43153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 451
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124730
46.7%
Latin 78057
29.2%
Common 60233
22.5%
Han 3131
 
1.2%
Hiragana 575
 
0.2%
Katakana 382
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3441
 
2.8%
3284
 
2.6%
3174
 
2.5%
2531
 
2.0%
2265
 
1.8%
2091
 
1.7%
2087
 
1.7%
1839
 
1.5%
1720
 
1.4%
1717
 
1.4%
Other values (996) 100581
80.6%
Han
ValueCountFrequency (%)
199
 
6.4%
195
 
6.2%
88
 
2.8%
87
 
2.8%
65
 
2.1%
60
 
1.9%
53
 
1.7%
53
 
1.7%
49
 
1.6%
47
 
1.5%
Other values (457) 2235
71.4%
Katakana
ValueCountFrequency (%)
29
 
7.6%
26
 
6.8%
19
 
5.0%
19
 
5.0%
19
 
5.0%
16
 
4.2%
16
 
4.2%
14
 
3.7%
14
 
3.7%
14
 
3.7%
Other values (53) 196
51.3%
Latin
ValueCountFrequency (%)
e 7104
 
9.1%
n 6635
 
8.5%
a 6544
 
8.4%
o 6357
 
8.1%
i 5896
 
7.6%
t 5654
 
7.2%
r 5352
 
6.9%
s 4010
 
5.1%
c 2872
 
3.7%
l 2705
 
3.5%
Other values (46) 24928
31.9%
Hiragana
ValueCountFrequency (%)
172
29.9%
55
 
9.6%
42
 
7.3%
39
 
6.8%
34
 
5.9%
21
 
3.7%
15
 
2.6%
14
 
2.4%
13
 
2.3%
12
 
2.1%
Other values (40) 158
27.5%
Common
ValueCountFrequency (%)
43153
71.6%
: 3189
 
5.3%
0 2161
 
3.6%
2 1514
 
2.5%
( 1363
 
2.3%
) 1362
 
2.3%
1 1313
 
2.2%
= 1147
 
1.9%
, 774
 
1.3%
. 655
 
1.1%
Other values (39) 3602
 
6.0%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 137897
51.6%
Hangul 124716
46.7%
CJK 3079
 
1.2%
Hiragana 575
 
0.2%
Katakana 382
 
0.1%
None 258
 
0.1%
Number Forms 127
 
< 0.1%
CJK Compat Ideographs 52
 
< 0.1%
Compat Jamo 14
 
< 0.1%
CJK Compat 5
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43153
31.3%
e 7104
 
5.2%
n 6635
 
4.8%
a 6544
 
4.7%
o 6357
 
4.6%
i 5896
 
4.3%
t 5654
 
4.1%
r 5352
 
3.9%
s 4010
 
2.9%
: 3189
 
2.3%
Other values (76) 44003
31.9%
Hangul
ValueCountFrequency (%)
3441
 
2.8%
3284
 
2.6%
3174
 
2.5%
2531
 
2.0%
2265
 
1.8%
2091
 
1.7%
2087
 
1.7%
1839
 
1.5%
1720
 
1.4%
1717
 
1.4%
Other values (995) 100567
80.6%
None
ValueCountFrequency (%)
· 236
91.5%
4
 
1.6%
4
 
1.6%
3
 
1.2%
3
 
1.2%
2
 
0.8%
2
 
0.8%
1
 
0.4%
α 1
 
0.4%
1
 
0.4%
CJK
ValueCountFrequency (%)
199
 
6.5%
195
 
6.3%
88
 
2.9%
87
 
2.8%
65
 
2.1%
60
 
1.9%
53
 
1.7%
53
 
1.7%
49
 
1.6%
47
 
1.5%
Other values (437) 2183
70.9%
Hiragana
ValueCountFrequency (%)
172
29.9%
55
 
9.6%
42
 
7.3%
39
 
6.8%
34
 
5.9%
21
 
3.7%
15
 
2.6%
14
 
2.4%
13
 
2.3%
12
 
2.1%
Other values (40) 158
27.5%
Number Forms
ValueCountFrequency (%)
61
48.0%
44
34.6%
18
 
14.2%
4
 
3.1%
Katakana
ValueCountFrequency (%)
29
 
7.6%
26
 
6.8%
19
 
5.0%
19
 
5.0%
19
 
5.0%
16
 
4.2%
16
 
4.2%
14
 
3.7%
14
 
3.7%
14
 
3.7%
Other values (53) 196
51.3%
Compat Jamo
ValueCountFrequency (%)
14
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
10
19.2%
7
13.5%
5
9.6%
4
 
7.7%
4
 
7.7%
4
 
7.7%
3
 
5.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
Other values (10) 10
19.2%
CJK Compat
ValueCountFrequency (%)
5
100.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct6971
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:41.350624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length173
Median length117
Mean length21.5002
Min length1

Characters and Unicode

Total characters215002
Distinct characters1061
Distinct categories8 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5777 ?
Unique (%)57.8%

Sample

1st row2009년판교통사고통계분석
2nd rowENJOY괌NOPLANNOPROBLEM
3rd row알짜배기세계여행시리즈중국
4th row카인의후예황순원작품선
5th row교통공학개정판
ValueCountFrequency (%)
대한교통학회지=journalofthetransportationresearchsocietyofkorea 107
 
1.1%
교수연구논문집 81
 
0.8%
교통안전연구논집 68
 
0.7%
교통사고통계 48
 
0.5%
대한민국현행법령집 47
 
0.5%
신호등 45
 
0.4%
교통기술과정책 39
 
0.4%
도로교통안전백서 33
 
0.3%
대한토목학회논문집a호=journalofthekoreansocietyofcivilengineers 28
 
0.3%
한국자동차공학회논문집=transactionsofkoreasocietyofengineers 26
 
0.3%
Other values (6973) 9491
94.8%
2023-12-13T06:57:42.034698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 7712
 
3.6%
A 7143
 
3.3%
N 6903
 
3.2%
O 6668
 
3.1%
T 6612
 
3.1%
I 6447
 
3.0%
R 5802
 
2.7%
S 5147
 
2.4%
3679
 
1.7%
C 3627
 
1.7%
Other values (1051) 155262
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128804
59.9%
Uppercase Letter 77762
36.2%
Decimal Number 6973
 
3.2%
Math Symbol 1191
 
0.6%
Letter Number 127
 
0.1%
Other Punctuation 124
 
0.1%
Space Separator 15
 
< 0.1%
Other Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3679
 
2.9%
3498
 
2.7%
3281
 
2.5%
2535
 
2.0%
2269
 
1.8%
2240
 
1.7%
2157
 
1.7%
1881
 
1.5%
1817
 
1.4%
1773
 
1.4%
Other values (999) 103674
80.5%
Uppercase Letter
ValueCountFrequency (%)
E 7712
9.9%
A 7143
 
9.2%
N 6903
 
8.9%
O 6668
 
8.6%
T 6612
 
8.5%
I 6447
 
8.3%
R 5802
 
7.5%
S 5147
 
6.6%
C 3627
 
4.7%
L 2927
 
3.8%
Other values (17) 18774
24.1%
Decimal Number
ValueCountFrequency (%)
0 2161
31.0%
2 1514
21.7%
1 1313
18.8%
9 431
 
6.2%
3 407
 
5.8%
4 247
 
3.5%
5 247
 
3.5%
8 243
 
3.5%
6 231
 
3.3%
7 179
 
2.6%
Math Symbol
ValueCountFrequency (%)
= 1147
96.3%
42
 
3.5%
1
 
0.1%
1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
121
97.6%
1
 
0.8%
1
 
0.8%
1
 
0.8%
Letter Number
ValueCountFrequency (%)
61
48.0%
44
34.6%
18
 
14.2%
4
 
3.1%
Other Symbol
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 128802
59.9%
Latin 77888
36.2%
Common 8309
 
3.9%
Katakana 1
 
< 0.1%
Greek 1
 
< 0.1%
Hiragana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3679
 
2.9%
3498
 
2.7%
3281
 
2.5%
2535
 
2.0%
2269
 
1.8%
2240
 
1.7%
2157
 
1.7%
1881
 
1.5%
1817
 
1.4%
1773
 
1.4%
Other values (997) 103672
80.5%
Latin
ValueCountFrequency (%)
E 7712
9.9%
A 7143
 
9.2%
N 6903
 
8.9%
O 6668
 
8.6%
T 6612
 
8.5%
I 6447
 
8.3%
R 5802
 
7.4%
S 5147
 
6.6%
C 3627
 
4.7%
L 2927
 
3.8%
Other values (20) 18900
24.3%
Common
ValueCountFrequency (%)
0 2161
26.0%
2 1514
18.2%
1 1313
15.8%
= 1147
13.8%
9 431
 
5.2%
3 407
 
4.9%
4 247
 
3.0%
5 247
 
3.0%
8 243
 
2.9%
6 231
 
2.8%
Other values (11) 368
 
4.4%
Katakana
ValueCountFrequency (%)
1
100.0%
Greek
ValueCountFrequency (%)
Α 1
100.0%
Hiragana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 128802
59.9%
ASCII 85896
40.0%
None 168
 
0.1%
Number Forms 127
 
0.1%
CJK Compat 5
 
< 0.1%
Geometric Shapes 1
 
< 0.1%
Katakana 1
 
< 0.1%
Punctuation 1
 
< 0.1%
Hiragana 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 7712
 
9.0%
A 7143
 
8.3%
N 6903
 
8.0%
O 6668
 
7.8%
T 6612
 
7.7%
I 6447
 
7.5%
R 5802
 
6.8%
S 5147
 
6.0%
C 3627
 
4.2%
L 2927
 
3.4%
Other values (28) 26908
31.3%
Hangul
ValueCountFrequency (%)
3679
 
2.9%
3498
 
2.7%
3281
 
2.5%
2535
 
2.0%
2269
 
1.8%
2240
 
1.7%
2157
 
1.7%
1881
 
1.5%
1817
 
1.4%
1773
 
1.4%
Other values (997) 103672
80.5%
None
ValueCountFrequency (%)
121
72.0%
42
 
25.0%
1
 
0.6%
1
 
0.6%
1
 
0.6%
Α 1
 
0.6%
1
 
0.6%
Number Forms
ValueCountFrequency (%)
61
48.0%
44
34.6%
18
 
14.2%
4
 
3.1%
CJK Compat
ValueCountFrequency (%)
5
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct4495
Distinct (%)45.0%
Missing13
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T06:57:42.371999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length47
Mean length8.2680485
Min length2

Characters and Unicode

Total characters82573
Distinct characters857
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3406 ?
Unique (%)34.1%

Sample

1st row도로교통공단
2nd row민보영
3rd row조창완
4th row황순원
5th row원제무.최재성
ValueCountFrequency (%)
도로교통공단 620
 
4.4%
research 522
 
3.7%
transportation 521
 
3.7%
board 518
 
3.7%
도로교통안전관리공단 442
 
3.2%
도로교통안전협회 384
 
2.7%
경찰청 205
 
1.5%
교통사고종합분석센터 169
 
1.2%
대한교통학회 156
 
1.1%
건설교통부 145
 
1.0%
Other values (5255) 10317
73.7%
2023-12-13T06:57:42.907607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4014
 
4.9%
a 2701
 
3.3%
2684
 
3.3%
r 2600
 
3.1%
2520
 
3.1%
o 2119
 
2.6%
1797
 
2.2%
e 1747
 
2.1%
1726
 
2.1%
n 1613
 
2.0%
Other values (847) 59052
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53886
65.3%
Lowercase Letter 18677
 
22.6%
Space Separator 4014
 
4.9%
Uppercase Letter 3590
 
4.3%
Other Punctuation 2189
 
2.7%
Decimal Number 103
 
0.1%
Close Punctuation 42
 
0.1%
Open Punctuation 42
 
0.1%
Dash Punctuation 23
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2684
 
5.0%
2520
 
4.7%
1797
 
3.3%
1726
 
3.2%
1401
 
2.6%
1316
 
2.4%
1233
 
2.3%
1198
 
2.2%
1151
 
2.1%
1027
 
1.9%
Other values (772) 37833
70.2%
Lowercase Letter
ValueCountFrequency (%)
a 2701
14.5%
r 2600
13.9%
o 2119
11.3%
e 1747
9.4%
n 1613
8.6%
t 1434
7.7%
s 1336
7.2%
i 1087
5.8%
c 790
 
4.2%
d 698
 
3.7%
Other values (16) 2552
13.7%
Uppercase Letter
ValueCountFrequency (%)
T 673
18.7%
B 621
17.3%
R 614
17.1%
S 229
 
6.4%
C 165
 
4.6%
E 149
 
4.2%
M 128
 
3.6%
A 126
 
3.5%
D 111
 
3.1%
K 87
 
2.4%
Other values (14) 687
19.1%
Decimal Number
ValueCountFrequency (%)
1 22
21.4%
2 19
18.4%
3 17
16.5%
6 12
11.7%
0 8
 
7.8%
4 8
 
7.8%
9 7
 
6.8%
5 6
 
5.8%
7 4
 
3.9%
Other Punctuation
ValueCountFrequency (%)
. 1333
60.9%
, 836
38.2%
& 11
 
0.5%
: 3
 
0.1%
' 3
 
0.1%
· 2
 
0.1%
/ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 36
85.7%
] 6
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 36
85.7%
[ 6
 
14.3%
Math Symbol
ValueCountFrequency (%)
< 3
50.0%
> 3
50.0%
Space Separator
ValueCountFrequency (%)
4014
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53258
64.5%
Latin 22268
27.0%
Common 6419
 
7.8%
Han 580
 
0.7%
Katakana 48
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2684
 
5.0%
2520
 
4.7%
1797
 
3.4%
1726
 
3.2%
1401
 
2.6%
1316
 
2.5%
1233
 
2.3%
1198
 
2.2%
1151
 
2.2%
1027
 
1.9%
Other values (681) 37205
69.9%
Han
ValueCountFrequency (%)
59
 
10.2%
59
 
10.2%
34
 
5.9%
28
 
4.8%
24
 
4.1%
22
 
3.8%
22
 
3.8%
21
 
3.6%
21
 
3.6%
21
 
3.6%
Other values (78) 269
46.4%
Latin
ValueCountFrequency (%)
a 2701
12.1%
r 2600
11.7%
o 2119
 
9.5%
e 1747
 
7.8%
n 1613
 
7.2%
t 1434
 
6.4%
s 1336
 
6.0%
i 1087
 
4.9%
c 790
 
3.5%
d 698
 
3.1%
Other values (41) 6143
27.6%
Common
ValueCountFrequency (%)
4014
62.5%
. 1333
 
20.8%
, 836
 
13.0%
) 36
 
0.6%
( 36
 
0.6%
- 23
 
0.4%
1 22
 
0.3%
2 19
 
0.3%
3 17
 
0.3%
6 12
 
0.2%
Other values (14) 71
 
1.1%
Katakana
ValueCountFrequency (%)
16
33.3%
16
33.3%
16
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53257
64.5%
ASCII 28684
34.7%
CJK 575
 
0.7%
Katakana 48
 
0.1%
CJK Compat Ideographs 5
 
< 0.1%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4014
14.0%
a 2701
 
9.4%
r 2600
 
9.1%
o 2119
 
7.4%
e 1747
 
6.1%
n 1613
 
5.6%
t 1434
 
5.0%
s 1336
 
4.7%
. 1333
 
4.6%
i 1087
 
3.8%
Other values (63) 8700
30.3%
Hangul
ValueCountFrequency (%)
2684
 
5.0%
2520
 
4.7%
1797
 
3.4%
1726
 
3.2%
1401
 
2.6%
1316
 
2.5%
1233
 
2.3%
1198
 
2.2%
1151
 
2.2%
1027
 
1.9%
Other values (680) 37204
69.9%
CJK
ValueCountFrequency (%)
59
 
10.3%
59
 
10.3%
34
 
5.9%
28
 
4.9%
24
 
4.2%
22
 
3.8%
22
 
3.8%
21
 
3.7%
21
 
3.7%
21
 
3.7%
Other values (75) 264
45.9%
Katakana
ValueCountFrequency (%)
16
33.3%
16
33.3%
16
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct4449
Distinct (%)44.5%
Missing13
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T06:57:43.338682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length42
Mean length7.6368279
Min length2

Characters and Unicode

Total characters76269
Distinct characters730
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3360 ?
Unique (%)33.6%

Sample

1st row도로교통공단
2nd row민보영
3rd row조창완
4th row황순원
5th row원제무최재성
ValueCountFrequency (%)
transportationresearchboard 516
 
5.2%
도로교통안전협회 382
 
3.8%
도로교통안전관리공단 350
 
3.5%
도로교통공단 293
 
2.9%
경찰청 195
 
2.0%
대한교통학회 156
 
1.6%
건설교통부 142
 
1.4%
도로교통공단교통사고종합분석센터 127
 
1.3%
대한토목학회 91
 
0.9%
교통개발연구원 68
 
0.7%
Other values (4440) 7668
76.8%
2023-12-13T06:57:43.912314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 3214
 
4.2%
A 2827
 
3.7%
2752
 
3.6%
2579
 
3.4%
O 2202
 
2.9%
T 2107
 
2.8%
E 1896
 
2.5%
1803
 
2.4%
1730
 
2.3%
N 1664
 
2.2%
Other values (720) 53495
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53886
70.7%
Uppercase Letter 22267
29.2%
Decimal Number 103
 
0.1%
Other Punctuation 11
 
< 0.1%
Space Separator 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2752
 
5.1%
2579
 
4.8%
1803
 
3.3%
1730
 
3.2%
1402
 
2.6%
1351
 
2.5%
1234
 
2.3%
1220
 
2.3%
1179
 
2.2%
1027
 
1.9%
Other values (682) 37609
69.8%
Uppercase Letter
ValueCountFrequency (%)
R 3214
14.4%
A 2827
12.7%
O 2202
9.9%
T 2107
9.5%
E 1896
8.5%
N 1664
7.5%
S 1565
7.0%
I 1173
 
5.3%
C 955
 
4.3%
D 809
 
3.6%
Other values (16) 3855
17.3%
Decimal Number
ValueCountFrequency (%)
1 22
21.4%
2 19
18.4%
3 17
16.5%
6 12
11.7%
0 8
 
7.8%
4 8
 
7.8%
9 7
 
6.8%
5 6
 
5.8%
7 4
 
3.9%
Other Punctuation
ValueCountFrequency (%)
11
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53886
70.7%
Latin 22268
29.2%
Common 115
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2752
 
5.1%
2579
 
4.8%
1803
 
3.3%
1730
 
3.2%
1402
 
2.6%
1351
 
2.5%
1234
 
2.3%
1220
 
2.3%
1179
 
2.2%
1027
 
1.9%
Other values (682) 37609
69.8%
Latin
ValueCountFrequency (%)
R 3214
14.4%
A 2827
12.7%
O 2202
9.9%
T 2107
9.5%
E 1896
8.5%
N 1664
7.5%
S 1565
7.0%
I 1173
 
5.3%
C 955
 
4.3%
D 809
 
3.6%
Other values (17) 3856
17.3%
Common
ValueCountFrequency (%)
1 22
19.1%
2 19
16.5%
3 17
14.8%
6 12
10.4%
11
9.6%
0 8
 
7.0%
4 8
 
7.0%
9 7
 
6.1%
5 6
 
5.2%
7 4
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53885
70.7%
ASCII 22371
29.3%
None 11
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 3214
14.4%
A 2827
12.6%
O 2202
9.8%
T 2107
9.4%
E 1896
8.5%
N 1664
7.4%
S 1565
 
7.0%
I 1173
 
5.2%
C 955
 
4.3%
D 809
 
3.6%
Other values (26) 3959
17.7%
Hangul
ValueCountFrequency (%)
2752
 
5.1%
2579
 
4.8%
1803
 
3.3%
1730
 
3.2%
1402
 
2.6%
1351
 
2.5%
1234
 
2.3%
1220
 
2.3%
1179
 
2.2%
1027
 
1.9%
Other values (681) 37608
69.8%
None
ValueCountFrequency (%)
11
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct2516
Distinct (%)25.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:44.231536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length48
Mean length6.6222
Min length1

Characters and Unicode

Total characters66222
Distinct characters921
Distinct categories9 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1539 ?
Unique (%)15.4%

Sample

1st row도로교통공단
2nd row넥서스BOOKS
3rd row성하출판
4th row현대문학
5th row박영사
ValueCountFrequency (%)
도로교통공단 894
 
7.7%
도로교통안전관리공단 645
 
5.5%
trb 519
 
4.5%
도로교통안전협회 419
 
3.6%
경찰청 190
 
1.6%
대한교통학회 156
 
1.3%
서울시정개발연구원 145
 
1.2%
건설교통부 142
 
1.2%
사단법인 141
 
1.2%
교통개발연구원 123
 
1.1%
Other values (2669) 8250
71.0%
2023-12-13T06:57:44.639474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3055
 
4.6%
2781
 
4.2%
2280
 
3.4%
2139
 
3.2%
1912
 
2.9%
1891
 
2.9%
1837
 
2.8%
1628
 
2.5%
1302
 
2.0%
1274
 
1.9%
Other values (911) 46123
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54195
81.8%
Lowercase Letter 5800
 
8.8%
Uppercase Letter 3609
 
5.4%
Space Separator 1628
 
2.5%
Other Punctuation 493
 
0.7%
Open Punctuation 154
 
0.2%
Decimal Number 137
 
0.2%
Close Punctuation 133
 
0.2%
Dash Punctuation 73
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3055
 
5.6%
2781
 
5.1%
2280
 
4.2%
2139
 
3.9%
1912
 
3.5%
1891
 
3.5%
1837
 
3.4%
1302
 
2.4%
1274
 
2.4%
1257
 
2.3%
Other values (836) 34467
63.6%
Lowercase Letter
ValueCountFrequency (%)
e 599
10.3%
n 560
9.7%
i 541
9.3%
o 533
9.2%
a 478
 
8.2%
r 455
 
7.8%
t 419
 
7.2%
s 389
 
6.7%
c 301
 
5.2%
l 258
 
4.4%
Other values (16) 1267
21.8%
Uppercase Letter
ValueCountFrequency (%)
T 695
19.3%
B 654
18.1%
R 579
16.0%
S 262
 
7.3%
P 167
 
4.6%
I 155
 
4.3%
C 152
 
4.2%
A 146
 
4.0%
M 146
 
4.0%
E 104
 
2.9%
Other values (14) 549
15.2%
Decimal Number
ValueCountFrequency (%)
2 55
40.1%
1 55
40.1%
3 6
 
4.4%
0 6
 
4.4%
9 5
 
3.6%
6 4
 
2.9%
8 2
 
1.5%
5 2
 
1.5%
4 1
 
0.7%
7 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 206
41.8%
: 162
32.9%
, 60
 
12.2%
& 46
 
9.3%
· 11
 
2.2%
' 5
 
1.0%
; 1
 
0.2%
/ 1
 
0.2%
@ 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 151
98.1%
[ 3
 
1.9%
Close Punctuation
ValueCountFrequency (%)
) 130
97.7%
] 3
 
2.3%
Space Separator
ValueCountFrequency (%)
1628
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51572
77.9%
Latin 9409
 
14.2%
Common 2618
 
4.0%
Han 2377
 
3.6%
Katakana 181
 
0.3%
Hiragana 65
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3055
 
5.9%
2781
 
5.4%
2280
 
4.4%
2139
 
4.1%
1912
 
3.7%
1891
 
3.7%
1837
 
3.6%
1302
 
2.5%
1274
 
2.5%
1257
 
2.4%
Other values (556) 31844
61.7%
Han
ValueCountFrequency (%)
111
 
4.7%
95
 
4.0%
95
 
4.0%
91
 
3.8%
91
 
3.8%
73
 
3.1%
71
 
3.0%
67
 
2.8%
67
 
2.8%
63
 
2.7%
Other values (220) 1553
65.3%
Latin
ValueCountFrequency (%)
T 695
 
7.4%
B 654
 
7.0%
e 599
 
6.4%
R 579
 
6.2%
n 560
 
6.0%
i 541
 
5.7%
o 533
 
5.7%
a 478
 
5.1%
r 455
 
4.8%
t 419
 
4.5%
Other values (40) 3896
41.4%
Katakana
ValueCountFrequency (%)
45
24.9%
36
19.9%
35
19.3%
5
 
2.8%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (25) 40
22.1%
Common
ValueCountFrequency (%)
1628
62.2%
. 206
 
7.9%
: 162
 
6.2%
( 151
 
5.8%
) 130
 
5.0%
- 73
 
2.8%
, 60
 
2.3%
2 55
 
2.1%
1 55
 
2.1%
& 46
 
1.8%
Other values (15) 52
 
2.0%
Hiragana
ValueCountFrequency (%)
11
16.9%
11
16.9%
11
16.9%
11
16.9%
11
16.9%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (5) 5
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51571
77.9%
ASCII 12016
 
18.1%
CJK 2357
 
3.6%
Katakana 181
 
0.3%
Hiragana 65
 
0.1%
CJK Compat Ideographs 20
 
< 0.1%
None 11
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3055
 
5.9%
2781
 
5.4%
2280
 
4.4%
2139
 
4.1%
1912
 
3.7%
1891
 
3.7%
1837
 
3.6%
1302
 
2.5%
1274
 
2.5%
1257
 
2.4%
Other values (555) 31843
61.7%
ASCII
ValueCountFrequency (%)
1628
 
13.5%
T 695
 
5.8%
B 654
 
5.4%
e 599
 
5.0%
R 579
 
4.8%
n 560
 
4.7%
i 541
 
4.5%
o 533
 
4.4%
a 478
 
4.0%
r 455
 
3.8%
Other values (64) 5294
44.1%
CJK
ValueCountFrequency (%)
111
 
4.7%
95
 
4.0%
95
 
4.0%
91
 
3.9%
91
 
3.9%
73
 
3.1%
71
 
3.0%
67
 
2.8%
67
 
2.8%
63
 
2.7%
Other values (212) 1533
65.0%
Katakana
ValueCountFrequency (%)
45
24.9%
36
19.9%
35
19.3%
5
 
2.8%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (25) 40
22.1%
None
ValueCountFrequency (%)
· 11
100.0%
Hiragana
ValueCountFrequency (%)
11
16.9%
11
16.9%
11
16.9%
11
16.9%
11
16.9%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (5) 5
7.7%
CJK Compat Ideographs
ValueCountFrequency (%)
5
25.0%
3
15.0%
3
15.0%
3
15.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2440
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:44.920646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length44
Mean length6.3789
Min length1

Characters and Unicode

Total characters63789
Distinct characters612
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1466 ?
Unique (%)14.7%

Sample

1st row도로교통공단
2nd row넥서스BOOKS
3rd row성하출판
4th row현대문학
5th row박영사
ValueCountFrequency (%)
도로교통공단 877
 
8.8%
도로교통안전관리공단 591
 
5.9%
trb 519
 
5.2%
도로교통안전협회 415
 
4.2%
경찰청 187
 
1.9%
대한교통학회 156
 
1.6%
서울시정개발연구원 144
 
1.4%
건설교통부 130
 
1.3%
교통개발연구원 123
 
1.2%
사단법인대한토목학회 110
 
1.1%
Other values (2430) 6748
67.5%
2023-12-13T06:57:45.368791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3175
 
5.0%
2876
 
4.5%
2310
 
3.6%
2151
 
3.4%
2023
 
3.2%
1904
 
3.0%
1854
 
2.9%
1395
 
2.2%
1326
 
2.1%
1257
 
2.0%
Other values (602) 43518
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54195
85.0%
Uppercase Letter 9409
 
14.8%
Decimal Number 137
 
0.2%
Other Punctuation 46
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3175
 
5.9%
2876
 
5.3%
2310
 
4.3%
2151
 
4.0%
2023
 
3.7%
1904
 
3.5%
1854
 
3.4%
1395
 
2.6%
1326
 
2.4%
1257
 
2.3%
Other values (563) 33924
62.6%
Uppercase Letter
ValueCountFrequency (%)
T 1114
11.8%
R 1034
11.0%
B 721
 
7.7%
E 703
 
7.5%
I 696
 
7.4%
S 651
 
6.9%
A 624
 
6.6%
N 620
 
6.6%
O 610
 
6.5%
C 453
 
4.8%
Other values (16) 2183
23.2%
Decimal Number
ValueCountFrequency (%)
1 55
40.1%
2 55
40.1%
0 6
 
4.4%
3 6
 
4.4%
9 5
 
3.6%
6 4
 
2.9%
8 2
 
1.5%
5 2
 
1.5%
4 1
 
0.7%
7 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54195
85.0%
Latin 9409
 
14.8%
Common 185
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3175
 
5.9%
2876
 
5.3%
2310
 
4.3%
2151
 
4.0%
2023
 
3.7%
1904
 
3.5%
1854
 
3.4%
1395
 
2.6%
1326
 
2.4%
1257
 
2.3%
Other values (563) 33924
62.6%
Latin
ValueCountFrequency (%)
T 1114
11.8%
R 1034
11.0%
B 721
 
7.7%
E 703
 
7.5%
I 696
 
7.4%
S 651
 
6.9%
A 624
 
6.6%
N 620
 
6.6%
O 610
 
6.5%
C 453
 
4.8%
Other values (16) 2183
23.2%
Common
ValueCountFrequency (%)
1 55
29.7%
2 55
29.7%
46
24.9%
0 6
 
3.2%
3 6
 
3.2%
9 5
 
2.7%
6 4
 
2.2%
8 2
 
1.1%
5 2
 
1.1%
4 1
 
0.5%
Other values (3) 3
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54194
85.0%
ASCII 9548
 
15.0%
None 46
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3175
 
5.9%
2876
 
5.3%
2310
 
4.3%
2151
 
4.0%
2023
 
3.7%
1904
 
3.5%
1854
 
3.4%
1395
 
2.6%
1326
 
2.4%
1257
 
2.3%
Other values (562) 33923
62.6%
ASCII
ValueCountFrequency (%)
T 1114
11.7%
R 1034
10.8%
B 721
 
7.6%
E 703
 
7.4%
I 696
 
7.3%
S 651
 
6.8%
A 624
 
6.5%
N 620
 
6.5%
O 610
 
6.4%
C 453
 
4.7%
Other values (28) 2322
24.3%
None
ValueCountFrequency (%)
46
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

출판년도
Real number (ℝ)

Distinct59
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1999.7375
Minimum1900
Maximum2107
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:57:45.507164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1981
Q11994
median2002
Q32008
95-th percentile2016
Maximum2107
Range207
Interquartile range (IQR)14

Descriptive statistics

Standard deviation14.774052
Coefficient of variation (CV)0.0073879958
Kurtosis20.195949
Mean1999.7375
Median Absolute Deviation (MAD)7
Skewness-3.3350311
Sum19997375
Variance218.27262
MonotonicityNot monotonic
2023-12-13T06:57:45.639447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2007 556
 
5.6%
2008 470
 
4.7%
1996 428
 
4.3%
2004 416
 
4.2%
2005 394
 
3.9%
2002 378
 
3.8%
2009 365
 
3.6%
2006 353
 
3.5%
2010 326
 
3.3%
2003 323
 
3.2%
Other values (49) 5991
59.9%
ValueCountFrequency (%)
1900 107
1.1%
1950 1
 
< 0.1%
1960 2
 
< 0.1%
1964 1
 
< 0.1%
1968 3
 
< 0.1%
1969 47
0.5%
1970 4
 
< 0.1%
1971 10
 
0.1%
1972 2
 
< 0.1%
1973 26
 
0.3%
ValueCountFrequency (%)
2107 1
 
< 0.1%
2021 29
 
0.3%
2020 90
 
0.9%
2019 89
 
0.9%
2018 117
1.2%
2017 121
1.2%
2016 159
1.6%
2015 138
1.4%
2014 168
1.7%
2013 257
2.6%
Distinct5894
Distinct (%)58.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:45.861593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length13.2836
Min length6

Characters and Unicode

Total characters132836
Distinct characters88
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4819 ?
Unique (%)48.2%

Sample

1st row363.1251-ㄷ68ㄱ
2nd row919.67-ㅁ984e
3rd row915.2-ㅈ688ㅈ
4th row811.37-ㅎ761ㅋ
5th row629.04-ㅇ553ㄱ
ValueCountFrequency (%)
363.12565-ㄷ68ㄱ-sr 138
 
1.4%
363.1251-ㄷ68ㄱ-sr 117
 
1.2%
388.072-ㄷ52ㄷ 107
 
1.1%
624.072-ㄷ52ㄷ 89
 
0.9%
388.071-ㄷ68ㄱ-sr 81
 
0.8%
363.12505-ㄷ68ㄱ-sr 68
 
0.7%
363.1251-ㄷ68ㄱ 60
 
0.6%
363.1251-ㄱ313ㄱ 60
 
0.6%
388.06-t783t-st 59
 
0.6%
340.52519-ㅂ754ㄷ 53
 
0.5%
Other values (5888) 9172
91.7%
2023-12-13T06:57:46.258978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 13522
 
10.2%
- 13413
 
10.1%
8 11116
 
8.4%
2 10001
 
7.5%
6 9471
 
7.1%
. 9310
 
7.0%
1 8474
 
6.4%
5 8113
 
6.1%
4 6323
 
4.8%
7 6020
 
4.5%
Other values (78) 37073
27.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83575
62.9%
Other Letter 17451
 
13.1%
Dash Punctuation 13413
 
10.1%
Other Punctuation 9311
 
7.0%
Uppercase Letter 7929
 
6.0%
Lowercase Letter 1149
 
0.9%
Other Symbol 4
 
< 0.1%
Space Separator 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (15) 829
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
t 212
18.5%
p 106
 
9.2%
s 88
 
7.7%
a 77
 
6.7%
r 73
 
6.4%
c 69
 
6.0%
h 62
 
5.4%
i 54
 
4.7%
d 47
 
4.1%
e 43
 
3.7%
Other values (15) 318
27.7%
Uppercase Letter
ValueCountFrequency (%)
S 3230
40.7%
R 1648
20.8%
T 1066
 
13.4%
K 875
 
11.0%
D 277
 
3.5%
I 150
 
1.9%
C 125
 
1.6%
E 97
 
1.2%
B 78
 
1.0%
F 64
 
0.8%
Other values (13) 319
 
4.0%
Decimal Number
ValueCountFrequency (%)
3 13522
16.2%
8 11116
13.3%
2 10001
12.0%
6 9471
11.3%
1 8474
10.1%
5 8113
9.7%
4 6323
7.6%
7 6020
7.2%
0 5621
6.7%
9 4914
 
5.9%
Other Punctuation
ValueCountFrequency (%)
. 9310
> 99.9%
/ 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 13413
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 106307
80.0%
Hangul 17451
 
13.1%
Latin 9078
 
6.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 3230
35.6%
R 1648
18.2%
T 1066
 
11.7%
K 875
 
9.6%
D 277
 
3.1%
t 212
 
2.3%
I 150
 
1.7%
C 125
 
1.4%
p 106
 
1.2%
E 97
 
1.1%
Other values (38) 1292
 
14.2%
Hangul
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (15) 829
 
4.8%
Common
ValueCountFrequency (%)
3 13522
12.7%
- 13413
12.6%
8 11116
10.5%
2 10001
9.4%
6 9471
8.9%
. 9310
8.8%
1 8474
8.0%
5 8113
7.6%
4 6323
5.9%
7 6020
5.7%
Other values (5) 10544
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 115381
86.9%
Compat Jamo 17440
 
13.1%
Hangul 11
 
< 0.1%
Geometric Shapes 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 13522
11.7%
- 13413
11.6%
8 11116
9.6%
2 10001
8.7%
6 9471
8.2%
. 9310
8.1%
1 8474
7.3%
5 8113
7.0%
4 6323
 
5.5%
7 6020
 
5.2%
Other values (52) 19618
17.0%
Compat Jamo
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (10) 818
 
4.7%
Geometric Shapes
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
3
27.3%
3
27.3%
3
27.3%
1
 
9.1%
1
 
9.1%
Distinct5894
Distinct (%)58.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:46.613620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length13.2828
Min length5

Characters and Unicode

Total characters132828
Distinct characters87
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4819 ?
Unique (%)48.2%

Sample

1st row363.1251 ㄷ68ㄱ
2nd row919.67 ㅁ984e
3rd row915.2 ㅈ688ㅈ
4th row811.37 ㅎ761ㅋ
5th row629.04 ㅇ553ㄱ
ValueCountFrequency (%)
sr 1618
 
6.9%
sk 817
 
3.5%
ㄷ68ㄱ 749
 
3.2%
st 515
 
2.2%
363.1251 472
 
2.0%
388.06 434
 
1.9%
363.12565 301
 
1.3%
363.125 235
 
1.0%
ㄷ52ㄷ 201
 
0.9%
ㄷ68ㅈ 182
 
0.8%
Other values (6350) 17877
76.4%
2023-12-13T06:57:47.143531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 13522
 
10.2%
13410
 
10.1%
8 11116
 
8.4%
2 10001
 
7.5%
6 9471
 
7.1%
. 9310
 
7.0%
1 8474
 
6.4%
5 8113
 
6.1%
4 6323
 
4.8%
7 6020
 
4.5%
Other values (77) 37068
27.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83574
62.9%
Other Letter 17451
 
13.1%
Space Separator 13410
 
10.1%
Other Punctuation 9311
 
7.0%
Uppercase Letter 7929
 
6.0%
Lowercase Letter 1149
 
0.9%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (15) 829
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
t 212
18.5%
p 106
 
9.2%
s 88
 
7.7%
a 77
 
6.7%
r 73
 
6.4%
c 69
 
6.0%
h 62
 
5.4%
i 54
 
4.7%
d 47
 
4.1%
e 43
 
3.7%
Other values (15) 318
27.7%
Uppercase Letter
ValueCountFrequency (%)
S 3230
40.7%
R 1648
20.8%
T 1066
 
13.4%
K 875
 
11.0%
D 277
 
3.5%
I 150
 
1.9%
C 125
 
1.6%
E 97
 
1.2%
B 78
 
1.0%
F 64
 
0.8%
Other values (13) 319
 
4.0%
Decimal Number
ValueCountFrequency (%)
3 13522
16.2%
8 11116
13.3%
2 10001
12.0%
6 9471
11.3%
1 8474
10.1%
5 8113
9.7%
4 6323
7.6%
7 6020
7.2%
0 5620
6.7%
9 4914
 
5.9%
Other Punctuation
ValueCountFrequency (%)
. 9310
> 99.9%
/ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
13410
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 106299
80.0%
Hangul 17451
 
13.1%
Latin 9078
 
6.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 3230
35.6%
R 1648
18.2%
T 1066
 
11.7%
K 875
 
9.6%
D 277
 
3.1%
t 212
 
2.3%
I 150
 
1.7%
C 125
 
1.4%
p 106
 
1.2%
E 97
 
1.1%
Other values (38) 1292
 
14.2%
Hangul
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (15) 829
 
4.8%
Common
ValueCountFrequency (%)
3 13522
12.7%
13410
12.6%
8 11116
10.5%
2 10001
9.4%
6 9471
8.9%
. 9310
8.8%
1 8474
8.0%
5 8113
7.6%
4 6323
5.9%
7 6020
5.7%
Other values (4) 10539
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 115373
86.9%
Compat Jamo 17440
 
13.1%
Hangul 11
 
< 0.1%
Geometric Shapes 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 13522
11.7%
13410
11.6%
8 11116
9.6%
2 10001
8.7%
6 9471
8.2%
. 9310
8.1%
1 8474
7.3%
5 8113
7.0%
4 6323
 
5.5%
7 6020
 
5.2%
Other values (51) 19613
17.0%
Compat Jamo
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (10) 818
 
4.7%
Geometric Shapes
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
3
27.3%
3
27.3%
3
27.3%
1
 
9.1%
1
 
9.1%

권책기호
Text

MISSING 

Distinct833
Distinct (%)24.6%
Missing6609
Missing (%)66.1%
Memory size156.2 KiB
2023-12-13T06:57:47.444104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length4.7891477
Min length1

Characters and Unicode

Total characters16240
Distinct characters76
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique587 ?
Unique (%)17.3%

Sample

1st rowv.1
2nd rowv.251
3rd row2003
4th rowv.31
5th row2004
ValueCountFrequency (%)
v.1 280
 
7.7%
v.2 266
 
7.3%
v.3 126
 
3.5%
v.4 87
 
2.4%
1996 69
 
1.9%
v.5 68
 
1.9%
2004 65
 
1.8%
2008 61
 
1.7%
2003 59
 
1.6%
2005 53
 
1.5%
Other values (659) 2503
68.8%
2023-12-13T06:57:47.907538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 2305
14.2%
v 2066
12.7%
1 2053
12.6%
2 1924
11.8%
0 1909
11.8%
9 1392
8.6%
3 601
 
3.7%
4 482
 
3.0%
, 446
 
2.7%
8 445
 
2.7%
Other values (66) 2617
16.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9972
61.4%
Other Punctuation 2829
 
17.4%
Lowercase Letter 2452
 
15.1%
Space Separator 246
 
1.5%
Open Punctuation 191
 
1.2%
Close Punctuation 191
 
1.2%
Dash Punctuation 188
 
1.2%
Uppercase Letter 112
 
0.7%
Other Letter 57
 
0.4%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
8.8%
4
 
7.0%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
2
 
3.5%
Other values (20) 25
43.9%
Lowercase Letter
ValueCountFrequency (%)
v 2066
84.3%
n 235
 
9.6%
o 70
 
2.9%
l 59
 
2.4%
a 6
 
0.2%
t 4
 
0.2%
h 3
 
0.1%
b 2
 
0.1%
e 2
 
0.1%
r 2
 
0.1%
Other values (3) 3
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
A 35
31.2%
C 24
21.4%
B 17
15.2%
D 16
14.3%
J 5
 
4.5%
F 5
 
4.5%
V 4
 
3.6%
G 1
 
0.9%
E 1
 
0.9%
M 1
 
0.9%
Other values (3) 3
 
2.7%
Decimal Number
ValueCountFrequency (%)
1 2053
20.6%
2 1924
19.3%
0 1909
19.1%
9 1392
14.0%
3 601
 
6.0%
4 482
 
4.8%
8 445
 
4.5%
6 416
 
4.2%
7 380
 
3.8%
5 370
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 2305
81.5%
, 446
 
15.8%
' 69
 
2.4%
/ 9
 
0.3%
Space Separator
ValueCountFrequency (%)
246
100.0%
Open Punctuation
ValueCountFrequency (%)
( 191
100.0%
Close Punctuation
ValueCountFrequency (%)
) 191
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 188
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13618
83.9%
Latin 2565
 
15.8%
Hangul 57
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
8.8%
4
 
7.0%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
2
 
3.5%
Other values (20) 25
43.9%
Latin
ValueCountFrequency (%)
v 2066
80.5%
n 235
 
9.2%
o 70
 
2.7%
l 59
 
2.3%
A 35
 
1.4%
C 24
 
0.9%
B 17
 
0.7%
D 16
 
0.6%
a 6
 
0.2%
J 5
 
0.2%
Other values (17) 32
 
1.2%
Common
ValueCountFrequency (%)
. 2305
16.9%
1 2053
15.1%
2 1924
14.1%
0 1909
14.0%
9 1392
10.2%
3 601
 
4.4%
4 482
 
3.5%
, 446
 
3.3%
8 445
 
3.3%
6 416
 
3.1%
Other values (9) 1645
12.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16182
99.6%
Hangul 57
 
0.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 2305
14.2%
v 2066
12.8%
1 2053
12.7%
2 1924
11.9%
0 1909
11.8%
9 1392
8.6%
3 601
 
3.7%
4 482
 
3.0%
, 446
 
2.8%
8 445
 
2.7%
Other values (35) 2559
15.8%
Hangul
ValueCountFrequency (%)
5
 
8.8%
4
 
7.0%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
2
 
3.5%
Other values (20) 25
43.9%
Number Forms
ValueCountFrequency (%)
1
100.0%

복본기호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8660
Missing (%)86.6%
Memory size156.2 KiB

별치기호
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6599 
SR
1622 
SK
817 
ST
 
515
SI
 
120
Other values (7)
 
327

Length

Max length4
Median length4
Mean length3.321
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6599
66.0%
SR 1622
 
16.2%
SK 817
 
8.2%
ST 515
 
5.1%
SI 120
 
1.2%
CD 93
 
0.9%
SD 88
 
0.9%
EB 54
 
0.5%
SF 47
 
0.5%
VD 29
 
0.3%
Other values (2) 16
 
0.2%

Length

2023-12-13T06:57:48.095529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6599
66.0%
sr 1622
 
16.2%
sk 817
 
8.2%
st 515
 
5.1%
si 120
 
1.2%
cd 93
 
0.9%
sd 88
 
0.9%
eb 54
 
0.5%
sf 47
 
0.5%
vd 29
 
0.3%
Other values (2) 16
 
0.2%

자료상태
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
6644 
5
3342 
7
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 6644
66.4%
5 3342
33.4%
7 14
 
0.1%

Length

2023-12-13T06:57:48.251961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:48.385360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6644
66.4%
5 3342
33.4%
7 14
 
0.1%

소장처코드
Real number (ℝ)

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9059
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:57:48.483463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile16
Maximum16
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation5.1308555
Coefficient of variation (CV)1.3136167
Kurtosis0.37085333
Mean3.9059
Median Absolute Deviation (MAD)0
Skewness1.4432037
Sum39059
Variance26.325678
MonotonicityNot monotonic
2023-12-13T06:57:48.620637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 7017
70.2%
14 559
 
5.6%
16 502
 
5.0%
11 274
 
2.7%
15 256
 
2.6%
4 196
 
2.0%
13 195
 
1.9%
3 187
 
1.9%
5 177
 
1.8%
7 123
 
1.2%
Other values (6) 514
 
5.1%
ValueCountFrequency (%)
1 7017
70.2%
2 106
 
1.1%
3 187
 
1.9%
4 196
 
2.0%
5 177
 
1.8%
6 100
 
1.0%
7 123
 
1.2%
8 116
 
1.2%
9 34
 
0.3%
10 119
 
1.2%
ValueCountFrequency (%)
16 502
5.0%
15 256
2.6%
14 559
5.6%
13 195
 
1.9%
12 39
 
0.4%
11 274
2.7%
10 119
 
1.2%
9 34
 
0.3%
8 116
 
1.2%
7 123
 
1.2%

소장처명
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도서실
6814 
부산교통방송
 
559
대구교통방송
 
502
경북지부
 
274
광주교통방송
 
256
Other values (12)
1595 

Length

Max length7
Median length3
Mean length3.8718
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시지부
2nd row도서실
3rd row도서실
4th row도서실
5th row도서실

Common Values

ValueCountFrequency (%)
도서실 6814
68.1%
부산교통방송 559
 
5.6%
대구교통방송 502
 
5.0%
경북지부 274
 
2.7%
광주교통방송 256
 
2.6%
서울특별시지부 203
 
2.0%
인천광역시지부 196
 
2.0%
제주지부 195
 
1.9%
대구광역시지부 187
 
1.9%
경기지부 177
 
1.8%
Other values (7) 637
 
6.4%

Length

2023-12-13T06:57:48.784718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도서실 6814
68.1%
부산교통방송 559
 
5.6%
대구교통방송 502
 
5.0%
경북지부 274
 
2.7%
광주교통방송 256
 
2.6%
서울특별시지부 203
 
2.0%
인천광역시지부 196
 
2.0%
제주지부 195
 
1.9%
대구광역시지부 187
 
1.9%
경기지부 177
 
1.8%
Other values (7) 637
 
6.4%

언어코드
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KOR
8637 
ENG
983 
JPN
 
368
FRE
 
6
CHI
 
3
Other values (3)
 
3

Length

Max length4
Median length3
Mean length3
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowKOR
2nd rowKOR
3rd rowKOR
4th rowKOR
5th rowKOR

Common Values

ValueCountFrequency (%)
KOR 8637
86.4%
ENG 983
 
9.8%
JPN 368
 
3.7%
FRE 6
 
0.1%
CHI 3
 
< 0.1%
<NA> 1
 
< 0.1%
JAN 1
 
< 0.1%
OR 1
 
< 0.1%

Length

2023-12-13T06:57:48.942384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:49.066445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kor 8637
86.4%
eng 983
 
9.8%
jpn 368
 
3.7%
fre 6
 
0.1%
chi 3
 
< 0.1%
na 1
 
< 0.1%
jan 1
 
< 0.1%
or 1
 
< 0.1%

국가코드
Categorical

IMBALANCE 

Distinct35
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ULK
8243 
US
 
786
GGK
 
391
JA
 
378
UK
 
72
Other values (30)
 
130

Length

Max length4
Median length3
Mean length2.8686
Min length2

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st rowULK
2nd rowULK
3rd rowULK
4th rowULK
5th rowULK

Common Values

ValueCountFrequency (%)
ULK 8243
82.4%
US 786
 
7.9%
GGK 391
 
3.9%
JA 378
 
3.8%
UK 72
 
0.7%
FR 43
 
0.4%
BNK 10
 
0.1%
SZ 6
 
0.1%
HK 6
 
0.1%
TGK 5
 
0.1%
Other values (25) 60
 
0.6%

Length

2023-12-13T06:57:49.208727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ulk 8243
82.4%
us 786
 
7.9%
ggk 391
 
3.9%
ja 378
 
3.8%
uk 72
 
0.7%
fr 43
 
0.4%
bnk 10
 
0.1%
hk 6
 
0.1%
sz 6
 
0.1%
tgk 5
 
< 0.1%
Other values (25) 60
 
0.6%
Distinct2090
Distinct (%)20.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:57:49.592325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length6.4657
Min length1

Characters and Unicode

Total characters64657
Distinct characters18
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1169 ?
Unique (%)11.7%

Sample

1st row363.1251
2nd row919.67
3rd row915.2
4th row811.37
5th row629.04
ValueCountFrequency (%)
363.1251 472
 
4.7%
388.06 434
 
4.3%
363.12565 301
 
3.0%
363.125 235
 
2.4%
363.1257 162
 
1.6%
388.072 156
 
1.6%
629.04 136
 
1.4%
625.794 130
 
1.3%
624.072 117
 
1.2%
388.068 109
 
1.1%
Other values (2080) 7748
77.5%
2023-12-13T06:57:50.131087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 10260
15.9%
. 9310
14.4%
8 6780
10.5%
2 6613
10.2%
1 6369
9.9%
6 5574
8.6%
5 5475
8.5%
0 4809
7.4%
4 3459
 
5.3%
7 3005
 
4.6%
Other values (8) 3003
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 55297
85.5%
Other Punctuation 9311
 
14.4%
Uppercase Letter 27
 
< 0.1%
Dash Punctuation 12
 
< 0.1%
Other Letter 6
 
< 0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 10260
18.6%
8 6780
12.3%
2 6613
12.0%
1 6369
11.5%
6 5574
10.1%
5 5475
9.9%
0 4809
8.7%
4 3459
 
6.3%
7 3005
 
5.4%
9 2953
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 9310
> 99.9%
/ 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
D 18
66.7%
V 9
33.3%
Other Letter
ValueCountFrequency (%)
3
50.0%
3
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 64624
99.9%
Latin 27
 
< 0.1%
Hangul 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
3 10260
15.9%
. 9310
14.4%
8 6780
10.5%
2 6613
10.2%
1 6369
9.9%
6 5574
8.6%
5 5475
8.5%
0 4809
7.4%
4 3459
 
5.4%
7 3005
 
4.6%
Other values (4) 2970
 
4.6%
Latin
ValueCountFrequency (%)
D 18
66.7%
V 9
33.3%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 64647
> 99.9%
Hangul 6
 
< 0.1%
Geometric Shapes 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 10260
15.9%
. 9310
14.4%
8 6780
10.5%
2 6613
10.2%
1 6369
9.9%
6 5574
8.6%
5 5475
8.5%
0 4809
7.4%
4 3459
 
5.4%
7 3005
 
4.6%
Other values (5) 2993
 
4.6%
Geometric Shapes
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%
Distinct4250
Distinct (%)42.6%
Missing22
Missing (%)0.2%
Memory size156.2 KiB
2023-12-13T06:57:50.486589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.7245941
Min length3

Characters and Unicode

Total characters47142
Distinct characters79
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3118 ?
Unique (%)31.2%

Sample

1st rowㄷ68ㄱ
2nd rowㅁ984e
3rd rowㅈ688ㅈ
4th rowㅎ761ㅋ
5th rowㅇ553ㄱ
ValueCountFrequency (%)
ㄷ68ㄱ 749
 
7.5%
ㄷ52ㄷ 201
 
2.0%
ㄷ68ㅈ 182
 
1.8%
ㄷ68ㄷ 147
 
1.5%
ㄱ313ㄱ 144
 
1.4%
ㄷ68ㅇ 130
 
1.3%
ㄱ443ㄱ 127
 
1.3%
ㄷ68ㅅ 123
 
1.2%
ㄱ271ㄷ 97
 
1.0%
t783t 90
 
0.9%
Other values (4239) 7989
80.1%
2023-12-13T06:57:51.321454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 4336
 
9.2%
4198
 
8.9%
6 3897
 
8.3%
2 3386
 
7.2%
3 3260
 
6.9%
3053
 
6.5%
7 3015
 
6.4%
4 2864
 
6.1%
5 2638
 
5.6%
2547
 
5.4%
Other values (69) 13948
29.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 27465
58.3%
Other Letter 17440
37.0%
Lowercase Letter 1147
 
2.4%
Uppercase Letter 1089
 
2.3%
Space Separator 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 212
18.5%
p 106
 
9.2%
s 88
 
7.7%
a 77
 
6.7%
r 73
 
6.4%
c 69
 
6.0%
h 62
 
5.4%
i 52
 
4.5%
d 47
 
4.1%
e 43
 
3.7%
Other values (15) 318
27.7%
Uppercase Letter
ValueCountFrequency (%)
T 551
50.6%
K 50
 
4.6%
E 43
 
3.9%
N 34
 
3.1%
C 34
 
3.1%
I 32
 
2.9%
R 30
 
2.8%
P 28
 
2.6%
O 28
 
2.6%
L 25
 
2.3%
Other values (13) 234
21.5%
Other Letter
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (10) 818
 
4.7%
Decimal Number
ValueCountFrequency (%)
8 4336
15.8%
6 3897
14.2%
2 3386
12.3%
3 3260
11.9%
7 3015
11.0%
4 2864
10.4%
5 2638
9.6%
1 2104
7.7%
9 1960
7.1%
0 5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27466
58.3%
Hangul 17440
37.0%
Latin 2236
 
4.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 551
24.6%
t 212
 
9.5%
p 106
 
4.7%
s 88
 
3.9%
a 77
 
3.4%
r 73
 
3.3%
c 69
 
3.1%
h 62
 
2.8%
i 52
 
2.3%
K 50
 
2.2%
Other values (38) 896
40.1%
Hangul
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (10) 818
 
4.7%
Common
ValueCountFrequency (%)
8 4336
15.8%
6 3897
14.2%
2 3386
12.3%
3 3260
11.9%
7 3015
11.0%
4 2864
10.4%
5 2638
9.6%
1 2104
7.7%
9 1960
7.1%
0 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29702
63.0%
Compat Jamo 17440
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 4336
14.6%
6 3897
13.1%
2 3386
11.4%
3 3260
11.0%
7 3015
10.2%
4 2864
9.6%
5 2638
8.9%
1 2104
7.1%
9 1960
6.6%
T 551
 
1.9%
Other values (49) 1691
 
5.7%
Compat Jamo
ValueCountFrequency (%)
4198
24.1%
3053
17.5%
2547
14.6%
1569
 
9.0%
1512
 
8.7%
1486
 
8.5%
875
 
5.0%
521
 
3.0%
495
 
2.8%
366
 
2.1%
Other values (10) 818
 
4.7%

자료유형코드
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
MM
7546 
EE
1478 
SS
 
711
NN
 
124
DD
 
87

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEE
2nd rowMM
3rd rowMM
4th rowMM
5th rowMM

Common Values

ValueCountFrequency (%)
MM 7546
75.5%
EE 1478
 
14.8%
SS 711
 
7.1%
NN 124
 
1.2%
DD 87
 
0.9%
EB 54
 
0.5%

Length

2023-12-13T06:57:51.466457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:51.594318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mm 7546
75.5%
ee 1478
 
14.8%
ss 711
 
7.1%
nn 124
 
1.2%
dd 87
 
0.9%
eb 54
 
0.5%

자료유형
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단행본
7546 
공단연구보고서
1478 
연속간행물
 
711
비도서자료
 
124
학위논문
 
87

Length

Max length7
Median length3
Mean length3.7669
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공단연구보고서
2nd row단행본
3rd row단행본
4th row단행본
5th row단행본

Common Values

ValueCountFrequency (%)
단행본 7546
75.5%
공단연구보고서 1478
 
14.8%
연속간행물 711
 
7.1%
비도서자료 124
 
1.2%
학위논문 87
 
0.9%
전자책 54
 
0.5%

Length

2023-12-13T06:57:51.743277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:51.951011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단행본 7546
75.5%
공단연구보고서 1478
 
14.8%
연속간행물 711
 
7.1%
비도서자료 124
 
1.2%
학위논문 87
 
0.9%
전자책 54
 
0.5%

마크유형구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KK
9048 
UU
949 
X
 
3

Length

Max length2
Median length2
Mean length1.9997
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKK
2nd rowKK
3rd rowKK
4th rowKK
5th rowKK

Common Values

ValueCountFrequency (%)
KK 9048
90.5%
UU 949
 
9.5%
X 3
 
< 0.1%

Length

2023-12-13T06:57:52.066208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:52.173898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kk 9048
90.5%
uu 949
 
9.5%
x 3
 
< 0.1%

마크유형
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
kormarc21
9048 
marc21(us)
949 
<NA>
 
3

Length

Max length10
Median length9
Mean length9.0934
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkormarc21
2nd rowkormarc21
3rd rowkormarc21
4th rowkormarc21
5th rowkormarc21

Common Values

ValueCountFrequency (%)
kormarc21 9048
90.5%
marc21(us) 949
 
9.5%
<NA> 3
 
< 0.1%

Length

2023-12-13T06:57:52.297077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:52.416912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kormarc21 9048
90.5%
marc21(us 949
 
9.5%
na 3
 
< 0.1%

입수유형
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구입
3454 
기증
2484 
<NA>
2013 
출판기증
1796 
구입기증
 
250

Length

Max length4
Median length2
Mean length2.8118
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판기증
2nd row구입
3rd row기증
4th row기증
5th row구입

Common Values

ValueCountFrequency (%)
구입 3454
34.5%
기증 2484
24.8%
<NA> 2013
20.1%
출판기증 1796
18.0%
구입기증 250
 
2.5%
교환 3
 
< 0.1%

Length

2023-12-13T06:57:52.536052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:52.679127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구입 3454
34.5%
기증 2484
24.8%
na 2013
20.1%
출판기증 1796
18.0%
구입기증 250
 
2.5%
교환 3
 
< 0.1%

포함자료
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7755 
IMG
1310 
URL
 
635
OTH IMG
 
218
856
 
60
Other values (2)
 
22

Length

Max length8
Median length4
Mean length4.0876
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row IMG
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7755
77.5%
IMG 1310
 
13.1%
URL 635
 
6.3%
OTH IMG 218
 
2.2%
856 60
 
0.6%
IMG URL 16
 
0.2%
OTH 6
 
0.1%

Length

2023-12-13T06:57:52.800411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:52.899269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7755
75.8%
img 1544
 
15.1%
url 651
 
6.4%
oth 224
 
2.2%
856 60
 
0.6%

보안등급
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

초록
Text

MISSING 

Distinct119
Distinct (%)33.4%
Missing9644
Missing (%)96.4%
Memory size156.2 KiB
2023-12-13T06:57:53.237031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length42
Mean length32.620787
Min length28

Characters and Unicode

Total characters11613
Distinct characters392
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)10.4%

Sample

1st row1t의협의 나날 (Uihyop ui nanal). -n2t활을 든...
2nd row1.t네덜란드 = Netherlandg(228 p.) --n2.t프...
3rd row동화상 광고물은 동화상을 표현함으로써 이전의 옥...
4th row장애인특수차량 산업의 현황 및 향후추이(Aibe Ts...
5th rowv.1, 헌법·- v.1-2,국회. - v.2, 선거·정당. - ...
ValueCountFrequency (%)
295
 
12.5%
v.1 94
 
4.0%
82
 
3.5%
v.2 80
 
3.4%
선거·정당 59
 
2.5%
연구는 51
 
2.2%
헌법· 47
 
2.0%
v.1-2,국회 47
 
2.0%
38
 
1.6%
교통사고 29
 
1.2%
Other values (523) 1532
65.1%
2023-12-13T06:57:53.732255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1945
 
16.7%
. 1554
 
13.4%
- 330
 
2.8%
, 284
 
2.4%
v 245
 
2.1%
1 228
 
2.0%
2 193
 
1.7%
171
 
1.5%
148
 
1.3%
140
 
1.2%
Other values (382) 6375
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5826
50.2%
Other Punctuation 2004
 
17.3%
Space Separator 1945
 
16.7%
Decimal Number 664
 
5.7%
Lowercase Letter 614
 
5.3%
Dash Punctuation 330
 
2.8%
Uppercase Letter 90
 
0.8%
Control 54
 
0.5%
Open Punctuation 37
 
0.3%
Close Punctuation 36
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
2.9%
148
 
2.5%
140
 
2.4%
139
 
2.4%
120
 
2.1%
112
 
1.9%
109
 
1.9%
106
 
1.8%
103
 
1.8%
103
 
1.8%
Other values (323) 4575
78.5%
Lowercase Letter
ValueCountFrequency (%)
v 245
39.9%
t 57
 
9.3%
e 40
 
6.5%
n 38
 
6.2%
i 28
 
4.6%
a 26
 
4.2%
p 25
 
4.1%
r 25
 
4.1%
c 21
 
3.4%
h 20
 
3.3%
Other values (9) 89
 
14.5%
Uppercase Letter
ValueCountFrequency (%)
T 14
15.6%
N 11
12.2%
O 10
11.1%
H 10
11.1%
W 10
11.1%
A 6
6.7%
I 6
6.7%
C 5
 
5.6%
M 5
 
5.6%
P 5
 
5.6%
Other values (4) 8
8.9%
Decimal Number
ValueCountFrequency (%)
1 228
34.3%
2 193
29.1%
9 71
 
10.7%
8 52
 
7.8%
0 46
 
6.9%
3 27
 
4.1%
4 17
 
2.6%
6 15
 
2.3%
7 9
 
1.4%
5 6
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 1554
77.5%
, 284
 
14.2%
· 124
 
6.2%
' 28
 
1.4%
/ 7
 
0.3%
: 5
 
0.2%
1
 
< 0.1%
! 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 11
84.6%
< 1
 
7.7%
> 1
 
7.7%
Space Separator
ValueCountFrequency (%)
1945
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 330
100.0%
Control
ValueCountFrequency (%)
 54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5690
49.0%
Common 5083
43.8%
Latin 704
 
6.1%
Han 108
 
0.9%
Hiragana 28
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
3.0%
148
 
2.6%
140
 
2.5%
139
 
2.4%
120
 
2.1%
112
 
2.0%
109
 
1.9%
106
 
1.9%
103
 
1.8%
103
 
1.8%
Other values (300) 4439
78.0%
Latin
ValueCountFrequency (%)
v 245
34.8%
t 57
 
8.1%
e 40
 
5.7%
n 38
 
5.4%
i 28
 
4.0%
a 26
 
3.7%
p 25
 
3.6%
r 25
 
3.6%
c 21
 
3.0%
h 20
 
2.8%
Other values (23) 179
25.4%
Common
ValueCountFrequency (%)
1945
38.3%
. 1554
30.6%
- 330
 
6.5%
, 284
 
5.6%
1 228
 
4.5%
2 193
 
3.8%
· 124
 
2.4%
9 71
 
1.4%
 54
 
1.1%
8 52
 
1.0%
Other values (16) 248
 
4.9%
Han
ValueCountFrequency (%)
16
14.8%
16
14.8%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
Other values (8) 12
11.1%
Hiragana
ValueCountFrequency (%)
17
60.7%
8
28.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5670
48.8%
ASCII 5662
48.8%
None 124
 
1.1%
CJK 108
 
0.9%
Hiragana 28
 
0.2%
Compat Jamo 20
 
0.2%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1945
34.4%
. 1554
27.4%
- 330
 
5.8%
, 284
 
5.0%
v 245
 
4.3%
1 228
 
4.0%
2 193
 
3.4%
9 71
 
1.3%
t 57
 
1.0%
 54
 
1.0%
Other values (47) 701
 
12.4%
Hangul
ValueCountFrequency (%)
171
 
3.0%
148
 
2.6%
140
 
2.5%
139
 
2.5%
120
 
2.1%
112
 
2.0%
109
 
1.9%
106
 
1.9%
103
 
1.8%
103
 
1.8%
Other values (299) 4419
77.9%
None
ValueCountFrequency (%)
· 124
100.0%
Compat Jamo
ValueCountFrequency (%)
20
100.0%
Hiragana
ValueCountFrequency (%)
17
60.7%
8
28.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
CJK
ValueCountFrequency (%)
16
14.8%
16
14.8%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
8
7.4%
Other values (8) 12
11.1%
Punctuation
ValueCountFrequency (%)
1
100.0%

제어번호
Real number (ℝ)

Distinct7174
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32071.644
Minimum4
Maximum83362
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:57:53.886755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile856
Q15852
median34903.5
Q352566.25
95-th percentile80819.6
Maximum83362
Range83358
Interquartile range (IQR)46714.25

Descriptive statistics

Standard deviation26638.933
Coefficient of variation (CV)0.83060704
Kurtosis-1.1896668
Mean32071.644
Median Absolute Deviation (MAD)25641.5
Skewness0.3821952
Sum3.2071644 × 108
Variance7.0963277 × 108
MonotonicityNot monotonic
2023-12-13T06:57:54.036350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1679 107
 
1.1%
2881 81
 
0.8%
1527 68
 
0.7%
1827 48
 
0.5%
3242 47
 
0.5%
5852 45
 
0.4%
21609 39
 
0.4%
5470 33
 
0.3%
11766 28
 
0.3%
7403 26
 
0.3%
Other values (7164) 9478
94.8%
ValueCountFrequency (%)
4 1
< 0.1%
5 1
< 0.1%
10 1
< 0.1%
14 1
< 0.1%
16 1
< 0.1%
20 1
< 0.1%
27 1
< 0.1%
35 2
< 0.1%
37 1
< 0.1%
41 1
< 0.1%
ValueCountFrequency (%)
83362 1
< 0.1%
83356 1
< 0.1%
83352 1
< 0.1%
83351 1
< 0.1%
83345 1
< 0.1%
83344 1
< 0.1%
83342 1
< 0.1%
83341 1
< 0.1%
83337 1
< 0.1%
83335 1
< 0.1%
Distinct1862
Distinct (%)40.0%
Missing5342
Missing (%)53.4%
Memory size156.2 KiB
2023-12-13T06:57:54.345465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length11.872263
Min length6

Characters and Unicode

Total characters55301
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1620 ?
Unique (%)34.8%

Sample

1st row9790000000000
2nd row8959480010
3rd row9790000000000
4th row891075012X
5th row9790000000000
ValueCountFrequency (%)
9790000000000 1734
37.2%
9780000000000 181
 
3.9%
12291366 107
 
2.3%
10156348 87
 
1.9%
12258741 68
 
1.5%
17383269 39
 
0.8%
12256382 26
 
0.6%
12267988 23
 
0.5%
809091775 21
 
0.5%
12278459 18
 
0.4%
Other values (1852) 2354
50.5%
2023-12-13T06:57:54.822744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22745
41.1%
9 7206
 
13.0%
4386
 
7.9%
7 3973
 
7.2%
8 3812
 
6.9%
1 2595
 
4.7%
3 2415
 
4.4%
2 2326
 
4.2%
5 2007
 
3.6%
4 1798
 
3.3%
Other values (7) 2038
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50640
91.6%
Space Separator 4386
 
7.9%
Uppercase Letter 161
 
0.3%
Lowercase Letter 110
 
0.2%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 22745
44.9%
9 7206
 
14.2%
7 3973
 
7.8%
8 3812
 
7.5%
1 2595
 
5.1%
3 2415
 
4.8%
2 2326
 
4.6%
5 2007
 
4.0%
4 1798
 
3.6%
6 1763
 
3.5%
Lowercase Letter
ValueCountFrequency (%)
x 105
95.5%
a 4
 
3.6%
c 1
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
X 153
95.0%
I 8
 
5.0%
Space Separator
ValueCountFrequency (%)
4386
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 55030
99.5%
Latin 271
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 22745
41.3%
9 7206
 
13.1%
4386
 
8.0%
7 3973
 
7.2%
8 3812
 
6.9%
1 2595
 
4.7%
3 2415
 
4.4%
2 2326
 
4.2%
5 2007
 
3.6%
4 1798
 
3.3%
Other values (2) 1767
 
3.2%
Latin
ValueCountFrequency (%)
X 153
56.5%
x 105
38.7%
I 8
 
3.0%
a 4
 
1.5%
c 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 55301
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22745
41.1%
9 7206
 
13.0%
4386
 
7.9%
7 3973
 
7.2%
8 3812
 
6.9%
1 2595
 
4.7%
3 2415
 
4.4%
2 2326
 
4.2%
5 2007
 
3.6%
4 1798
 
3.3%
Other values (7) 2038
 
3.7%

국가구분
Categorical

IMBALANCE 

Distinct35
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ULK
8243 
US
 
786
GGK
 
391
JA
 
378
UK
 
72
Other values (30)
 
130

Length

Max length4
Median length3
Mean length2.8686
Min length2

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st rowULK
2nd rowULK
3rd rowULK
4th rowULK
5th rowULK

Common Values

ValueCountFrequency (%)
ULK 8243
82.4%
US 786
 
7.9%
GGK 391
 
3.9%
JA 378
 
3.8%
UK 72
 
0.7%
FR 43
 
0.4%
BNK 10
 
0.1%
SZ 6
 
0.1%
HK 6
 
0.1%
TGK 5
 
0.1%
Other values (25) 60
 
0.6%

Length

2023-12-13T06:57:55.012296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ulk 8243
82.4%
us 786
 
7.9%
ggk 391
 
3.9%
ja 378
 
3.8%
uk 72
 
0.7%
fr 43
 
0.4%
bnk 10
 
0.1%
hk 6
 
0.1%
sz 6
 
0.1%
tgk 5
 
< 0.1%
Other values (25) 60
 
0.6%
Distinct1574
Distinct (%)15.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-10-21 00:00:00
Maximum2021-06-11 00:00:00
2023-12-13T06:57:55.176813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:57:55.346104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

분관정보
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
AA
9360 
<NA>
 
640

Length

Max length4
Median length2
Mean length2.128
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAA
2nd rowAA
3rd rowAA
4th rowAA
5th rowAA

Common Values

ValueCountFrequency (%)
AA 9360
93.6%
<NA> 640
 
6.4%

Length

2023-12-13T06:57:55.533785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:55.634678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
aa 9360
93.6%
na 640
 
6.4%

기관코드
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing207
Missing (%)2.1%
Memory size156.2 KiB

Sample

등록번호서명색인서명저자색인저자출판사색인출판사출판년도청구기호색인청구기호권책기호복본기호별치기호자료상태소장처코드소장처명언어코드국가코드분류번호저자기호자료유형코드자료유형마크유형구분마크유형입수유형포함자료보안등급초록제어번호국제표준도서번호국가구분입력일분관정보기관코드
2618240005102009년판 교통사고통계분석2009년판교통사고통계분석도로교통공단도로교통공단도로교통공단도로교통공단2009363.1251-ㄷ68ㄱ363.1251 ㄷ68ㄱ<NA>NaN<NA>54인천광역시지부KORULK363.1251ㄷ68ㄱEE공단연구보고서KKkormarc21출판기증IMG0<NA>52296<NA>ULK2009-10-12AA0
2160121667Enjoy 괌 : No plan! No problem!ENJOY괌NOPLANNOPROBLEM민보영민보영넥서스BOOKS넥서스BOOKS2016919.67-ㅁ984e919.67 ㅁ984e<NA>NaN<NA>11도서실KORULK919.67ㅁ984eMM단행본KKkormarc21구입<NA>0<NA>764319790000000000ULK2016-09-22AAAA
1398614041(알짜배기 세계여행 시리즈)중국알짜배기세계여행시리즈중국조창완조창완성하출판성하출판2005915.2-ㅈ688ㅈ915.2 ㅈ688ㅈ<NA>NaN<NA>11도서실KORULK915.2ㅈ688ㅈMM단행본KKkormarc21기증<NA>0<NA>441638959480010ULK2007-06-28AA0
1982119881카인의 후예 : 황순원 작품선카인의후예황순원작품선황순원황순원현대문학현대문학2011811.37-ㅎ761ㅋ811.37 ㅎ761ㅋ<NA>NaN<NA>11도서실KORULK811.37ㅎ761ㅋMM단행본KKkormarc21기증<NA>0<NA>655369790000000000ULK2013-05-28AANaN
1007210098교통공학 : 개정판교통공학개정판원제무.최재성원제무최재성박영사박영사2003629.04-ㅇ553ㄱ629.04 ㅇ553ㄱ<NA>NaN<NA>11도서실KORULK629.04ㅇ553ㄱMM단행본KKkormarc21구입<NA>0<NA>34136891075012XULK2005-08-23AA0
2859510000307성냥팔이 소녀는 누가 죽였을까 : 세상에서 가장 기묘한 22가지 재판 이야기성냥팔이소녀는누가죽였을까세상에서가장기묘한22가지재판이야기도진기.도진기추수밭추수밭2013340-ㄷ82ㅅ340 ㄷ82ㅅ<NA>NaN<NA>510광주.전남지부KORULK340ㄷ82ㅅMM단행본KKkormarc21구입<NA>0<NA>734199790000000000ULK2015-06-04AAAA
3171514001123규장각 각신들의 나날 : 정은궐 장편소설규장각각신들의나날정은궐장편소설정은궐정은궐파란미디어파란미디어2009811.32-ㅈ456ㄱ811.32 ㅈ456ㄱv.1NaN<NA>514부산교통방송KORULK811.32ㅈ456ㄱMM단행본KKkormarc21기증<NA>0<NA>553279790000000000ULK2010-09-16AA0
99469971경찰50년사경찰50년사경찰청경찰청경찰청 경찰사편찬위원회경찰청경찰사편찬위원회1995363.209-ㄱ313ㄱ363.209 ㄱ313ㄱ<NA>NaN<NA>11도서실KORULK363.209ㄱ313ㄱMM단행본KKkormarc21기증<NA>0<NA>12535<NA>ULK2004-02-05AA0
1427014325교통재현연수 해외출장 결과보고서교통재현연수해외출장결과보고서박원규.정호교 외1인박원규정호교외1인도로교통안전관리공단도로교통안전관리공단2007388.06-ㅂ426ㄱ-SI388.06 ㅂ426ㄱ SIv.251NaNSI11도서실KORULK388.06ㅂ426ㄱMM단행본KKkormarc21출판기증IMG0<NA>44858<NA>ULK2007-09-28AA0
74897506Asphalt Nation : How the automobile took over America and how we take it back.ASPHALTNATIONHOWTHEAUTOMOBILETOOKOVERAMERICAANDHOWWETAKEITBACKKay, Jane holtz.KAYJANEHOLTZCalifornia Press.CALIFORNIAPRESS1997303.48-K23a303.48 K23a<NA>NaN<NA>11도서실ENGUS303.48K23aMM단행본UUmarc21(us)기증<NA>0<NA>1072152021620US2002-05-08AA0
등록번호서명색인서명저자색인저자출판사색인출판사출판년도청구기호색인청구기호권책기호복본기호별치기호자료상태소장처코드소장처명언어코드국가코드분류번호저자기호자료유형코드자료유형마크유형구분마크유형입수유형포함자료보안등급초록제어번호국제표준도서번호국가구분입력일분관정보기관코드
54785491한글 윈도우 NT Workstation 4.0 : 한눈에 알 수 있는한글윈도우NTWORKSTATION40한눈에알수있는Joyce, JerryJOYCEJERRY영진영진1997004.6-J89ㅎ004.6 J89ㅎ<NA>NaN<NA>11도서실KORULK4.6J89ㅎMM단행본KKkormarc21<NA><NA>0<NA>66848931407963ULK1998-01-20AA0
262704000598(2011년)교통사고 잦은곳 기본개선계획 및 효과분석2011년교통사고잦은곳기본개선계획및효과분석도로교통공단 교통사고종합분석센터도로교통공단교통사고종합분석센터도로교통공단도로교통공단2011363.12565-ㄷ68ㄱ-SR363.12565 ㄷ68ㄱ SR2011, v.1NaNSR54인천광역시지부KORULK363.12565ㄷ68ㄱEE공단연구보고서KKkormarc21출판기증IMG0<NA>60254<NA>ULK2012-05-08AA0
271406000224교통사고조사실무편람교통사고조사실무편람이상두이상두맨투맨맨투맨1994363.1251-ㅇ749ㄱ363.1251 ㅇ749ㄱv.1NaN<NA>56강원지부KORULK363.1251ㅇ749ㄱMM단행본KKkormarc21구입기증<NA>0<NA>389<NA>ULK1996-11-05AA0
3038113000514교통수요관리론교통수요관리론황기연 외1황기연외1청문각청문각2001388.401-ㅎ733ㄱ388.401 ㅎ733ㄱ<NA>NaN<NA>513제주지부KORULK388.401ㅎ733ㄱMM단행본KKkormarc21구입<NA>0<NA>35405<NA>ULK2005-11-14AA0
278638000101환경친화적 도로건설요령환경친화적도로건설요령건설교통부건설교통부건설교통부건설교통부1999625.7-ㅊ362ㅎ-SK625.7 ㅊ362ㅎ SK<NA>NaNSK58대전.충남지부KORULK625.7ㅊ362ㅎMM단행본KKkormarc21구입<NA>0<NA>34789<NA>ULK2005-09-27AA0
20882098화물자동차운수산업 효율화를 위한 정책방안 = Research on Improving Efficiency of Trucking Industry in Korea화물자동차운수산업효율화를위한정책방안=RESEARCHONIMPROVINGEFFICIENCYOFTRUCKINGINDUSTRYINKOREA교통개발연구원교통개발연구원교통개발연구원교통개발연구원1998388.324-ㄱ443ㅎ-SK388.324 ㄱ443ㅎ SK<NA>NaNSK11도서실KORULK388.324ㄱ443ㅎMM단행본KKkormarc21<NA><NA>0<NA>73928987730433ULK1999-03-11AA0
3483216001390(외우지 않고 통으로 이해하는)통세계사외우지않고통으로이해하는통세계사김상훈김상훈다산북스다산북스2009909-ㄱ766ㅌ909 ㄱ766ㅌv.4NaN<NA>516대구교통방송KORULK909ㄱ766ㅌMM단행본KKkormarc21구입<NA>0<NA>521509790000000000ULK2009-09-09AA0
949954광고학개론광고학개론김원수김원수경문사경문사1988659.1-ㄱ863ㄱ659.1 ㄱ863ㄱ<NA>NaN<NA>11도서실KORULK659.1ㄱ863ㄱMM단행본KKkormarc21<NA><NA>0<NA>260<NA>ULK1996-10-26AA0
2369623786(양재호의) 교통기사 필기 기출편 : 2021 최신판양재호의교통기사필기기출편2021최신판양재호양재호TranBooksTRANBOOKS2021388.076-ㅇ289ㄱ388.076 ㅇ289ㄱ<NA>NaN<NA>11도서실KORICK388.076ㅇ289ㄱMM단행본KKkormarc21구입<NA>0<NA>83197<NA>ICK2021-01-25AAAA
95209543도시계량분석 : EXCEL을 이용한도시계량분석EXCEL을이용한원제무원제무박영사박영사2001519.5-ㅇ553ㄷ519.5 ㅇ553ㄷ<NA>NaN<NA>11도서실KORULK519.5ㅇ553ㄷMM단행본KKkormarc21구입<NA>0<NA>11507891030457ULK2003-05-26AA0