Overview

Dataset statistics

Number of variables31
Number of observations10000
Missing cells106912
Missing cells (%)34.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 MiB
Average record size in memory262.0 B

Variable types

Text11
Categorical12
Unsupported5
DateTime3

Dataset

Description역사관련 사이트 메타데이터 기반 통합 검색을 위하여 한국역사정보통합시스템이 제공 중인 역사 자료 메타데이터 중 유물유적 자료
Author교육부 국사편찬위원회
URLhttps://www.data.go.kr/data/15051039/fileData.do

Alerts

SUBJECT_KHON1 has constant value ""Constant
EDITOR is highly imbalanced (99.3%)Imbalance
UNIT is highly imbalanced (97.2%)Imbalance
PUBLISHER is highly imbalanced (98.8%)Imbalance
MAINTITLE has 3621 (36.2%) missing valuesMissing
ALTERNATIVE has 7967 (79.7%) missing valuesMissing
DOCSENDER has 10000 (100.0%) missing valuesMissing
AUTHOR has 9052 (90.5%) missing valuesMissing
TYPE has 10000 (100.0%) missing valuesMissing
TABLEOFCONTENTS has 10000 (100.0%) missing valuesMissing
ABSTRACT has 6481 (64.8%) missing valuesMissing
REQUIRES has 10000 (100.0%) missing valuesMissing
DATEEVENT has 10000 (100.0%) missing valuesMissing
DOCCREATED has 9462 (94.6%) missing valuesMissing
DOCISSUED has 9090 (90.9%) missing valuesMissing
CREATORSORT has 2683 (26.8%) missing valuesMissing
DATESORT has 8556 (85.6%) missing valuesMissing
URI_KHON has unique valuesUnique
URI_KHDP has unique valuesUnique
URL has unique valuesUnique
DOCSENDER is an unsupported type, check if it needs cleaning or further analysisUnsupported
TYPE is an unsupported type, check if it needs cleaning or further analysisUnsupported
TABLEOFCONTENTS is an unsupported type, check if it needs cleaning or further analysisUnsupported
REQUIRES is an unsupported type, check if it needs cleaning or further analysisUnsupported
DATEEVENT is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 14:36:03.387031
Analysis finished2023-12-12 14:36:08.060866
Duration4.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

URI_KHON
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:36:08.209002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length24.2088
Min length12

Characters and Unicode

Total characters242088
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowKH.NAHF.ag_003_0030_0020_0010
2nd rowKH.NAHF.ag_003_0070_0070_0050
3rd rowKH.NAHF.cr_005_0010_0020_0010_0020
4th rowKH.KSAC.4030
5th rowKH.NAHF.ag_002_0020_0040
ValueCountFrequency (%)
kh.nahf.ag_003_0030_0020_0010 1
 
< 0.1%
kh.nahf.ku_001_0010_0040_0030 1
 
< 0.1%
kh.ksac.1064 1
 
< 0.1%
kh.nahf.ku_003_0030_0040 1
 
< 0.1%
kh.nahf.ku_001_0020_0030_2170_0460 1
 
< 0.1%
kh.ksac.ands017 1
 
< 0.1%
kh.nahf.ku_002_0080_0010_0010 1
 
< 0.1%
kh.ksac.4918 1
 
< 0.1%
kh.nahf.ag_004_0020_0010_0410 1
 
< 0.1%
kh.ksac.5216 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T23:36:08.627109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 72933
30.1%
_ 27542
 
11.4%
. 20000
 
8.3%
H 16498
 
6.8%
K 12368
 
5.1%
1 12139
 
5.0%
2 9383
 
3.9%
A 8868
 
3.7%
N 6457
 
2.7%
F 6441
 
2.7%
Other values (40) 49459
20.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 118954
49.1%
Uppercase Letter 60134
24.8%
Connector Punctuation 27542
 
11.4%
Other Punctuation 20000
 
8.3%
Lowercase Letter 15458
 
6.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
H 16498
27.4%
K 12368
20.6%
A 8868
14.7%
N 6457
 
10.7%
F 6441
 
10.7%
C 2736
 
4.5%
S 2455
 
4.1%
P 1259
 
2.1%
D 1227
 
2.0%
I 1226
 
2.0%
Other values (9) 599
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
k 5124
33.1%
u 3624
23.4%
a 1567
 
10.1%
g 1433
 
9.3%
r 873
 
5.6%
s 717
 
4.6%
h 578
 
3.7%
c 543
 
3.5%
n 270
 
1.7%
y 177
 
1.1%
Other values (9) 552
 
3.6%
Decimal Number
ValueCountFrequency (%)
0 72933
61.3%
1 12139
 
10.2%
2 9383
 
7.9%
3 6310
 
5.3%
4 5618
 
4.7%
6 3536
 
3.0%
5 2983
 
2.5%
7 2412
 
2.0%
8 2124
 
1.8%
9 1516
 
1.3%
Connector Punctuation
ValueCountFrequency (%)
_ 27542
100.0%
Other Punctuation
ValueCountFrequency (%)
. 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 166496
68.8%
Latin 75592
31.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
H 16498
21.8%
K 12368
16.4%
A 8868
11.7%
N 6457
 
8.5%
F 6441
 
8.5%
k 5124
 
6.8%
u 3624
 
4.8%
C 2736
 
3.6%
S 2455
 
3.2%
a 1567
 
2.1%
Other values (28) 9454
12.5%
Common
ValueCountFrequency (%)
0 72933
43.8%
_ 27542
 
16.5%
. 20000
 
12.0%
1 12139
 
7.3%
2 9383
 
5.6%
3 6310
 
3.8%
4 5618
 
3.4%
6 3536
 
2.1%
5 2983
 
1.8%
7 2412
 
1.4%
Other values (2) 3640
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 242088
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 72933
30.1%
_ 27542
 
11.4%
. 20000
 
8.3%
H 16498
 
6.8%
K 12368
 
5.1%
1 12139
 
5.0%
2 9383
 
3.9%
A 8868
 
3.7%
N 6457
 
2.7%
F 6441
 
2.7%
Other values (40) 49459
20.4%

MDCENTER
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
NAHF
6441 
KSAC
2353 
IDP
1197 
GASA
 
9

Length

Max length4
Median length4
Mean length3.8803
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNAHF
2nd rowNAHF
3rd rowNAHF
4th rowKSAC
5th rowNAHF

Common Values

ValueCountFrequency (%)
NAHF 6441
64.4%
KSAC 2353
 
23.5%
IDP 1197
 
12.0%
GASA 9
 
0.1%

Length

2023-12-12T23:36:08.815677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:08.928120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
nahf 6441
64.4%
ksac 2353
 
23.5%
idp 1197
 
12.0%
gasa 9
 
0.1%

SUBJECT_KHON
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KH.14.08.000
3624 
KH.14.02.000
1977 
KH.14.06.000
1422 
KH.14.03.000
1197 
KH.14.07.000
522 
Other values (4)
1258 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKH.14.06.000
2nd rowKH.14.06.000
3rd rowKH.14.10.000
4th rowKH.14.02.000
5th rowKH.14.06.000

Common Values

ValueCountFrequency (%)
KH.14.08.000 3624
36.2%
KH.14.02.000 1977
19.8%
KH.14.06.000 1422
 
14.2%
KH.14.03.000 1197
 
12.0%
KH.14.07.000 522
 
5.2%
KH.14.10.000 461
 
4.6%
KH.14.09.000 412
 
4.1%
KH.14.01.000 376
 
3.8%
KH.14.05.000 9
 
0.1%

Length

2023-12-12T23:36:09.048657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:09.184951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kh.14.08.000 3624
36.2%
kh.14.02.000 1977
19.8%
kh.14.06.000 1422
 
14.2%
kh.14.03.000 1197
 
12.0%
kh.14.07.000 522
 
5.2%
kh.14.10.000 461
 
4.6%
kh.14.09.000 412
 
4.1%
kh.14.01.000 376
 
3.8%
kh.14.05.000 9
 
0.1%

DBINFO
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고구려문화유산자료
3624 
영남유학유물
1977 
암각화자료
1422 
국외독립운동유적지
661 
국내독립운동유적지
536 
Other values (5)
1780 

Length

Max length9
Median length8
Mean length7.4387
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row암각화자료
2nd row암각화자료
3rd row도록·보고서
4th row영남유학유물
5th row암각화자료

Common Values

ValueCountFrequency (%)
고구려문화유산자료 3624
36.2%
영남유학유물 1977
19.8%
암각화자료 1422
 
14.2%
국외독립운동유적지 661
 
6.6%
국내독립운동유적지 536
 
5.4%
고구려고분벽화 522
 
5.2%
도록·보고서 461
 
4.6%
크라스키노발해성 412
 
4.1%
영남유교유적 376
 
3.8%
한국가사문학 9
 
0.1%

Length

2023-12-12T23:36:09.364283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:09.528493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고구려문화유산자료 3624
36.2%
영남유학유물 1977
19.8%
암각화자료 1422
 
14.2%
국외독립운동유적지 661
 
6.6%
국내독립운동유적지 536
 
5.4%
고구려고분벽화 522
 
5.2%
도록·보고서 461
 
4.6%
크라스키노발해성 412
 
4.1%
영남유교유적 376
 
3.8%
한국가사문학 9
 
0.1%

URI_KHDP
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:36:10.024379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length21
Mean length16.3285
Min length4

Characters and Unicode

Total characters163285
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowag_003_0030_0020_0010
2nd rowag_003_0070_0070_0050
3rd rowcr_005_0010_0020_0010_0020
4th row4030
5th rowag_002_0020_0040
ValueCountFrequency (%)
ag_003_0030_0020_0010 1
 
< 0.1%
ku_001_0010_0040_0030 1
 
< 0.1%
1064 1
 
< 0.1%
ku_003_0030_0040 1
 
< 0.1%
ku_001_0020_0030_2170_0460 1
 
< 0.1%
ands017 1
 
< 0.1%
ku_002_0080_0010_0010 1
 
< 0.1%
4918 1
 
< 0.1%
ag_004_0020_0010_0410 1
 
< 0.1%
5216 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T23:36:10.383805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 72933
44.7%
_ 27542
 
16.9%
1 12139
 
7.4%
2 9383
 
5.7%
3 6310
 
3.9%
4 5618
 
3.4%
k 5124
 
3.1%
u 3624
 
2.2%
6 3536
 
2.2%
5 2983
 
1.8%
Other values (37) 14093
 
8.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 118954
72.9%
Connector Punctuation 27542
 
16.9%
Lowercase Letter 15458
 
9.5%
Uppercase Letter 1331
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
k 5124
33.1%
u 3624
23.4%
a 1567
 
10.1%
g 1433
 
9.3%
r 873
 
5.6%
s 717
 
4.6%
h 578
 
3.7%
c 543
 
3.5%
n 270
 
1.7%
y 177
 
1.1%
Other values (9) 552
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
C 383
28.8%
U 224
16.8%
R 115
 
8.6%
S 93
 
7.0%
M 88
 
6.6%
B 69
 
5.2%
P 62
 
4.7%
H 57
 
4.3%
A 56
 
4.2%
J 50
 
3.8%
Other values (7) 134
 
10.1%
Decimal Number
ValueCountFrequency (%)
0 72933
61.3%
1 12139
 
10.2%
2 9383
 
7.9%
3 6310
 
5.3%
4 5618
 
4.7%
6 3536
 
3.0%
5 2983
 
2.5%
7 2412
 
2.0%
8 2124
 
1.8%
9 1516
 
1.3%
Connector Punctuation
ValueCountFrequency (%)
_ 27542
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 146496
89.7%
Latin 16789
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
k 5124
30.5%
u 3624
21.6%
a 1567
 
9.3%
g 1433
 
8.5%
r 873
 
5.2%
s 717
 
4.3%
h 578
 
3.4%
c 543
 
3.2%
C 383
 
2.3%
n 270
 
1.6%
Other values (26) 1677
 
10.0%
Common
ValueCountFrequency (%)
0 72933
49.8%
_ 27542
 
18.8%
1 12139
 
8.3%
2 9383
 
6.4%
3 6310
 
4.3%
4 5618
 
3.8%
6 3536
 
2.4%
5 2983
 
2.0%
7 2412
 
1.6%
8 2124
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 163285
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 72933
44.7%
_ 27542
 
16.9%
1 12139
 
7.4%
2 9383
 
5.7%
3 6310
 
3.9%
4 5618
 
3.4%
k 5124
 
3.1%
u 3624
 
2.2%
6 3536
 
2.2%
5 2983
 
1.8%
Other values (37) 14093
 
8.6%

MAINTITLE
Text

MISSING 

Distinct5547
Distinct (%)87.0%
Missing3621
Missing (%)36.2%
Memory size156.2 KiB
2023-12-12T23:36:10.757395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length38
Mean length12.3847
Min length1

Characters and Unicode

Total characters79002
Distinct characters2254
Distinct categories15 ?
Distinct scripts5 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5182 ?
Unique (%)81.2%

Sample

1st row서쪽 부분(동물들)
2nd row사람
3rd row하고성자성(下古城子城)
4th row명문(전답매매문기)(明文(田畓賣買文記))
5th row조라그트 하드 2암면
ValueCountFrequency (%)
부분 283
 
2.2%
180
 
1.4%
우주르 179
 
1.4%
하단 177
 
1.4%
산양 111
 
0.9%
사슴 103
 
0.8%
암면 94
 
0.7%
전경 80
 
0.6%
동물들 78
 
0.6%
안학궁 65
 
0.5%
Other values (6296) 11419
89.4%
2023-12-12T23:36:11.233956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6392
 
8.1%
) 4060
 
5.1%
( 4059
 
5.1%
1096
 
1.4%
1 926
 
1.2%
865
 
1.1%
848
 
1.1%
819
 
1.0%
_ 718
 
0.9%
674
 
0.9%
Other values (2244) 58545
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58124
73.6%
Space Separator 6392
 
8.1%
Close Punctuation 4065
 
5.1%
Open Punctuation 4064
 
5.1%
Decimal Number 3687
 
4.7%
Lowercase Letter 813
 
1.0%
Connector Punctuation 718
 
0.9%
Dash Punctuation 439
 
0.6%
Other Punctuation 376
 
0.5%
Uppercase Letter 257
 
0.3%
Other values (5) 67
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1096
 
1.9%
865
 
1.5%
848
 
1.5%
819
 
1.4%
674
 
1.2%
673
 
1.2%
641
 
1.1%
639
 
1.1%
553
 
1.0%
551
 
0.9%
Other values (2113) 50765
87.3%
Lowercase Letter
ValueCountFrequency (%)
a 116
 
14.3%
n 73
 
9.0%
o 53
 
6.5%
i 52
 
6.4%
e 46
 
5.7%
l 35
 
4.3%
m 30
 
3.7%
u 28
 
3.4%
t 28
 
3.4%
c 26
 
3.2%
Other values (42) 326
40.1%
Uppercase Letter
ValueCountFrequency (%)
X 43
16.7%
I 35
13.6%
S 26
10.1%
C 20
 
7.8%
M 18
 
7.0%
V 12
 
4.7%
P 11
 
4.3%
A 11
 
4.3%
L 10
 
3.9%
H 7
 
2.7%
Other values (23) 64
24.9%
Other Punctuation
ValueCountFrequency (%)
· 136
36.2%
, 89
23.7%
. 48
 
12.8%
; 32
 
8.5%
: 23
 
6.1%
& 14
 
3.7%
# 14
 
3.7%
" 8
 
2.1%
? 6
 
1.6%
/ 4
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 926
25.1%
2 574
15.6%
3 514
13.9%
0 414
11.2%
4 301
 
8.2%
8 218
 
5.9%
5 199
 
5.4%
7 195
 
5.3%
6 186
 
5.0%
9 160
 
4.3%
Math Symbol
ValueCountFrequency (%)
~ 21
60.0%
5
 
14.3%
5
 
14.3%
× 3
 
8.6%
1
 
2.9%
Letter Number
ValueCountFrequency (%)
5
26.3%
4
21.1%
4
21.1%
3
15.8%
3
15.8%
Close Punctuation
ValueCountFrequency (%)
) 4060
99.9%
2
 
< 0.1%
] 2
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4059
99.9%
2
 
< 0.1%
[ 2
 
< 0.1%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
6392
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 718
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 439
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43660
55.3%
Common 19789
25.0%
Han 14464
 
18.3%
Latin 901
 
1.1%
Cyrillic 188
 
0.2%

Most frequent character per script

Han
ValueCountFrequency (%)
403
 
2.8%
381
 
2.6%
367
 
2.5%
305
 
2.1%
269
 
1.9%
207
 
1.4%
203
 
1.4%
183
 
1.3%
169
 
1.2%
158
 
1.1%
Other values (1449) 11819
81.7%
Hangul
ValueCountFrequency (%)
1096
 
2.5%
865
 
2.0%
848
 
1.9%
819
 
1.9%
674
 
1.5%
673
 
1.5%
641
 
1.5%
639
 
1.5%
553
 
1.3%
551
 
1.3%
Other values (654) 36301
83.1%
Latin
ValueCountFrequency (%)
a 116
 
12.9%
n 73
 
8.1%
o 53
 
5.9%
i 52
 
5.8%
e 46
 
5.1%
X 43
 
4.8%
l 35
 
3.9%
I 35
 
3.9%
m 30
 
3.3%
u 28
 
3.1%
Other values (45) 390
43.3%
Common
ValueCountFrequency (%)
6392
32.3%
) 4060
20.5%
( 4059
20.5%
1 926
 
4.7%
_ 718
 
3.6%
2 574
 
2.9%
3 514
 
2.6%
- 439
 
2.2%
0 414
 
2.1%
4 301
 
1.5%
Other values (31) 1392
 
7.0%
Cyrillic
ValueCountFrequency (%)
а 19
 
10.1%
о 18
 
9.6%
и 18
 
9.6%
с 13
 
6.9%
р 13
 
6.9%
е 12
 
6.4%
к 9
 
4.8%
т 8
 
4.3%
н 8
 
4.3%
д 7
 
3.7%
Other values (25) 63
33.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43657
55.3%
ASCII 20502
26.0%
CJK 14463
 
18.3%
Cyrillic 188
 
0.2%
None 145
 
0.2%
Number Forms 19
 
< 0.1%
Math Operators 11
 
< 0.1%
Punctuation 8
 
< 0.1%
Specials 3
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6392
31.2%
) 4060
19.8%
( 4059
19.8%
1 926
 
4.5%
_ 718
 
3.5%
2 574
 
2.8%
3 514
 
2.5%
- 439
 
2.1%
0 414
 
2.0%
4 301
 
1.5%
Other values (68) 2105
 
10.3%
Hangul
ValueCountFrequency (%)
1096
 
2.5%
865
 
2.0%
848
 
1.9%
819
 
1.9%
674
 
1.5%
673
 
1.5%
641
 
1.5%
639
 
1.5%
553
 
1.3%
551
 
1.3%
Other values (653) 36298
83.1%
CJK
ValueCountFrequency (%)
403
 
2.8%
381
 
2.6%
367
 
2.5%
305
 
2.1%
269
 
1.9%
207
 
1.4%
203
 
1.4%
183
 
1.3%
169
 
1.2%
158
 
1.1%
Other values (1448) 11818
81.7%
None
ValueCountFrequency (%)
· 136
93.8%
× 3
 
2.1%
2
 
1.4%
2
 
1.4%
1
 
0.7%
1
 
0.7%
Cyrillic
ValueCountFrequency (%)
а 19
 
10.1%
о 18
 
9.6%
и 18
 
9.6%
с 13
 
6.9%
р 13
 
6.9%
е 12
 
6.4%
к 9
 
4.8%
т 8
 
4.3%
н 8
 
4.3%
д 7
 
3.7%
Other values (25) 63
33.5%
Number Forms
ValueCountFrequency (%)
5
26.3%
4
21.1%
4
21.1%
3
15.8%
3
15.8%
Math Operators
ValueCountFrequency (%)
5
45.5%
5
45.5%
1
 
9.1%
Punctuation
ValueCountFrequency (%)
4
50.0%
4
50.0%
Specials
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
CJK Ext A
ValueCountFrequency (%)
1
100.0%

ALTERNATIVE
Text

MISSING 

Distinct56
Distinct (%)2.8%
Missing7967
Missing (%)79.7%
Memory size156.2 KiB
2023-12-12T23:36:11.536468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length8
Mean length8.1765863
Min length2

Characters and Unicode

Total characters16623
Distinct characters271
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)2.7%

Sample

1st row영남유교문화유물
2nd row영남유교문화유물
3rd row영남유교문화유물
4th row영남유교문화유물
5th row영남유교문화유물
ValueCountFrequency (%)
영남유교문화유물 1977
92.3%
번째 8
 
0.4%
도성 7
 
0.3%
발해의 5
 
0.2%
도면 5
 
0.2%
중심으로 3
 
0.1%
사진 3
 
0.1%
고분군 3
 
0.1%
고구려의 3
 
0.1%
도판 3
 
0.1%
Other values (114) 124
 
5.8%
2023-12-12T23:36:12.005330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3955
23.8%
1978
11.9%
1978
11.9%
1977
11.9%
1977
11.9%
1977
11.9%
1977
11.9%
108
 
0.6%
18
 
0.1%
( 17
 
0.1%
Other values (261) 661
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16259
97.8%
Space Separator 108
 
0.6%
Lowercase Letter 92
 
0.6%
Decimal Number 60
 
0.4%
Uppercase Letter 46
 
0.3%
Open Punctuation 18
 
0.1%
Close Punctuation 18
 
0.1%
Dash Punctuation 13
 
0.1%
Other Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3955
24.3%
1978
12.2%
1978
12.2%
1977
12.2%
1977
12.2%
1977
12.2%
1977
12.2%
18
 
0.1%
14
 
0.1%
12
 
0.1%
Other values (202) 396
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
a 10
10.9%
o 10
10.9%
i 10
10.9%
r 9
9.8%
s 7
 
7.6%
e 7
 
7.6%
n 5
 
5.4%
y 4
 
4.3%
u 4
 
4.3%
t 4
 
4.3%
Other values (12) 22
23.9%
Uppercase Letter
ValueCountFrequency (%)
K 5
 
10.9%
E 4
 
8.7%
R 4
 
8.7%
S 4
 
8.7%
O 3
 
6.5%
A 3
 
6.5%
G 3
 
6.5%
P 3
 
6.5%
H 2
 
4.3%
Y 2
 
4.3%
Other values (9) 13
28.3%
Decimal Number
ValueCountFrequency (%)
1 15
25.0%
4 8
13.3%
7 6
 
10.0%
9 6
 
10.0%
0 6
 
10.0%
6 5
 
8.3%
2 5
 
8.3%
3 4
 
6.7%
8 3
 
5.0%
5 2
 
3.3%
Open Punctuation
ValueCountFrequency (%)
( 17
94.4%
1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 17
94.4%
1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
; 1
 
11.1%
Space Separator
ValueCountFrequency (%)
108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16112
96.9%
Common 226
 
1.4%
Han 144
 
0.9%
Latin 138
 
0.8%
Hiragana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3955
24.5%
1978
12.3%
1978
12.3%
1977
12.3%
1977
12.3%
1977
12.3%
1977
12.3%
18
 
0.1%
14
 
0.1%
12
 
0.1%
Other values (109) 249
 
1.5%
Han
ValueCountFrequency (%)
5
 
3.5%
5
 
3.5%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
Other values (80) 102
70.8%
Latin
ValueCountFrequency (%)
a 10
 
7.2%
o 10
 
7.2%
i 10
 
7.2%
r 9
 
6.5%
s 7
 
5.1%
e 7
 
5.1%
K 5
 
3.6%
n 5
 
3.6%
E 4
 
2.9%
y 4
 
2.9%
Other values (31) 67
48.6%
Common
ValueCountFrequency (%)
108
47.8%
( 17
 
7.5%
) 17
 
7.5%
1 15
 
6.6%
- 13
 
5.8%
, 8
 
3.5%
4 8
 
3.5%
7 6
 
2.7%
9 6
 
2.7%
0 6
 
2.7%
Other values (8) 22
 
9.7%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16112
96.9%
ASCII 362
 
2.2%
CJK 144
 
0.9%
Hiragana 3
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3955
24.5%
1978
12.3%
1978
12.3%
1977
12.3%
1977
12.3%
1977
12.3%
1977
12.3%
18
 
0.1%
14
 
0.1%
12
 
0.1%
Other values (109) 249
 
1.5%
ASCII
ValueCountFrequency (%)
108
29.8%
( 17
 
4.7%
) 17
 
4.7%
1 15
 
4.1%
- 13
 
3.6%
a 10
 
2.8%
o 10
 
2.8%
i 10
 
2.8%
r 9
 
2.5%
, 8
 
2.2%
Other values (47) 145
40.1%
CJK
ValueCountFrequency (%)
5
 
3.5%
5
 
3.5%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
Other values (80) 102
70.8%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

DOCSENDER
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

EDITOR
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9991 
미상
 
8
송강유족보존회
 
1

Length

Max length7
Median length4
Mean length3.9987
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9991
99.9%
미상 8
 
0.1%
송강유족보존회 1
 
< 0.1%

Length

2023-12-12T23:36:12.184196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:12.403820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9991
99.9%
미상 8
 
0.1%
송강유족보존회 1
 
< 0.1%

AUTHOR
Text

MISSING 

Distinct677
Distinct (%)71.4%
Missing9052
Missing (%)90.5%
Memory size156.2 KiB
2023-12-12T23:36:12.716200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length38
Mean length9.0559072
Min length2

Characters and Unicode

Total characters8585
Distinct characters629
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique578 ?
Unique (%)61.0%

Sample

1st row이휘준(李彙濬)
2nd row고흥유림 및 면민
3rd row항일독립운동기념탑건립추진위원회
4th row권주(權柱)
5th row권정하(權靖夏)
ValueCountFrequency (%)
미상 52
 
3.5%
이상정(李象靖 27
 
1.8%
건립위원회 26
 
1.7%
건립추진위원회 25
 
1.7%
기념사업회 20
 
1.3%
유경시(柳敬時 17
 
1.1%
유림 16
 
1.1%
15
 
1.0%
14
 
0.9%
이조(吏曹)/이현보(李賢輔 14
 
0.9%
Other values (904) 1274
84.9%
2023-12-12T23:36:13.279647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
553
 
6.4%
) 331
 
3.9%
( 330
 
3.8%
285
 
3.3%
192
 
2.2%
166
 
1.9%
161
 
1.9%
140
 
1.6%
139
 
1.6%
137
 
1.6%
Other values (619) 6151
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6664
77.6%
Space Separator 553
 
6.4%
Close Punctuation 335
 
3.9%
Open Punctuation 334
 
3.9%
Other Punctuation 297
 
3.5%
Lowercase Letter 141
 
1.6%
Uppercase Letter 129
 
1.5%
Decimal Number 117
 
1.4%
Math Symbol 15
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
285
 
4.3%
192
 
2.9%
166
 
2.5%
161
 
2.4%
140
 
2.1%
139
 
2.1%
137
 
2.1%
119
 
1.8%
119
 
1.8%
118
 
1.8%
Other values (543) 5088
76.4%
Uppercase Letter
ValueCountFrequency (%)
И 16
12.4%
Е 13
 
10.1%
В 13
 
10.1%
А 11
 
8.5%
I 10
 
7.8%
V 9
 
7.0%
A 7
 
5.4%
Л 6
 
4.7%
Б 6
 
4.7%
E 5
 
3.9%
Other values (18) 33
25.6%
Lowercase Letter
ValueCountFrequency (%)
е 17
12.1%
н 15
10.6%
а 15
10.6%
и 12
8.5%
в 12
8.5%
о 11
 
7.8%
л 11
 
7.8%
к 8
 
5.7%
с 7
 
5.0%
т 4
 
2.8%
Other values (14) 29
20.6%
Decimal Number
ValueCountFrequency (%)
1 32
27.4%
3 30
25.6%
4 16
13.7%
5 11
 
9.4%
0 8
 
6.8%
6 6
 
5.1%
8 6
 
5.1%
9 3
 
2.6%
7 3
 
2.6%
2 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 99
33.3%
/ 71
23.9%
, 70
23.6%
· 36
 
12.1%
? 15
 
5.1%
; 2
 
0.7%
& 2
 
0.7%
2
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 331
98.8%
] 4
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 330
98.8%
[ 4
 
1.2%
Space Separator
ValueCountFrequency (%)
553
100.0%
Math Symbol
ValueCountFrequency (%)
+ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5732
66.8%
Common 1651
 
19.2%
Han 932
 
10.9%
Cyrillic 211
 
2.5%
Latin 59
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
285
 
5.0%
192
 
3.3%
166
 
2.9%
161
 
2.8%
140
 
2.4%
139
 
2.4%
137
 
2.4%
119
 
2.1%
119
 
2.1%
118
 
2.1%
Other values (313) 4156
72.5%
Han
ValueCountFrequency (%)
88
 
9.4%
55
 
5.9%
41
 
4.4%
34
 
3.6%
30
 
3.2%
29
 
3.1%
28
 
3.0%
28
 
3.0%
28
 
3.0%
25
 
2.7%
Other values (220) 546
58.6%
Cyrillic
ValueCountFrequency (%)
е 17
 
8.1%
И 16
 
7.6%
н 15
 
7.1%
а 15
 
7.1%
Е 13
 
6.2%
В 13
 
6.2%
и 12
 
5.7%
в 12
 
5.7%
о 11
 
5.2%
л 11
 
5.2%
Other values (20) 76
36.0%
Common
ValueCountFrequency (%)
553
33.5%
) 331
20.0%
( 330
20.0%
. 99
 
6.0%
/ 71
 
4.3%
, 70
 
4.2%
· 36
 
2.2%
1 32
 
1.9%
3 30
 
1.8%
4 16
 
1.0%
Other values (14) 83
 
5.0%
Latin
ValueCountFrequency (%)
I 10
16.9%
V 9
15.3%
A 7
11.9%
E 5
 
8.5%
S 3
 
5.1%
L 2
 
3.4%
t 2
 
3.4%
O 2
 
3.4%
T 2
 
3.4%
B 2
 
3.4%
Other values (12) 15
25.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5732
66.8%
ASCII 1672
 
19.5%
CJK 930
 
10.8%
Cyrillic 211
 
2.5%
None 38
 
0.4%
CJK Ext A 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
553
33.1%
) 331
19.8%
( 330
19.7%
. 99
 
5.9%
/ 71
 
4.2%
, 70
 
4.2%
1 32
 
1.9%
3 30
 
1.8%
4 16
 
1.0%
+ 15
 
0.9%
Other values (34) 125
 
7.5%
Hangul
ValueCountFrequency (%)
285
 
5.0%
192
 
3.3%
166
 
2.9%
161
 
2.8%
140
 
2.4%
139
 
2.4%
137
 
2.4%
119
 
2.1%
119
 
2.1%
118
 
2.1%
Other values (313) 4156
72.5%
CJK
ValueCountFrequency (%)
88
 
9.5%
55
 
5.9%
41
 
4.4%
34
 
3.7%
30
 
3.2%
29
 
3.1%
28
 
3.0%
28
 
3.0%
28
 
3.0%
25
 
2.7%
Other values (218) 544
58.5%
None
ValueCountFrequency (%)
· 36
94.7%
2
 
5.3%
Cyrillic
ValueCountFrequency (%)
е 17
 
8.1%
И 16
 
7.6%
н 15
 
7.1%
а 15
 
7.1%
Е 13
 
6.2%
В 13
 
6.2%
и 12
 
5.7%
в 12
 
5.7%
о 11
 
5.2%
л 11
 
5.2%
Other values (20) 76
36.0%
CJK Ext A
ValueCountFrequency (%)
1
50.0%
1
50.0%

SUBJECT_KHON1
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KH.14
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKH.14
2nd rowKH.14
3rd rowKH.14
4th rowKH.14
5th rowKH.14

Common Values

ValueCountFrequency (%)
KH.14 10000
100.0%

Length

2023-12-12T23:36:13.474277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:13.569597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kh.14 10000
100.0%

SUBJECT_KHON2
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KH.14.08
3624 
KH.14.02
1977 
KH.14.06
1422 
KH.14.03
1197 
KH.14.07
522 
Other values (4)
1258 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKH.14.06
2nd rowKH.14.06
3rd rowKH.14.10
4th rowKH.14.02
5th rowKH.14.06

Common Values

ValueCountFrequency (%)
KH.14.08 3624
36.2%
KH.14.02 1977
19.8%
KH.14.06 1422
 
14.2%
KH.14.03 1197
 
12.0%
KH.14.07 522
 
5.2%
KH.14.10 461
 
4.6%
KH.14.09 412
 
4.1%
KH.14.01 376
 
3.8%
KH.14.05 9
 
0.1%

Length

2023-12-12T23:36:13.671557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:13.800100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kh.14.08 3624
36.2%
kh.14.02 1977
19.8%
kh.14.06 1422
 
14.2%
kh.14.03 1197
 
12.0%
kh.14.07 522
 
5.2%
kh.14.10 461
 
4.6%
kh.14.09 412
 
4.1%
kh.14.01 376
 
3.8%
kh.14.05 9
 
0.1%

SUBJECT_KHDP
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ku
3624 
YCON03
1977 
ag
1422 
IDP-RU-002
661 
IDP-RU-001
536 
Other values (6)
1780 

Length

Max length10
Median length2
Mean length3.7878
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowag
2nd rowag
3rd rowcr
4th rowYCON03
5th rowag

Common Values

ValueCountFrequency (%)
ku 3624
36.2%
YCON03 1977
19.8%
ag 1422
 
14.2%
IDP-RU-002 661
 
6.6%
IDP-RU-001 536
 
5.4%
kk 522
 
5.2%
cr 461
 
4.6%
kr 412
 
4.1%
RIN 376
 
3.8%
EF02 6
 
0.1%

Length

2023-12-12T23:36:13.971331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ku 3624
36.2%
ycon03 1977
19.8%
ag 1422
 
14.2%
idp-ru-002 661
 
6.6%
idp-ru-001 536
 
5.4%
kk 522
 
5.2%
cr 461
 
4.6%
kr 412
 
4.1%
rin 376
 
3.8%
ef02 6
 
0.1%

TYPE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

UNIT
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
9972 
1
 
28

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 9972
99.7%
1 28
 
0.3%

Length

2023-12-12T23:36:14.188998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:14.298917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 9972
99.7%
1 28
 
0.3%

PUBLISHER
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9966 
동북아역사재단
 
19
고구려연구재단
 
5
동북아역사재단, 러시아과학원 극동분소 역사고고민속학연구소
 
3
동북아역사재단·몽골과학아카데미 고고학연구소
 
2
Other values (5)
 
5

Length

Max length36
Median length4
Mean length4.0262
Min length3

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9966
99.7%
동북아역사재단 19
 
0.2%
고구려연구재단 5
 
0.1%
동북아역사재단, 러시아과학원 극동분소 역사고고민속학연구소 3
 
< 0.1%
동북아역사재단·몽골과학아카데미 고고학연구소 2
 
< 0.1%
동북아역사재단/카자흐스탄 교육과학부 고고학연구소 1
 
< 0.1%
김학준 1
 
< 0.1%
조선총독부박물관 1
 
< 0.1%
동북아역사재단 / 키르기스스탄국립대학교 역사-지역학부·박물관연구소 1
 
< 0.1%
서울대학교 박물관, 동북아역사재단 1
 
< 0.1%

Length

2023-12-12T23:36:14.432747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:14.588361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9966
99.5%
동북아역사재단 24
 
0.2%
고구려연구재단 5
 
< 0.1%
러시아과학원 3
 
< 0.1%
극동분소 3
 
< 0.1%
역사고고민속학연구소 3
 
< 0.1%
고고학연구소 3
 
< 0.1%
동북아역사재단·몽골과학아카데미 2
 
< 0.1%
1
 
< 0.1%
서울대학교 1
 
< 0.1%
Other values (7) 7
 
0.1%

FORMAT_MEDIUM
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
text/xml
6441 
text/xml|image/jpeg
1977 
text/xml,image/jpeg
1197 
image/jpeg
 
376
text/html
 
9

Length

Max length19
Median length8
Mean length11.5675
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowtext/xml
2nd rowtext/xml
3rd rowtext/xml
4th rowtext/xml|image/jpeg
5th rowtext/xml

Common Values

ValueCountFrequency (%)
text/xml 6441
64.4%
text/xml|image/jpeg 1977
 
19.8%
text/xml,image/jpeg 1197
 
12.0%
image/jpeg 376
 
3.8%
text/html 9
 
0.1%

Length

2023-12-12T23:36:14.770394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:36:14.913249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
text/xml 6441
64.4%
text/xml|image/jpeg 1977
 
19.8%
text/xml,image/jpeg 1197
 
12.0%
image/jpeg 376
 
3.8%
text/html 9
 
0.1%

TABLEOFCONTENTS
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

ABSTRACT
Text

MISSING 

Distinct3416
Distinct (%)97.1%
Missing6481
Missing (%)64.8%
Memory size156.2 KiB
2023-12-12T23:36:15.383125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length636
Mean length239.30009
Min length12

Characters and Unicode

Total characters842097
Distinct characters3922
Distinct categories19 ?
Distinct scripts7 ?
Distinct blocks18 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3356 ?
Unique (%)95.4%

Sample

1st row이는 고성이씨 탑동종가 소장자료의 하나로 전답매매문기이다. 이 문기에 의하면, 1838년 12월 23일 전주(田主)인 이득삼(李得三)이 탑동댁공소(塔洞宅公所)로 낭자전(廊字田) 6복(卜) 6속(束) 4두락지(斗落只)를 20냥에 매매한다는 것이다. 이러한 토지매매문서를 명문(明文)이라 하는데, 여기서 명문을 받는 이는 보통 양반의 경우 자신의 가노(家奴)를 시켜 매매행위를 하는 것이 일반적이기 때문에 가노 앞으로 명문을 발급하는 것이 일반적이다. 그런데 이 문서에서는 직접 '탑동댁공소'로 명기한 것이 특이한 경우에 해당한다. 매매문서는 '문서의 제목, 매매 내용(토지의 소재지, 지번, 매매토지 규모, 매매액, 분쟁시의 조치사항)'으로 구성되며, 뒤에는 '매주(賣主)와 증인, 필집(筆執)'등의 순으로 서명하는 것이 일반적인 관행이었다.
2nd row&#xD; 소재지 :
3rd row『성학십도(聖學十圖)』는 본래 조선 중기의 학자 이황(李滉:1501~1570)이 1568년(선조 1) 12월에 성학(聖學)의 개요를 국왕에게 올린 상소문 형식의 글이다. 그후 이황이 경연(經筵)에 입시하였을 때 선조가 성군이 되기를 바라면서 성학의 대강을 강의하고 심법(心法)의 요점을 설명하기 위하여 여러 성리학자들의 도설(圖說)에서 골라 책을 엮고, 각 도식 아래 자신의 의견을 서술하여 왕에게 강론한 이래, 1681년(숙종 7) 오도일(吳道一)이 간행하였으며, 1741년(영조 17) 중간되었다. 성학(聖學)이란 제왕학(帝王學)을 말하는데, 조선중기 이황 단계에서는 군주(君主)의 마음을 바로 잡는다면 국가의 질서를 바로 잡을 수 있다고 파악한 것이다. 십도(十圖)란 태극도(太極圖)·서명도(西銘圖)·소학도(小學圖)·대학도(大學圖)·백록동규도(白鹿洞規圖)·심통성정도(心統性情圖)·인설도(仁說圖)·심학도(心學圖)·경재잠도(敬齋箴圖)·숙흥야매잠도(夙興夜寐箴圖)의 10가지이다.
4th row신랑이 신부의 집에 가서 신부를 맞이는 의식의 하나인 친영(親迎 : 신랑이 신부 집에 가서 신부를 맞이하는 의식)의 초례(醮禮)를 끝낸 신랑이 신부 집에서 3일을 지낸 후에 친가(親家)로 귀가할 때에 신부 모친인 권씨가 사돈되는 신랑 모친에게 퇴상(退床)의 음식과 함께 상장(上狀)이라 하여 보내는 편지이다. 안부 인사와 함께 사돈의 넓은 도량으로 친딸같이 가르쳐주기를 바라며, 가난한 탓에 분황과 초를 보내게 되어 무안하니 용서하시기를 빈다는 편지이다.
5th row정호선(丁好善: 1571∼1633)이 선몽대에서 지은 시를 새긴 시판이다. 정호선이 선몽대를 방문하였을 때 벽에 걸린 선몽대 주인의 시를 보고, 그것에 차운하여 이 시를 지었다. 이 시는 7언 절구 2수이다. 첫째 수에서는 선몽대의 경치에 찬탄하고 주인과 담소하며 우정을 나눈 행복한 시간을 읊고 있다. 둘째 수에서는 복잡하고 어지러운 세상사에서 떠나 이 곳 선몽대에서 유유자적하며 살고 싶은 심정을 노래하고 있다.시판의 왼쪽 부분에는 이 시판이 만들어진 경위가 기록되어 있다. 이 시판은 정호선의 6대손인 정재원(丁載遠)이 선몽대를 방문했을 때, 선몽대 주인이 정호선의 시 2수를 보여주었다. 이에 정재원은 아들 정약용(丁若鏞: 1762~1836)에게 시켜서 이 시판을 새기게 하였다.
ValueCountFrequency (%)
있다 1807
 
1.1%
1713
 
1.0%
1548
 
0.9%
소재지 1204
 
0.7%
있는 695
 
0.4%
683
 
0.4%
xd 661
 
0.4%
것이다 619
 
0.4%
호는 589
 
0.4%
자는 511
 
0.3%
Other values (52948) 155042
93.9%
2023-12-12T23:36:16.034111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164396
 
19.5%
( 20793
 
2.5%
) 20782
 
2.5%
20182
 
2.4%
14609
 
1.7%
13308
 
1.6%
. 13180
 
1.6%
, 12769
 
1.5%
12519
 
1.5%
1 10725
 
1.3%
Other values (3912) 538834
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 545662
64.8%
Space Separator 164693
 
19.6%
Decimal Number 39095
 
4.6%
Other Punctuation 34825
 
4.1%
Open Punctuation 23096
 
2.7%
Close Punctuation 23069
 
2.7%
Lowercase Letter 6488
 
0.8%
Uppercase Letter 2030
 
0.2%
Math Symbol 1318
 
0.2%
Final Punctuation 518
 
0.1%
Other values (9) 1303
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20182
 
3.7%
14609
 
2.7%
13308
 
2.4%
12519
 
2.3%
10365
 
1.9%
9901
 
1.8%
8292
 
1.5%
7414
 
1.4%
7293
 
1.3%
6977
 
1.3%
Other values (3743) 434802
79.7%
Lowercase Letter
ValueCountFrequency (%)
a 832
12.8%
x 703
10.8%
e 632
 
9.7%
o 483
 
7.4%
n 419
 
6.5%
i 415
 
6.4%
r 404
 
6.2%
t 394
 
6.1%
l 324
 
5.0%
u 217
 
3.3%
Other values (36) 1665
25.7%
Uppercase Letter
ValueCountFrequency (%)
D 731
36.0%
C 160
 
7.9%
S 145
 
7.1%
A 114
 
5.6%
M 93
 
4.6%
K 75
 
3.7%
P 75
 
3.7%
B 65
 
3.2%
N 64
 
3.2%
R 50
 
2.5%
Other values (31) 458
22.6%
Other Punctuation
ValueCountFrequency (%)
. 13180
37.8%
, 12769
36.7%
· 2691
 
7.7%
: 2543
 
7.3%
; 1119
 
3.2%
# 759
 
2.2%
& 756
 
2.2%
" 670
 
1.9%
' 170
 
0.5%
86
 
0.2%
Other values (9) 82
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 592
44.9%
516
39.2%
82
 
6.2%
53
 
4.0%
45
 
3.4%
× 9
 
0.7%
+ 6
 
0.5%
4
 
0.3%
4
 
0.3%
= 3
 
0.2%
Other values (2) 4
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 10725
27.4%
2 4306
11.0%
5 3524
 
9.0%
3 3361
 
8.6%
4 3157
 
8.1%
7 3085
 
7.9%
6 2871
 
7.3%
9 2812
 
7.2%
8 2716
 
6.9%
0 2538
 
6.5%
Open Punctuation
ValueCountFrequency (%)
( 20793
90.0%
1286
 
5.6%
323
 
1.4%
256
 
1.1%
234
 
1.0%
[ 106
 
0.5%
81
 
0.4%
17
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 20782
90.1%
1288
 
5.6%
322
 
1.4%
256
 
1.1%
234
 
1.0%
] 106
 
0.5%
81
 
0.4%
Other Symbol
ValueCountFrequency (%)
18
42.9%
10
23.8%
7
 
16.7%
3
 
7.1%
3
 
7.1%
1
 
2.4%
Other Number
ValueCountFrequency (%)
7
24.1%
7
24.1%
7
24.1%
½ 4
13.8%
3
10.3%
1
 
3.4%
Space Separator
ValueCountFrequency (%)
164396
99.8%
  297
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 512
99.4%
3
 
0.6%
Final Punctuation
ValueCountFrequency (%)
419
80.9%
99
 
19.1%
Initial Punctuation
ValueCountFrequency (%)
417
80.8%
99
 
19.2%
Control
ValueCountFrequency (%)
7
77.8%
2
 
22.2%
Modifier Symbol
ValueCountFrequency (%)
` 177
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%
Format
ValueCountFrequency (%)
­ 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 482785
57.3%
Common 287916
34.2%
Han 62872
 
7.5%
Latin 8418
 
1.0%
Cyrillic 101
 
< 0.1%
Katakana 4
 
< 0.1%
Hiragana 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1275
 
2.0%
1072
 
1.7%
882
 
1.4%
840
 
1.3%
581
 
0.9%
533
 
0.8%
502
 
0.8%
471
 
0.7%
455
 
0.7%
432
 
0.7%
Other values (2650) 55829
88.8%
Hangul
ValueCountFrequency (%)
20182
 
4.2%
14609
 
3.0%
13308
 
2.8%
12519
 
2.6%
10365
 
2.1%
9901
 
2.1%
8292
 
1.7%
7414
 
1.5%
7293
 
1.5%
6977
 
1.4%
Other values (1079) 371925
77.0%
Common
ValueCountFrequency (%)
164396
57.1%
( 20793
 
7.2%
) 20782
 
7.2%
. 13180
 
4.6%
, 12769
 
4.4%
1 10725
 
3.7%
2 4306
 
1.5%
5 3524
 
1.2%
3 3361
 
1.2%
4 3157
 
1.1%
Other values (71) 30923
 
10.7%
Latin
ValueCountFrequency (%)
a 832
 
9.9%
D 731
 
8.7%
x 703
 
8.4%
e 632
 
7.5%
o 483
 
5.7%
n 419
 
5.0%
i 415
 
4.9%
r 404
 
4.8%
t 394
 
4.7%
l 324
 
3.8%
Other values (43) 3081
36.6%
Cyrillic
ValueCountFrequency (%)
е 10
 
9.9%
а 9
 
8.9%
в 8
 
7.9%
н 6
 
5.9%
о 6
 
5.9%
с 5
 
5.0%
к 5
 
5.0%
А 4
 
4.0%
Е 4
 
4.0%
л 4
 
4.0%
Other values (25) 40
39.6%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Hiragana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 482634
57.3%
ASCII 287036
34.1%
CJK 62859
 
7.5%
None 7426
 
0.9%
Punctuation 1145
 
0.1%
Math Operators 606
 
0.1%
Compat Jamo 151
 
< 0.1%
Cyrillic 101
 
< 0.1%
Arrows 45
 
< 0.1%
Enclosed Alphanum 25
 
< 0.1%
Other values (8) 69
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164396
57.3%
( 20793
 
7.2%
) 20782
 
7.2%
. 13180
 
4.6%
, 12769
 
4.4%
1 10725
 
3.7%
2 4306
 
1.5%
5 3524
 
1.2%
3 3361
 
1.2%
4 3157
 
1.1%
Other values (77) 30043
 
10.5%
Hangul
ValueCountFrequency (%)
20182
 
4.2%
14609
 
3.0%
13308
 
2.8%
12519
 
2.6%
10365
 
2.1%
9901
 
2.1%
8292
 
1.7%
7414
 
1.5%
7293
 
1.5%
6977
 
1.4%
Other values (1076) 371774
77.0%
None
ValueCountFrequency (%)
· 2691
36.2%
1288
17.3%
1286
17.3%
323
 
4.3%
322
 
4.3%
  297
 
4.0%
256
 
3.4%
256
 
3.4%
234
 
3.2%
234
 
3.2%
Other values (10) 239
 
3.2%
CJK
ValueCountFrequency (%)
1275
 
2.0%
1072
 
1.7%
882
 
1.4%
840
 
1.3%
581
 
0.9%
533
 
0.8%
502
 
0.8%
471
 
0.7%
455
 
0.7%
432
 
0.7%
Other values (2641) 55816
88.8%
Math Operators
ValueCountFrequency (%)
516
85.1%
82
 
13.5%
4
 
0.7%
4
 
0.7%
Punctuation
ValueCountFrequency (%)
419
36.6%
417
36.4%
99
 
8.6%
99
 
8.6%
86
 
7.5%
17
 
1.5%
4
 
0.3%
3
 
0.3%
1
 
0.1%
Compat Jamo
ValueCountFrequency (%)
142
94.0%
5
 
3.3%
4
 
2.6%
Arrows
ValueCountFrequency (%)
45
100.0%
CJK Compat
ValueCountFrequency (%)
18
85.7%
3
 
14.3%
Geometric Shapes
ValueCountFrequency (%)
10
55.6%
7
38.9%
1
 
5.6%
Cyrillic
ValueCountFrequency (%)
е 10
 
9.9%
а 9
 
8.9%
в 8
 
7.9%
н 6
 
5.9%
о 6
 
5.9%
с 5
 
5.0%
к 5
 
5.0%
А 4
 
4.0%
Е 4
 
4.0%
л 4
 
4.0%
Other values (25) 40
39.6%
Small Forms
ValueCountFrequency (%)
8
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
7
28.0%
7
28.0%
7
28.0%
3
12.0%
1
 
4.0%
Specials
ValueCountFrequency (%)
3
100.0%
CJK Ext A
ValueCountFrequency (%)
3
23.1%
2
15.4%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
100.0%

ISPARTOF_ID
Categorical

Distinct30
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
3587 
KH.NAHF.ku_001
2825 
KH.NAHF.ku_002
644 
KH.NAHF.ag_004
387 
KH.NAHF.ag_002
 
351
Other values (25)
2206 

Length

Max length14
Median length14
Mean length10.413
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKH.NAHF.ag_003
2nd rowKH.NAHF.ag_003
3rd rowKH.NAHF.cr_005
4th row<NA>
5th rowKH.NAHF.ag_002

Common Values

ValueCountFrequency (%)
<NA> 3587
35.9%
KH.NAHF.ku_001 2825
28.2%
KH.NAHF.ku_002 644
 
6.4%
KH.NAHF.ag_004 387
 
3.9%
KH.NAHF.ag_002 351
 
3.5%
KH.NAHF.ag_001 333
 
3.3%
KH.NAHF.ag_003 296
 
3.0%
KH.NAHF.kk_002 226
 
2.3%
KH.NAHF.ku_003 152
 
1.5%
KH.NAHF.cr_006 140
 
1.4%
Other values (20) 1059
 
10.6%

Length

2023-12-12T23:36:16.205398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 3587
35.9%
kh.nahf.ku_001 2825
28.2%
kh.nahf.ku_002 644
 
6.4%
kh.nahf.ag_004 387
 
3.9%
kh.nahf.ag_002 351
 
3.5%
kh.nahf.ag_001 333
 
3.3%
kh.nahf.ag_003 296
 
3.0%
kh.nahf.kk_002 226
 
2.3%
kh.nahf.ku_003 152
 
1.5%
kh.nahf.cr_006 140
 
1.4%
Other values (20) 1059
 
10.6%

ISPARTOF
Categorical

Distinct30
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
3587 
고구려문화유산자료/중국
2825 
고구려문화유산자료/일본
644 
키르기스스탄 중·동부지역의 암각화
387 
몽골서북부 지역의 암각화
 
351
Other values (25)
2206 

Length

Max length34
Median length33
Mean length10.1222
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙아시아의 바위그림
2nd row중앙아시아의 바위그림
3rd row환인·집안 지역 고구려 유적 지질조사 보고서
4th row<NA>
5th row몽골서북부 지역의 암각화

Common Values

ValueCountFrequency (%)
<NA> 3587
35.9%
고구려문화유산자료/중국 2825
28.2%
고구려문화유산자료/일본 644
 
6.4%
키르기스스탄 중·동부지역의 암각화 387
 
3.9%
몽골서북부 지역의 암각화 351
 
3.5%
몽골고비알타이의 바위그림 333
 
3.3%
중앙아시아의 바위그림 296
 
3.0%
덕흥리 고분벽화 226
 
2.3%
고구려문화유산자료/논문편 152
 
1.5%
고구려 안학궁 조사 보고서 2006 140
 
1.4%
Other values (20) 1059
 
10.6%

Length

2023-12-12T23:36:16.339717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 3587
21.9%
고구려문화유산자료/중국 2825
17.2%
암각화 763
 
4.6%
바위그림 653
 
4.0%
고구려문화유산자료/일본 644
 
3.9%
키르기스스탄 412
 
2.5%
보고서 409
 
2.5%
연해주 403
 
2.5%
중·동부지역의 387
 
2.4%
지역의 376
 
2.3%
Other values (55) 5955
36.3%

REQUIRES
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

DATEEVENT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

DOCCREATED
Text

MISSING 

Distinct444
Distinct (%)82.5%
Missing9462
Missing (%)94.6%
Memory size156.2 KiB
2023-12-12T23:36:16.628792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length42
Mean length11.250929
Min length2

Characters and Unicode

Total characters6053
Distinct characters133
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique389 ?
Unique (%)72.3%

Sample

1st row1953. 3. 1 (1997. 6. 5 이전)
2nd row1991. 8. 150
3rd row1985
4th row1981. 3
5th row1979. 3. 1
ValueCountFrequency (%)
8 84
 
5.8%
3 71
 
4.9%
1 57
 
3.9%
150 55
 
3.8%
10 53
 
3.7%
11 50
 
3.5%
4 48
 
3.3%
5 42
 
2.9%
12 39
 
2.7%
6 30
 
2.1%
Other values (273) 917
63.4%
2023-12-12T23:36:17.103785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1025
16.9%
9 917
15.1%
910
15.0%
. 807
13.3%
0 367
 
6.1%
8 298
 
4.9%
5 237
 
3.9%
2 220
 
3.6%
7 199
 
3.3%
3 187
 
3.1%
Other values (123) 886
14.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3770
62.3%
Space Separator 910
 
15.0%
Other Punctuation 822
 
13.6%
Other Letter 363
 
6.0%
Dash Punctuation 87
 
1.4%
Open Punctuation 45
 
0.7%
Close Punctuation 44
 
0.7%
Math Symbol 5
 
0.1%
Modifier Symbol 5
 
0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
12.1%
23
 
6.3%
13
 
3.6%
13
 
3.6%
13
 
3.6%
13
 
3.6%
12
 
3.3%
12
 
3.3%
9
 
2.5%
9
 
2.5%
Other values (102) 202
55.6%
Decimal Number
ValueCountFrequency (%)
1 1025
27.2%
9 917
24.3%
0 367
 
9.7%
8 298
 
7.9%
5 237
 
6.3%
2 220
 
5.8%
7 199
 
5.3%
3 187
 
5.0%
6 187
 
5.0%
4 133
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 807
98.2%
, 14
 
1.7%
/ 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
D 1
50.0%
Space Separator
ValueCountFrequency (%)
910
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5688
94.0%
Hangul 363
 
6.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
12.1%
23
 
6.3%
13
 
3.6%
13
 
3.6%
13
 
3.6%
13
 
3.6%
12
 
3.3%
12
 
3.3%
9
 
2.5%
9
 
2.5%
Other values (102) 202
55.6%
Common
ValueCountFrequency (%)
1 1025
18.0%
9 917
16.1%
910
16.0%
. 807
14.2%
0 367
 
6.5%
8 298
 
5.2%
5 237
 
4.2%
2 220
 
3.9%
7 199
 
3.5%
3 187
 
3.3%
Other values (9) 521
9.2%
Latin
ValueCountFrequency (%)
A 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5690
94.0%
Hangul 363
 
6.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1025
18.0%
9 917
16.1%
910
16.0%
. 807
14.2%
0 367
 
6.4%
8 298
 
5.2%
5 237
 
4.2%
2 220
 
3.9%
7 199
 
3.5%
3 187
 
3.3%
Other values (11) 523
9.2%
Hangul
ValueCountFrequency (%)
44
 
12.1%
23
 
6.3%
13
 
3.6%
13
 
3.6%
13
 
3.6%
13
 
3.6%
12
 
3.3%
12
 
3.3%
9
 
2.5%
9
 
2.5%
Other values (102) 202
55.6%

DOCISSUED
Text

MISSING 

Distinct556
Distinct (%)61.1%
Missing9090
Missing (%)90.9%
Memory size156.2 KiB
2023-12-12T23:36:17.478692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length6.0571429
Min length2

Characters and Unicode

Total characters5512
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique403 ?
Unique (%)44.3%

Sample

1st row1838년
2nd row丙子 1-1月-99
3rd row을해년
4th row1862년/1861년0
5th row을사(乙-巳)-99
ValueCountFrequency (%)
조선시대 66
 
6.6%
조선후기 41
 
4.1%
조선말기 8
 
0.8%
16세기 8
 
0.8%
辛丑 7
 
0.7%
을해년 7
 
0.7%
18세기경 7
 
0.7%
1716년 6
 
0.6%
18세기 6
 
0.6%
12월 5
 
0.5%
Other values (577) 841
83.9%
2023-12-12T23:36:18.010307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 693
 
12.6%
9 648
 
11.8%
436
 
7.9%
- 376
 
6.8%
7 256
 
4.6%
8 252
 
4.6%
0 203
 
3.7%
2 182
 
3.3%
5 156
 
2.8%
6 155
 
2.8%
Other values (110) 2155
39.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2783
50.5%
Other Letter 1991
36.1%
Dash Punctuation 376
 
6.8%
Close Punctuation 123
 
2.2%
Open Punctuation 123
 
2.2%
Space Separator 94
 
1.7%
Math Symbol 16
 
0.3%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
436
21.9%
135
 
6.8%
133
 
6.7%
129
 
6.5%
77
 
3.9%
68
 
3.4%
57
 
2.9%
46
 
2.3%
43
 
2.2%
41
 
2.1%
Other values (90) 826
41.5%
Decimal Number
ValueCountFrequency (%)
1 693
24.9%
9 648
23.3%
7 256
 
9.2%
8 252
 
9.1%
0 203
 
7.3%
2 182
 
6.5%
5 156
 
5.6%
6 155
 
5.6%
3 144
 
5.2%
4 94
 
3.4%
Other Punctuation
ValueCountFrequency (%)
? 2
33.3%
. 2
33.3%
, 1
16.7%
/ 1
16.7%
Math Symbol
ValueCountFrequency (%)
11
68.8%
~ 5
31.2%
Dash Punctuation
ValueCountFrequency (%)
- 376
100.0%
Close Punctuation
ValueCountFrequency (%)
) 123
100.0%
Open Punctuation
ValueCountFrequency (%)
( 123
100.0%
Space Separator
ValueCountFrequency (%)
94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3521
63.9%
Hangul 1578
28.6%
Han 413
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
436
27.6%
135
 
8.6%
133
 
8.4%
129
 
8.2%
77
 
4.9%
68
 
4.3%
57
 
3.6%
43
 
2.7%
41
 
2.6%
31
 
2.0%
Other values (43) 428
27.1%
Han
ValueCountFrequency (%)
46
 
11.1%
23
 
5.6%
21
 
5.1%
17
 
4.1%
16
 
3.9%
15
 
3.6%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
Other values (37) 225
54.5%
Common
ValueCountFrequency (%)
1 693
19.7%
9 648
18.4%
- 376
10.7%
7 256
 
7.3%
8 252
 
7.2%
0 203
 
5.8%
2 182
 
5.2%
5 156
 
4.4%
6 155
 
4.4%
3 144
 
4.1%
Other values (10) 456
13.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3510
63.7%
Hangul 1578
28.6%
CJK 413
 
7.5%
Math Operators 11
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 693
19.7%
9 648
18.5%
- 376
10.7%
7 256
 
7.3%
8 252
 
7.2%
0 203
 
5.8%
2 182
 
5.2%
5 156
 
4.4%
6 155
 
4.4%
3 144
 
4.1%
Other values (9) 445
12.7%
Hangul
ValueCountFrequency (%)
436
27.6%
135
 
8.6%
133
 
8.4%
129
 
8.2%
77
 
4.9%
68
 
4.3%
57
 
3.6%
43
 
2.7%
41
 
2.6%
31
 
2.0%
Other values (43) 428
27.1%
CJK
ValueCountFrequency (%)
46
 
11.1%
23
 
5.6%
21
 
5.1%
17
 
4.1%
16
 
3.9%
15
 
3.6%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
Other values (37) 225
54.5%
Math Operators
ValueCountFrequency (%)
11
100.0%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2015-02-02 00:00:00
2023-12-12T23:36:18.147257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:36:18.254354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2007-10-30 00:00:00
2023-12-12T23:36:18.366798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:36:18.481920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2008-04-21 00:00:00
Maximum2015-10-06 00:00:00
2023-12-12T23:36:18.588329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:36:18.708719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)

URL
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:36:18.987329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length139
Mean length96.4657
Min length63

Characters and Unicode

Total characters964657
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row<url><get>http://contents.nahf.or.kr/id/NAHF.ag_003_0030_0020_0010</get></url>
2nd row<url><get>http://contents.nahf.or.kr/id/NAHF.ag_003_0070_0070_0050</get></url>
3rd row<url><get>http://contents.nahf.or.kr/id/NAHF.cr_005_0010_0020_0010_0020</get></url>
4th row<url> <get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=2&amp;lvl2=4030</get> </url>
5th row<url><get>http://contents.nahf.or.kr/id/NAHF.ag_002_0020_0040</get></url>
ValueCountFrequency (%)
url 7118
41.6%
url><get>http://contents.nahf.or.kr/id/nahf.ag_003_0030_0020_0010</get></url 1
 
< 0.1%
url><get>http://contents.nahf.or.kr/id/nahf.ku_001_0010_0040_0030</get></url 1
 
< 0.1%
get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=1&amp;lvl2=1064</get 1
 
< 0.1%
url><get>http://contents.nahf.or.kr/id/nahf.ku_003_0030_0040</get></url 1
 
< 0.1%
url><get>http://contents.nahf.or.kr/id/nahf.ku_001_0020_0030_2170_0460</get></url 1
 
< 0.1%
get>http://www.ugyo.net/resolver.jsp?cat=rin&amp;lvl1=ands017</get 1
 
< 0.1%
url><get>http://contents.nahf.or.kr/id/nahf.ku_002_0080_0010_0010</get></url 1
 
< 0.1%
get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=5&amp;lvl2=4918</get 1
 
< 0.1%
url><get>http://contents.nahf.or.kr/id/nahf.ag_004_0020_0010_0410</get></url 1
 
< 0.1%
Other values (9991) 9991
58.4%
2023-12-12T23:36:19.418169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128124
 
13.3%
0 75345
 
7.8%
t 58803
 
6.1%
/ 57665
 
6.0%
r 46826
 
4.9%
< 40000
 
4.1%
> 40000
 
4.1%
e 38428
 
4.0%
. 37647
 
3.9%
l 33026
 
3.4%
Other values (56) 408793
42.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 425790
44.1%
Decimal Number 133676
 
13.9%
Space Separator 128124
 
13.3%
Other Punctuation 119979
 
12.4%
Math Symbol 89113
 
9.2%
Uppercase Letter 38039
 
3.9%
Connector Punctuation 27542
 
2.9%
Dash Punctuation 2394
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 58803
13.8%
r 46826
11.0%
e 38428
 
9.0%
l 33026
 
7.8%
u 27192
 
6.4%
n 25922
 
6.1%
g 23804
 
5.6%
o 21285
 
5.0%
h 19413
 
4.6%
p 19177
 
4.5%
Other values (14) 111914
26.3%
Uppercase Letter
ValueCountFrequency (%)
H 7695
20.2%
A 6497
17.1%
N 6466
17.0%
F 6441
16.9%
C 1580
 
4.2%
U 1421
 
3.7%
R 1330
 
3.5%
S 1308
 
3.4%
P 1277
 
3.4%
D 1272
 
3.3%
Other values (10) 2752
 
7.2%
Decimal Number
ValueCountFrequency (%)
0 75345
56.4%
1 16629
 
12.4%
2 13893
 
10.4%
3 6582
 
4.9%
4 5979
 
4.5%
5 4329
 
3.2%
6 3626
 
2.7%
8 3351
 
2.5%
7 2426
 
1.8%
9 1516
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/ 57665
48.1%
. 37647
31.4%
: 10000
 
8.3%
; 5554
 
4.6%
& 5554
 
4.6%
? 3559
 
3.0%
Math Symbol
ValueCountFrequency (%)
< 40000
44.9%
> 40000
44.9%
= 9113
 
10.2%
Space Separator
ValueCountFrequency (%)
128124
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 27542
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2394
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 500828
51.9%
Latin 463829
48.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 58803
 
12.7%
r 46826
 
10.1%
e 38428
 
8.3%
l 33026
 
7.1%
u 27192
 
5.9%
n 25922
 
5.6%
g 23804
 
5.1%
o 21285
 
4.6%
h 19413
 
4.2%
p 19177
 
4.1%
Other values (34) 149953
32.3%
Common
ValueCountFrequency (%)
128124
25.6%
0 75345
15.0%
/ 57665
11.5%
< 40000
 
8.0%
> 40000
 
8.0%
. 37647
 
7.5%
_ 27542
 
5.5%
1 16629
 
3.3%
2 13893
 
2.8%
: 10000
 
2.0%
Other values (12) 53983
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 964657
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
128124
 
13.3%
0 75345
 
7.8%
t 58803
 
6.1%
/ 57665
 
6.0%
r 46826
 
4.9%
< 40000
 
4.1%
> 40000
 
4.1%
e 38428
 
4.0%
. 37647
 
3.9%
l 33026
 
3.4%
Other values (56) 408793
42.4%

CREATORSORT
Text

MISSING 

Distinct678
Distinct (%)9.3%
Missing2683
Missing (%)26.8%
Memory size156.2 KiB
2023-12-12T23:36:19.768054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length2
Mean length2.9327593
Min length2

Characters and Unicode

Total characters21459
Distinct characters627
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique578 ?
Unique (%)7.9%

Sample

1st row
2nd row
3rd row
4th row
5th row
ValueCountFrequency (%)
미상 52
 
3.5%
이상정(李象靖 27
 
1.8%
건립위원회 26
 
1.7%
건립추진위원회 25
 
1.7%
기념사업회 20
 
1.3%
유경시(柳敬時 17
 
1.1%
유림 16
 
1.1%
15
 
1.0%
이조(吏曹)/이현보(李賢輔 14
 
0.9%
14
 
0.9%
Other values (904) 1274
84.9%
2023-12-12T23:36:20.267057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13433
62.6%
) 331
 
1.5%
( 330
 
1.5%
285
 
1.3%
192
 
0.9%
166
 
0.8%
161
 
0.8%
140
 
0.7%
139
 
0.6%
137
 
0.6%
Other values (617) 6145
28.6%

Most occurring categories

ValueCountFrequency (%)
Space Separator 13433
62.6%
Other Letter 6664
31.1%
Close Punctuation 335
 
1.6%
Open Punctuation 334
 
1.6%
Other Punctuation 293
 
1.4%
Lowercase Letter 137
 
0.6%
Uppercase Letter 129
 
0.6%
Decimal Number 117
 
0.5%
Math Symbol 17
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
285
 
4.3%
192
 
2.9%
166
 
2.5%
161
 
2.4%
140
 
2.1%
139
 
2.1%
137
 
2.1%
119
 
1.8%
119
 
1.8%
118
 
1.8%
Other values (543) 5088
76.4%
Uppercase Letter
ValueCountFrequency (%)
И 16
12.4%
В 13
 
10.1%
Е 13
 
10.1%
А 11
 
8.5%
I 10
 
7.8%
V 9
 
7.0%
A 7
 
5.4%
Б 6
 
4.7%
Л 6
 
4.7%
Г 5
 
3.9%
Other values (18) 33
25.6%
Lowercase Letter
ValueCountFrequency (%)
е 17
12.4%
н 15
10.9%
а 15
10.9%
и 12
8.8%
в 12
8.8%
л 11
8.0%
о 11
8.0%
к 8
 
5.8%
с 7
 
5.1%
т 4
 
2.9%
Other values (12) 25
18.2%
Decimal Number
ValueCountFrequency (%)
1 32
27.4%
3 30
25.6%
4 16
13.7%
5 11
 
9.4%
0 8
 
6.8%
8 6
 
5.1%
6 6
 
5.1%
9 3
 
2.6%
7 3
 
2.6%
2 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 99
33.8%
/ 71
24.2%
, 70
23.9%
· 36
 
12.3%
? 15
 
5.1%
2
 
0.7%
Math Symbol
ValueCountFrequency (%)
+ 15
88.2%
< 1
 
5.9%
> 1
 
5.9%
Close Punctuation
ValueCountFrequency (%)
) 331
98.8%
] 4
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 330
98.8%
[ 4
 
1.2%
Space Separator
ValueCountFrequency (%)
13433
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14529
67.7%
Hangul 5732
 
26.7%
Han 932
 
4.3%
Cyrillic 211
 
1.0%
Latin 55
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
285
 
5.0%
192
 
3.3%
166
 
2.9%
161
 
2.8%
140
 
2.4%
139
 
2.4%
137
 
2.4%
119
 
2.1%
119
 
2.1%
118
 
2.1%
Other values (313) 4156
72.5%
Han
ValueCountFrequency (%)
88
 
9.4%
55
 
5.9%
41
 
4.4%
34
 
3.6%
30
 
3.2%
29
 
3.1%
28
 
3.0%
28
 
3.0%
28
 
3.0%
25
 
2.7%
Other values (220) 546
58.6%
Cyrillic
ValueCountFrequency (%)
е 17
 
8.1%
И 16
 
7.6%
н 15
 
7.1%
а 15
 
7.1%
В 13
 
6.2%
Е 13
 
6.2%
и 12
 
5.7%
в 12
 
5.7%
А 11
 
5.2%
л 11
 
5.2%
Other values (20) 76
36.0%
Common
ValueCountFrequency (%)
13433
92.5%
) 331
 
2.3%
( 330
 
2.3%
. 99
 
0.7%
/ 71
 
0.5%
, 70
 
0.5%
· 36
 
0.2%
1 32
 
0.2%
3 30
 
0.2%
4 16
 
0.1%
Other values (14) 81
 
0.6%
Latin
ValueCountFrequency (%)
I 10
18.2%
V 9
16.4%
A 7
12.7%
E 5
9.1%
S 3
 
5.5%
O 2
 
3.6%
L 2
 
3.6%
B 2
 
3.6%
N 2
 
3.6%
M 2
 
3.6%
Other values (10) 11
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14546
67.8%
Hangul 5732
 
26.7%
CJK 930
 
4.3%
Cyrillic 211
 
1.0%
None 38
 
0.2%
CJK Ext A 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13433
92.3%
) 331
 
2.3%
( 330
 
2.3%
. 99
 
0.7%
/ 71
 
0.5%
, 70
 
0.5%
1 32
 
0.2%
3 30
 
0.2%
4 16
 
0.1%
? 15
 
0.1%
Other values (32) 119
 
0.8%
Hangul
ValueCountFrequency (%)
285
 
5.0%
192
 
3.3%
166
 
2.9%
161
 
2.8%
140
 
2.4%
139
 
2.4%
137
 
2.4%
119
 
2.1%
119
 
2.1%
118
 
2.1%
Other values (313) 4156
72.5%
CJK
ValueCountFrequency (%)
88
 
9.5%
55
 
5.9%
41
 
4.4%
34
 
3.7%
30
 
3.2%
29
 
3.1%
28
 
3.0%
28
 
3.0%
28
 
3.0%
25
 
2.7%
Other values (218) 544
58.5%
None
ValueCountFrequency (%)
· 36
94.7%
2
 
5.3%
Cyrillic
ValueCountFrequency (%)
е 17
 
8.1%
И 16
 
7.6%
н 15
 
7.1%
а 15
 
7.1%
В 13
 
6.2%
Е 13
 
6.2%
и 12
 
5.7%
в 12
 
5.7%
А 11
 
5.2%
л 11
 
5.2%
Other values (20) 76
36.0%
CJK Ext A
ValueCountFrequency (%)
1
50.0%
1
50.0%

DATESORT
Text

MISSING 

Distinct995
Distinct (%)68.9%
Missing8556
Missing (%)85.6%
Memory size156.2 KiB
2023-12-12T23:36:20.615545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length35
Mean length8.0186981
Min length2

Characters and Unicode

Total characters11579
Distinct characters201
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique786 ?
Unique (%)54.4%

Sample

1st row1838년
2nd row丙子 1-1月-99
3rd row을해년
4th row1862년/1861년0
5th row1953. 3. 1 (1997. 6. 5 이전)
ValueCountFrequency (%)
8 84
 
3.4%
3 71
 
2.9%
조선시대 66
 
2.7%
1 57
 
2.3%
150 55
 
2.3%
10 53
 
2.2%
11 50
 
2.0%
4 48
 
2.0%
5 42
 
1.7%
조선후기 41
 
1.7%
Other values (846) 1877
76.8%
2023-12-12T23:36:21.136241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1711
14.8%
9 1583
13.7%
1004
 
8.7%
. 809
 
7.0%
0 562
 
4.9%
8 546
 
4.7%
489
 
4.2%
- 473
 
4.1%
7 455
 
3.9%
5 393
 
3.4%
Other values (191) 3554
30.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6539
56.5%
Other Letter 2372
 
20.5%
Space Separator 1006
 
8.7%
Other Punctuation 826
 
7.1%
Dash Punctuation 473
 
4.1%
Open Punctuation 168
 
1.5%
Close Punctuation 167
 
1.4%
Math Symbol 21
 
0.2%
Modifier Symbol 5
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
489
20.6%
142
 
6.0%
138
 
5.8%
136
 
5.7%
83
 
3.5%
71
 
3.0%
59
 
2.5%
46
 
1.9%
45
 
1.9%
42
 
1.8%
Other values (168) 1121
47.3%
Decimal Number
ValueCountFrequency (%)
1 1711
26.2%
9 1583
24.2%
0 562
 
8.6%
8 546
 
8.3%
7 455
 
7.0%
5 393
 
6.0%
2 393
 
6.0%
6 339
 
5.2%
3 330
 
5.0%
4 227
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 809
97.9%
, 15
 
1.8%
/ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
1004
99.8%
  2
 
0.2%
Math Symbol
ValueCountFrequency (%)
11
52.4%
~ 10
47.6%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
A 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 473
100.0%
Open Punctuation
ValueCountFrequency (%)
( 168
100.0%
Close Punctuation
ValueCountFrequency (%)
) 167
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9205
79.5%
Hangul 1959
 
16.9%
Han 413
 
3.6%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
489
25.0%
142
 
7.2%
138
 
7.0%
136
 
6.9%
83
 
4.2%
71
 
3.6%
59
 
3.0%
45
 
2.3%
42
 
2.1%
37
 
1.9%
Other values (121) 717
36.6%
Han
ValueCountFrequency (%)
46
 
11.1%
23
 
5.6%
21
 
5.1%
17
 
4.1%
16
 
3.9%
15
 
3.6%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
Other values (37) 225
54.5%
Common
ValueCountFrequency (%)
1 1711
18.6%
9 1583
17.2%
1004
10.9%
. 809
8.8%
0 562
 
6.1%
8 546
 
5.9%
- 473
 
5.1%
7 455
 
4.9%
5 393
 
4.3%
2 393
 
4.3%
Other values (11) 1276
13.9%
Latin
ValueCountFrequency (%)
D 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9194
79.4%
Hangul 1959
 
16.9%
CJK 413
 
3.6%
Math Operators 11
 
0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1711
18.6%
9 1583
17.2%
1004
10.9%
. 809
8.8%
0 562
 
6.1%
8 546
 
5.9%
- 473
 
5.1%
7 455
 
4.9%
5 393
 
4.3%
2 393
 
4.3%
Other values (11) 1265
13.8%
Hangul
ValueCountFrequency (%)
489
25.0%
142
 
7.2%
138
 
7.0%
136
 
6.9%
83
 
4.2%
71
 
3.6%
59
 
3.0%
45
 
2.3%
42
 
2.1%
37
 
1.9%
Other values (121) 717
36.6%
CJK
ValueCountFrequency (%)
46
 
11.1%
23
 
5.6%
21
 
5.1%
17
 
4.1%
16
 
3.9%
15
 
3.6%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
Other values (37) 225
54.5%
Math Operators
ValueCountFrequency (%)
11
100.0%
None
ValueCountFrequency (%)
  2
100.0%

Sample

URI_KHONMDCENTERSUBJECT_KHONDBINFOURI_KHDPMAINTITLEALTERNATIVEDOCSENDEREDITORAUTHORSUBJECT_KHON1SUBJECT_KHON2SUBJECT_KHDPTYPEUNITPUBLISHERFORMAT_MEDIUMTABLEOFCONTENTSABSTRACTISPARTOF_IDISPARTOFREQUIRESDATEEVENTDOCCREATEDDOCISSUEDDATE_ISSUEDDATE_CREATEDDATE_MODIFIEDURLCREATORSORTDATESORT
4332KH.NAHF.ag_003_0030_0020_0010NAHFKH.14.06.000암각화자료ag_003_0030_0020_0010서쪽 부분(동물들)<NA><NA><NA><NA>KH.14KH.14.06ag<NA>2<NA>text/xml<NA><NA>KH.NAHF.ag_003중앙아시아의 바위그림<NA><NA><NA><NA>2013-12-06 00:00:001900-01-01 00:00:002015-10-06 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ag_003_0030_0020_0010</get></url><NA>
4389KH.NAHF.ag_003_0070_0070_0050NAHFKH.14.06.000암각화자료ag_003_0070_0070_0050사람<NA><NA><NA><NA>KH.14KH.14.06ag<NA>2<NA>text/xml<NA><NA>KH.NAHF.ag_003중앙아시아의 바위그림<NA><NA><NA><NA>2013-12-06 00:00:001900-01-01 00:00:002015-10-06 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ag_003_0070_0070_0050</get></url><NA>
5250KH.NAHF.cr_005_0010_0020_0010_0020NAHFKH.14.10.000도록·보고서cr_005_0010_0020_0010_0020하고성자성(下古城子城)<NA><NA><NA><NA>KH.14KH.14.10cr<NA>2<NA>text/xml<NA><NA>KH.NAHF.cr_005환인·집안 지역 고구려 유적 지질조사 보고서<NA><NA><NA><NA>2013-12-05 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.cr_005_0010_0020_0010_0020</get></url><NA>
2408KH.KSAC.4030KSACKH.14.02.000영남유학유물4030명문(전답매매문기)(明文(田畓賣買文記))영남유교문화유물<NA><NA><NA>KH.14KH.14.02YCON03<NA>2<NA>text/xml|image/jpeg<NA>이는 고성이씨 탑동종가 소장자료의 하나로 전답매매문기이다. 이 문기에 의하면, 1838년 12월 23일 전주(田主)인 이득삼(李得三)이 탑동댁공소(塔洞宅公所)로 낭자전(廊字田) 6복(卜) 6속(束) 4두락지(斗落只)를 20냥에 매매한다는 것이다. 이러한 토지매매문서를 명문(明文)이라 하는데, 여기서 명문을 받는 이는 보통 양반의 경우 자신의 가노(家奴)를 시켜 매매행위를 하는 것이 일반적이기 때문에 가노 앞으로 명문을 발급하는 것이 일반적이다. 그런데 이 문서에서는 직접 '탑동댁공소'로 명기한 것이 특이한 경우에 해당한다. 매매문서는 '문서의 제목, 매매 내용(토지의 소재지, 지번, 매매토지 규모, 매매액, 분쟁시의 조치사항)'으로 구성되며, 뒤에는 '매주(賣主)와 증인, 필집(筆執)'등의 순으로 서명하는 것이 일반적인 관행이었다.<NA><NA><NA><NA><NA>1838년1900-01-01 00:00:001900-01-01 00:00:002008-04-21 00:00:00<url> <get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=2&amp;lvl2=4030</get> </url><NA>1838년
3972KH.NAHF.ag_002_0020_0040NAHFKH.14.06.000암각화자료ag_002_0020_0040조라그트 하드 2암면<NA><NA><NA><NA>KH.14KH.14.06ag<NA>2<NA>text/xml<NA><NA>KH.NAHF.ag_002몽골서북부 지역의 암각화<NA><NA><NA><NA>2013-12-06 00:00:001900-01-01 00:00:002015-10-06 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ag_002_0020_0040</get></url><NA>
9746KH.NAHF.ku_002_0060_0010_0020_0030NAHFKH.14.08.000고구려문화유산자료ku_002_0060_0010_0020_0030<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_002고구려문화유산자료/일본<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_002_0060_0010_0020_0030</get></url><NA>
9868KH.NAHF.ku_002_0070_0020_0030_0710NAHFKH.14.08.000고구려문화유산자료ku_002_0070_0020_0030_0710<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_002고구려문화유산자료/일본<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_002_0070_0020_0030_0710</get></url><NA>
8378KH.NAHF.ku_001_0030_0010_0010_0140NAHFKH.14.08.000고구려문화유산자료ku_001_0030_0010_0010_0140<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_001고구려문화유산자료/중국<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_001_0030_0010_0010_0140</get></url><NA>
8341KH.NAHF.ku_001_0020_0040_0300NAHFKH.14.08.000고구려문화유산자료ku_001_0020_0040_0300<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_001고구려문화유산자료/중국<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_001_0020_0040_0300</get></url><NA>
9541KH.NAHF.ku_002_0010_0050_0010_0270NAHFKH.14.08.000고구려문화유산자료ku_002_0010_0050_0010_0270<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_002고구려문화유산자료/일본<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_002_0010_0050_0010_0270</get></url><NA>
URI_KHONMDCENTERSUBJECT_KHONDBINFOURI_KHDPMAINTITLEALTERNATIVEDOCSENDEREDITORAUTHORSUBJECT_KHON1SUBJECT_KHON2SUBJECT_KHDPTYPEUNITPUBLISHERFORMAT_MEDIUMTABLEOFCONTENTSABSTRACTISPARTOF_IDISPARTOFREQUIRESDATEEVENTDOCCREATEDDOCISSUEDDATE_ISSUEDDATE_CREATEDDATE_MODIFIEDURLCREATORSORTDATESORT
661KH.IDP.s0084hIDPKH.14.03.000국내독립운동유적지s0084h송촌3·1의거애국선열추념탑(松村三一義擧愛國先烈追念塔)<NA><NA><NA>탑건립추진위원장KH.14KH.14.03IDP-RU-001<NA>2<NA>text/xml,image/jpeg<NA>소재지 : 경기도 남양주시 조안면 송촌1리 715-1 (용진교회 내)<NA><NA><NA><NA>1994. 8. 150<NA>1900-01-01 00:00:001900-01-01 00:00:002013-10-21 00:00:00<url> <get>http://search2.i815.or.kr/Search/HistoryCon.jsp?menu=IDP-RU-001&amp;nKey=s0084h</get> </url>탑건립추진위원장1994. 8. 150
5432KH.NAHF.cr_006_0030_0580NAHFKH.14.10.000도록·보고서cr_006_0030_0580북궁 3호 건물 터와 돌확<NA><NA><NA><NA>KH.14KH.14.10cr<NA>2<NA>text/xml<NA><NA>KH.NAHF.cr_006고구려 안학궁 조사 보고서 2006<NA><NA><NA><NA>2013-12-05 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.cr_006_0030_0580</get></url><NA>
1198KH.IDP.US344354IDPKH.14.03.000국외독립운동유적지US344354장인환·전명운의거 장소<NA><NA><NA><NA>KH.14KH.14.03IDP-RU-002<NA>2<NA>text/xml,image/jpeg<NA>1908년 3월 23일 오전 9시 10분경 張仁煥과 田明雲이 당시 오클랜드 역으로 가기 위해 페리부두에 와 있던 친일 미국인 스티븐스(Durham W.Stevens)를 처단했던 장소이다. 두 의사의 의거는 민족독립을 위한 의열투쟁의 효시가 되었을 뿐만 아니라 미주한인들의 민족적 단결과 독립운동 전개에 하나의 원동력이 되었다. &#xD; 소재지 : Embarcadeo St.<NA><NA><NA><NA><NA><NA>1900-01-01 00:00:001900-01-01 00:00:002013-10-21 00:00:00<url> <get>http://search2.i815.or.kr/Search/HistoryCon.jsp?menu=IDP-RU-002&amp;nKey=US344354</get> </url><NA><NA>
2570KH.KSAC.4236KSACKH.14.02.000영남유학유물4236창계집(滄溪集)영남유교문화유물<NA><NA><NA>KH.14KH.14.02YCON03<NA>2<NA>text/xml|image/jpeg<NA>『창계집(滄溪集)』은 문경동(文敬仝, 1457-1521)의 문집으로 4권2책이다. 문경동은 자가 흠지(欽之), 호는 창계(滄溪), 본관은 안동(安東)이다. 속명(續命)의 아들이며 영주(榮州)사람이다. 1495년(연산군 1) 문과에 급제하여 내외직을 두루 역임하고, 1510년 삼포왜란이일어났을때 왜구 격퇴한 공로로 성균관사성(成均館司成)을 거쳐 예천군수(醴泉郡守)와 청풍군수(淸風郡守)를 지냈다.첫머리에는 金若鍊의 서문이 있고, 이어서 滄溪先生 世系圖와 滄溪先生 外裔錄이 실려있다. 권1-4에 賦 10수, 辭 1수, 跋文 1편을 제외하고는 모두 詩가 실려있다. 시는 차운시가 많이 보인다. 賦 중 「次枕流亭賦」는 예안현에 사는 金萬鈞이 枕流亭과 枕流亭賦를 짓고 문경동에게 화답할 것을 청하여 차운한 부이다. 여기에서 자연의 아름다움과 천지간의 이치에 대하여 읊었다. 문집 부록으로 문경동에 대한 墓碣銘이 실려있고, 문집 끝에는 權斗經의 발문이 있다.<NA><NA><NA><NA><NA>丙午1900-01-01 00:00:001900-01-01 00:00:002008-04-21 00:00:00<url> <get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=1&amp;lvl2=4236</get> </url><NA>丙午
5725KH.NAHF.kk_002_0080_0070_0030NAHFKH.14.07.000고구려고분벽화kk_002_0080_0070_0030연화문<NA><NA><NA><NA>KH.14KH.14.07kk<NA>2<NA>text/xml<NA><NA>KH.NAHF.kk_002덕흥리 고분벽화<NA><NA><NA><NA>2013-12-06 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.kk_002_0080_0070_0030</get></url><NA>
3739KH.NAHF.ag_001_0030_0090_0010NAHFKH.14.06.000암각화자료ag_001_0030_0090_0010부분<NA><NA><NA><NA>KH.14KH.14.06ag<NA>2<NA>text/xml<NA><NA>KH.NAHF.ag_001몽골고비알타이의 바위그림<NA><NA><NA><NA>2013-12-06 00:00:001900-01-01 00:00:002015-10-06 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ag_001_0030_0090_0010</get></url><NA>
2271KH.KSAC.3252KSACKH.14.02.000영남유학유물3252단자(예장서-이태화)(單子(禮狀書-李泰和))영남유교문화유물<NA><NA>이태화(李泰和)KH.14KH.14.02YCON03<NA>2<NA>text/xml|image/jpeg<NA>이태화(李泰和)의 예장서(禮狀書)인 이 문서는 한산이씨 대산종가의 단자로, 정미년 12월 20일에 이태화가 작성하였다. 이태화가 그의 셋째 아들 이상정(李象靖)과 결혼하게 된 집에 길일을 잡았다는 것과 예물을 드리는 글이다. 예장서란 신랑집에서 예단에 동봉하여 신부집으로 보내는 서간이다.단자의 명칭이 붙은 문서로는 단자(單子)ㆍ선원록세계단자(璿源錄世系單子)ㆍ돈녕단자(敦寧單子)ㆍ공신자손세계단자(功臣子孫世系單子)ㆍ호구단자(戶口單子)ㆍ천단자(薦單子)ㆍ포폄단자(褒貶單子)ㆍ진상단자(進上單子)ㆍ하직단자(下直單子)ㆍ사은단자(謝恩單子)ㆍ육행단자(六行單子)ㆍ문안단자(問安單子)ㆍ지수단자(祗受單子)ㆍ처녀단자(處女單子)ㆍ서경단자(署經單子) 등이 있다.<NA><NA><NA><NA><NA>정미(丁-未)-991900-01-01 00:00:001900-01-01 00:00:002008-04-21 00:00:00<url> <get>http://www.ugyo.net/resolver.jsp?cat=rlc&amp;lvl1=2&amp;lvl2=3252</get> </url>이태화(李泰和)정미(丁-未)-99
537KH.IDP.RU098138IDPKH.14.03.000국외독립운동유적지RU098138코민테른 집행위원회·동양비서부 조선위원회<NA><NA><NA><NA>KH.14KH.14.03IDP-RU-002<NA>2<NA>text/xml,image/jpeg<NA>&#xD; 소재지 : 노��이 아르바트(Novyi Arbat) 1번지<NA><NA><NA><NA><NA><NA>1900-01-01 00:00:001900-01-01 00:00:002013-10-21 00:00:00<url> <get>http://search2.i815.or.kr/Search/HistoryCon.jsp?menu=IDP-RU-002&amp;nKey=RU098138</get> </url><NA><NA>
9725KH.NAHF.ku_002_0050_0020_0010_0040NAHFKH.14.08.000고구려문화유산자료ku_002_0050_0020_0010_0040<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_002고구려문화유산자료/일본<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_002_0050_0020_0010_0040</get></url><NA>
7600KH.NAHF.ku_001_0020_0030_1830_0210NAHFKH.14.08.000고구려문화유산자료ku_001_0020_0030_1830_0210<NA><NA><NA><NA><NA>KH.14KH.14.08ku<NA>2<NA>text/xml<NA><NA>KH.NAHF.ku_001고구려문화유산자료/중국<NA><NA><NA><NA>2015-02-02 00:00:001900-01-01 00:00:002015-09-18 00:00:00<url><get>http://contents.nahf.or.kr/id/NAHF.ku_001_0020_0030_1830_0210</get></url><NA>