Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells9109
Missing cells (%)10.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory781.2 KiB
Average record size in memory80.0 B

Variable types

Text7
Categorical2

Dataset

Description국립중앙박물관을 포함한 104개 협력 박물관의 약 29만건의 유물정보에 대한 목록 및 상세 정보 (소장기관명, 명칭, 다른명칭, 재질, 분류 등)를 제공합니다.
Author문화체육관광부 국립중앙박물관
URLhttps://www.data.go.kr/data/15083246/fileData.do

Alerts

다른명칭 has 2635 (26.4%) missing valuesMissing
재질(중) has 1437 (14.4%) missing valuesMissing
분류(중) has 417 (4.2%) missing valuesMissing
분류(소) has 954 (9.5%) missing valuesMissing
분류(세) has 3666 (36.7%) missing valuesMissing
소장품고유아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:42:24.772325
Analysis finished2023-12-12 10:42:27.714361
Duration2.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:42:27.878937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters240000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowPS0100100500105708900000
2nd rowPS0100100100701311500000
3rd rowPS0100100101103347900000
4th rowPS0100100101101242800000
5th rowPS0100100101100215000000
ValueCountFrequency (%)
ps0100100500105708900000 1
 
< 0.1%
ps0100100100400178100000 1
 
< 0.1%
ps0100100101100090100000 1
 
< 0.1%
ps0100100102002264900000 1
 
< 0.1%
ps0100100101700414800000 1
 
< 0.1%
ps0100100102101432300000 1
 
< 0.1%
ps0100100101900071700000 1
 
< 0.1%
ps0100100100400279300000 1
 
< 0.1%
ps0100100102001589900000 1
 
< 0.1%
ps0100100102101351900000 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T19:42:28.310607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 134312
56.0%
1 44110
 
18.4%
P 10000
 
4.2%
S 10000
 
4.2%
2 9040
 
3.8%
7 5594
 
2.3%
5 5373
 
2.2%
3 5045
 
2.1%
4 4393
 
1.8%
6 4368
 
1.8%
Other values (2) 7765
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 220000
91.7%
Uppercase Letter 20000
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 134312
61.1%
1 44110
 
20.1%
2 9040
 
4.1%
7 5594
 
2.5%
5 5373
 
2.4%
3 5045
 
2.3%
4 4393
 
2.0%
6 4368
 
2.0%
9 3959
 
1.8%
8 3806
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
P 10000
50.0%
S 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 220000
91.7%
Latin 20000
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 134312
61.1%
1 44110
 
20.1%
2 9040
 
4.1%
7 5594
 
2.5%
5 5373
 
2.4%
3 5045
 
2.3%
4 4393
 
2.0%
6 4368
 
2.0%
9 3959
 
1.8%
8 3806
 
1.7%
Latin
ValueCountFrequency (%)
P 10000
50.0%
S 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 240000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 134312
56.0%
1 44110
 
18.4%
P 10000
 
4.2%
S 10000
 
4.2%
2 9040
 
3.8%
7 5594
 
2.3%
5 5373
 
2.2%
3 5045
 
2.1%
4 4393
 
1.8%
6 4368
 
1.8%
Other values (2) 7765
 
3.2%

소장기관명
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국립1-국립중앙박물관-신수
2424 
국립1-국립중앙박물관-건판
2040 
국립1-국립중앙박물관-신안
1221 
국립1-국립중앙박물관-고적
1147 
국립1-국립광주박물관-광주
643 
Other values (22)
2525 

Length

Max length15
Median length14
Mean length13.901
Min length13

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row국립1-국립광주박물관-광주
2nd row국립1-국립중앙박물관-신안
3rd row국립1-국립중앙박물관-신수
4th row국립1-국립중앙박물관-신수
5th row국립1-국립중앙박물관-신수

Common Values

ValueCountFrequency (%)
국립1-국립중앙박물관-신수 2424
24.2%
국립1-국립중앙박물관-건판 2040
20.4%
국립1-국립중앙박물관-신안 1221
12.2%
국립1-국립중앙박물관-고적 1147
11.5%
국립1-국립광주박물관-광주 643
 
6.4%
국립1-국립중앙박물관-구 514
 
5.1%
국립1-국립중앙박물관-증 482
 
4.8%
국립1-국립중앙박물관-본관 406
 
4.1%
국립1-국립중앙박물관-덕수 350
 
3.5%
국립1-국립중앙박물관-동원 254
 
2.5%
Other values (17) 519
 
5.2%

Length

2023-12-12T19:42:28.570085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국립1-국립중앙박물관-신수 2424
24.2%
국립1-국립중앙박물관-건판 2040
20.4%
국립1-국립중앙박물관-신안 1221
12.2%
국립1-국립중앙박물관-고적 1147
11.5%
국립1-국립광주박물관-광주 643
 
6.4%
국립1-국립중앙박물관-구 514
 
5.1%
국립1-국립중앙박물관-증 482
 
4.8%
국립1-국립중앙박물관-본관 406
 
4.1%
국립1-국립중앙박물관-덕수 350
 
3.5%
국립1-국립중앙박물관-동원 254
 
2.5%
Other values (17) 519
 
5.2%

명칭
Text

Distinct5403
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:42:29.028968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length30
Mean length7.9257
Min length1

Characters and Unicode

Total characters79257
Distinct characters1220
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4515 ?
Unique (%)45.1%

Sample

1st row발형토기저부편
2nd row흑유도호
3rd row토기 바닥 조각
4th row십장생 열쇠패
5th row뚜껑 있는 굽다리 접시
ValueCountFrequency (%)
토기 623
 
3.1%
조각 598
 
3.0%
경북경주 260
 
1.3%
백자 253
 
1.3%
출토 249
 
1.3%
청자접시 192
 
1.0%
평남대동 190
 
1.0%
항아리 167
 
0.8%
청자 158
 
0.8%
접시 155
 
0.8%
Other values (6098) 16935
85.6%
2023-12-12T19:42:29.703206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9902
 
12.5%
2887
 
3.6%
2135
 
2.7%
1792
 
2.3%
1393
 
1.8%
1365
 
1.7%
1307
 
1.6%
1281
 
1.6%
1222
 
1.5%
1218
 
1.5%
Other values (1210) 54755
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67980
85.8%
Space Separator 9902
 
12.5%
Decimal Number 924
 
1.2%
Close Punctuation 164
 
0.2%
Open Punctuation 164
 
0.2%
Other Punctuation 74
 
0.1%
Uppercase Letter 21
 
< 0.1%
Lowercase Letter 14
 
< 0.1%
Dash Punctuation 11
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2887
 
4.2%
2135
 
3.1%
1792
 
2.6%
1393
 
2.0%
1365
 
2.0%
1307
 
1.9%
1281
 
1.9%
1222
 
1.8%
1218
 
1.8%
1204
 
1.8%
Other values (1160) 52176
76.8%
Uppercase Letter
ValueCountFrequency (%)
E 4
19.0%
B 4
19.0%
A 3
14.3%
D 2
9.5%
H 2
9.5%
G 1
 
4.8%
X 1
 
4.8%
C 1
 
4.8%
T 1
 
4.8%
V 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
1 262
28.4%
2 151
16.3%
3 96
 
10.4%
5 89
 
9.6%
9 86
 
9.3%
4 64
 
6.9%
0 51
 
5.5%
7 48
 
5.2%
6 47
 
5.1%
8 30
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
e 3
21.4%
a 3
21.4%
n 2
14.3%
x 1
 
7.1%
d 1
 
7.1%
l 1
 
7.1%
u 1
 
7.1%
h 1
 
7.1%
c 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 114
69.5%
23
 
14.0%
20
 
12.2%
] 4
 
2.4%
3
 
1.8%
Open Punctuation
ValueCountFrequency (%)
( 114
69.5%
24
 
14.6%
19
 
11.6%
[ 4
 
2.4%
3
 
1.8%
Other Punctuation
ValueCountFrequency (%)
' 39
52.7%
, 18
24.3%
· 12
 
16.2%
" 4
 
5.4%
. 1
 
1.4%
Math Symbol
ValueCountFrequency (%)
1
50.0%
~ 1
50.0%
Space Separator
ValueCountFrequency (%)
9902
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65001
82.0%
Common 11241
 
14.2%
Han 2979
 
3.8%
Latin 36
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2887
 
4.4%
2135
 
3.3%
1792
 
2.8%
1393
 
2.1%
1365
 
2.1%
1307
 
2.0%
1281
 
2.0%
1222
 
1.9%
1218
 
1.9%
1204
 
1.9%
Other values (602) 49197
75.7%
Han
ValueCountFrequency (%)
119
 
4.0%
105
 
3.5%
101
 
3.4%
93
 
3.1%
85
 
2.9%
85
 
2.9%
77
 
2.6%
69
 
2.3%
64
 
2.1%
58
 
1.9%
Other values (548) 2123
71.3%
Common
ValueCountFrequency (%)
9902
88.1%
1 262
 
2.3%
2 151
 
1.3%
) 114
 
1.0%
( 114
 
1.0%
3 96
 
0.9%
5 89
 
0.8%
9 86
 
0.8%
4 64
 
0.6%
0 51
 
0.5%
Other values (19) 312
 
2.8%
Latin
ValueCountFrequency (%)
E 4
 
11.1%
B 4
 
11.1%
e 3
 
8.3%
A 3
 
8.3%
a 3
 
8.3%
D 2
 
5.6%
H 2
 
5.6%
n 2
 
5.6%
G 1
 
2.8%
X 1
 
2.8%
Other values (11) 11
30.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64991
82.0%
ASCII 11171
 
14.1%
CJK 2901
 
3.7%
None 104
 
0.1%
CJK Compat Ideographs 78
 
0.1%
Compat Jamo 10
 
< 0.1%
Number Forms 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9902
88.6%
1 262
 
2.3%
2 151
 
1.4%
) 114
 
1.0%
( 114
 
1.0%
3 96
 
0.9%
5 89
 
0.8%
9 86
 
0.8%
4 64
 
0.6%
0 51
 
0.5%
Other values (31) 242
 
2.2%
Hangul
ValueCountFrequency (%)
2887
 
4.4%
2135
 
3.3%
1792
 
2.8%
1393
 
2.1%
1365
 
2.1%
1307
 
2.0%
1281
 
2.0%
1222
 
1.9%
1218
 
1.9%
1204
 
1.9%
Other values (600) 49187
75.7%
CJK
ValueCountFrequency (%)
119
 
4.1%
105
 
3.6%
101
 
3.5%
93
 
3.2%
85
 
2.9%
85
 
2.9%
77
 
2.7%
69
 
2.4%
64
 
2.2%
58
 
2.0%
Other values (525) 2045
70.5%
CJK Compat Ideographs
ValueCountFrequency (%)
35
44.9%
6
 
7.7%
5
 
6.4%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
2
 
2.6%
2
 
2.6%
1
 
1.3%
Other values (13) 13
 
16.7%
None
ValueCountFrequency (%)
24
23.1%
23
22.1%
20
19.2%
19
18.3%
· 12
11.5%
3
 
2.9%
3
 
2.9%
Compat Jamo
ValueCountFrequency (%)
9
90.0%
1
 
10.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

다른명칭
Text

MISSING 

Distinct3801
Distinct (%)51.6%
Missing2635
Missing (%)26.4%
Memory size156.2 KiB
2023-12-12T19:42:30.222053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length67
Mean length9.2025798
Min length1

Characters and Unicode

Total characters67777
Distinct characters1953
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3020 ?
Unique (%)41.0%

Sample

1st row鉢形土器底部片
2nd row黑釉陶壺
3rd row土器底部片, 토기저부편, 토기 저부편
4th row十長生 열쇠패, 열쇠패
5th row高杯, 有蓋高杯, 고배, 유개고배
ValueCountFrequency (%)
土器片 235
 
1.8%
토기 188
 
1.4%
백자 179
 
1.3%
靑磁접시 168
 
1.3%
토기편 122
 
0.9%
청자 112
 
0.8%
雜釉陶壺 110
 
0.8%
土器底部片 91
 
0.7%
無文土器 87
 
0.6%
토기저부편 87
 
0.6%
Other values (4720) 12006
89.7%
2023-12-12T19:42:30.925219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6129
 
9.0%
, 4630
 
6.8%
2043
 
3.0%
1869
 
2.8%
1794
 
2.6%
1583
 
2.3%
1492
 
2.2%
1470
 
2.2%
1375
 
2.0%
1301
 
1.9%
Other values (1943) 44091
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56198
82.9%
Space Separator 6129
 
9.0%
Other Punctuation 4775
 
7.0%
Close Punctuation 228
 
0.3%
Open Punctuation 228
 
0.3%
Lowercase Letter 181
 
0.3%
Decimal Number 14
 
< 0.1%
Uppercase Letter 10
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Math Symbol 4
 
< 0.1%
Other values (4) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2043
 
3.6%
1869
 
3.3%
1794
 
3.2%
1583
 
2.8%
1492
 
2.7%
1470
 
2.6%
1375
 
2.4%
1301
 
2.3%
1230
 
2.2%
1000
 
1.8%
Other values (1888) 41041
73.0%
Lowercase Letter
ValueCountFrequency (%)
e 26
14.4%
r 21
11.6%
a 18
9.9%
n 14
 
7.7%
o 14
 
7.7%
c 11
 
6.1%
l 11
 
6.1%
i 11
 
6.1%
s 9
 
5.0%
d 9
 
5.0%
Other values (9) 37
20.4%
Uppercase Letter
ValueCountFrequency (%)
F 2
20.0%
T 2
20.0%
I 1
10.0%
G 1
10.0%
S 1
10.0%
H 1
10.0%
A 1
10.0%
O 1
10.0%
Other Punctuation
ValueCountFrequency (%)
, 4630
97.0%
' 90
 
1.9%
· 49
 
1.0%
. 3
 
0.1%
" 2
 
< 0.1%
/ 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 8
57.1%
1 3
 
21.4%
7 1
 
7.1%
8 1
 
7.1%
6 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 181
79.4%
24
 
10.5%
19
 
8.3%
4
 
1.8%
Open Punctuation
ValueCountFrequency (%)
( 181
79.4%
25
 
11.0%
18
 
7.9%
4
 
1.8%
Math Symbol
ValueCountFrequency (%)
+ 2
50.0%
1
25.0%
~ 1
25.0%
Space Separator
ValueCountFrequency (%)
6129
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 33323
49.2%
Hangul 22874
33.7%
Common 11387
 
16.8%
Latin 192
 
0.3%
Hiragana 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
2043
 
6.1%
1869
 
5.6%
1794
 
5.4%
1583
 
4.8%
1470
 
4.4%
1230
 
3.7%
854
 
2.6%
704
 
2.1%
574
 
1.7%
556
 
1.7%
Other values (1428) 20646
62.0%
Hangul
ValueCountFrequency (%)
1492
 
6.5%
1375
 
6.0%
1301
 
5.7%
1000
 
4.4%
950
 
4.2%
626
 
2.7%
615
 
2.7%
482
 
2.1%
461
 
2.0%
447
 
2.0%
Other values (449) 14125
61.8%
Latin
ValueCountFrequency (%)
e 26
13.5%
r 21
10.9%
a 18
 
9.4%
n 14
 
7.3%
o 14
 
7.3%
c 11
 
5.7%
l 11
 
5.7%
i 11
 
5.7%
s 9
 
4.7%
d 9
 
4.7%
Other values (18) 48
25.0%
Common
ValueCountFrequency (%)
6129
53.8%
, 4630
40.7%
) 181
 
1.6%
( 181
 
1.6%
' 90
 
0.8%
· 49
 
0.4%
25
 
0.2%
24
 
0.2%
19
 
0.2%
18
 
0.2%
Other values (17) 41
 
0.4%
Hiragana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
CJK 32494
47.9%
Hangul 22865
33.7%
ASCII 11432
 
16.9%
CJK Compat Ideographs 829
 
1.2%
None 143
 
0.2%
Compat Jamo 9
 
< 0.1%
Punctuation 2
 
< 0.1%
Hiragana 1
 
< 0.1%
Math Operators 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6129
53.6%
, 4630
40.5%
) 181
 
1.6%
( 181
 
1.6%
' 90
 
0.8%
e 26
 
0.2%
r 21
 
0.2%
a 18
 
0.2%
n 14
 
0.1%
o 14
 
0.1%
Other values (34) 128
 
1.1%
CJK
ValueCountFrequency (%)
2043
 
6.3%
1869
 
5.8%
1794
 
5.5%
1583
 
4.9%
1470
 
4.5%
1230
 
3.8%
854
 
2.6%
704
 
2.2%
574
 
1.8%
556
 
1.7%
Other values (1350) 19817
61.0%
Hangul
ValueCountFrequency (%)
1492
 
6.5%
1375
 
6.0%
1301
 
5.7%
1000
 
4.4%
950
 
4.2%
626
 
2.7%
615
 
2.7%
482
 
2.1%
461
 
2.0%
447
 
2.0%
Other values (447) 14116
61.7%
CJK Compat Ideographs
ValueCountFrequency (%)
349
42.1%
110
 
13.3%
44
 
5.3%
43
 
5.2%
38
 
4.6%
25
 
3.0%
24
 
2.9%
15
 
1.8%
14
 
1.7%
12
 
1.4%
Other values (68) 155
18.7%
None
ValueCountFrequency (%)
· 49
34.3%
25
17.5%
24
16.8%
19
 
13.3%
18
 
12.6%
4
 
2.8%
4
 
2.8%
Compat Jamo
ValueCountFrequency (%)
8
88.9%
1
 
11.1%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Hiragana
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

재질(중)
Text

MISSING 

Distinct72
Distinct (%)0.8%
Missing1437
Missing (%)14.4%
Memory size156.2 KiB
2023-12-12T19:42:31.196949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length2
Mean length2.046479
Min length1

Characters and Unicode

Total characters17524
Distinct characters84
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row연질
2nd row흑유
3rd row연질
4th row동합금
5th row경질
ValueCountFrequency (%)
유리 2051
24.0%
청자 1323
15.5%
경질 944
11.0%
연질 940
11.0%
백자 824
9.6%
동합금 416
 
4.9%
기타 362
 
4.2%
분청 221
 
2.6%
220
 
2.6%
청백자 177
 
2.1%
Other values (62) 1085
12.7%
2023-12-12T19:42:31.645394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2329
13.3%
2294
13.1%
2056
11.7%
2053
11.7%
1835
10.5%
1015
 
5.8%
947
 
5.4%
944
 
5.4%
577
 
3.3%
474
 
2.7%
Other values (74) 3000
17.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17524
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2329
13.3%
2294
13.1%
2056
11.7%
2053
11.7%
1835
10.5%
1015
 
5.8%
947
 
5.4%
944
 
5.4%
577
 
3.3%
474
 
2.7%
Other values (74) 3000
17.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17524
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2329
13.3%
2294
13.1%
2056
11.7%
2053
11.7%
1835
10.5%
1015
 
5.8%
947
 
5.4%
944
 
5.4%
577
 
3.3%
474
 
2.7%
Other values (74) 3000
17.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17524
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2329
13.3%
2294
13.1%
2056
11.7%
2053
11.7%
1835
10.5%
1015
 
5.8%
947
 
5.4%
944
 
5.4%
577
 
3.3%
474
 
2.7%
Other values (74) 3000
17.1%

분류(대)
Categorical

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
식생활
4307 
미디어
2081 
주생활
898 
산업/생업
654 
사회생활
503 
Other values (9)
1557 

Length

Max length5
Median length3
Mean length3.2938
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식생활
2nd row식생활
3rd row식생활
4th row주생활
5th row사회생활

Common Values

ValueCountFrequency (%)
식생활 4307
43.1%
미디어 2081
20.8%
주생활 898
 
9.0%
산업/생업 654
 
6.5%
사회생활 503
 
5.0%
문화예술 481
 
4.8%
<NA> 410
 
4.1%
의생활 189
 
1.9%
기타자료 143
 
1.4%
군사 133
 
1.3%
Other values (4) 201
 
2.0%

Length

2023-12-12T19:42:31.847261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
식생활 4307
43.1%
미디어 2081
20.8%
주생활 898
 
9.0%
산업/생업 654
 
6.5%
사회생활 503
 
5.0%
문화예술 481
 
4.8%
na 410
 
4.1%
의생활 189
 
1.9%
기타자료 143
 
1.4%
군사 133
 
1.3%
Other values (4) 201
 
2.0%

분류(중)
Text

MISSING 

Distinct53
Distinct (%)0.6%
Missing417
Missing (%)4.2%
Memory size156.2 KiB
2023-12-12T19:42:32.132762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.2163206
Min length2

Characters and Unicode

Total characters30822
Distinct characters86
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row음식기
2nd row음식기
3rd row음식기
4th row생활용품/가전
5th row의례생활
ValueCountFrequency (%)
음식기 4230
44.1%
기록물 2059
21.5%
건축부재 549
 
5.7%
생활용품/가전 349
 
3.6%
서화 285
 
3.0%
의례생활 266
 
2.8%
선사생활 251
 
2.6%
사회제도 218
 
2.3%
문헌 164
 
1.7%
장신구 141
 
1.5%
Other values (43) 1071
 
11.2%
2023-12-12T19:42:32.893564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6452
20.9%
4237
13.7%
4231
13.7%
2059
 
6.7%
2059
 
6.7%
866
 
2.8%
866
 
2.8%
682
 
2.2%
549
 
1.8%
549
 
1.8%
Other values (76) 8272
26.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30428
98.7%
Other Punctuation 394
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6452
21.2%
4237
13.9%
4231
13.9%
2059
 
6.8%
2059
 
6.8%
866
 
2.8%
866
 
2.8%
682
 
2.2%
549
 
1.8%
549
 
1.8%
Other values (75) 7878
25.9%
Other Punctuation
ValueCountFrequency (%)
/ 394
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30428
98.7%
Common 394
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6452
21.2%
4237
13.9%
4231
13.9%
2059
 
6.8%
2059
 
6.8%
866
 
2.8%
866
 
2.8%
682
 
2.2%
549
 
1.8%
549
 
1.8%
Other values (75) 7878
25.9%
Common
ValueCountFrequency (%)
/ 394
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30428
98.7%
ASCII 394
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6452
21.2%
4237
13.9%
4231
13.9%
2059
 
6.8%
2059
 
6.8%
866
 
2.8%
866
 
2.8%
682
 
2.2%
549
 
1.8%
549
 
1.8%
Other values (75) 7878
25.9%
ASCII
ValueCountFrequency (%)
/ 394
100.0%

분류(소)
Text

MISSING 

Distinct116
Distinct (%)1.3%
Missing954
Missing (%)9.5%
Memory size156.2 KiB
2023-12-12T19:42:33.197687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.4660623
Min length1

Characters and Unicode

Total characters22308
Distinct characters154
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st row음식
2nd row저장운반
3rd row기타
4th row제례
5th row회화
ValueCountFrequency (%)
음식 2842
31.4%
필름 2054
22.7%
저장운반 748
 
8.3%
지붕재 487
 
5.4%
기타 372
 
4.1%
회화 195
 
2.2%
문서 190
 
2.1%
생활구일체 156
 
1.7%
상장 148
 
1.6%
화장구 127
 
1.4%
Other values (106) 1727
19.1%
2023-12-12T19:42:33.725881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3001
 
13.5%
2842
 
12.7%
2054
 
9.2%
2054
 
9.2%
1215
 
5.4%
779
 
3.5%
755
 
3.4%
748
 
3.4%
602
 
2.7%
581
 
2.6%
Other values (144) 7677
34.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22222
99.6%
Other Punctuation 84
 
0.4%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3001
 
13.5%
2842
 
12.8%
2054
 
9.2%
2054
 
9.2%
1215
 
5.5%
779
 
3.5%
755
 
3.4%
748
 
3.4%
602
 
2.7%
581
 
2.6%
Other values (141) 7591
34.2%
Other Punctuation
ValueCountFrequency (%)
/ 84
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22222
99.6%
Common 86
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3001
 
13.5%
2842
 
12.8%
2054
 
9.2%
2054
 
9.2%
1215
 
5.5%
779
 
3.5%
755
 
3.4%
748
 
3.4%
602
 
2.7%
581
 
2.6%
Other values (141) 7591
34.2%
Common
ValueCountFrequency (%)
/ 84
97.7%
) 1
 
1.2%
( 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22222
99.6%
ASCII 86
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3001
 
13.5%
2842
 
12.8%
2054
 
9.2%
2054
 
9.2%
1215
 
5.5%
779
 
3.5%
755
 
3.4%
748
 
3.4%
602
 
2.7%
581
 
2.6%
Other values (141) 7591
34.2%
ASCII
ValueCountFrequency (%)
/ 84
97.7%
) 1
 
1.2%
( 1
 
1.2%

분류(세)
Text

MISSING 

Distinct280
Distinct (%)4.4%
Missing3666
Missing (%)36.7%
Memory size156.2 KiB
2023-12-12T19:42:34.181480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length2.2192927
Min length1

Characters and Unicode

Total characters14057
Distinct characters241
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)1.4%

Sample

1st row
2nd row항아리
3rd row열쇠패
4th row제기
5th row일반회화
ValueCountFrequency (%)
접시 1023
 
16.2%
항아리 488
 
7.7%
기타 468
 
7.4%
304
 
4.8%
272
 
4.3%
235
 
3.7%
대접 233
 
3.7%
일반회화 173
 
2.7%
수막새 172
 
2.7%
134
 
2.1%
Other values (270) 2832
44.7%
2023-12-12T19:42:34.831136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1257
 
8.9%
1129
 
8.0%
567
 
4.0%
516
 
3.7%
501
 
3.6%
489
 
3.5%
473
 
3.4%
328
 
2.3%
312
 
2.2%
306
 
2.2%
Other values (231) 8179
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13750
97.8%
Open Punctuation 151
 
1.1%
Close Punctuation 151
 
1.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1257
 
9.1%
1129
 
8.2%
567
 
4.1%
516
 
3.8%
501
 
3.6%
489
 
3.6%
473
 
3.4%
328
 
2.4%
312
 
2.3%
306
 
2.2%
Other values (228) 7872
57.3%
Open Punctuation
ValueCountFrequency (%)
( 151
100.0%
Close Punctuation
ValueCountFrequency (%)
) 151
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13750
97.8%
Common 307
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1257
 
9.1%
1129
 
8.2%
567
 
4.1%
516
 
3.8%
501
 
3.6%
489
 
3.6%
473
 
3.4%
328
 
2.4%
312
 
2.3%
306
 
2.2%
Other values (228) 7872
57.3%
Common
ValueCountFrequency (%)
( 151
49.2%
) 151
49.2%
/ 5
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13750
97.8%
ASCII 307
 
2.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1257
 
9.1%
1129
 
8.2%
567
 
4.1%
516
 
3.8%
501
 
3.6%
489
 
3.6%
473
 
3.4%
328
 
2.4%
312
 
2.3%
306
 
2.2%
Other values (228) 7872
57.3%
ASCII
ValueCountFrequency (%)
( 151
49.2%
) 151
49.2%
/ 5
 
1.6%

Correlations

2023-12-12T19:42:34.953690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장기관명재질(중)분류(대)분류(중)
소장기관명1.0000.8380.7830.844
재질(중)0.8381.0000.8870.918
분류(대)0.7830.8871.0001.000
분류(중)0.8440.9181.0001.000
2023-12-12T19:42:35.069229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장기관명분류(대)
소장기관명1.0000.379
분류(대)0.3791.000
2023-12-12T19:42:35.172771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장기관명분류(대)
소장기관명1.0000.379
분류(대)0.3791.000

Missing values

2023-12-12T19:42:27.240662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:42:27.407439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:42:27.599197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소장품고유아이디소장기관명명칭다른명칭재질(중)분류(대)분류(중)분류(소)분류(세)
23745PS0100100500105708900000국립1-국립광주박물관-광주발형토기저부편鉢形土器底部片연질식생활음식기음식
40889PS0100100100701311500000국립1-국립중앙박물관-신안흑유도호黑釉陶壺흑유식생활음식기저장운반항아리
58057PS0100100101103347900000국립1-국립중앙박물관-신수토기 바닥 조각土器底部片, 토기저부편, 토기 저부편연질식생활음식기<NA><NA>
77500PS0100100101101242800000국립1-국립중앙박물관-신수십장생 열쇠패十長生 열쇠패, 열쇠패동합금주생활생활용품/가전기타열쇠패
80868PS0100100101100215000000국립1-국립중앙박물관-신수뚜껑 있는 굽다리 접시高杯, 有蓋高杯, 고배, 유개고배경질사회생활의례생활제례제기
51898PS0100100100900259000000국립1-국립중앙박물관-동원고종 어진高宗 御眞문화예술서화회화일반회화
31586PS0100100101100523300000국립1-국립중앙박물관-신수각진 병扁甁, 편병, 광구편병, 廣口扁甁<NA>식생활음식기저장운반
544PS0100100101600980600000국립1-국립중앙박물관-구대한제국 관련 기사가 실린 영국신문<NA>기타미디어신문/방송신문<NA>
76645PS0100100101700369300000국립1-국립중앙박물관-증박자拍子, 두들개<NA><NA><NA><NA><NA>
88934PS0100100100700874200000국립1-국립중앙박물관-신안청자쌍어문전접시靑磁雙魚文전접시청자식생활음식기음식접시
소장품고유아이디소장기관명명칭다른명칭재질(중)분류(대)분류(중)분류(소)분류(세)
23013PS0100100102102372700000국립1-국립중앙박물관-고적말띠꾸미개편雲珠片청동교통/통신마구장식운주
87281PS0100100102002730600000국립1-국립중앙박물관-건판경기개성 천마산 관음사 석조관세음보살반가상<NA>유리미디어기록물필름<NA>
45448PS0100100102100429000000국립1-국립중앙박물관-고적백자편白磁片백자식생활음식기음식<NA>
30876PS0100100101102391400000국립1-국립중앙박물관-신수그물추土製漁網錘, 토제어망추, 고기잡이그물추연질산업/생업어업어로어망추
4817PS0100100101102085000000국립1-국립중앙박물관-신수귀걸이金銅耳飾, 금동이식, 금동귀걸이금동의생활장신구신체장식이식(귀걸이)
67395PS0100100102003456000000국립1-국립중앙박물관-건판서울 서대문 홍제동 오층석탑<NA>유리미디어기록물필름<NA>
50711PS0100100101700593200000국립1-국립중앙박물관-증백자대접백자 대접백자식생활음식기음식대접
39775PS0100100102002763200000국립1-국립중앙박물관-건판경북경주 호우총 출토 각종 행엽<NA>유리미디어기록물필름<NA>
54632PS0100100101101972500000국립1-국립중앙박물관-신수항아리圓底短頸壺, 원저단경호, 둥근밑항아리, 둥근바닥항아리연질식생활음식기저장운반항아리
9929PS0100100101104016300000국립1-국립중앙박물관-신수수키와圓瓦, 원와경질주생활건축부재지붕재수키와