Overview

Dataset statistics

Number of variables7
Number of observations3274
Missing cells13
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory179.2 KiB
Average record size in memory56.0 B

Variable types

Text5
Categorical1
DateTime1

Dataset

Description국립중앙과학관은 기초과학, 응용과학, 산업기술, 과학기술사, 자연사 등 분야에서 수집해온 과학기술자료들을 DB화하여 전시, 교육, 연구의 자원으로 활용 가능한 데이터입니다. 3D 프린팅 데이터와 소장자료에 대한 이름, 취득방법, 사진 등의 정보를 바탕으로 대여열람을 위한 기초자료로서 유관기관의 활용에 도움이 될것으로 기대됩니다.
Author과학기술정보통신부 국립중앙과학관
URLhttps://www.data.go.kr/data/15048431/fileData.do

Alerts

메타 아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:54:44.704624
Analysis finished2023-12-12 17:54:46.456203
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메타 아이디
Text

UNIQUE 

Distinct3274
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
2023-12-13T02:54:46.664636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters45836
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3274 ?
Unique (%)100.0%

Sample

1st rowmeta_000002961
2nd rowmeta_000003212
3rd rowmeta_000002854
4th rowmeta_000003146
5th rowmeta_000003905
ValueCountFrequency (%)
meta_000002961 1
 
< 0.1%
meta_000000620 1
 
< 0.1%
meta_000000051 1
 
< 0.1%
meta_000000380 1
 
< 0.1%
meta_000003683 1
 
< 0.1%
meta_000000768 1
 
< 0.1%
meta_000003795 1
 
< 0.1%
meta_000000389 1
 
< 0.1%
meta_000000421 1
 
< 0.1%
meta_000003226 1
 
< 0.1%
Other values (3264) 3264
99.7%
2023-12-13T02:54:47.142709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 18422
40.2%
m 3274
 
7.1%
e 3274
 
7.1%
t 3274
 
7.1%
a 3274
 
7.1%
_ 3274
 
7.1%
2 2054
 
4.5%
3 1942
 
4.2%
1 1430
 
3.1%
5 958
 
2.1%
Other values (5) 4660
 
10.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29466
64.3%
Lowercase Letter 13096
28.6%
Connector Punctuation 3274
 
7.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 18422
62.5%
2 2054
 
7.0%
3 1942
 
6.6%
1 1430
 
4.9%
5 958
 
3.3%
4 956
 
3.2%
6 954
 
3.2%
7 947
 
3.2%
8 943
 
3.2%
9 860
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
m 3274
25.0%
e 3274
25.0%
t 3274
25.0%
a 3274
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3274
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32740
71.4%
Latin 13096
 
28.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 18422
56.3%
_ 3274
 
10.0%
2 2054
 
6.3%
3 1942
 
5.9%
1 1430
 
4.4%
5 958
 
2.9%
4 956
 
2.9%
6 954
 
2.9%
7 947
 
2.9%
8 943
 
2.9%
Latin
ValueCountFrequency (%)
m 3274
25.0%
e 3274
25.0%
t 3274
25.0%
a 3274
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45836
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 18422
40.2%
m 3274
 
7.1%
e 3274
 
7.1%
t 3274
 
7.1%
a 3274
 
7.1%
_ 3274
 
7.1%
2 2054
 
4.5%
3 1942
 
4.2%
1 1430
 
3.1%
5 958
 
2.1%
Other values (5) 4660
 
10.2%

대분류
Categorical

Distinct36
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
해양생물(패류)
360 
해양생물(국내패류)
352 
곤충류
315 
조류
225 
인쇄
211 
Other values (31)
1811 

Length

Max length10
Median length9
Mean length4.3478925
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row생활
2nd row생활
3rd row생활용품
4th row생활용품
5th row조립가능3D

Common Values

ValueCountFrequency (%)
해양생물(패류) 360
 
11.0%
해양생물(국내패류) 352
 
10.8%
곤충류 315
 
9.6%
조류 225
 
6.9%
인쇄 211
 
6.4%
어류 198
 
6.0%
생활 194
 
5.9%
거미류 193
 
5.9%
건축 156
 
4.8%
암석/화석 106
 
3.2%
Other values (26) 964
29.4%

Length

2023-12-13T02:54:47.357243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해양생물(패류 360
 
11.0%
해양생물(국내패류 352
 
10.8%
곤충류 315
 
9.6%
조류 225
 
6.9%
인쇄 211
 
6.4%
어류 198
 
6.0%
생활 196
 
6.0%
거미류 193
 
5.9%
건축 156
 
4.8%
암석/화석 106
 
3.2%
Other values (24) 962
29.4%
Distinct323
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
2023-12-13T02:54:47.693584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length40
Mean length4.5296274
Min length2

Characters and Unicode

Total characters14830
Distinct characters341
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)5.4%

Sample

1st row주생활
2nd row의례
3rd row가전
4th row기계
5th row기타
ValueCountFrequency (%)
흡강목 309
 
9.4%
거미목 193
 
5.9%
딱정벌레목 182
 
5.6%
연활자 151
 
4.6%
백합목 108
 
3.3%
무기 93
 
2.8%
참새목 86
 
2.6%
연장 85
 
2.6%
전기_전자 85
 
2.6%
나비목 77
 
2.4%
Other values (311) 1905
58.2%
2023-12-13T02:54:48.202727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2009
 
13.5%
498
 
3.4%
a 440
 
3.0%
310
 
2.1%
309
 
2.1%
i 288
 
1.9%
o 287
 
1.9%
274
 
1.8%
270
 
1.8%
r 269
 
1.8%
Other values (331) 9876
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11108
74.9%
Lowercase Letter 2935
 
19.8%
Uppercase Letter 280
 
1.9%
Open Punctuation 192
 
1.3%
Close Punctuation 192
 
1.3%
Connector Punctuation 97
 
0.7%
Other Punctuation 10
 
0.1%
Space Separator 8
 
0.1%
Decimal Number 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2009
 
18.1%
498
 
4.5%
310
 
2.8%
309
 
2.8%
274
 
2.5%
270
 
2.4%
244
 
2.2%
237
 
2.1%
233
 
2.1%
226
 
2.0%
Other values (274) 6498
58.5%
Lowercase Letter
ValueCountFrequency (%)
a 440
15.0%
i 288
9.8%
o 287
9.8%
r 269
9.2%
s 216
 
7.4%
u 189
 
6.4%
t 185
 
6.3%
d 158
 
5.4%
e 156
 
5.3%
n 124
 
4.2%
Other values (16) 623
21.2%
Uppercase Letter
ValueCountFrequency (%)
P 57
20.4%
S 41
14.6%
A 37
13.2%
C 20
 
7.1%
T 19
 
6.8%
O 18
 
6.4%
M 15
 
5.4%
R 13
 
4.6%
L 13
 
4.6%
E 8
 
2.9%
Other values (11) 39
13.9%
Decimal Number
ValueCountFrequency (%)
4 2
25.0%
7 2
25.0%
8 2
25.0%
1 2
25.0%
Other Punctuation
ValueCountFrequency (%)
/ 8
80.0%
, 2
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 192
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 97
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11108
74.9%
Latin 3215
 
21.7%
Common 507
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2009
 
18.1%
498
 
4.5%
310
 
2.8%
309
 
2.8%
274
 
2.5%
270
 
2.4%
244
 
2.2%
237
 
2.1%
233
 
2.1%
226
 
2.0%
Other values (274) 6498
58.5%
Latin
ValueCountFrequency (%)
a 440
13.7%
i 288
 
9.0%
o 287
 
8.9%
r 269
 
8.4%
s 216
 
6.7%
u 189
 
5.9%
t 185
 
5.8%
d 158
 
4.9%
e 156
 
4.9%
n 124
 
3.9%
Other values (37) 903
28.1%
Common
ValueCountFrequency (%)
( 192
37.9%
) 192
37.9%
_ 97
19.1%
8
 
1.6%
/ 8
 
1.6%
4 2
 
0.4%
7 2
 
0.4%
8 2
 
0.4%
1 2
 
0.4%
, 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11108
74.9%
ASCII 3722
 
25.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2009
 
18.1%
498
 
4.5%
310
 
2.8%
309
 
2.8%
274
 
2.5%
270
 
2.4%
244
 
2.2%
237
 
2.1%
233
 
2.1%
226
 
2.0%
Other values (274) 6498
58.5%
ASCII
ValueCountFrequency (%)
a 440
 
11.8%
i 288
 
7.7%
o 287
 
7.7%
r 269
 
7.2%
s 216
 
5.8%
( 192
 
5.2%
) 192
 
5.2%
u 189
 
5.1%
t 185
 
5.0%
d 158
 
4.2%
Other values (47) 1306
35.1%
Distinct741
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
2023-12-13T02:54:48.484904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length38
Mean length5.0250458
Min length1

Characters and Unicode

Total characters16452
Distinct characters488
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique365 ?
Unique (%)11.1%

Sample

1st row자물쇠
2nd row동경
3rd row가전
4th row기계
5th row기타
ValueCountFrequency (%)
한글바탕체 117
 
3.6%
전기 75
 
2.3%
물레고둥과 59
 
1.8%
휴대전화 53
 
1.6%
맞춤 46
 
1.4%
화살제작 43
 
1.3%
하늘소과 42
 
1.3%
발우제작 42
 
1.3%
서계문집 42
 
1.3%
수레제작 41
 
1.3%
Other values (730) 2714
82.9%
2023-12-13T02:54:48.960898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1850
 
11.2%
a 461
 
2.8%
e 404
 
2.5%
367
 
2.2%
335
 
2.0%
i 310
 
1.9%
286
 
1.7%
264
 
1.6%
261
 
1.6%
253
 
1.5%
Other values (478) 11661
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13334
81.0%
Lowercase Letter 2596
 
15.8%
Uppercase Letter 247
 
1.5%
Other Punctuation 96
 
0.6%
Open Punctuation 89
 
0.5%
Close Punctuation 89
 
0.5%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1850
 
13.9%
367
 
2.8%
335
 
2.5%
286
 
2.1%
264
 
2.0%
261
 
2.0%
253
 
1.9%
233
 
1.7%
231
 
1.7%
223
 
1.7%
Other values (430) 9031
67.7%
Lowercase Letter
ValueCountFrequency (%)
a 461
17.8%
e 404
15.6%
i 310
11.9%
d 224
8.6%
r 150
 
5.8%
c 133
 
5.1%
t 133
 
5.1%
o 132
 
5.1%
l 128
 
4.9%
s 97
 
3.7%
Other values (12) 424
16.3%
Uppercase Letter
ValueCountFrequency (%)
A 40
16.2%
C 34
13.8%
P 29
11.7%
S 17
 
6.9%
M 16
 
6.5%
H 15
 
6.1%
T 13
 
5.3%
D 13
 
5.3%
L 12
 
4.9%
N 11
 
4.5%
Other values (11) 47
19.0%
Other Punctuation
ValueCountFrequency (%)
, 88
91.7%
/ 8
 
8.3%
Open Punctuation
ValueCountFrequency (%)
( 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13334
81.0%
Latin 2843
 
17.3%
Common 275
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1850
 
13.9%
367
 
2.8%
335
 
2.5%
286
 
2.1%
264
 
2.0%
261
 
2.0%
253
 
1.9%
233
 
1.7%
231
 
1.7%
223
 
1.7%
Other values (430) 9031
67.7%
Latin
ValueCountFrequency (%)
a 461
16.2%
e 404
14.2%
i 310
10.9%
d 224
 
7.9%
r 150
 
5.3%
c 133
 
4.7%
t 133
 
4.7%
o 132
 
4.6%
l 128
 
4.5%
s 97
 
3.4%
Other values (33) 671
23.6%
Common
ValueCountFrequency (%)
( 89
32.4%
) 89
32.4%
, 88
32.0%
/ 8
 
2.9%
1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13334
81.0%
ASCII 3118
 
19.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1850
 
13.9%
367
 
2.8%
335
 
2.5%
286
 
2.1%
264
 
2.0%
261
 
2.0%
253
 
1.9%
233
 
1.7%
231
 
1.7%
223
 
1.7%
Other values (430) 9031
67.7%
ASCII
ValueCountFrequency (%)
a 461
14.8%
e 404
13.0%
i 310
 
9.9%
d 224
 
7.2%
r 150
 
4.8%
c 133
 
4.3%
t 133
 
4.3%
o 132
 
4.2%
l 128
 
4.1%
s 97
 
3.1%
Other values (38) 946
30.3%
Distinct3254
Distinct (%)99.4%
Missing1
Missing (%)< 0.1%
Memory size25.7 KiB
2023-12-13T02:54:49.225989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length35
Mean length9.0106936
Min length1

Characters and Unicode

Total characters29492
Distinct characters1733
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3235 ?
Unique (%)98.8%

Sample

1st row '五子三元'명자물통
2nd row15세기동경
3rd row1950년대선풍기
4th row1960년대아이스크림제조기
5th row2D퍼즐동물세트강아지
ValueCountFrequency (%)
숫자 3
 
0.1%
숫자연활자10개모음(원 3
 
0.1%
혹줄돼지고둥 2
 
0.1%
둥근전복 2
 
0.1%
달팽이 2
 
0.1%
돼지고둥 2
 
0.1%
검은줄좁쌀무늬고둥 2
 
0.1%
파라사우롤로푸스 2
 
0.1%
dubia 2
 
0.1%
우럭 2
 
0.1%
Other values (3249) 3261
99.3%
2023-12-13T02:54:49.653340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 1293
 
4.4%
( 731
 
2.5%
) 731
 
2.5%
450
 
1.5%
435
 
1.5%
432
 
1.5%
413
 
1.4%
- 375
 
1.3%
0 374
 
1.3%
364
 
1.2%
Other values (1723) 23894
81.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22601
76.6%
Lowercase Letter 1587
 
5.4%
Other Punctuation 1342
 
4.6%
Decimal Number 1203
 
4.1%
Uppercase Letter 863
 
2.9%
Open Punctuation 738
 
2.5%
Close Punctuation 737
 
2.5%
Dash Punctuation 375
 
1.3%
Letter Number 13
 
< 0.1%
Space Separator 12
 
< 0.1%
Other values (2) 21
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
450
 
2.0%
435
 
1.9%
432
 
1.9%
413
 
1.8%
364
 
1.6%
310
 
1.4%
307
 
1.4%
289
 
1.3%
277
 
1.2%
256
 
1.1%
Other values (1630) 19068
84.4%
Uppercase Letter
ValueCountFrequency (%)
S 113
13.1%
M 86
 
10.0%
C 69
 
8.0%
H 68
 
7.9%
A 65
 
7.5%
P 52
 
6.0%
R 52
 
6.0%
T 47
 
5.4%
D 40
 
4.6%
G 31
 
3.6%
Other values (16) 240
27.8%
Lowercase Letter
ValueCountFrequency (%)
a 182
11.5%
i 143
 
9.0%
o 143
 
9.0%
n 126
 
7.9%
e 125
 
7.9%
t 111
 
7.0%
u 102
 
6.4%
r 101
 
6.4%
s 97
 
6.1%
l 95
 
6.0%
Other values (15) 362
22.8%
Decimal Number
ValueCountFrequency (%)
0 374
31.1%
1 281
23.4%
2 130
 
10.8%
3 105
 
8.7%
5 77
 
6.4%
4 59
 
4.9%
8 56
 
4.7%
6 47
 
3.9%
9 43
 
3.6%
7 31
 
2.6%
Letter Number
ValueCountFrequency (%)
4
30.8%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 1293
96.3%
. 30
 
2.2%
& 8
 
0.6%
' 5
 
0.4%
: 2
 
0.1%
! 2
 
0.1%
? 1
 
0.1%
/ 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 6
54.5%
= 2
 
18.2%
1
 
9.1%
1
 
9.1%
+ 1
 
9.1%
Other Symbol
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
1
 
10.0%
1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 731
99.1%
[ 7
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 731
99.2%
] 6
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 375
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21495
72.9%
Common 4428
 
15.0%
Latin 2462
 
8.3%
Han 1106
 
3.8%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
450
 
2.1%
435
 
2.0%
432
 
2.0%
413
 
1.9%
364
 
1.7%
310
 
1.4%
307
 
1.4%
289
 
1.3%
277
 
1.3%
256
 
1.2%
Other values (1485) 17962
83.6%
Han
ValueCountFrequency (%)
151
13.7%
147
13.3%
147
13.3%
141
12.7%
139
12.6%
139
12.6%
11
 
1.0%
9
 
0.8%
8
 
0.7%
8
 
0.7%
Other values (135) 206
18.6%
Latin
ValueCountFrequency (%)
a 182
 
7.4%
i 143
 
5.8%
o 143
 
5.8%
n 126
 
5.1%
e 125
 
5.1%
S 113
 
4.6%
t 111
 
4.5%
u 102
 
4.1%
r 101
 
4.1%
s 97
 
3.9%
Other values (49) 1219
49.5%
Common
ValueCountFrequency (%)
, 1293
29.2%
( 731
16.5%
) 731
16.5%
- 375
 
8.5%
0 374
 
8.4%
1 281
 
6.3%
2 130
 
2.9%
3 105
 
2.4%
5 77
 
1.7%
4 59
 
1.3%
Other values (23) 272
 
6.1%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21471
72.8%
ASCII 6865
 
23.3%
CJK 1103
 
3.7%
Compat Jamo 24
 
0.1%
Number Forms 13
 
< 0.1%
Box Drawing 6
 
< 0.1%
Geometric Shapes 4
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Math Operators 1
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 1293
18.8%
( 731
 
10.6%
) 731
 
10.6%
- 375
 
5.5%
0 374
 
5.4%
1 281
 
4.1%
a 182
 
2.7%
i 143
 
2.1%
o 143
 
2.1%
2 130
 
1.9%
Other values (67) 2482
36.2%
Hangul
ValueCountFrequency (%)
450
 
2.1%
435
 
2.0%
432
 
2.0%
413
 
1.9%
364
 
1.7%
310
 
1.4%
307
 
1.4%
289
 
1.3%
277
 
1.3%
256
 
1.2%
Other values (1472) 17938
83.5%
CJK
ValueCountFrequency (%)
151
13.7%
147
13.3%
147
13.3%
141
12.8%
139
12.6%
139
12.6%
11
 
1.0%
9
 
0.8%
8
 
0.7%
8
 
0.7%
Other values (132) 203
18.4%
Box Drawing
ValueCountFrequency (%)
6
100.0%
Compat Jamo
ValueCountFrequency (%)
5
20.8%
3
12.5%
3
12.5%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (3) 3
12.5%
Number Forms
ValueCountFrequency (%)
4
30.8%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Geometric Shapes
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
α 1
100.0%
Distinct2665
Distinct (%)81.7%
Missing12
Missing (%)0.4%
Memory size25.7 KiB
2023-12-13T02:54:49.972381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length48
Mean length21.690067
Min length2

Characters and Unicode

Total characters70753
Distinct characters76
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2496 ?
Unique (%)76.5%

Sample

1st rowLock
2nd rowBronze Mirror
3rd row1950's Electric Fan
4th row1960's Ice Cream Maker
5th rowCart
ValueCountFrequency (%)
pieces 150
 
1.7%
types 149
 
1.7%
korean 140
 
1.6%
modern 139
 
1.6%
lead 139
 
1.6%
10 138
 
1.6%
samsung 55
 
0.6%
phone 53
 
0.6%
cellular 52
 
0.6%
a 51
 
0.6%
Other values (4367) 7765
87.9%
2023-12-13T02:54:50.490668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 6542
 
9.2%
6103
 
8.6%
e 5369
 
7.6%
i 4596
 
6.5%
o 4276
 
6.0%
s 4157
 
5.9%
r 3789
 
5.4%
n 3622
 
5.1%
l 3143
 
4.4%
t 3056
 
4.3%
Other values (66) 26100
36.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 54479
77.0%
Space Separator 6103
 
8.6%
Uppercase Letter 5716
 
8.1%
Decimal Number 2629
 
3.7%
Other Punctuation 557
 
0.8%
Close Punctuation 519
 
0.7%
Open Punctuation 519
 
0.7%
Dash Punctuation 220
 
0.3%
Math Symbol 6
 
< 0.1%
Letter Number 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 6542
12.0%
e 5369
9.9%
i 4596
 
8.4%
o 4276
 
7.8%
s 4157
 
7.6%
r 3789
 
7.0%
n 3622
 
6.6%
l 3143
 
5.8%
t 3056
 
5.6%
u 2921
 
5.4%
Other values (17) 13008
23.9%
Uppercase Letter
ValueCountFrequency (%)
S 670
11.7%
C 610
 
10.7%
P 599
 
10.5%
M 457
 
8.0%
A 421
 
7.4%
H 284
 
5.0%
L 272
 
4.8%
B 269
 
4.7%
K 266
 
4.7%
G 256
 
4.5%
Other values (16) 1612
28.2%
Decimal Number
ValueCountFrequency (%)
1 682
25.9%
0 456
17.3%
8 387
14.7%
9 242
 
9.2%
7 178
 
6.8%
6 169
 
6.4%
5 163
 
6.2%
2 131
 
5.0%
3 128
 
4.9%
4 93
 
3.5%
Other Punctuation
ValueCountFrequency (%)
, 412
74.0%
. 108
 
19.4%
& 33
 
5.9%
' 3
 
0.5%
/ 1
 
0.2%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
6103
100.0%
Close Punctuation
ValueCountFrequency (%)
) 519
100.0%
Open Punctuation
ValueCountFrequency (%)
( 519
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 220
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 60198
85.1%
Common 10553
 
14.9%
Han 1
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 6542
 
10.9%
e 5369
 
8.9%
i 4596
 
7.6%
o 4276
 
7.1%
s 4157
 
6.9%
r 3789
 
6.3%
n 3622
 
6.0%
l 3143
 
5.2%
t 3056
 
5.1%
u 2921
 
4.9%
Other values (44) 18727
31.1%
Common
ValueCountFrequency (%)
6103
57.8%
1 682
 
6.5%
) 519
 
4.9%
( 519
 
4.9%
0 456
 
4.3%
, 412
 
3.9%
8 387
 
3.7%
9 242
 
2.3%
- 220
 
2.1%
7 178
 
1.7%
Other values (10) 835
 
7.9%
Han
ValueCountFrequency (%)
1
100.0%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70747
> 99.9%
Number Forms 4
 
< 0.1%
CJK 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 6542
 
9.2%
6103
 
8.6%
e 5369
 
7.6%
i 4596
 
6.5%
o 4276
 
6.0%
s 4157
 
5.9%
r 3789
 
5.4%
n 3622
 
5.1%
l 3143
 
4.4%
t 3056
 
4.3%
Other values (62) 26094
36.9%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
α 1
100.0%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
Minimum2015-03-04 00:00:00
Maximum2016-09-07 00:00:00
2023-12-13T02:54:50.616695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:54:50.722485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Correlations

2023-12-13T02:54:50.788775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류등록일
대분류1.0000.998
등록일0.9981.000

Missing values

2023-12-13T02:54:46.129845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:54:46.264713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:54:46.387643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

메타 아이디대분류중분류소분류한글명영문명등록일
0meta_000002961생활주생활자물쇠'五子三元'명자물통Lock2016-09-07
1meta_000003212생활의례동경15세기동경Bronze Mirror2016-09-07
2meta_000002854생활용품가전가전1950년대선풍기1950's Electric Fan2016-09-07
3meta_000003146생활용품기계기계1960년대아이스크림제조기1960's Ice Cream Maker2016-09-07
4meta_000003905조립가능3D기타기타2D퍼즐동물세트강아지<NA>2016-09-07
5meta_000003906조립가능3D기타기타2D퍼즐동물세트토끼<NA>2016-09-07
6meta_000003903조립가능3D기타기타2D퍼즐무리쉬아이돌<NA>2016-09-07
7meta_000003904조립가능3D기타기타2D퍼즐열대어<NA>2016-09-07
8meta_000002872생활운송수레제작2바퀴수레Cart2016-09-07
9meta_000003221기계시계탁상시계5분모래시계5minute Sandglass2016-09-07
메타 아이디대분류중분류소분류한글명영문명등록일
3264meta_000000647곤충류나비목밤나방과흰줄뒷날개나방Catocala lara2015-03-04
3265meta_000000340해양생물(갑각류)완흉목따개비과흰줄따개비Balanus albicostatus Pilsbry2015-03-04
3266meta_000000844거미류거미목깡충거미과흰줄무늬깡충거미(암)Sitticus albolineatus2015-03-04
3267meta_000000235조류기러기목오리과흰줄박이오리(수)Histrionicus histrionicus2015-03-04
3268meta_000000234조류기러기목오리과흰줄박이오리(암)Histrionicus histrionicus2015-03-04
3269meta_000000440곤충류나비목네발나비과흰줄표범나비Argyronome laodice (Pallas)2015-03-04
3270meta_000003525해양생물(국내패류)고복족목소라과흰팥알고둥Collonista amakusaensis2016-09-07
3271meta_000000966해양생물(패류)신복족목대추고둥과흰혹밤색줄고둥Amalda rubiginosa albocallosa (Lischke, 1873)2015-03-04
3272meta_000003258공룡힙실로포돈(Hypsilophodon)조각류,진조각류,힙실로포돈과힙실로포돈Hypsilophodon2016-09-07
3273meta_000000119조류참새목참새과힝둥새Anthus hodgsoni2015-03-04