Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 3274 |
Missing cells | 13 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 179.2 KiB |
Average record size in memory | 56.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 1 |
DateTime | 1 |
Dataset
Description | 국립중앙과학관은 기초과학, 응용과학, 산업기술, 과학기술사, 자연사 등 분야에서 수집해온 과학기술자료들을 DB화하여 전시, 교육, 연구의 자원으로 활용 가능한 데이터입니다. 3D 프린팅 데이터와 소장자료에 대한 이름, 취득방법, 사진 등의 정보를 바탕으로 대여열람을 위한 기초자료로서 유관기관의 활용에 도움이 될것으로 기대됩니다. |
---|---|
Author | 과학기술정보통신부 국립중앙과학관 |
URL | https://www.data.go.kr/data/15048431/fileData.do |
메타 아이디 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 17:54:44.704624 |
---|---|
Analysis finished | 2023-12-12 17:54:46.456203 |
Duration | 1.75 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
메타 아이디
Text
UNIQUE
 
Distinct | 3274 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.7 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 45836 |
---|---|
Distinct characters | 15 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3274 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | meta_000002961 |
---|---|
2nd row | meta_000003212 |
3rd row | meta_000002854 |
4th row | meta_000003146 |
5th row | meta_000003905 |
Value | Count | Frequency (%) |
meta_000002961 | 1 | < 0.1% |
meta_000000620 | 1 | < 0.1% |
meta_000000051 | 1 | < 0.1% |
meta_000000380 | 1 | < 0.1% |
meta_000003683 | 1 | < 0.1% |
meta_000000768 | 1 | < 0.1% |
meta_000003795 | 1 | < 0.1% |
meta_000000389 | 1 | < 0.1% |
meta_000000421 | 1 | < 0.1% |
meta_000003226 | 1 | < 0.1% |
Other values (3264) | 3264 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18422 | |
m | 3274 | 7.1% |
e | 3274 | 7.1% |
t | 3274 | 7.1% |
a | 3274 | 7.1% |
_ | 3274 | 7.1% |
2 | 2054 | 4.5% |
3 | 1942 | 4.2% |
1 | 1430 | 3.1% |
5 | 958 | 2.1% |
Other values (5) | 4660 | 10.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 29466 | |
Lowercase Letter | 13096 | |
Connector Punctuation | 3274 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18422 | |
2 | 2054 | 7.0% |
3 | 1942 | 6.6% |
1 | 1430 | 4.9% |
5 | 958 | 3.3% |
4 | 956 | 3.2% |
6 | 954 | 3.2% |
7 | 947 | 3.2% |
8 | 943 | 3.2% |
9 | 860 | 2.9% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 3274 | |
e | 3274 | |
t | 3274 | |
a | 3274 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3274 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 32740 | |
Latin | 13096 | 28.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18422 | |
_ | 3274 | 10.0% |
2 | 2054 | 6.3% |
3 | 1942 | 5.9% |
1 | 1430 | 4.4% |
5 | 958 | 2.9% |
4 | 956 | 2.9% |
6 | 954 | 2.9% |
7 | 947 | 2.9% |
8 | 943 | 2.9% |
Latin
Value | Count | Frequency (%) |
m | 3274 | |
e | 3274 | |
t | 3274 | |
a | 3274 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 45836 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18422 | |
m | 3274 | 7.1% |
e | 3274 | 7.1% |
t | 3274 | 7.1% |
a | 3274 | 7.1% |
_ | 3274 | 7.1% |
2 | 2054 | 4.5% |
3 | 1942 | 4.2% |
1 | 1430 | 3.1% |
5 | 958 | 2.1% |
Other values (5) | 4660 | 10.2% |
대분류
Categorical
Distinct | 36 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.7 KiB |
해양생물(패류) | |
---|---|
해양생물(국내패류) | |
곤충류 | |
조류 | |
인쇄 | |
Other values (31) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 4.3478925 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 생활 |
---|---|
2nd row | 생활 |
3rd row | 생활용품 |
4th row | 생활용품 |
5th row | 조립가능3D |
Common Values
Value | Count | Frequency (%) |
해양생물(패류) | 360 | 11.0% |
해양생물(국내패류) | 352 | 10.8% |
곤충류 | 315 | 9.6% |
조류 | 225 | 6.9% |
인쇄 | 211 | 6.4% |
어류 | 198 | 6.0% |
생활 | 194 | 5.9% |
거미류 | 193 | 5.9% |
건축 | 156 | 4.8% |
암석/화석 | 106 | 3.2% |
Other values (26) | 964 |
Length
Value | Count | Frequency (%) |
해양생물(패류 | 360 | 11.0% |
해양생물(국내패류 | 352 | 10.8% |
곤충류 | 315 | 9.6% |
조류 | 225 | 6.9% |
인쇄 | 211 | 6.4% |
어류 | 198 | 6.0% |
생활 | 196 | 6.0% |
거미류 | 193 | 5.9% |
건축 | 156 | 4.8% |
암석/화석 | 106 | 3.2% |
Other values (24) | 962 |
중분류
Text
Distinct | 323 |
---|---|
Distinct (%) | 9.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.7 KiB |
Value | Count | Frequency (%) |
흡강목 | 309 | 9.4% |
거미목 | 193 | 5.9% |
딱정벌레목 | 182 | 5.6% |
연활자 | 151 | 4.6% |
백합목 | 108 | 3.3% |
무기 | 93 | 2.8% |
참새목 | 86 | 2.6% |
연장 | 85 | 2.6% |
전기_전자 | 85 | 2.6% |
나비목 | 77 | 2.4% |
Other values (311) | 1905 |
Most occurring characters
Value | Count | Frequency (%) |
목 | 2009 | 13.5% |
기 | 498 | 3.4% |
a | 440 | 3.0% |
강 | 310 | 2.1% |
흡 | 309 | 2.1% |
i | 288 | 1.9% |
o | 287 | 1.9% |
활 | 274 | 1.8% |
자 | 270 | 1.8% |
r | 269 | 1.8% |
Other values (331) | 9876 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 11108 | |
Lowercase Letter | 2935 | 19.8% |
Uppercase Letter | 280 | 1.9% |
Open Punctuation | 192 | 1.3% |
Close Punctuation | 192 | 1.3% |
Connector Punctuation | 97 | 0.7% |
Other Punctuation | 10 | 0.1% |
Space Separator | 8 | 0.1% |
Decimal Number | 8 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
목 | 2009 | 18.1% |
기 | 498 | 4.5% |
강 | 310 | 2.8% |
흡 | 309 | 2.8% |
활 | 274 | 2.5% |
자 | 270 | 2.4% |
연 | 244 | 2.2% |
미 | 237 | 2.1% |
전 | 233 | 2.1% |
레 | 226 | 2.0% |
Other values (274) | 6498 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 440 | |
i | 288 | |
o | 287 | |
r | 269 | |
s | 216 | 7.4% |
u | 189 | 6.4% |
t | 185 | 6.3% |
d | 158 | 5.4% |
e | 156 | 5.3% |
n | 124 | 4.2% |
Other values (16) | 623 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 57 | |
S | 41 | |
A | 37 | |
C | 20 | 7.1% |
T | 19 | 6.8% |
O | 18 | 6.4% |
M | 15 | 5.4% |
R | 13 | 4.6% |
L | 13 | 4.6% |
E | 8 | 2.9% |
Other values (11) | 39 |
Decimal Number
Value | Count | Frequency (%) |
4 | 2 | |
7 | 2 | |
8 | 2 | |
1 | 2 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 8 | |
, | 2 | 20.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 192 |
Close Punctuation
Value | Count | Frequency (%) |
) | 192 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 97 |
Space Separator
Value | Count | Frequency (%) |
8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 11108 | |
Latin | 3215 | 21.7% |
Common | 507 | 3.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
목 | 2009 | 18.1% |
기 | 498 | 4.5% |
강 | 310 | 2.8% |
흡 | 309 | 2.8% |
활 | 274 | 2.5% |
자 | 270 | 2.4% |
연 | 244 | 2.2% |
미 | 237 | 2.1% |
전 | 233 | 2.1% |
레 | 226 | 2.0% |
Other values (274) | 6498 |
Latin
Value | Count | Frequency (%) |
a | 440 | |
i | 288 | 9.0% |
o | 287 | 8.9% |
r | 269 | 8.4% |
s | 216 | 6.7% |
u | 189 | 5.9% |
t | 185 | 5.8% |
d | 158 | 4.9% |
e | 156 | 4.9% |
n | 124 | 3.9% |
Other values (37) | 903 |
Common
Value | Count | Frequency (%) |
( | 192 | |
) | 192 | |
_ | 97 | |
8 | 1.6% | |
/ | 8 | 1.6% |
4 | 2 | 0.4% |
7 | 2 | 0.4% |
8 | 2 | 0.4% |
1 | 2 | 0.4% |
, | 2 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 11108 | |
ASCII | 3722 | 25.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
목 | 2009 | 18.1% |
기 | 498 | 4.5% |
강 | 310 | 2.8% |
흡 | 309 | 2.8% |
활 | 274 | 2.5% |
자 | 270 | 2.4% |
연 | 244 | 2.2% |
미 | 237 | 2.1% |
전 | 233 | 2.1% |
레 | 226 | 2.0% |
Other values (274) | 6498 |
ASCII
Value | Count | Frequency (%) |
a | 440 | 11.8% |
i | 288 | 7.7% |
o | 287 | 7.7% |
r | 269 | 7.2% |
s | 216 | 5.8% |
( | 192 | 5.2% |
) | 192 | 5.2% |
u | 189 | 5.1% |
t | 185 | 5.0% |
d | 158 | 4.2% |
Other values (47) | 1306 |
소분류
Text
Distinct | 741 |
---|---|
Distinct (%) | 22.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.7 KiB |
Value | Count | Frequency (%) |
한글바탕체 | 117 | 3.6% |
전기 | 75 | 2.3% |
물레고둥과 | 59 | 1.8% |
휴대전화 | 53 | 1.6% |
맞춤 | 46 | 1.4% |
화살제작 | 43 | 1.3% |
하늘소과 | 42 | 1.3% |
발우제작 | 42 | 1.3% |
서계문집 | 42 | 1.3% |
수레제작 | 41 | 1.3% |
Other values (730) | 2714 |
Most occurring characters
Value | Count | Frequency (%) |
과 | 1850 | 11.2% |
a | 461 | 2.8% |
e | 404 | 2.5% |
고 | 367 | 2.2% |
둥 | 335 | 2.0% |
i | 310 | 1.9% |
기 | 286 | 1.7% |
제 | 264 | 1.6% |
작 | 261 | 1.6% |
미 | 253 | 1.5% |
Other values (478) | 11661 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 13334 | |
Lowercase Letter | 2596 | 15.8% |
Uppercase Letter | 247 | 1.5% |
Other Punctuation | 96 | 0.6% |
Open Punctuation | 89 | 0.5% |
Close Punctuation | 89 | 0.5% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
과 | 1850 | 13.9% |
고 | 367 | 2.8% |
둥 | 335 | 2.5% |
기 | 286 | 2.1% |
제 | 264 | 2.0% |
작 | 261 | 2.0% |
미 | 253 | 1.9% |
리 | 233 | 1.7% |
이 | 231 | 1.7% |
레 | 223 | 1.7% |
Other values (430) | 9031 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 461 | |
e | 404 | |
i | 310 | |
d | 224 | |
r | 150 | 5.8% |
c | 133 | 5.1% |
t | 133 | 5.1% |
o | 132 | 5.1% |
l | 128 | 4.9% |
s | 97 | 3.7% |
Other values (12) | 424 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 40 | |
C | 34 | |
P | 29 | |
S | 17 | 6.9% |
M | 16 | 6.5% |
H | 15 | 6.1% |
T | 13 | 5.3% |
D | 13 | 5.3% |
L | 12 | 4.9% |
N | 11 | 4.5% |
Other values (11) | 47 |
Other Punctuation
Value | Count | Frequency (%) |
, | 88 | |
/ | 8 | 8.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 89 |
Close Punctuation
Value | Count | Frequency (%) |
) | 89 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 13334 | |
Latin | 2843 | 17.3% |
Common | 275 | 1.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
과 | 1850 | 13.9% |
고 | 367 | 2.8% |
둥 | 335 | 2.5% |
기 | 286 | 2.1% |
제 | 264 | 2.0% |
작 | 261 | 2.0% |
미 | 253 | 1.9% |
리 | 233 | 1.7% |
이 | 231 | 1.7% |
레 | 223 | 1.7% |
Other values (430) | 9031 |
Latin
Value | Count | Frequency (%) |
a | 461 | |
e | 404 | |
i | 310 | |
d | 224 | 7.9% |
r | 150 | 5.3% |
c | 133 | 4.7% |
t | 133 | 4.7% |
o | 132 | 4.6% |
l | 128 | 4.5% |
s | 97 | 3.4% |
Other values (33) | 671 |
Common
Value | Count | Frequency (%) |
( | 89 | |
) | 89 | |
, | 88 | |
/ | 8 | 2.9% |
1 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 13334 | |
ASCII | 3118 | 19.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
과 | 1850 | 13.9% |
고 | 367 | 2.8% |
둥 | 335 | 2.5% |
기 | 286 | 2.1% |
제 | 264 | 2.0% |
작 | 261 | 2.0% |
미 | 253 | 1.9% |
리 | 233 | 1.7% |
이 | 231 | 1.7% |
레 | 223 | 1.7% |
Other values (430) | 9031 |
ASCII
Value | Count | Frequency (%) |
a | 461 | |
e | 404 | |
i | 310 | 9.9% |
d | 224 | 7.2% |
r | 150 | 4.8% |
c | 133 | 4.3% |
t | 133 | 4.3% |
o | 132 | 4.2% |
l | 128 | 4.1% |
s | 97 | 3.1% |
Other values (38) | 946 |
한글명
Text
Distinct | 3254 |
---|---|
Distinct (%) | 99.4% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 25.7 KiB |
Length
Max length | 44 |
---|---|
Median length | 35 |
Mean length | 9.0106936 |
Min length | 1 |
Characters and Unicode
Total characters | 29492 |
---|---|
Distinct characters | 1733 |
Distinct categories | 12 ? |
Distinct scripts | 5 ? |
Distinct blocks | 11 ? |
Unique
Unique | 3235 ? |
---|---|
Unique (%) | 98.8% |
Sample
1st row | '五子三元'명자물통 |
---|---|
2nd row | 15세기동경 |
3rd row | 1950년대선풍기 |
4th row | 1960년대아이스크림제조기 |
5th row | 2D퍼즐동물세트강아지 |
Value | Count | Frequency (%) |
숫자 | 3 | 0.1% |
숫자연활자10개모음(원 | 3 | 0.1% |
혹줄돼지고둥 | 2 | 0.1% |
둥근전복 | 2 | 0.1% |
달팽이 | 2 | 0.1% |
돼지고둥 | 2 | 0.1% |
검은줄좁쌀무늬고둥 | 2 | 0.1% |
파라사우롤로푸스 | 2 | 0.1% |
dubia | 2 | 0.1% |
우럭 | 2 | 0.1% |
Other values (3249) | 3261 |
Most occurring characters
Value | Count | Frequency (%) |
, | 1293 | 4.4% |
( | 731 | 2.5% |
) | 731 | 2.5% |
이 | 450 | 1.5% |
개 | 435 | 1.5% |
리 | 432 | 1.5% |
고 | 413 | 1.4% |
- | 375 | 1.3% |
0 | 374 | 1.3% |
둥 | 364 | 1.2% |
Other values (1723) | 23894 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 22601 | |
Lowercase Letter | 1587 | 5.4% |
Other Punctuation | 1342 | 4.6% |
Decimal Number | 1203 | 4.1% |
Uppercase Letter | 863 | 2.9% |
Open Punctuation | 738 | 2.5% |
Close Punctuation | 737 | 2.5% |
Dash Punctuation | 375 | 1.3% |
Letter Number | 13 | < 0.1% |
Space Separator | 12 | < 0.1% |
Other values (2) | 21 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 450 | 2.0% |
개 | 435 | 1.9% |
리 | 432 | 1.9% |
고 | 413 | 1.8% |
둥 | 364 | 1.6% |
대 | 310 | 1.4% |
자 | 307 | 1.4% |
미 | 289 | 1.3% |
기 | 277 | 1.2% |
제 | 256 | 1.1% |
Other values (1630) | 19068 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 113 | |
M | 86 | 10.0% |
C | 69 | 8.0% |
H | 68 | 7.9% |
A | 65 | 7.5% |
P | 52 | 6.0% |
R | 52 | 6.0% |
T | 47 | 5.4% |
D | 40 | 4.6% |
G | 31 | 3.6% |
Other values (16) | 240 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 182 | |
i | 143 | 9.0% |
o | 143 | 9.0% |
n | 126 | 7.9% |
e | 125 | 7.9% |
t | 111 | 7.0% |
u | 102 | 6.4% |
r | 101 | 6.4% |
s | 97 | 6.1% |
l | 95 | 6.0% |
Other values (15) | 362 |
Decimal Number
Value | Count | Frequency (%) |
0 | 374 | |
1 | 281 | |
2 | 130 | 10.8% |
3 | 105 | 8.7% |
5 | 77 | 6.4% |
4 | 59 | 4.9% |
8 | 56 | 4.7% |
6 | 47 | 3.9% |
9 | 43 | 3.6% |
7 | 31 | 2.6% |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 4 | |
Ⅱ | 2 | |
Ⅰ | 1 | 7.7% |
Ⅸ | 1 | 7.7% |
Ⅷ | 1 | 7.7% |
Ⅶ | 1 | 7.7% |
Ⅵ | 1 | 7.7% |
Ⅴ | 1 | 7.7% |
Ⅳ | 1 | 7.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1293 | |
. | 30 | 2.2% |
& | 8 | 0.6% |
' | 5 | 0.4% |
: | 2 | 0.1% |
! | 2 | 0.1% |
? | 1 | 0.1% |
/ | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
~ | 6 | |
= | 2 | 18.2% |
⊙ | 1 | 9.1% |
→ | 1 | 9.1% |
+ | 1 | 9.1% |
Other Symbol
Value | Count | Frequency (%) |
─ | 6 | |
△ | 2 | 20.0% |
▣ | 1 | 10.0% |
○ | 1 | 10.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 731 | |
[ | 7 | 0.9% |
Close Punctuation
Value | Count | Frequency (%) |
) | 731 | |
] | 6 | 0.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 375 |
Space Separator
Value | Count | Frequency (%) |
12 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 21495 | |
Common | 4428 | 15.0% |
Latin | 2462 | 8.3% |
Han | 1106 | 3.8% |
Greek | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 450 | 2.1% |
개 | 435 | 2.0% |
리 | 432 | 2.0% |
고 | 413 | 1.9% |
둥 | 364 | 1.7% |
대 | 310 | 1.4% |
자 | 307 | 1.4% |
미 | 289 | 1.3% |
기 | 277 | 1.3% |
제 | 256 | 1.2% |
Other values (1485) | 17962 |
Han
Value | Count | Frequency (%) |
字 | 151 | |
版 | 147 | |
活 | 147 | |
新 | 141 | |
式 | 139 | |
鉛 | 139 | |
箭 | 11 | 1.0% |
筒 | 9 | 0.8% |
銃 | 8 | 0.7% |
藥 | 8 | 0.7% |
Other values (135) | 206 |
Latin
Value | Count | Frequency (%) |
a | 182 | 7.4% |
i | 143 | 5.8% |
o | 143 | 5.8% |
n | 126 | 5.1% |
e | 125 | 5.1% |
S | 113 | 4.6% |
t | 111 | 4.5% |
u | 102 | 4.1% |
r | 101 | 4.1% |
s | 97 | 3.9% |
Other values (49) | 1219 |
Common
Value | Count | Frequency (%) |
, | 1293 | |
( | 731 | |
) | 731 | |
- | 375 | 8.5% |
0 | 374 | 8.4% |
1 | 281 | 6.3% |
2 | 130 | 2.9% |
3 | 105 | 2.4% |
5 | 77 | 1.7% |
4 | 59 | 1.3% |
Other values (23) | 272 | 6.1% |
Greek
Value | Count | Frequency (%) |
α | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 21471 | |
ASCII | 6865 | 23.3% |
CJK | 1103 | 3.7% |
Compat Jamo | 24 | 0.1% |
Number Forms | 13 | < 0.1% |
Box Drawing | 6 | < 0.1% |
Geometric Shapes | 4 | < 0.1% |
CJK Compat Ideographs | 3 | < 0.1% |
Math Operators | 1 | < 0.1% |
Arrows | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 1293 | |
( | 731 | 10.6% |
) | 731 | 10.6% |
- | 375 | 5.5% |
0 | 374 | 5.4% |
1 | 281 | 4.1% |
a | 182 | 2.7% |
i | 143 | 2.1% |
o | 143 | 2.1% |
2 | 130 | 1.9% |
Other values (67) | 2482 |
Hangul
Value | Count | Frequency (%) |
이 | 450 | 2.1% |
개 | 435 | 2.0% |
리 | 432 | 2.0% |
고 | 413 | 1.9% |
둥 | 364 | 1.7% |
대 | 310 | 1.4% |
자 | 307 | 1.4% |
미 | 289 | 1.3% |
기 | 277 | 1.3% |
제 | 256 | 1.2% |
Other values (1472) | 17938 |
CJK
Value | Count | Frequency (%) |
字 | 151 | |
版 | 147 | |
活 | 147 | |
新 | 141 | |
式 | 139 | |
鉛 | 139 | |
箭 | 11 | 1.0% |
筒 | 9 | 0.8% |
銃 | 8 | 0.7% |
藥 | 8 | 0.7% |
Other values (132) | 203 |
Box Drawing
Value | Count | Frequency (%) |
─ | 6 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 5 | |
ㄴ | 3 | |
ㄷ | 3 | |
ㅍ | 2 | 8.3% |
ㅅ | 2 | 8.3% |
ㄹ | 2 | 8.3% |
ㅁ | 1 | 4.2% |
ㅂ | 1 | 4.2% |
ㅇ | 1 | 4.2% |
ㅈ | 1 | 4.2% |
Other values (3) | 3 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 4 | |
Ⅱ | 2 | |
Ⅰ | 1 | 7.7% |
Ⅸ | 1 | 7.7% |
Ⅷ | 1 | 7.7% |
Ⅶ | 1 | 7.7% |
Ⅵ | 1 | 7.7% |
Ⅴ | 1 | 7.7% |
Ⅳ | 1 | 7.7% |
Geometric Shapes
Value | Count | Frequency (%) |
△ | 2 | |
▣ | 1 | |
○ | 1 |
Math Operators
Value | Count | Frequency (%) |
⊙ | 1 |
Arrows
Value | Count | Frequency (%) |
→ | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
量 | 1 | |
令 | 1 | |
兩 | 1 |
None
Value | Count | Frequency (%) |
α | 1 |
영문명
Text
Distinct | 2665 |
---|---|
Distinct (%) | 81.7% |
Missing | 12 |
Missing (%) | 0.4% |
Memory size | 25.7 KiB |
Length
Max length | 63 |
---|---|
Median length | 48 |
Mean length | 21.690067 |
Min length | 2 |
Characters and Unicode
Total characters | 70753 |
---|---|
Distinct characters | 76 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 2496 ? |
---|---|
Unique (%) | 76.5% |
Sample
1st row | Lock |
---|---|
2nd row | Bronze Mirror |
3rd row | 1950's Electric Fan |
4th row | 1960's Ice Cream Maker |
5th row | Cart |
Value | Count | Frequency (%) |
pieces | 150 | 1.7% |
types | 149 | 1.7% |
korean | 140 | 1.6% |
modern | 139 | 1.6% |
lead | 139 | 1.6% |
10 | 138 | 1.6% |
samsung | 55 | 0.6% |
phone | 53 | 0.6% |
cellular | 52 | 0.6% |
a | 51 | 0.6% |
Other values (4367) | 7765 |
Most occurring characters
Value | Count | Frequency (%) |
a | 6542 | 9.2% |
6103 | 8.6% | |
e | 5369 | 7.6% |
i | 4596 | 6.5% |
o | 4276 | 6.0% |
s | 4157 | 5.9% |
r | 3789 | 5.4% |
n | 3622 | 5.1% |
l | 3143 | 4.4% |
t | 3056 | 4.3% |
Other values (66) | 26100 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 54479 | |
Space Separator | 6103 | 8.6% |
Uppercase Letter | 5716 | 8.1% |
Decimal Number | 2629 | 3.7% |
Other Punctuation | 557 | 0.8% |
Close Punctuation | 519 | 0.7% |
Open Punctuation | 519 | 0.7% |
Dash Punctuation | 220 | 0.3% |
Math Symbol | 6 | < 0.1% |
Letter Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 6542 | |
e | 5369 | |
i | 4596 | 8.4% |
o | 4276 | 7.8% |
s | 4157 | 7.6% |
r | 3789 | 7.0% |
n | 3622 | 6.6% |
l | 3143 | 5.8% |
t | 3056 | 5.6% |
u | 2921 | 5.4% |
Other values (17) | 13008 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 670 | |
C | 610 | 10.7% |
P | 599 | 10.5% |
M | 457 | 8.0% |
A | 421 | 7.4% |
H | 284 | 5.0% |
L | 272 | 4.8% |
B | 269 | 4.7% |
K | 266 | 4.7% |
G | 256 | 4.5% |
Other values (16) | 1612 |
Decimal Number
Value | Count | Frequency (%) |
1 | 682 | |
0 | 456 | |
8 | 387 | |
9 | 242 | 9.2% |
7 | 178 | 6.8% |
6 | 169 | 6.4% |
5 | 163 | 6.2% |
2 | 131 | 5.0% |
3 | 128 | 4.9% |
4 | 93 | 3.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 412 | |
. | 108 | 19.4% |
& | 33 | 5.9% |
' | 3 | 0.5% |
/ | 1 | 0.2% |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 3 | |
Ⅱ | 1 | 25.0% |
Space Separator
Value | Count | Frequency (%) |
6103 |
Close Punctuation
Value | Count | Frequency (%) |
) | 519 |
Open Punctuation
Value | Count | Frequency (%) |
( | 519 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 220 |
Math Symbol
Value | Count | Frequency (%) |
~ | 6 |
Other Letter
Value | Count | Frequency (%) |
卍 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 60198 | |
Common | 10553 | 14.9% |
Han | 1 | < 0.1% |
Greek | 1 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 6542 | 10.9% |
e | 5369 | 8.9% |
i | 4596 | 7.6% |
o | 4276 | 7.1% |
s | 4157 | 6.9% |
r | 3789 | 6.3% |
n | 3622 | 6.0% |
l | 3143 | 5.2% |
t | 3056 | 5.1% |
u | 2921 | 4.9% |
Other values (44) | 18727 |
Common
Value | Count | Frequency (%) |
6103 | ||
1 | 682 | 6.5% |
) | 519 | 4.9% |
( | 519 | 4.9% |
0 | 456 | 4.3% |
, | 412 | 3.9% |
8 | 387 | 3.7% |
9 | 242 | 2.3% |
- | 220 | 2.1% |
7 | 178 | 1.7% |
Other values (10) | 835 | 7.9% |
Han
Value | Count | Frequency (%) |
卍 | 1 |
Greek
Value | Count | Frequency (%) |
α | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 70747 | |
Number Forms | 4 | < 0.1% |
CJK | 1 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 6542 | 9.2% |
6103 | 8.6% | |
e | 5369 | 7.6% |
i | 4596 | 6.5% |
o | 4276 | 6.0% |
s | 4157 | 5.9% |
r | 3789 | 5.4% |
n | 3622 | 5.1% |
l | 3143 | 4.4% |
t | 3056 | 4.3% |
Other values (62) | 26094 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 3 | |
Ⅱ | 1 | 25.0% |
CJK
Value | Count | Frequency (%) |
卍 | 1 |
None
Value | Count | Frequency (%) |
α | 1 |
등록일
Date
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.7 KiB |
Minimum | 2015-03-04 00:00:00 |
---|---|
Maximum | 2016-09-07 00:00:00 |
대분류 | 등록일 | |
---|---|---|
대분류 | 1.000 | 0.998 |
등록일 | 0.998 | 1.000 |
메타 아이디 | 대분류 | 중분류 | 소분류 | 한글명 | 영문명 | 등록일 | |
---|---|---|---|---|---|---|---|
0 | meta_000002961 | 생활 | 주생활 | 자물쇠 | '五子三元'명자물통 | Lock | 2016-09-07 |
1 | meta_000003212 | 생활 | 의례 | 동경 | 15세기동경 | Bronze Mirror | 2016-09-07 |
2 | meta_000002854 | 생활용품 | 가전 | 가전 | 1950년대선풍기 | 1950's Electric Fan | 2016-09-07 |
3 | meta_000003146 | 생활용품 | 기계 | 기계 | 1960년대아이스크림제조기 | 1960's Ice Cream Maker | 2016-09-07 |
4 | meta_000003905 | 조립가능3D | 기타 | 기타 | 2D퍼즐동물세트강아지 | <NA> | 2016-09-07 |
5 | meta_000003906 | 조립가능3D | 기타 | 기타 | 2D퍼즐동물세트토끼 | <NA> | 2016-09-07 |
6 | meta_000003903 | 조립가능3D | 기타 | 기타 | 2D퍼즐무리쉬아이돌 | <NA> | 2016-09-07 |
7 | meta_000003904 | 조립가능3D | 기타 | 기타 | 2D퍼즐열대어 | <NA> | 2016-09-07 |
8 | meta_000002872 | 생활 | 운송 | 수레제작 | 2바퀴수레 | Cart | 2016-09-07 |
9 | meta_000003221 | 기계 | 시계 | 탁상시계 | 5분모래시계 | 5minute Sandglass | 2016-09-07 |
메타 아이디 | 대분류 | 중분류 | 소분류 | 한글명 | 영문명 | 등록일 | |
---|---|---|---|---|---|---|---|
3264 | meta_000000647 | 곤충류 | 나비목 | 밤나방과 | 흰줄뒷날개나방 | Catocala lara | 2015-03-04 |
3265 | meta_000000340 | 해양생물(갑각류) | 완흉목 | 따개비과 | 흰줄따개비 | Balanus albicostatus Pilsbry | 2015-03-04 |
3266 | meta_000000844 | 거미류 | 거미목 | 깡충거미과 | 흰줄무늬깡충거미(암) | Sitticus albolineatus | 2015-03-04 |
3267 | meta_000000235 | 조류 | 기러기목 | 오리과 | 흰줄박이오리(수) | Histrionicus histrionicus | 2015-03-04 |
3268 | meta_000000234 | 조류 | 기러기목 | 오리과 | 흰줄박이오리(암) | Histrionicus histrionicus | 2015-03-04 |
3269 | meta_000000440 | 곤충류 | 나비목 | 네발나비과 | 흰줄표범나비 | Argyronome laodice (Pallas) | 2015-03-04 |
3270 | meta_000003525 | 해양생물(국내패류) | 고복족목 | 소라과 | 흰팥알고둥 | Collonista amakusaensis | 2016-09-07 |
3271 | meta_000000966 | 해양생물(패류) | 신복족목 | 대추고둥과 | 흰혹밤색줄고둥 | Amalda rubiginosa albocallosa (Lischke, 1873) | 2015-03-04 |
3272 | meta_000003258 | 공룡 | 힙실로포돈(Hypsilophodon) | 조각류,진조각류,힙실로포돈과 | 힙실로포돈 | Hypsilophodon | 2016-09-07 |
3273 | meta_000000119 | 조류 | 참새목 | 참새과 | 힝둥새 | Anthus hodgsoni | 2015-03-04 |