Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 9978 |
Missing cells | 410 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 545.8 KiB |
Average record size in memory | 56.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 3 |
Boolean | 1 |
Dataset
Description | 국립중앙과학관 홈페이지에 있는 과학학습콘텐츠의 전시보유품 상세항목입니다. |
---|---|
Author | 과학기술정보통신부 국립중앙과학관 |
URL | https://www.data.go.kr/data/15067823/fileData.do |
공개여부 has constant value "" | Constant |
이름 is highly overall correlated with 전시타입 | High correlation |
전시타입 is highly overall correlated with 이름 and 1 other fields | High correlation |
등록자 아이디 is highly overall correlated with 전시타입 | High correlation |
전시타입 is highly imbalanced (99.7%) | Imbalance |
등록자 아이디 is highly imbalanced (67.5%) | Imbalance |
공개여부 has 410 (4.1%) missing values | Missing |
상세 아이디 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 00:54:54.303321 |
---|---|
Analysis finished | 2023-12-12 00:54:55.297800 |
Duration | 0.99 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
상세 아이디
Text
UNIQUE
 
Distinct | 9978 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
Value | Count | Frequency (%) |
14 | 1 | < 0.1% |
7,041 | 1 | < 0.1% |
7,029 | 1 | < 0.1% |
7,030 | 1 | < 0.1% |
7,080 | 1 | < 0.1% |
7,037 | 1 | < 0.1% |
7,038 | 1 | < 0.1% |
7,039 | 1 | < 0.1% |
7,040 | 1 | < 0.1% |
7,042 | 1 | < 0.1% |
Other values (9968) | 9968 |
Most occurring characters
Value | Count | Frequency (%) |
, | 9150 | |
1 | 7518 | |
4 | 4971 | |
5 | 4839 | |
3 | 4588 | |
6 | 3989 | |
2 | 3948 | |
8 | 3336 | 6.4% |
7 | 3270 | 6.3% |
9 | 3104 | 6.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 42607 | |
Other Punctuation | 9150 | 17.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 7518 | |
4 | 4971 | |
5 | 4839 | |
3 | 4588 | |
6 | 3989 | |
2 | 3948 | |
8 | 3336 | |
7 | 3270 | |
9 | 3104 | |
0 | 3044 |
Other Punctuation
Value | Count | Frequency (%) |
, | 9150 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 51757 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
, | 9150 | |
1 | 7518 | |
4 | 4971 | |
5 | 4839 | |
3 | 4588 | |
6 | 3989 | |
2 | 3948 | |
8 | 3336 | 6.4% |
7 | 3270 | 6.3% |
9 | 3104 | 6.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 51757 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 9150 | |
1 | 7518 | |
4 | 4971 | |
5 | 4839 | |
3 | 4588 | |
6 | 3989 | |
2 | 3948 | |
8 | 3336 | 6.4% |
7 | 3270 | 6.3% |
9 | 3104 | 6.0% |
전시품 아이디
Text
Distinct | 2564 |
---|---|
Distinct (%) | 25.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
Value | Count | Frequency (%) |
1,583 | 10 | 0.1% |
1,582 | 10 | 0.1% |
1,588 | 9 | 0.1% |
1,587 | 9 | 0.1% |
428 | 8 | 0.1% |
429 | 8 | 0.1% |
1,497 | 7 | 0.1% |
588 | 7 | 0.1% |
1,507 | 7 | 0.1% |
1,493 | 7 | 0.1% |
Other values (2554) | 9896 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 6848 | |
, | 6059 | |
2 | 4101 | |
4 | 3736 | |
3 | 3549 | |
5 | 3435 | |
8 | 3046 | |
9 | 2841 | |
7 | 2815 | |
0 | 2744 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 35758 | |
Other Punctuation | 6059 | 14.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 6848 | |
2 | 4101 | |
4 | 3736 | |
3 | 3549 | |
5 | 3435 | |
8 | 3046 | |
9 | 2841 | |
7 | 2815 | |
0 | 2744 | |
6 | 2643 | 7.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 6059 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 41817 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 6848 | |
, | 6059 | |
2 | 4101 | |
4 | 3736 | |
3 | 3549 | |
5 | 3435 | |
8 | 3046 | |
9 | 2841 | |
7 | 2815 | |
0 | 2744 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 41817 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 6848 | |
, | 6059 | |
2 | 4101 | |
4 | 3736 | |
3 | 3549 | |
5 | 3435 | |
8 | 3046 | |
9 | 2841 | |
7 | 2815 | |
0 | 2744 |
전시타입
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
E | |
---|---|
L | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | E |
---|---|
2nd row | E |
3rd row | E |
4th row | E |
5th row | E |
Common Values
Value | Count | Frequency (%) |
E | 9976 | |
L | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
e | 9976 | |
l | 2 | < 0.1% |
이름
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
크기 | |
---|---|
국적 | |
제조연대 | |
학명 | |
제조사 | |
Other values (37) |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.6309882 |
Min length | 1 |
Unique
Unique | 20 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 국적 |
---|---|
2nd row | 재질 |
3rd row | 국적 |
4th row | 재질 |
5th row | 국적 |
Common Values
Value | Count | Frequency (%) |
크기 | 1338 | |
국적 | 1177 | |
제조연대 | 1127 | |
학명 | 932 | |
제조사 | 927 | |
재질 | 855 | |
용도/기능 | 733 | |
과명 | 681 | |
목명 | 653 | |
영명 | 579 | |
Other values (32) | 976 |
Length
Value | Count | Frequency (%) |
크기 | 1338 | |
국적 | 1177 | |
제조연대 | 1127 | |
학명 | 932 | |
제조사 | 927 | |
재질 | 855 | |
용도/기능 | 733 | |
과명 | 681 | |
목명 | 653 | |
영명 | 579 | |
Other values (31) | 976 |
항목값
Text
Distinct | 3799 |
---|---|
Distinct (%) | 38.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
Value | Count | Frequency (%) |
한국 | 976 | 6.2% |
2011년 | 206 | 1.3% |
141230 | 203 | 1.3% |
아미랜드 | 171 | 1.1% |
㈜전시과학 | 148 | 0.9% |
등 | 146 | 0.9% |
콩과 | 141 | 0.9% |
콩목 | 141 | 0.9% |
2009년 | 138 | 0.9% |
0.2kg | 121 | 0.8% |
Other values (6097) | 13467 |
Most occurring characters
Value | Count | Frequency (%) |
6226 | 7.4% | |
0 | 4099 | 4.9% |
1 | 2758 | 3.3% |
2 | 2737 | 3.3% |
a | 2587 | 3.1% |
e | 2127 | 2.5% |
i | 1918 | 2.3% |
s | 1747 | 2.1% |
n | 1530 | 1.8% |
o | 1510 | 1.8% |
Other values (772) | 56944 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 31548 | |
Lowercase Letter | 23023 | |
Decimal Number | 14785 | |
Space Separator | 6226 | 7.4% |
Uppercase Letter | 2313 | 2.7% |
Other Symbol | 2105 | 2.5% |
Other Punctuation | 2049 | 2.4% |
Math Symbol | 1457 | 1.7% |
Close Punctuation | 299 | 0.4% |
Open Punctuation | 298 | 0.4% |
Other values (2) | 80 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
년 | 1299 | 4.1% |
한 | 1093 | 3.5% |
국 | 1072 | 3.4% |
과 | 884 | 2.8% |
목 | 676 | 2.1% |
시 | 625 | 2.0% |
아 | 606 | 1.9% |
이 | 519 | 1.6% |
전 | 513 | 1.6% |
리 | 488 | 1.5% |
Other values (686) | 23773 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 201 | 8.7% |
C | 199 | 8.6% |
G | 197 | 8.5% |
T | 185 | 8.0% |
S | 181 | 7.8% |
A | 178 | 7.7% |
P | 166 | 7.2% |
D | 130 | 5.6% |
B | 114 | 4.9% |
L | 102 | 4.4% |
Other values (17) | 660 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2587 | 11.2% |
e | 2127 | 9.2% |
i | 1918 | 8.3% |
s | 1747 | 7.6% |
n | 1530 | 6.6% |
o | 1510 | 6.6% |
r | 1466 | 6.4% |
l | 1220 | 5.3% |
m | 1188 | 5.2% |
t | 1164 | 5.1% |
Other values (16) | 6566 |
Decimal Number
Value | Count | Frequency (%) |
0 | 4099 | |
1 | 2758 | |
2 | 2737 | |
3 | 1073 | 7.3% |
4 | 983 | 6.6% |
5 | 873 | 5.9% |
8 | 650 | 4.4% |
9 | 634 | 4.3% |
6 | 495 | 3.3% |
7 | 483 | 3.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1055 | |
. | 659 | |
/ | 160 | 7.8% |
* | 127 | 6.2% |
: | 21 | 1.0% |
· | 14 | 0.7% |
& | 7 | 0.3% |
' | 6 | 0.3% |
Math Symbol
Value | Count | Frequency (%) |
× | 1072 | |
~ | 327 | 22.4% |
+ | 28 | 1.9% |
> | 22 | 1.5% |
= | 5 | 0.3% |
< | 3 | 0.2% |
Other Symbol
Value | Count | Frequency (%) |
㎝ | 1068 | |
㎜ | 761 | |
㈜ | 275 | 13.1% |
㎡ | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
6226 |
Close Punctuation
Value | Count | Frequency (%) |
) | 299 |
Open Punctuation
Value | Count | Frequency (%) |
( | 298 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 77 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 31823 | |
Common | 27024 | |
Latin | 25334 | |
Greek | 2 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
년 | 1299 | 4.1% |
한 | 1093 | 3.4% |
국 | 1072 | 3.4% |
과 | 884 | 2.8% |
목 | 676 | 2.1% |
시 | 625 | 2.0% |
아 | 606 | 1.9% |
이 | 519 | 1.6% |
전 | 513 | 1.6% |
리 | 488 | 1.5% |
Other values (687) | 24048 |
Latin
Value | Count | Frequency (%) |
a | 2587 | 10.2% |
e | 2127 | 8.4% |
i | 1918 | 7.6% |
s | 1747 | 6.9% |
n | 1530 | 6.0% |
o | 1510 | 6.0% |
r | 1466 | 5.8% |
l | 1220 | 4.8% |
m | 1188 | 4.7% |
t | 1164 | 4.6% |
Other values (42) | 8877 |
Common
Value | Count | Frequency (%) |
6226 | ||
0 | 4099 | |
1 | 2758 | |
2 | 2737 | |
3 | 1073 | 4.0% |
× | 1072 | 4.0% |
㎝ | 1068 | 4.0% |
, | 1055 | 3.9% |
4 | 983 | 3.6% |
5 | 873 | 3.2% |
Other values (22) | 5080 |
Greek
Value | Count | Frequency (%) |
Φ | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49442 | |
Hangul | 31542 | |
CJK Compat | 1830 | 2.2% |
None | 1363 | 1.6% |
Compat Jamo | 6 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6226 | 12.6% | |
0 | 4099 | 8.3% |
1 | 2758 | 5.6% |
2 | 2737 | 5.5% |
a | 2587 | 5.2% |
e | 2127 | 4.3% |
i | 1918 | 3.9% |
s | 1747 | 3.5% |
n | 1530 | 3.1% |
o | 1510 | 3.1% |
Other values (69) | 22203 |
Hangul
Value | Count | Frequency (%) |
년 | 1299 | 4.1% |
한 | 1093 | 3.5% |
국 | 1072 | 3.4% |
과 | 884 | 2.8% |
목 | 676 | 2.1% |
시 | 625 | 2.0% |
아 | 606 | 1.9% |
이 | 519 | 1.6% |
전 | 513 | 1.6% |
리 | 488 | 1.5% |
Other values (682) | 23767 |
None
Value | Count | Frequency (%) |
× | 1072 | |
㈜ | 275 | 20.2% |
· | 14 | 1.0% |
Φ | 2 | 0.1% |
CJK Compat
Value | Count | Frequency (%) |
㎝ | 1068 | |
㎜ | 761 | |
㎡ | 1 | 0.1% |
Compat Jamo
Value | Count | Frequency (%) |
ㅇ | 3 | |
ㄹ | 1 | 16.7% |
ㄻ | 1 | 16.7% |
ㅍ | 1 | 16.7% |
공개여부
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 410 |
Missing (%) | 4.1% |
Memory size | 19.6 KiB |
True | |
---|---|
(Missing) | 410 |
Value | Count | Frequency (%) |
True | 9568 | |
(Missing) | 410 | 4.1% |
등록자 아이디
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.1 KiB |
superadmin | |
---|---|
scicenter | |
jnse | 156 |
child | 75 |
gisegen | 49 |
Other values (8) | 71 |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 9.5026057 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | superadmin |
---|---|
2nd row | superadmin |
3rd row | superadmin |
4th row | superadmin |
5th row | superadmin |
Common Values
Value | Count | Frequency (%) |
superadmin | 6261 | |
scicenter | 3366 | |
jnse | 156 | 1.6% |
child | 75 | 0.8% |
gisegen | 49 | 0.5% |
cheongyang | 27 | 0.3% |
gogostar | 18 | 0.2% |
jejusi | 8 | 0.1% |
biocp | 7 | 0.1% |
cheonan | 5 | 0.1% |
Other values (3) | 6 | 0.1% |
Length
Value | Count | Frequency (%) |
superadmin | 6261 | |
scicenter | 3366 | |
jnse | 156 | 1.6% |
child | 75 | 0.8% |
gisegen | 49 | 0.5% |
cheongyang | 27 | 0.3% |
gogostar | 18 | 0.2% |
jejusi | 8 | 0.1% |
biocp | 7 | 0.1% |
cheonan | 5 | 0.1% |
Other values (3) | 6 | 0.1% |
전시타입 | 이름 | 등록자 아이디 | |
---|---|---|---|
전시타입 | 1.000 | 1.000 | 0.740 |
이름 | 1.000 | 1.000 | 0.869 |
등록자 아이디 | 0.740 | 0.869 | 1.000 |
등록자 아이디 | 이름 | 전시타입 | |
---|---|---|---|
등록자 아이디 | 1.000 | 0.481 | 0.706 |
이름 | 0.481 | 1.000 | 0.998 |
전시타입 | 0.706 | 0.998 | 1.000 |
전시타입 | 이름 | 등록자 아이디 | |
---|---|---|---|
전시타입 | 1.000 | 0.998 | 0.706 |
이름 | 0.998 | 1.000 | 0.481 |
등록자 아이디 | 0.706 | 0.481 | 1.000 |
상세 아이디 | 전시품 아이디 | 전시타입 | 이름 | 항목값 | 공개여부 | 등록자 아이디 | |
---|---|---|---|---|---|---|---|
0 | 14 | 5 | E | 국적 | 한국 | Y | superadmin |
1 | 15 | 5 | E | 재질 | 기타 | Y | superadmin |
2 | 24 | 8 | E | 국적 | 한국 | Y | superadmin |
3 | 25 | 8 | E | 재질 | 기타 | Y | superadmin |
4 | 28 | 10 | E | 국적 | 이집트 | Y | superadmin |
5 | 29 | 10 | E | 재질 | 목제 | Y | superadmin |
6 | 34 | 13 | E | 재질 | 목제 | Y | superadmin |
7 | 36 | 15 | E | 재질 | 목제 | Y | superadmin |
8 | 39 | 18 | E | 재질 | 목제 | Y | superadmin |
9 | 44 | 20 | E | 국적 | 스웨덴 | Y | superadmin |
상세 아이디 | 전시품 아이디 | 전시타입 | 이름 | 항목값 | 공개여부 | 등록자 아이디 | |
---|---|---|---|---|---|---|---|
9968 | 17,503 | 2,551 | E | 방언 | 감생이 | Y | scicenter |
9969 | 17,512 | 2,549 | E | 분포 | 우리 나라 전 연안, 일본, 대만, 동중국해 | Y | scicenter |
9970 | 17,525 | 2,546 | E | 학명 | Sebastes schlegeli Hilgendorf | Y | scicenter |
9971 | 17,526 | 2,546 | E | 영명 | Black rockfish | Y | scicenter |
9972 | 17,527 | 2,546 | E | 방언 | 우럭, 감펭이 | Y | scicenter |
9973 | 17,528 | 2,546 | E | 분포 | 국내 전 연해, 동중국해, 일본 | Y | scicenter |
9974 | 17,282 | 2,512 | E | 국적 | 한국 | Y | scicenter |
9975 | 17,283 | 2,512 | E | 크기 | 1100*4100*310 | Y | scicenter |
9976 | 17,284 | 2,512 | E | 용도/기능 | 어린이체험 | Y | scicenter |
9977 | 17,285 | 2,512 | E | 제조연대 | 2011 | Y | scicenter |