Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 106 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.5 KiB |
Average record size in memory | 34.2 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | 국립중앙과학관 홈페이지에 있는 과학학습콘텐츠의 코드 상세정보관리 목록입니다. |
---|---|
Author | 과학기술정보통신부 국립중앙과학관 |
URL | https://www.data.go.kr/data/15067827/fileData.do |
Reproduction
Analysis started | 2023-12-12 22:05:05.371037 |
---|---|
Analysis finished | 2023-12-12 22:05:05.730121 |
Duration | 0.36 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
코드번호
Categorical
Distinct | 14 |
---|---|
Distinct (%) | 13.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 980.0 B |
I001 | |
---|---|
4 | |
I003 | |
I002 | |
1 | |
Other values (9) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.1226415 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | I002 |
---|---|
2nd row | I002 |
3rd row | I002 |
4th row | I002 |
5th row | I003 |
Common Values
Value | Count | Frequency (%) |
I001 | 24 | |
4 | 17 | |
I003 | 10 | |
I002 | 8 | 7.5% |
1 | 8 | 7.5% |
M001 | 6 | 5.7% |
M002 | 6 | 5.7% |
M003 | 6 | 5.7% |
M004 | 6 | 5.7% |
C001 | 4 | 3.8% |
Other values (4) | 11 |
Length
Value | Count | Frequency (%) |
i001 | 24 | |
4 | 17 | |
i003 | 10 | |
i002 | 8 | 7.5% |
1 | 8 | 7.5% |
m001 | 6 | 5.7% |
m002 | 6 | 5.7% |
m003 | 6 | 5.7% |
m004 | 6 | 5.7% |
c001 | 4 | 3.8% |
Other values (4) | 11 |
코드상세번호
Categorical
HIGH CORRELATION
 
Distinct | 36 |
---|---|
Distinct (%) | 34.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 980.0 B |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | |
Other values (31) |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.5660377 |
Min length | 1 |
Unique
Unique | 19 ? |
---|---|
Unique (%) | 17.9% |
Sample
1st row | 5 |
---|---|
2nd row | 6 |
3rd row | 7 |
4th row | 8 |
5th row | 6 |
Common Values
Value | Count | Frequency (%) |
1 | 12 | 11.3% |
2 | 12 | 11.3% |
3 | 10 | 9.4% |
4 | 9 | 8.5% |
5 | 8 | 7.5% |
6 | 8 | 7.5% |
7 | 4 | 3.8% |
8 | 4 | 3.8% |
9 | 3 | 2.8% |
10 | 3 | 2.8% |
Other values (26) | 33 |
Length
Value | Count | Frequency (%) |
1 | 12 | 11.3% |
2 | 12 | 11.3% |
3 | 10 | 9.4% |
4 | 9 | 8.5% |
5 | 8 | 7.5% |
6 | 8 | 7.5% |
7 | 4 | 3.8% |
8 | 4 | 3.8% |
9 | 3 | 2.8% |
10 | 3 | 2.8% |
Other values (26) | 33 |
코드명
Text
Distinct | 104 |
---|---|
Distinct (%) | 98.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 980.0 B |
Value | Count | Frequency (%) |
서울특별시 | 4 | 3.6% |
해외과학관 | 2 | 1.8% |
로봇 | 2 | 1.8% |
우리나라 | 2 | 1.8% |
어린이 | 1 | 0.9% |
활자아이콘5 | 1 | 0.9% |
음악 | 1 | 0.9% |
농업/산림 | 1 | 0.9% |
천문/지질 | 1 | 0.9% |
우주 | 1 | 0.9% |
Other values (96) | 96 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 43 | 7.2% |
아 | 42 | 7.0% |
콘 | 42 | 7.0% |
식 | 31 | 5.2% |
공 | 30 | 5.0% |
룡 | 29 | 4.9% |
육 | 26 | 4.4% |
1 | 16 | 2.7% |
자 | 15 | 2.5% |
로 | 13 | 2.2% |
Other values (112) | 309 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 504 | |
Decimal Number | 58 | 9.7% |
Open Punctuation | 9 | 1.5% |
Close Punctuation | 9 | 1.5% |
Space Separator | 6 | 1.0% |
Other Punctuation | 6 | 1.0% |
Uppercase Letter | 4 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 43 | 8.5% |
아 | 42 | 8.3% |
콘 | 42 | 8.3% |
식 | 31 | 6.2% |
공 | 30 | 6.0% |
룡 | 29 | 5.8% |
육 | 26 | 5.2% |
자 | 15 | 3.0% |
로 | 13 | 2.6% |
봇 | 13 | 2.6% |
Other values (94) | 220 |
Decimal Number
Value | Count | Frequency (%) |
1 | 16 | |
2 | 10 | |
4 | 5 | 8.6% |
3 | 5 | 8.6% |
5 | 4 | 6.9% |
6 | 4 | 6.9% |
8 | 4 | 6.9% |
7 | 4 | 6.9% |
0 | 3 | 5.2% |
9 | 3 | 5.2% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1 | |
D | 1 | |
B | 1 | |
C | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 9 |
Close Punctuation
Value | Count | Frequency (%) |
) | 9 |
Space Separator
Value | Count | Frequency (%) |
6 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 504 | |
Common | 88 | 14.8% |
Latin | 4 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 43 | 8.5% |
아 | 42 | 8.3% |
콘 | 42 | 8.3% |
식 | 31 | 6.2% |
공 | 30 | 6.0% |
룡 | 29 | 5.8% |
육 | 26 | 5.2% |
자 | 15 | 3.0% |
로 | 13 | 2.6% |
봇 | 13 | 2.6% |
Other values (94) | 220 |
Common
Value | Count | Frequency (%) |
1 | 16 | |
2 | 10 | |
( | 9 | |
) | 9 | |
6 | 6.8% | |
/ | 6 | 6.8% |
4 | 5 | 5.7% |
3 | 5 | 5.7% |
5 | 4 | 4.5% |
6 | 4 | 4.5% |
Other values (4) | 14 |
Latin
Value | Count | Frequency (%) |
A | 1 | |
D | 1 | |
B | 1 | |
C | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 504 | |
ASCII | 92 | 15.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 43 | 8.5% |
아 | 42 | 8.3% |
콘 | 42 | 8.3% |
식 | 31 | 6.2% |
공 | 30 | 6.0% |
룡 | 29 | 5.8% |
육 | 26 | 5.2% |
자 | 15 | 3.0% |
로 | 13 | 2.6% |
봇 | 13 | 2.6% |
Other values (94) | 220 |
ASCII
Value | Count | Frequency (%) |
1 | 16 | |
2 | 10 | |
( | 9 | |
) | 9 | |
6 | 6.5% | |
/ | 6 | 6.5% |
4 | 5 | 5.4% |
3 | 5 | 5.4% |
5 | 4 | 4.3% |
6 | 4 | 4.3% |
Other values (8) | 18 |
우선순위
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 22.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5660377 |
Minimum | 1 |
---|---|
Maximum | 24 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 5 |
Q3 | 8.75 |
95-th percentile | 18.75 |
Maximum | 24 |
Range | 23 |
Interquartile range (IQR) | 6.75 |
Descriptive statistics
Standard deviation | 5.6535179 |
---|---|
Coefficient of variation (CV) | 0.86102427 |
Kurtosis | 1.0905805 |
Mean | 6.5660377 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 1.3392032 |
Sum | 696 |
Variance | 31.962264 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 14 | |
2 | 14 | |
3 | 12 | |
4 | 11 | |
5 | 9 | |
6 | 9 | |
7 | 5 | 4.7% |
8 | 5 | 4.7% |
9 | 3 | 2.8% |
10 | 3 | 2.8% |
Other values (14) | 21 |
Value | Count | Frequency (%) |
1 | 14 | |
2 | 14 | |
3 | 12 | |
4 | 11 | |
5 | 9 | |
6 | 9 | |
7 | 5 | 4.7% |
8 | 5 | 4.7% |
9 | 3 | 2.8% |
10 | 3 | 2.8% |
Value | Count | Frequency (%) |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 | |
20 | 1 | |
19 | 1 | |
18 | 1 | |
17 | 2 | |
16 | 2 | |
15 | 2 |
코드번호 | 코드상세번호 | 우선순위 | |
---|---|---|---|
코드번호 | 1.000 | 0.000 | 0.000 |
코드상세번호 | 0.000 | 1.000 | 1.000 |
우선순위 | 0.000 | 1.000 | 1.000 |
코드번호 | 코드상세번호 | |
---|---|---|
코드번호 | 1.000 | 0.000 |
코드상세번호 | 0.000 | 1.000 |
우선순위 | 코드번호 | 코드상세번호 | |
---|---|---|---|
우선순위 | 1.000 | 0.000 | 0.854 |
코드번호 | 0.000 | 1.000 | 0.000 |
코드상세번호 | 0.854 | 0.000 | 1.000 |
코드번호 | 코드상세번호 | 코드명 | 우선순위 | |
---|---|---|---|---|
0 | I002 | 5 | 활자아이콘5 | 5 |
1 | I002 | 6 | 활자아이콘6 | 6 |
2 | I002 | 7 | 활자아이콘7 | 7 |
3 | I002 | 8 | 활자아이콘8 | 8 |
4 | I003 | 6 | 로봇아이콘6 | 6 |
5 | I003 | 7 | 로봇아이콘7 | 7 |
6 | I003 | 8 | 로봇아이콘8 | 8 |
7 | I003 | 9 | 로봇아이콘9 | 9 |
8 | I003 | 10 | 로봇아이콘10 | 10 |
9 | 1 | C001 | 공룡 | 1 |
코드번호 | 코드상세번호 | 코드명 | 우선순위 | |
---|---|---|---|---|
96 | 4 | 14 | 경상북도(대구) | 14 |
97 | 4 | 15 | 전라남도(광주) | 15 |
98 | 4 | 16 | 경상남도(울산) | 16 |
99 | 4 | 17 | 해외과학관 | 17 |
100 | 1 | C005 | 우리나라 텃새 | 5 |
101 | 1 | C008 | 축음기 | 8 |
102 | 1 | C006 | 수의역사 | 6 |
103 | C001 | 4 | 잡식공룡 | 4 |
104 | 1 | C004 | 우리나라 성곽축조과학 | 4 |
105 | 1 | C007 | 컴퓨터 | 7 |