Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 796 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 19.6 KiB |
Average record size in memory | 25.2 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | 기관 대표 홈페이지의 연간 인기 키워드에 대한 정보로써 키워드가 등록된 날짜, 키워드, 조회수 항목 정보를 제공합니다. |
---|---|
Author | 한국보건산업진흥원 |
URL | https://www.data.go.kr/data/15122038/fileData.do |
Reproduction
Analysis started | 2023-12-12 21:07:46.736951 |
---|---|
Analysis finished | 2023-12-12 21:07:47.237766 |
Duration | 0.5 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
키워드 날짜
Categorical
Distinct | 8 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.3 KiB |
2020-12-31 | |
---|---|
2018-12-31 | |
2021-12-31 | |
2015-12-31 | |
2017-12-31 | |
Other values (3) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-12-31 |
---|---|
2nd row | 2020-12-31 |
3rd row | 2020-12-31 |
4th row | 2020-12-31 |
5th row | 2020-12-31 |
Common Values
Value | Count | Frequency (%) |
2020-12-31 | 100 | |
2018-12-31 | 100 | |
2021-12-31 | 100 | |
2015-12-31 | 100 | |
2017-12-31 | 100 | |
2017-09-25 | 100 | |
2022-12-31 | 100 | |
2019-12-31 | 96 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-12-31 | 100 | |
2018-12-31 | 100 | |
2021-12-31 | 100 | |
2015-12-31 | 100 | |
2017-12-31 | 100 | |
2017-09-25 | 100 | |
2022-12-31 | 100 | |
2019-12-31 | 96 |
키워드
Text
Distinct | 464 |
---|---|
Distinct (%) | 58.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.3 KiB |
Value | Count | Frequency (%) |
medical | 35 | 3.3% |
korea | 19 | 1.8% |
bio | 18 | 1.7% |
device | 18 | 1.7% |
market | 18 | 1.7% |
global | 17 | 1.6% |
kimes | 11 | 1.0% |
2019 | 10 | 0.9% |
khidi | 9 | 0.8% |
ghkol | 8 | 0.7% |
Other values (513) | 908 |
Most occurring characters
Value | Count | Frequency (%) |
290 | 4.4% | |
e | 279 | 4.2% |
a | 239 | 3.6% |
, | 222 | 3.4% |
i | 184 | 2.8% |
l | 157 | 2.4% |
r | 157 | 2.4% |
o | 137 | 2.1% |
0 | 135 | 2.0% |
I | 134 | 2.0% |
Other values (342) | 4658 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2082 | |
Lowercase Letter | 1988 | |
Uppercase Letter | 1349 | |
Decimal Number | 537 | 8.1% |
Space Separator | 290 | 4.4% |
Other Punctuation | 260 | 3.9% |
Dash Punctuation | 39 | 0.6% |
Close Punctuation | 20 | 0.3% |
Open Punctuation | 20 | 0.3% |
Math Symbol | 7 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 62 | 3.0% |
업 | 55 | 2.6% |
고 | 47 | 2.3% |
년 | 43 | 2.1% |
인 | 42 | 2.0% |
산 | 39 | 1.9% |
공 | 38 | 1.8% |
가 | 38 | 1.8% |
시 | 35 | 1.7% |
차 | 35 | 1.7% |
Other values (267) | 1648 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 134 | 9.9% |
A | 110 | 8.2% |
M | 108 | 8.0% |
K | 100 | 7.4% |
D | 85 | 6.3% |
B | 82 | 6.1% |
O | 73 | 5.4% |
C | 72 | 5.3% |
E | 70 | 5.2% |
T | 70 | 5.2% |
Other values (16) | 445 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 279 | |
a | 239 | |
i | 184 | |
l | 157 | 7.9% |
r | 157 | 7.9% |
o | 137 | 6.9% |
t | 114 | 5.7% |
n | 113 | 5.7% |
c | 100 | 5.0% |
s | 74 | 3.7% |
Other values (15) | 434 |
Decimal Number
Value | Count | Frequency (%) |
0 | 135 | |
2 | 128 | |
1 | 113 | |
7 | 35 | 6.5% |
4 | 34 | 6.3% |
3 | 25 | 4.7% |
9 | 22 | 4.1% |
5 | 18 | 3.4% |
8 | 17 | 3.2% |
6 | 10 | 1.9% |
Other Punctuation
Value | Count | Frequency (%) |
, | 222 | |
. | 25 | 9.6% |
& | 10 | 3.8% |
· | 2 | 0.8% |
' | 1 | 0.4% |
Math Symbol
Value | Count | Frequency (%) |
+ | 5 | |
= | 1 | 14.3% |
~ | 1 | 14.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 19 | |
] | 1 | 5.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 19 | |
[ | 1 | 5.0% |
Space Separator
Value | Count | Frequency (%) |
290 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 39 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3337 | |
Hangul | 2082 | |
Common | 1173 | 17.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 62 | 3.0% |
업 | 55 | 2.6% |
고 | 47 | 2.3% |
년 | 43 | 2.1% |
인 | 42 | 2.0% |
산 | 39 | 1.9% |
공 | 38 | 1.8% |
가 | 38 | 1.8% |
시 | 35 | 1.7% |
차 | 35 | 1.7% |
Other values (267) | 1648 |
Latin
Value | Count | Frequency (%) |
e | 279 | 8.4% |
a | 239 | 7.2% |
i | 184 | 5.5% |
l | 157 | 4.7% |
r | 157 | 4.7% |
o | 137 | 4.1% |
I | 134 | 4.0% |
t | 114 | 3.4% |
n | 113 | 3.4% |
A | 110 | 3.3% |
Other values (41) | 1713 |
Common
Value | Count | Frequency (%) |
290 | ||
, | 222 | |
0 | 135 | |
2 | 128 | |
1 | 113 | 9.6% |
- | 39 | 3.3% |
7 | 35 | 3.0% |
4 | 34 | 2.9% |
. | 25 | 2.1% |
3 | 25 | 2.1% |
Other values (14) | 127 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4508 | |
Hangul | 2082 | |
None | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
290 | 6.4% | |
e | 279 | 6.2% |
a | 239 | 5.3% |
, | 222 | 4.9% |
i | 184 | 4.1% |
l | 157 | 3.5% |
r | 157 | 3.5% |
o | 137 | 3.0% |
0 | 135 | 3.0% |
I | 134 | 3.0% |
Other values (64) | 2574 |
Hangul
Value | Count | Frequency (%) |
기 | 62 | 3.0% |
업 | 55 | 2.6% |
고 | 47 | 2.3% |
년 | 43 | 2.1% |
인 | 42 | 2.0% |
산 | 39 | 1.9% |
공 | 38 | 1.8% |
가 | 38 | 1.8% |
시 | 35 | 1.7% |
차 | 35 | 1.7% |
Other values (267) | 1648 |
None
Value | Count | Frequency (%) |
· | 2 |
조회수
Real number (ℝ)
Distinct | 672 |
---|---|
Distinct (%) | 84.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25566.974 |
Minimum | 1 |
---|---|
Maximum | 717746 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 68 |
Q1 | 252.75 |
median | 1043.5 |
Q3 | 7392.25 |
95-th percentile | 150013 |
Maximum | 717746 |
Range | 717745 |
Interquartile range (IQR) | 7139.5 |
Descriptive statistics
Standard deviation | 73374.681 |
---|---|
Coefficient of variation (CV) | 2.8699009 |
Kurtosis | 34.033378 |
Mean | 25566.974 |
Median Absolute Deviation (MAD) | 917 |
Skewness | 5.143721 |
Sum | 20351311 |
Variance | 5.3838438 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
68 | 5 | 0.6% |
244 | 4 | 0.5% |
1142 | 4 | 0.5% |
208 | 4 | 0.5% |
211 | 4 | 0.5% |
221 | 3 | 0.4% |
194 | 3 | 0.4% |
77 | 3 | 0.4% |
3 | 3 | 0.4% |
79 | 3 | 0.4% |
Other values (662) | 760 |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
3 | 3 | |
6 | 3 | |
10 | 1 | 0.1% |
12 | 1 | 0.1% |
16 | 1 | 0.1% |
23 | 1 | 0.1% |
24 | 1 | 0.1% |
29 | 1 | 0.1% |
30 | 1 | 0.1% |
Value | Count | Frequency (%) |
717746 | 1 | |
672011 | 1 | |
671051 | 1 | |
532779 | 1 | |
532626 | 1 | |
400329 | 1 | |
398011 | 1 | |
380790 | 1 | |
368800 | 1 | |
357822 | 1 |
키워드 날짜 | 조회수 | |
---|---|---|
키워드 날짜 | 1.000 | 0.243 |
조회수 | 0.243 | 1.000 |
조회수 | 키워드 날짜 | |
---|---|---|
조회수 | 1.000 | 0.083 |
키워드 날짜 | 0.083 | 1.000 |
키워드 날짜 | 키워드 | 조회수 | |
---|---|---|---|
0 | 2020-12-31 | 공고 | 532626 |
1 | 2020-12-31 | 2019년 | 380790 |
2 | 2020-12-31 | 2019 | 259956 |
3 | 2020-12-31 | 고령 | 218570 |
4 | 2020-12-31 | 개최 | 209983 |
5 | 2020-12-31 | 가이드라인 | 207761 |
6 | 2020-12-31 | KOTRA | 185716 |
7 | 2020-12-31 | NET | 168884 |
8 | 2020-12-31 | R&D | 151591 |
9 | 2020-12-31 | UAE | 150616 |
키워드 날짜 | 키워드 | 조회수 | |
---|---|---|---|
786 | 2022-12-31 | = | 309 |
787 | 2022-12-31 | 4분기 | 302 |
788 | 2022-12-31 | Industry | 284 |
789 | 2022-12-31 | 8월5일 | 270 |
790 | 2022-12-31 | 대상과제 | 257 |
791 | 2022-12-31 | 2022년 제1회 KBIC STAR DAY | 244 |
792 | 2022-12-31 | 개최 (2.24(목)) | 244 |
793 | 2022-12-31 | 디지털병리 | 172 |
794 | 2022-12-31 | 기술 | 3 |
795 | 2022-12-31 | 국가기술표준원 | 3 |