Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.6 KiB |
Average record size in memory | 54.4 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 3 |
Text | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한양대 |
URL | https://bigdata-region.kr/#/dataset/80a4842e-deee-45fd-85eb-74d3699ca208 |
Reproduction
Analysis started | 2023-12-10 13:46:36.762921 |
---|---|
Analysis finished | 2023-12-10 13:46:37.529810 |
Duration | 0.77 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
해시태그수집일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2020-11-01 00:00:00 |
---|---|
Maximum | 2020-11-01 00:00:00 |
해시태그채널ID
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | |
---|---|
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ |
Length
Max length | 56 |
---|---|
Median length | 56 |
Mean length | 56 |
Min length | 56 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg |
---|---|
2nd row | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg |
3rd row | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ |
4th row | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg |
5th row | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg |
Common Values
Value | Count | Frequency (%) |
https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | 17 | |
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | 13 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
https://www.youtube.com/channel/uccl8poejcc6dwackyzw_eqg | 17 | |
https://www.youtube.com/channel/ucfl1scaksd6_7jizwwhcwjq | 13 |
해시태그영상ID
Text
Distinct | 27 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 43 |
---|---|
Median length | 43 |
Mean length | 43 |
Min length | 43 |
Characters and Unicode
Total characters | 1290 |
---|---|
Distinct characters | 69 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 24 ? |
---|---|
Unique (%) | 80.0% |
Sample
1st row | https://www.youtube.com/watch?v=vpv1k45SuSg |
---|---|
2nd row | https://www.youtube.com/watch?v=vVLeDvKJ4is |
3rd row | https://www.youtube.com/watch?v=zjRwVyQB5os |
4th row | https://www.youtube.com/watch?v=w2mfqaAxdCU |
5th row | https://www.youtube.com/watch?v=w2_1gb3qoho |
Value | Count | Frequency (%) |
https://www.youtube.com/watch?v=zo5p5oxqxg4 | 2 | 6.7% |
https://www.youtube.com/watch?v=znviox8ceis | 2 | 6.7% |
https://www.youtube.com/watch?v=zmsm38xl16m | 2 | 6.7% |
https://www.youtube.com/watch?v=wzwzrxylbni | 1 | 3.3% |
https://www.youtube.com/watch?v=vpv1k45susg | 1 | 3.3% |
https://www.youtube.com/watch?v=wjqx8nhfoe0 | 1 | 3.3% |
https://www.youtube.com/watch?v=x_jrsgklzba | 1 | 3.3% |
https://www.youtube.com/watch?v=zopz3y-wz0w | 1 | 3.3% |
https://www.youtube.com/watch?v=xz7fjo_ldfa | 1 | 3.3% |
https://www.youtube.com/watch?v=xzcdemoxics | 1 | 3.3% |
Other values (17) | 17 |
Most occurring characters
Value | Count | Frequency (%) |
w | 134 | 10.4% |
t | 120 | 9.3% |
/ | 90 | 7.0% |
o | 69 | 5.3% |
c | 66 | 5.1% |
h | 64 | 5.0% |
u | 62 | 4.8% |
. | 60 | 4.7% |
s | 37 | 2.9% |
y | 36 | 2.8% |
Other values (59) | 552 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 873 | |
Other Punctuation | 210 | 16.3% |
Uppercase Letter | 121 | 9.4% |
Decimal Number | 48 | 3.7% |
Math Symbol | 30 | 2.3% |
Connector Punctuation | 6 | 0.5% |
Dash Punctuation | 2 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
w | 134 | |
t | 120 | |
o | 69 | 7.9% |
c | 66 | 7.6% |
h | 64 | 7.3% |
u | 62 | 7.1% |
s | 37 | 4.2% |
y | 36 | 4.1% |
p | 36 | 4.1% |
m | 36 | 4.1% |
Other values (16) | 213 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 12 | 9.9% |
Z | 10 | 8.3% |
O | 8 | 6.6% |
S | 8 | 6.6% |
X | 8 | 6.6% |
Q | 7 | 5.8% |
P | 6 | 5.0% |
M | 6 | 5.0% |
Y | 5 | 4.1% |
B | 5 | 4.1% |
Other values (16) | 46 |
Decimal Number
Value | Count | Frequency (%) |
5 | 8 | |
4 | 7 | |
3 | 6 | |
8 | 5 | |
2 | 5 | |
0 | 4 | |
6 | 4 | |
1 | 4 | |
7 | 3 | 6.2% |
9 | 2 | 4.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 90 | |
. | 60 | |
: | 30 | 14.3% |
? | 30 | 14.3% |
Math Symbol
Value | Count | Frequency (%) |
= | 30 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 6 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 994 | |
Common | 296 | 22.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
w | 134 | |
t | 120 | 12.1% |
o | 69 | 6.9% |
c | 66 | 6.6% |
h | 64 | 6.4% |
u | 62 | 6.2% |
s | 37 | 3.7% |
y | 36 | 3.6% |
p | 36 | 3.6% |
m | 36 | 3.6% |
Other values (42) | 334 |
Common
Value | Count | Frequency (%) |
/ | 90 | |
. | 60 | |
: | 30 | 10.1% |
? | 30 | 10.1% |
= | 30 | 10.1% |
5 | 8 | 2.7% |
4 | 7 | 2.4% |
_ | 6 | 2.0% |
3 | 6 | 2.0% |
8 | 5 | 1.7% |
Other values (7) | 24 | 8.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1290 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
w | 134 | 10.4% |
t | 120 | 9.3% |
/ | 90 | 7.0% |
o | 69 | 5.3% |
c | 66 | 5.1% |
h | 64 | 5.0% |
u | 62 | 4.8% |
. | 60 | 4.7% |
s | 37 | 2.9% |
y | 36 | 2.8% |
Other values (59) | 552 |
해시태그추출명사명
Text
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
영상 | 3 | 10.0% |
오브 | 2 | 6.7% |
공 | 2 | 6.7% |
토니 | 1 | 3.3% |
해물 | 1 | 3.3% |
수호 | 1 | 3.3% |
튜버 | 1 | 3.3% |
포식자 | 1 | 3.3% |
선 | 1 | 3.3% |
배틀 | 1 | 3.3% |
Other values (16) | 16 |
Most occurring characters
Value | Count | Frequency (%) |
영 | 3 | 5.1% |
상 | 3 | 5.1% |
오 | 3 | 5.1% |
자 | 2 | 3.4% |
우 | 2 | 3.4% |
공 | 2 | 3.4% |
브 | 2 | 3.4% |
현 | 1 | 1.7% |
틀 | 1 | 1.7% |
은 | 1 | 1.7% |
Other values (39) | 39 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 59 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
영 | 3 | 5.1% |
상 | 3 | 5.1% |
오 | 3 | 5.1% |
자 | 2 | 3.4% |
우 | 2 | 3.4% |
공 | 2 | 3.4% |
브 | 2 | 3.4% |
현 | 1 | 1.7% |
틀 | 1 | 1.7% |
은 | 1 | 1.7% |
Other values (39) | 39 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 59 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
영 | 3 | 5.1% |
상 | 3 | 5.1% |
오 | 3 | 5.1% |
자 | 2 | 3.4% |
우 | 2 | 3.4% |
공 | 2 | 3.4% |
브 | 2 | 3.4% |
현 | 1 | 1.7% |
틀 | 1 | 1.7% |
은 | 1 | 1.7% |
Other values (39) | 39 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 59 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
영 | 3 | 5.1% |
상 | 3 | 5.1% |
오 | 3 | 5.1% |
자 | 2 | 3.4% |
우 | 2 | 3.4% |
공 | 2 | 3.4% |
브 | 2 | 3.4% |
현 | 1 | 1.7% |
틀 | 1 | 1.7% |
은 | 1 | 1.7% |
Other values (39) | 39 |
해시태그빈도수
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
1 | |
---|---|
2 | 2 |
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 27 | |
2 | 2 | 6.7% |
3 | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 27 | |
2 | 2 | 6.7% |
3 | 1 | 3.3% |
해시태그최근지수
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
19 | |
---|---|
20 | |
18 | 2 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 19 |
---|---|
2nd row | 19 |
3rd row | 19 |
4th row | 20 |
5th row | 19 |
Common Values
Value | Count | Frequency (%) |
19 | 22 | |
20 | 6 | 20.0% |
18 | 2 | 6.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
19 | 22 | |
20 | 6 | 20.0% |
18 | 2 | 6.7% |
해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|
해시태그채널ID | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
해시태그영상ID | 1.000 | 1.000 | 0.800 | 1.000 | 1.000 |
해시태그추출명사명 | 1.000 | 0.800 | 1.000 | 0.000 | 0.801 |
해시태그빈도수 | 0.000 | 1.000 | 0.000 | 1.000 | 0.340 |
해시태그최근지수 | 0.000 | 1.000 | 0.801 | 0.340 | 1.000 |
해시태그채널ID | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|
해시태그채널ID | 1.000 | 0.000 | 0.000 |
해시태그빈도수 | 0.000 | 1.000 | 0.107 |
해시태그최근지수 | 0.000 | 0.107 | 1.000 |
해시태그채널ID | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|
해시태그채널ID | 1.000 | 0.000 | 0.000 |
해시태그빈도수 | 0.000 | 1.000 | 0.107 |
해시태그최근지수 | 0.000 | 0.107 | 1.000 |
해시태그수집일자 | 해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|---|
0 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=vpv1k45SuSg | 영상 | 1 | 19 |
1 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=vVLeDvKJ4is | 콜로 | 1 | 19 |
2 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zjRwVyQB5os | 장르 | 1 | 19 |
3 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=w2mfqaAxdCU | 오브 | 1 | 20 |
4 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=w2_1gb3qoho | 피자 | 1 | 19 |
5 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zlB_z3Y2ZUI | 퀴즈쇼 | 1 | 19 |
6 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=wOcfWIpnNqs | 꿀 | 1 | 19 |
7 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zlkSEeXcESw | 허양임 | 1 | 19 |
8 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=wSbuXywhj7U | 강의 | 1 | 19 |
9 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zm0A2B4rjXo | 생활 | 1 | 19 |
해시태그수집일자 | 해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|---|
20 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=znviOX8ceIs | 우주 | 1 | 19 |
21 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=x9HfCcQqrDM | 마이 | 1 | 19 |
22 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zo5P5OxQxG4 | 배틀 | 1 | 20 |
23 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=xIIZf0qT9ds | 공 | 1 | 20 |
24 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zo5P5OxQxG4 | 선 | 1 | 20 |
25 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=xZCDemoxIcs | 포식자 | 1 | 19 |
26 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=xZ7fJO_lDFA | 튜버 | 1 | 19 |
27 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zopZ3Y-WZ0w | 수호 | 2 | 19 |
28 | 2020-11-01 | https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg | https://www.youtube.com/watch?v=x_jRSgKlZBA | 오브 | 1 | 19 |
29 | 2020-11-01 | https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ | https://www.youtube.com/watch?v=zpI_n5RLNPE | 전현무 | 1 | 18 |