Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.6 KiB |
Average record size in memory | 54.4 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 3 |
Text | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한양대 |
URL | https://bigdata-region.kr/#/dataset/3c212550-81d5-4b91-b3ba-5a76e02ec943 |
Reproduction
Analysis started | 2023-12-10 14:15:49.361605 |
---|---|
Analysis finished | 2023-12-10 14:15:50.063942 |
Duration | 0.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
해시태그수집일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2021-10-01 00:00:00 |
---|---|
Maximum | 2021-10-01 00:00:00 |
해시태그채널ID
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | |
---|---|
https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw |
Length
Max length | 56 |
---|---|
Median length | 56 |
Mean length | 56 |
Min length | 56 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw |
---|---|
2nd row | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw |
3rd row | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw |
4th row | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw |
5th row | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw |
Common Values
Value | Count | Frequency (%) |
https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | 17 | |
https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | 13 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
https://www.youtube.com/channel/ucfm_07mxv6cglrek8qdkpaw | 17 | |
https://www.youtube.com/channel/uckinyts9ihqoewr1sze2jtw | 13 |
해시태그영상ID
Text
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 43 |
---|---|
Median length | 43 |
Mean length | 43 |
Min length | 43 |
Characters and Unicode
Total characters | 1290 |
---|---|
Distinct characters | 69 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 28 ? |
---|---|
Unique (%) | 93.3% |
Sample
1st row | https://www.youtube.com/watch?v=9bf_LM0p_nA |
---|---|
2nd row | https://www.youtube.com/watch?v=9bf_LM0p_nA |
3rd row | https://www.youtube.com/watch?v=cJ1AreifGxw |
4th row | https://www.youtube.com/watch?v=9pOC80Llvto |
5th row | https://www.youtube.com/watch?v=9pGhong3FgI |
Value | Count | Frequency (%) |
https://www.youtube.com/watch?v=9bf_lm0p_na | 2 | 6.7% |
https://www.youtube.com/watch?v=c_9yz5pnmga | 1 | 3.3% |
https://www.youtube.com/watch?v=bnapmj04dum | 1 | 3.3% |
https://www.youtube.com/watch?v=cdfgnbk0hdy | 1 | 3.3% |
https://www.youtube.com/watch?v=bdpsxlfzfco | 1 | 3.3% |
https://www.youtube.com/watch?v=bmdvfgdvypm | 1 | 3.3% |
https://www.youtube.com/watch?v=cbxewahrva4 | 1 | 3.3% |
https://www.youtube.com/watch?v=b_8oxvs5ele | 1 | 3.3% |
https://www.youtube.com/watch?v=cbpnypsyc_g | 1 | 3.3% |
https://www.youtube.com/watch?v=b6q54dz3-f8 | 1 | 3.3% |
Other values (19) | 19 |
Most occurring characters
Value | Count | Frequency (%) |
t | 124 | 9.6% |
w | 121 | 9.4% |
/ | 90 | 7.0% |
c | 74 | 5.7% |
o | 67 | 5.2% |
h | 63 | 4.9% |
u | 61 | 4.7% |
. | 60 | 4.7% |
b | 40 | 3.1% |
p | 40 | 3.1% |
Other values (59) | 550 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 865 | |
Other Punctuation | 210 | 16.3% |
Uppercase Letter | 122 | 9.5% |
Decimal Number | 51 | 4.0% |
Math Symbol | 30 | 2.3% |
Connector Punctuation | 8 | 0.6% |
Dash Punctuation | 4 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 124 | |
w | 121 | |
c | 74 | 8.6% |
o | 67 | 7.7% |
h | 63 | 7.3% |
u | 61 | 7.1% |
b | 40 | 4.6% |
p | 40 | 4.6% |
y | 37 | 4.3% |
s | 36 | 4.2% |
Other values (16) | 202 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 14 | 11.5% |
B | 9 | 7.4% |
O | 8 | 6.6% |
F | 8 | 6.6% |
U | 7 | 5.7% |
G | 7 | 5.7% |
Y | 6 | 4.9% |
M | 6 | 4.9% |
Q | 6 | 4.9% |
C | 5 | 4.1% |
Other values (16) | 46 |
Decimal Number
Value | Count | Frequency (%) |
9 | 9 | |
0 | 8 | |
8 | 8 | |
4 | 7 | |
5 | 4 | |
6 | 4 | |
3 | 4 | |
7 | 3 | 5.9% |
2 | 3 | 5.9% |
1 | 1 | 2.0% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 90 | |
. | 60 | |
: | 30 | 14.3% |
? | 30 | 14.3% |
Math Symbol
Value | Count | Frequency (%) |
= | 30 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 8 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 987 | |
Common | 303 | 23.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 124 | 12.6% |
w | 121 | 12.3% |
c | 74 | 7.5% |
o | 67 | 6.8% |
h | 63 | 6.4% |
u | 61 | 6.2% |
b | 40 | 4.1% |
p | 40 | 4.1% |
y | 37 | 3.7% |
s | 36 | 3.6% |
Other values (42) | 324 |
Common
Value | Count | Frequency (%) |
/ | 90 | |
. | 60 | |
: | 30 | 9.9% |
= | 30 | 9.9% |
? | 30 | 9.9% |
9 | 9 | 3.0% |
_ | 8 | 2.6% |
0 | 8 | 2.6% |
8 | 8 | 2.6% |
4 | 7 | 2.3% |
Other values (7) | 23 | 7.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1290 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
t | 124 | 9.6% |
w | 121 | 9.4% |
/ | 90 | 7.0% |
c | 74 | 5.7% |
o | 67 | 5.2% |
h | 63 | 4.9% |
u | 61 | 4.7% |
. | 60 | 4.7% |
b | 40 | 3.1% |
p | 40 | 3.1% |
Other values (59) | 550 |
해시태그추출명사명
Text
Distinct | 24 |
---|---|
Distinct (%) | 80.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
스톤 | 3 | 10.0% |
철권 | 3 | 10.0% |
에스에스비 | 2 | 6.7% |
공 | 2 | 6.7% |
에픽 | 1 | 3.3% |
도서관 | 1 | 3.3% |
민국 | 1 | 3.3% |
접종 | 1 | 3.3% |
생산 | 1 | 3.3% |
블리자드 | 1 | 3.3% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 10 | 15.4% |
에 | 6 | 9.2% |
철 | 3 | 4.6% |
권 | 3 | 4.6% |
톤 | 3 | 4.6% |
서 | 2 | 3.1% |
도 | 2 | 3.1% |
국 | 2 | 3.1% |
관 | 2 | 3.1% |
드 | 2 | 3.1% |
Other values (26) | 30 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 10 | 15.4% |
에 | 6 | 9.2% |
철 | 3 | 4.6% |
권 | 3 | 4.6% |
톤 | 3 | 4.6% |
서 | 2 | 3.1% |
도 | 2 | 3.1% |
국 | 2 | 3.1% |
관 | 2 | 3.1% |
드 | 2 | 3.1% |
Other values (26) | 30 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 10 | 15.4% |
에 | 6 | 9.2% |
철 | 3 | 4.6% |
권 | 3 | 4.6% |
톤 | 3 | 4.6% |
서 | 2 | 3.1% |
도 | 2 | 3.1% |
국 | 2 | 3.1% |
관 | 2 | 3.1% |
드 | 2 | 3.1% |
Other values (26) | 30 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 10 | 15.4% |
에 | 6 | 9.2% |
철 | 3 | 4.6% |
권 | 3 | 4.6% |
톤 | 3 | 4.6% |
서 | 2 | 3.1% |
도 | 2 | 3.1% |
국 | 2 | 3.1% |
관 | 2 | 3.1% |
드 | 2 | 3.1% |
Other values (26) | 30 |
해시태그빈도수
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
1 | |
---|---|
3 | 2 |
2 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 27 | |
3 | 2 | 6.7% |
2 | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 27 | |
3 | 2 | 6.7% |
2 | 1 | 3.3% |
해시태그최근지수
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
20 | |
---|---|
18 | |
19 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 18 |
4th row | 20 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 15 | |
18 | 11 | |
19 | 4 | 13.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 15 | |
18 | 11 | |
19 | 4 | 13.3% |
해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|
해시태그채널ID | 1.000 | 1.000 | 1.000 | 0.000 | 0.600 |
해시태그영상ID | 1.000 | 1.000 | 0.948 | 1.000 | 1.000 |
해시태그추출명사명 | 1.000 | 0.948 | 1.000 | 1.000 | 0.967 |
해시태그빈도수 | 0.000 | 1.000 | 1.000 | 1.000 | 0.625 |
해시태그최근지수 | 0.600 | 1.000 | 0.967 | 0.625 | 1.000 |
해시태그최근지수 | 해시태그빈도수 | 해시태그채널ID | |
---|---|---|---|
해시태그최근지수 | 1.000 | 0.287 | 0.853 |
해시태그빈도수 | 0.287 | 1.000 | 0.000 |
해시태그채널ID | 0.853 | 0.000 | 1.000 |
해시태그채널ID | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|
해시태그채널ID | 1.000 | 0.000 | 0.853 |
해시태그빈도수 | 0.000 | 1.000 | 0.287 |
해시태그최근지수 | 0.853 | 0.287 | 1.000 |
해시태그수집일자 | 해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|---|
0 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=9bf_LM0p_nA | 도서관 | 1 | 20 |
1 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=9bf_LM0p_nA | 관 | 1 | 20 |
2 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cJ1AreifGxw | 산 | 1 | 18 |
3 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=9pOC80Llvto | 폴리 | 1 | 20 |
4 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=9pGhong3FgI | 스톤 | 1 | 20 |
5 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cNOFfzrOHGA | 뉴스 | 3 | 20 |
6 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=9qs976PQokg | 스톤 | 1 | 20 |
7 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cQVKvkddG-g | 에스에스비 | 1 | 18 |
8 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=A0CUrUTGiTY | 하스 | 1 | 20 |
9 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cQZnGaeBOIQ | 에스 | 2 | 19 |
해시태그수집일자 | 해시태그채널ID | 해시태그영상ID | 해시태그추출명사명 | 해시태그빈도수 | 해시태그최근지수 | |
---|---|---|---|---|---|---|
20 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cantqg2Bt_4 | 서울 | 1 | 18 |
21 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=B6q54Dz3-f8 | 도 | 1 | 20 |
22 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cbpnYPsyc_g | 미국 | 1 | 18 |
23 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=B_8OXVS5ElE | 블리자드 | 1 | 19 |
24 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cbxeWaHrvA4 | 생산 | 1 | 18 |
25 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=BmdvFgdvYpM | 공 | 1 | 19 |
26 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=BdpsxLfZFCo | 스톤 | 1 | 20 |
27 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cdfgnbK0hdY | 접종 | 1 | 18 |
28 | 2021-10-01 | https://www.youtube.com/channel/UCFM_07Mxv6CglREk8qdkPaw | https://www.youtube.com/watch?v=BnAPMj04dUM | 철권 | 1 | 20 |
29 | 2021-10-01 | https://www.youtube.com/channel/UCkinYTS9IHqOEwR1Sze2JTw | https://www.youtube.com/watch?v=cfbxHYUVHtA | 법무 | 1 | 18 |