Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical4
Text2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/a10ca9b9-07bf-443f-b199-11ed58405c22

Alerts

해시태그수집일자 has constant value ""Constant
해시태그채널ID is highly overall correlated with 해시태그최근지수High correlation
해시태그최근지수 is highly overall correlated with 해시태그채널IDHigh correlation
해시태그빈도수 is highly imbalanced (78.9%)Imbalance

Reproduction

Analysis started2023-12-10 14:22:13.766335
Analysis finished2023-12-10 14:22:14.275935
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020-12-10
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-10
2nd row2020-12-10
3rd row2020-12-10
4th row2020-12-10
5th row2020-12-10

Common Values

ValueCountFrequency (%)
2020-12-10 30
100.0%

Length

2023-12-10T23:22:14.379374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:22:14.517620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-10 30
100.0%

해시태그채널ID
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw
17 
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ
13 

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw
2nd rowhttps://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw
3rd rowhttps://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ
4th rowhttps://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw
5th rowhttps://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSw 17
56.7%
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ 13
43.3%

Length

2023-12-10T23:22:14.626845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:22:14.736742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucecuyraqhrvumrdod6fgksw 17
56.7%
https://www.youtube.com/channel/uczbyyvm2ux-iqwzh0oxp-mq 13
43.3%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:22:15.095009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters68
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)80.0%

Sample

1st rowhttps://www.youtube.com/watch?v=cf36OBX0AYY
2nd rowhttps://www.youtube.com/watch?v=bW4FAdx6i18
3rd rowhttps://www.youtube.com/watch?v=iPc5D9zOld0
4th rowhttps://www.youtube.com/watch?v=gqKSuT_we7s
5th rowhttps://www.youtube.com/watch?v=fC0uHEMAK0U
ValueCountFrequency (%)
https://www.youtube.com/watch?v=jfofxq4d-mc 2
 
6.7%
https://www.youtube.com/watch?v=jftnqmlkaje 2
 
6.7%
https://www.youtube.com/watch?v=ijwihxkafj0 2
 
6.7%
https://www.youtube.com/watch?v=ikw2kl7joom 1
 
3.3%
https://www.youtube.com/watch?v=cf36obx0ayy 1
 
3.3%
https://www.youtube.com/watch?v=jc6a9vbaigy 1
 
3.3%
https://www.youtube.com/watch?v=nvs5jvjqwjk 1
 
3.3%
https://www.youtube.com/watch?v=ohfzb2xogwm 1
 
3.3%
https://www.youtube.com/watch?v=msyhyrhe2sg 1
 
3.3%
https://www.youtube.com/watch?v=m4yfdj8ghra 1
 
3.3%
Other values (17) 17
56.7%
2023-12-10T23:22:15.626161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 127
 
9.8%
t 124
 
9.6%
/ 90
 
7.0%
o 70
 
5.4%
c 67
 
5.2%
h 66
 
5.1%
u 66
 
5.1%
. 60
 
4.7%
m 38
 
2.9%
s 35
 
2.7%
Other values (58) 547
42.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 872
67.6%
Other Punctuation 210
 
16.3%
Uppercase Letter 126
 
9.8%
Decimal Number 46
 
3.6%
Math Symbol 30
 
2.3%
Connector Punctuation 4
 
0.3%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 127
14.6%
t 124
14.2%
o 70
 
8.0%
c 67
 
7.7%
h 66
 
7.6%
u 66
 
7.6%
m 38
 
4.4%
s 35
 
4.0%
e 34
 
3.9%
b 33
 
3.8%
Other values (16) 212
24.3%
Uppercase Letter
ValueCountFrequency (%)
A 15
 
11.9%
X 11
 
8.7%
Y 7
 
5.6%
F 7
 
5.6%
D 6
 
4.8%
B 6
 
4.8%
E 6
 
4.8%
M 5
 
4.0%
H 5
 
4.0%
S 5
 
4.0%
Other values (15) 53
42.1%
Decimal Number
ValueCountFrequency (%)
0 12
26.1%
4 7
15.2%
2 5
10.9%
9 5
10.9%
6 5
10.9%
5 4
 
8.7%
8 3
 
6.5%
7 2
 
4.3%
1 2
 
4.3%
3 1
 
2.2%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 998
77.4%
Common 292
 
22.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 127
 
12.7%
t 124
 
12.4%
o 70
 
7.0%
c 67
 
6.7%
h 66
 
6.6%
u 66
 
6.6%
m 38
 
3.8%
s 35
 
3.5%
e 34
 
3.4%
b 33
 
3.3%
Other values (41) 338
33.9%
Common
ValueCountFrequency (%)
/ 90
30.8%
. 60
20.5%
: 30
 
10.3%
= 30
 
10.3%
? 30
 
10.3%
0 12
 
4.1%
4 7
 
2.4%
2 5
 
1.7%
9 5
 
1.7%
6 5
 
1.7%
Other values (7) 18
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 127
 
9.8%
t 124
 
9.6%
/ 90
 
7.0%
o 70
 
5.4%
c 67
 
5.2%
h 66
 
5.1%
u 66
 
5.1%
. 60
 
4.7%
m 38
 
2.9%
s 35
 
2.7%
Other values (58) 547
42.4%
Distinct21
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:22:15.858841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length2.7666667
Min length1

Characters and Unicode

Total characters83
Distinct characters41
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)56.7%

Sample

1st row나이츠
2nd row표창
3rd row큰고래
4th row게임
5th row포켓몬스터
ValueCountFrequency (%)
나이츠 7
23.3%
서바이벌 2
 
6.7%
모바일게임 2
 
6.7%
세븐 2
 
6.7%
초딩 1
 
3.3%
1
 
3.3%
몬스터 1
 
3.3%
범프 1
 
3.3%
엣지 1
 
3.3%
서양 1
 
3.3%
Other values (11) 11
36.7%
2023-12-10T23:22:16.307872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
10.8%
7
 
8.4%
7
 
8.4%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
2
 
2.4%
Other values (31) 38
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
10.8%
7
 
8.4%
7
 
8.4%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
2
 
2.4%
Other values (31) 38
45.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
10.8%
7
 
8.4%
7
 
8.4%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
2
 
2.4%
Other values (31) 38
45.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
10.8%
7
 
8.4%
7
 
8.4%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
2
 
2.4%
Other values (31) 38
45.8%

해시태그빈도수
Categorical

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
29 
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 29
96.7%
3 1
 
3.3%

Length

2023-12-10T23:22:16.468189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:22:16.570286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 29
96.7%
3 1
 
3.3%

해시태그최근지수
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
20
18 
18
12 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row18
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 18
60.0%
18 12
40.0%

Length

2023-12-10T23:22:16.684811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:22:16.797291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 18
60.0%
18 12
40.0%

Correlations

2023-12-10T23:22:16.872101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0001.0000.0000.976
해시태그영상ID1.0001.0000.0000.0000.778
해시태그추출명사명1.0000.0001.0001.0001.000
해시태그빈도수0.0000.0001.0001.0000.000
해시태그최근지수0.9760.7781.0000.0001.000
2023-12-10T23:22:16.996448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그최근지수해시태그빈도수해시태그채널ID
해시태그최근지수1.0000.0000.860
해시태그빈도수0.0001.0000.000
해시태그채널ID0.8600.0001.000
2023-12-10T23:22:17.120832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그빈도수해시태그최근지수
해시태그채널ID1.0000.0000.860
해시태그빈도수0.0001.0000.000
해시태그최근지수0.8600.0001.000

Missing values

2023-12-10T23:22:14.080480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:22:14.207138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=cf36OBX0AYY나이츠120
12020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=bW4FAdx6i18표창120
22020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=iPc5D9zOld0큰고래118
32020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=gqKSuT_we7s게임120
42020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=fC0uHEMAK0U포켓몬스터120
52020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=idw8DzbLfo4효근118
62020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=hAPGLQXpov4나이츠120
72020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=ie0xu0ucU2A팔사118
82020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=hMx4YufE_Fo모바일게임120
92020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=if5lVXgJHEM동양118
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=iut2WDSBwTc서양인118
212020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=m4Yfdj8GHRA나이츠120
222020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=jFTNqmLkAjE서양118
232020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=msyHYrhE2Sg세븐120
242020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=jFTNqmLkAjE엣지118
252020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=oHfzB2XogWM모바일게임120
262020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=nVS5JVJqWJk나이츠120
272020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=jFofXQ4D-mc범프118
282020-12-10https://www.youtube.com/channel/UCEcuyrAqhRVUmRdOd6FGKSwhttps://www.youtube.com/watch?v=opPXqe_KzA0몬스터120
292020-12-10https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=jFofXQ4D-mc비디오118