Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

DateTime1
Categorical3
Text2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/80a4842e-deee-45fd-85eb-74d3699ca208

Alerts

해시태그수집일자 has constant value ""Constant
해시태그빈도수 is highly imbalanced (64.6%)Imbalance

Reproduction

Analysis started2023-12-10 13:46:36.762921
Analysis finished2023-12-10 13:46:37.529810
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-11-01 00:00:00
Maximum2020-11-01 00:00:00
2023-12-10T22:46:37.600782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:46:37.755260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg
17 
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ
13 

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg
2nd rowhttps://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg
3rd rowhttps://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ
4th rowhttps://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg
5th rowhttps://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQg 17
56.7%
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ 13
43.3%

Length

2023-12-10T22:46:37.947696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:46:38.123316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/uccl8poejcc6dwackyzw_eqg 17
56.7%
https://www.youtube.com/channel/ucfl1scaksd6_7jizwwhcwjq 13
43.3%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T22:46:38.488223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)80.0%

Sample

1st rowhttps://www.youtube.com/watch?v=vpv1k45SuSg
2nd rowhttps://www.youtube.com/watch?v=vVLeDvKJ4is
3rd rowhttps://www.youtube.com/watch?v=zjRwVyQB5os
4th rowhttps://www.youtube.com/watch?v=w2mfqaAxdCU
5th rowhttps://www.youtube.com/watch?v=w2_1gb3qoho
ValueCountFrequency (%)
https://www.youtube.com/watch?v=zo5p5oxqxg4 2
 
6.7%
https://www.youtube.com/watch?v=znviox8ceis 2
 
6.7%
https://www.youtube.com/watch?v=zmsm38xl16m 2
 
6.7%
https://www.youtube.com/watch?v=wzwzrxylbni 1
 
3.3%
https://www.youtube.com/watch?v=vpv1k45susg 1
 
3.3%
https://www.youtube.com/watch?v=wjqx8nhfoe0 1
 
3.3%
https://www.youtube.com/watch?v=x_jrsgklzba 1
 
3.3%
https://www.youtube.com/watch?v=zopz3y-wz0w 1
 
3.3%
https://www.youtube.com/watch?v=xz7fjo_ldfa 1
 
3.3%
https://www.youtube.com/watch?v=xzcdemoxics 1
 
3.3%
Other values (17) 17
56.7%
2023-12-10T22:46:39.146087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 134
 
10.4%
t 120
 
9.3%
/ 90
 
7.0%
o 69
 
5.3%
c 66
 
5.1%
h 64
 
5.0%
u 62
 
4.8%
. 60
 
4.7%
s 37
 
2.9%
y 36
 
2.8%
Other values (59) 552
42.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 873
67.7%
Other Punctuation 210
 
16.3%
Uppercase Letter 121
 
9.4%
Decimal Number 48
 
3.7%
Math Symbol 30
 
2.3%
Connector Punctuation 6
 
0.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 134
15.3%
t 120
13.7%
o 69
 
7.9%
c 66
 
7.6%
h 64
 
7.3%
u 62
 
7.1%
s 37
 
4.2%
y 36
 
4.1%
p 36
 
4.1%
m 36
 
4.1%
Other values (16) 213
24.4%
Uppercase Letter
ValueCountFrequency (%)
I 12
 
9.9%
Z 10
 
8.3%
O 8
 
6.6%
S 8
 
6.6%
X 8
 
6.6%
Q 7
 
5.8%
P 6
 
5.0%
M 6
 
5.0%
Y 5
 
4.1%
B 5
 
4.1%
Other values (16) 46
38.0%
Decimal Number
ValueCountFrequency (%)
5 8
16.7%
4 7
14.6%
3 6
12.5%
8 5
10.4%
2 5
10.4%
0 4
8.3%
6 4
8.3%
1 4
8.3%
7 3
 
6.2%
9 2
 
4.2%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 994
77.1%
Common 296
 
22.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 134
13.5%
t 120
 
12.1%
o 69
 
6.9%
c 66
 
6.6%
h 64
 
6.4%
u 62
 
6.2%
s 37
 
3.7%
y 36
 
3.6%
p 36
 
3.6%
m 36
 
3.6%
Other values (42) 334
33.6%
Common
ValueCountFrequency (%)
/ 90
30.4%
. 60
20.3%
: 30
 
10.1%
? 30
 
10.1%
= 30
 
10.1%
5 8
 
2.7%
4 7
 
2.4%
_ 6
 
2.0%
3 6
 
2.0%
8 5
 
1.7%
Other values (7) 24
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 134
 
10.4%
t 120
 
9.3%
/ 90
 
7.0%
o 69
 
5.3%
c 66
 
5.1%
h 64
 
5.0%
u 62
 
4.8%
. 60
 
4.7%
s 37
 
2.9%
y 36
 
2.8%
Other values (59) 552
42.8%
Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T22:46:39.488741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.9666667
Min length1

Characters and Unicode

Total characters59
Distinct characters49
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)76.7%

Sample

1st row영상
2nd row콜로
3rd row장르
4th row오브
5th row피자
ValueCountFrequency (%)
영상 3
 
10.0%
오브 2
 
6.7%
2
 
6.7%
토니 1
 
3.3%
해물 1
 
3.3%
수호 1
 
3.3%
튜버 1
 
3.3%
포식자 1
 
3.3%
1
 
3.3%
배틀 1
 
3.3%
Other values (16) 16
53.3%
2023-12-10T22:46:40.118266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

해시태그빈도수
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
27 
2
 
2
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 27
90.0%
2 2
 
6.7%
3 1
 
3.3%

Length

2023-12-10T22:46:40.562331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:46:40.760125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 27
90.0%
2 2
 
6.7%
3 1
 
3.3%
Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
22 
20
18
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row19
2nd row19
3rd row19
4th row20
5th row19

Common Values

ValueCountFrequency (%)
19 22
73.3%
20 6
 
20.0%
18 2
 
6.7%

Length

2023-12-10T22:46:40.937946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:46:41.091529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 22
73.3%
20 6
 
20.0%
18 2
 
6.7%

Correlations

2023-12-10T22:46:41.228975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0001.0000.0000.000
해시태그영상ID1.0001.0000.8001.0001.000
해시태그추출명사명1.0000.8001.0000.0000.801
해시태그빈도수0.0001.0000.0001.0000.340
해시태그최근지수0.0001.0000.8010.3401.000
2023-12-10T22:46:41.484931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그빈도수해시태그최근지수
해시태그채널ID1.0000.0000.000
해시태그빈도수0.0001.0000.107
해시태그최근지수0.0000.1071.000
2023-12-10T22:46:41.613660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그빈도수해시태그최근지수
해시태그채널ID1.0000.0000.000
해시태그빈도수0.0001.0000.107
해시태그최근지수0.0000.1071.000

Missing values

2023-12-10T22:46:37.276329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:46:37.459359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=vpv1k45SuSg영상119
12020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=vVLeDvKJ4is콜로119
22020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zjRwVyQB5os장르119
32020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=w2mfqaAxdCU오브120
42020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=w2_1gb3qoho피자119
52020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zlB_z3Y2ZUI퀴즈쇼119
62020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=wOcfWIpnNqs119
72020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zlkSEeXcESw허양임119
82020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=wSbuXywhj7U강의119
92020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zm0A2B4rjXo생활119
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=znviOX8ceIs우주119
212020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=x9HfCcQqrDM마이119
222020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zo5P5OxQxG4배틀120
232020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=xIIZf0qT9ds120
242020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zo5P5OxQxG4120
252020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=xZCDemoxIcs포식자119
262020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=xZ7fJO_lDFA튜버119
272020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zopZ3Y-WZ0w수호219
282020-11-01https://www.youtube.com/channel/UCcL8PoeJCC6DwAcKyzw_eQghttps://www.youtube.com/watch?v=x_jRSgKlZBA오브119
292020-11-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=zpI_n5RLNPE전현무118