Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical4
Text2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/1e740658-fca2-4980-bcc5-e19a86a956a3

Alerts

해시태그수집일자 has constant value ""Constant
해시태그빈도수 is highly imbalanced (73.5%)Imbalance

Reproduction

Analysis started2023-12-10 14:12:39.933178
Analysis finished2023-12-10 14:12:40.620964
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2021-07-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-07-01
2nd row2021-07-01
3rd row2021-07-01
4th row2021-07-01
5th row2021-07-01

Common Values

ValueCountFrequency (%)
2021-07-01 30
100.0%

Length

2023-12-10T23:12:40.719693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:40.895051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-07-01 30
100.0%
Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g
17 
https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQ
13 

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g
2nd rowhttps://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g
3rd rowhttps://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQ
4th rowhttps://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g
5th rowhttps://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7g 17
56.7%
https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQ 13
43.3%

Length

2023-12-10T23:12:41.052139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:41.292413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucazs_xwau1ybmnoqdd1mr7g 17
56.7%
https://www.youtube.com/channel/ucfctzjtujhe18k8ixwmxtyq 13
43.3%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:12:41.697620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st rowhttps://www.youtube.com/watch?v=W1ZU0fBwV0w
2nd rowhttps://www.youtube.com/watch?v=W1-vSCeEclc
3rd rowhttps://www.youtube.com/watch?v=li8Q5-Ig470
4th rowhttps://www.youtube.com/watch?v=W1tD54KCfOE
5th rowhttps://www.youtube.com/watch?v=W1nTKZqqgJM
ValueCountFrequency (%)
https://www.youtube.com/watch?v=w6xcka9idsu 2
 
6.7%
https://www.youtube.com/watch?v=w1zu0fbwv0w 1
 
3.3%
https://www.youtube.com/watch?v=loj4dggtszi 1
 
3.3%
https://www.youtube.com/watch?v=wax4lhnk1zi 1
 
3.3%
https://www.youtube.com/watch?v=lu9cqhqwm7o 1
 
3.3%
https://www.youtube.com/watch?v=wa35-xh_zvk 1
 
3.3%
https://www.youtube.com/watch?v=wamsh9rlbbk 1
 
3.3%
https://www.youtube.com/watch?v=lt__-opxmxm 1
 
3.3%
https://www.youtube.com/watch?v=w9p8swkbo0a 1
 
3.3%
https://www.youtube.com/watch?v=lst3-tzs6oy 1
 
3.3%
Other values (19) 19
63.3%
2023-12-10T23:12:42.525662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 128
 
9.9%
t 126
 
9.8%
/ 90
 
7.0%
o 66
 
5.1%
c 66
 
5.1%
u 65
 
5.0%
h 64
 
5.0%
. 60
 
4.7%
s 36
 
2.8%
b 35
 
2.7%
Other values (59) 554
42.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 846
65.6%
Other Punctuation 210
 
16.3%
Uppercase Letter 130
 
10.1%
Decimal Number 61
 
4.7%
Math Symbol 30
 
2.3%
Dash Punctuation 8
 
0.6%
Connector Punctuation 5
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 128
15.1%
t 126
14.9%
o 66
 
7.8%
c 66
 
7.8%
u 65
 
7.7%
h 64
 
7.6%
s 36
 
4.3%
b 35
 
4.1%
v 34
 
4.0%
y 33
 
3.9%
Other values (16) 193
22.8%
Uppercase Letter
ValueCountFrequency (%)
W 21
16.2%
A 10
 
7.7%
K 9
 
6.9%
E 8
 
6.2%
S 7
 
5.4%
U 6
 
4.6%
M 6
 
4.6%
I 5
 
3.8%
P 5
 
3.8%
L 4
 
3.1%
Other values (16) 49
37.7%
Decimal Number
ValueCountFrequency (%)
5 9
14.8%
9 8
13.1%
4 7
11.5%
8 7
11.5%
0 7
11.5%
6 6
9.8%
1 5
8.2%
3 4
6.6%
2 4
6.6%
7 4
6.6%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 976
75.7%
Common 314
 
24.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 128
 
13.1%
t 126
 
12.9%
o 66
 
6.8%
c 66
 
6.8%
u 65
 
6.7%
h 64
 
6.6%
s 36
 
3.7%
b 35
 
3.6%
v 34
 
3.5%
y 33
 
3.4%
Other values (42) 323
33.1%
Common
ValueCountFrequency (%)
/ 90
28.7%
. 60
19.1%
: 30
 
9.6%
? 30
 
9.6%
= 30
 
9.6%
5 9
 
2.9%
9 8
 
2.5%
- 8
 
2.5%
4 7
 
2.2%
8 7
 
2.2%
Other values (7) 35
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 128
 
9.9%
t 126
 
9.8%
/ 90
 
7.0%
o 66
 
5.1%
c 66
 
5.1%
u 65
 
5.0%
h 64
 
5.0%
. 60
 
4.7%
s 36
 
2.8%
b 35
 
2.7%
Other values (59) 554
42.9%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:12:42.900346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length1.9333333
Min length1

Characters and Unicode

Total characters58
Distinct characters52
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row월세
2nd row정씨
3rd row왕건
4th row
5th row
ValueCountFrequency (%)
김치 2
 
6.7%
월세 1
 
3.3%
1
 
3.3%
보호 1
 
3.3%
출근 1
 
3.3%
구조 1
 
3.3%
다이어트 1
 
3.3%
1
 
3.3%
탈라스 1
 
3.3%
실형 1
 
3.3%
Other values (19) 19
63.3%
2023-12-10T23:12:43.553075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (42) 42
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (42) 42
72.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (42) 42
72.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (42) 42
72.4%

해시태그빈도수
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
28 
3
 
1
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 28
93.3%
3 1
 
3.3%
2 1
 
3.3%

Length

2023-12-10T23:12:43.764338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:43.986353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 28
93.3%
3 1
 
3.3%
2 1
 
3.3%
Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
16 
18
20
17
16
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row16
2nd row19
3rd row19
4th row19
5th row19

Common Values

ValueCountFrequency (%)
19 16
53.3%
18 7
23.3%
20 4
 
13.3%
17 2
 
6.7%
16 1
 
3.3%

Length

2023-12-10T23:12:44.241139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:44.491209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 16
53.3%
18 7
23.3%
20 4
 
13.3%
17 2
 
6.7%
16 1
 
3.3%

Correlations

2023-12-10T23:12:44.730137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0000.0000.0000.000
해시태그영상ID1.0001.0000.9901.0001.000
해시태그추출명사명0.0000.9901.0001.0001.000
해시태그빈도수0.0001.0001.0001.0000.000
해시태그최근지수0.0001.0001.0000.0001.000
2023-12-10T23:12:45.011084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그최근지수해시태그빈도수해시태그채널ID
해시태그최근지수1.0000.0000.000
해시태그빈도수0.0001.0000.000
해시태그채널ID0.0000.0001.000
2023-12-10T23:12:45.169781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그빈도수해시태그최근지수
해시태그채널ID1.0000.0000.000
해시태그빈도수0.0001.0000.000
해시태그최근지수0.0000.0001.000

Missing values

2023-12-10T23:12:40.395405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:12:40.558312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W1ZU0fBwV0w월세116
12021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W1-vSCeEclc정씨119
22021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=li8Q5-Ig470왕건119
32021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W1tD54KCfOE119
42021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W1nTKZqqgJM119
52021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=ljY-EdETyxk우주119
62021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W3HaoEAE9QA본색118
72021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=ljfESkQTgRY김반장119
82021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W4P8nOLtay0120
92021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=lkN2f5mKmWs119
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=ls6tGDwW-Gs상미118
212021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W9obj0I9YkA실형219
222021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=lst3-tzS6OY탈라스120
232021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=W9p8swKBo0A119
242021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=lt__-oPxMXM김치119
252021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=WAmsH9rLBbk다이어트118
262021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=WA35-xH_Zvk구조117
272021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=lu9cqhQWM7o출근120
282021-07-01https://www.youtube.com/channel/UCaZS_XwAu1yBMNoQDD1mR7ghttps://www.youtube.com/watch?v=WAx4LhnK1zI보호119
292021-07-01https://www.youtube.com/channel/UCFCtZJTuJhE18k8IXwmXTYQhttps://www.youtube.com/watch?v=luUWVrCf6L4전화119