Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical4
Text2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/29a22897-3912-414f-8a5a-1ab2c7a2dd62

Alerts

해시태그수집일자 has constant value ""Constant
해시태그빈도수 is highly imbalanced (57.4%)Imbalance

Reproduction

Analysis started2023-12-10 14:14:28.290102
Analysis finished2023-12-10 14:14:29.532588
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2021-08-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-08-01
2nd row2021-08-01
3rd row2021-08-01
4th row2021-08-01
5th row2021-08-01

Common Values

ValueCountFrequency (%)
2021-08-01 30
100.0%

Length

2023-12-10T23:14:29.658077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:14:29.828627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-08-01 30
100.0%
Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
17 
https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQ
https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1w

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
2nd rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
3rd rowhttps://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1w
4th rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
5th rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw 17
56.7%
https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQ 7
23.3%
https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1w 6
 
20.0%

Length

2023-12-10T23:14:30.055688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:14:30.368455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/uc-4cn_iv6nbpef0ntbq3cmw 17
56.7%
https://www.youtube.com/channel/ucjundlkbjldzl60kffkm1vq 7
23.3%
https://www.youtube.com/channel/ucjlqst3nauar-nrkr1zve1w 6
 
20.0%
Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:14:30.775754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters68
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)76.7%

Sample

1st rowhttps://www.youtube.com/watch?v=2gd-8motICo
2nd rowhttps://www.youtube.com/watch?v=1t3mPGGVb20
3rd rowhttps://www.youtube.com/watch?v=F-J5_EUjwIk
4th rowhttps://www.youtube.com/watch?v=4nOoQFYPEek
5th rowhttps://www.youtube.com/watch?v=2gd-8motICo
ValueCountFrequency (%)
https://www.youtube.com/watch?v=5upu0p1isxw 3
 
10.0%
https://www.youtube.com/watch?v=2gd-8motico 2
 
6.7%
https://www.youtube.com/watch?v=-wusuq-ixqa 2
 
6.7%
https://www.youtube.com/watch?v=-w73mypt20k 1
 
3.3%
https://www.youtube.com/watch?v=htk4easmujk 1
 
3.3%
https://www.youtube.com/watch?v=wgdkxdmkymw 1
 
3.3%
https://www.youtube.com/watch?v=1y_fg34uewm 1
 
3.3%
https://www.youtube.com/watch?v=lmpnzpwfhp4 1
 
3.3%
https://www.youtube.com/watch?v=uytzxdfpgdq 1
 
3.3%
https://www.youtube.com/watch?v=0zployercre 1
 
3.3%
Other values (16) 16
53.3%
2023-12-10T23:14:31.497966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 129
 
10.0%
t 127
 
9.8%
/ 90
 
7.0%
o 68
 
5.3%
c 65
 
5.0%
h 61
 
4.7%
u 61
 
4.7%
. 60
 
4.7%
p 41
 
3.2%
m 39
 
3.0%
Other values (58) 549
42.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 845
65.5%
Other Punctuation 210
 
16.3%
Uppercase Letter 135
 
10.5%
Decimal Number 54
 
4.2%
Math Symbol 30
 
2.3%
Dash Punctuation 12
 
0.9%
Connector Punctuation 4
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 129
15.3%
t 127
15.0%
o 68
 
8.0%
c 65
 
7.7%
h 61
 
7.2%
u 61
 
7.2%
p 41
 
4.9%
m 39
 
4.6%
e 35
 
4.1%
s 35
 
4.1%
Other values (16) 184
21.8%
Uppercase Letter
ValueCountFrequency (%)
U 17
 
12.6%
I 8
 
5.9%
F 8
 
5.9%
A 8
 
5.9%
E 7
 
5.2%
Y 6
 
4.4%
M 6
 
4.4%
N 6
 
4.4%
C 6
 
4.4%
W 6
 
4.4%
Other values (15) 57
42.2%
Decimal Number
ValueCountFrequency (%)
0 8
14.8%
4 7
13.0%
2 7
13.0%
5 7
13.0%
1 7
13.0%
8 6
11.1%
6 5
9.3%
3 4
7.4%
7 2
 
3.7%
9 1
 
1.9%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 980
76.0%
Common 310
 
24.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 129
13.2%
t 127
 
13.0%
o 68
 
6.9%
c 65
 
6.6%
h 61
 
6.2%
u 61
 
6.2%
p 41
 
4.2%
m 39
 
4.0%
e 35
 
3.6%
s 35
 
3.6%
Other values (41) 319
32.6%
Common
ValueCountFrequency (%)
/ 90
29.0%
. 60
19.4%
: 30
 
9.7%
? 30
 
9.7%
= 30
 
9.7%
- 12
 
3.9%
0 8
 
2.6%
4 7
 
2.3%
2 7
 
2.3%
5 7
 
2.3%
Other values (7) 29
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 129
 
10.0%
t 127
 
9.8%
/ 90
 
7.0%
o 68
 
5.3%
c 65
 
5.0%
h 61
 
4.7%
u 61
 
4.7%
. 60
 
4.7%
p 41
 
3.2%
m 39
 
3.0%
Other values (58) 549
42.6%
Distinct28
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:14:31.946961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length1.9666667
Min length1

Characters and Unicode

Total characters59
Distinct characters46
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)86.7%

Sample

1st row직합
2nd row
3rd row정책
4th row
5th row행정
ValueCountFrequency (%)
행정 2
 
6.7%
영상 2
 
6.7%
직합 1
 
3.3%
공무원 1
 
3.3%
국가 1
 
3.3%
1
 
3.3%
수입 1
 
3.3%
합격 1
 
3.3%
걸즈 1
 
3.3%
1
 
3.3%
Other values (18) 18
60.0%
2023-12-10T23:14:32.559283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (36) 36
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (36) 36
61.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (36) 36
61.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (36) 36
61.0%

해시태그빈도수
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
26 
2
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 26
86.7%
2 3
 
10.0%
3 1
 
3.3%

Length

2023-12-10T23:14:32.807499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:14:33.008099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 26
86.7%
2 3
 
10.0%
3 1
 
3.3%
Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
14 
18
20
17

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row18
2nd row20
3rd row20
4th row19
5th row18

Common Values

ValueCountFrequency (%)
19 14
46.7%
18 9
30.0%
20 5
 
16.7%
17 2
 
6.7%

Length

2023-12-10T23:14:33.257843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:14:33.446471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 14
46.7%
18 9
30.0%
20 5
 
16.7%
17 2
 
6.7%

Correlations

2023-12-10T23:14:33.586709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0001.0000.2550.330
해시태그영상ID1.0001.0000.9661.0001.000
해시태그추출명사명1.0000.9661.0001.0000.786
해시태그빈도수0.2551.0001.0001.0000.364
해시태그최근지수0.3301.0000.7860.3641.000
2023-12-10T23:14:33.754103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그최근지수해시태그빈도수
해시태그채널ID1.0000.3070.066
해시태그최근지수0.3071.0000.343
해시태그빈도수0.0660.3431.000
2023-12-10T23:14:33.909684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그빈도수해시태그최근지수
해시태그채널ID1.0000.0660.307
해시태그빈도수0.0661.0000.343
해시태그최근지수0.3070.3431.000

Missing values

2023-12-10T23:14:29.022026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:14:29.365612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=2gd-8motICo직합118
12021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=1t3mPGGVb20120
22021-08-01https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1whttps://www.youtube.com/watch?v=F-J5_EUjwIk정책120
32021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=4nOoQFYPEek119
42021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=2gd-8motICo행정118
52021-08-01https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1whttps://www.youtube.com/watch?v=MHOFCKl2We4영상120
62021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=5UpU0p1isXw면접119
72021-08-01https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1whttps://www.youtube.com/watch?v=N-7WdF_eVCo개발118
82021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=5UpU0p1isXw영등포119
92021-08-01https://www.youtube.com/channel/UCJlqST3naUAR-nRkr1ZVE1whttps://www.youtube.com/watch?v=S3Kqn2cQ6SU영상119
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202021-08-01https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQhttps://www.youtube.com/watch?v=-WUSUq-ixqA자메이카218
212021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=Iy06m58zYRc320
222021-08-01https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQhttps://www.youtube.com/watch?v=0L15KsJky_I119
232021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=JLNINRNGx-A119
242021-08-01https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQhttps://www.youtube.com/watch?v=0ZPLoyercRE걸즈120
252021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=UYTZxdFPgdQ합격119
262021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=LmpNzpwfHp4수입119
272021-08-01https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQhttps://www.youtube.com/watch?v=1Y_FG34UEwM117
282021-08-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=WGDkXdMkYmw국가119
292021-08-01https://www.youtube.com/channel/UCJundLkbjldzL60kFfKm1vQhttps://www.youtube.com/watch?v=1gzqFnJRtNE끝말117