Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical3
Text2
Numeric1

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/dde439f1-b116-43ff-93d5-992c9712eb5e

Alerts

해시태그수집일자 has constant value ""Constant

Reproduction

Analysis started2023-12-10 14:17:57.038413
Analysis finished2023-12-10 14:17:57.765165
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2021-09-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-01
2nd row2021-09-01
3rd row2021-09-01
4th row2021-09-01
5th row2021-09-01

Common Values

ValueCountFrequency (%)
2021-09-01 30
100.0%

Length

2023-12-10T23:17:57.863608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:58.012160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-01 30
100.0%
Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
17 
https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqg
10 
https://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clw

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
2nd rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
3rd rowhttps://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clw
4th rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw
5th rowhttps://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMw 17
56.7%
https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqg 10
33.3%
https://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clw 3
 
10.0%

Length

2023-12-10T23:17:58.162310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:58.325876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/uc-4cn_iv6nbpef0ntbq3cmw 17
56.7%
https://www.youtube.com/channel/ucjh7yjlh8majqzxzmys-gqg 10
33.3%
https://www.youtube.com/channel/ucja3a6te2jncmhtwxyn5clw 3
 
10.0%
Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:17:58.763234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters68
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)73.3%

Sample

1st rowhttps://www.youtube.com/watch?v=8tYsYfwUJIk
2nd rowhttps://www.youtube.com/watch?v=4nOoQFYPEek
3rd rowhttps://www.youtube.com/watch?v=sGbkt4zZQsY
4th rowhttps://www.youtube.com/watch?v=Bg4x8FswW3w
5th rowhttps://www.youtube.com/watch?v=96Xi4tVFr5U
ValueCountFrequency (%)
https://www.youtube.com/watch?v=kpegknmbdyc 2
 
6.7%
https://www.youtube.com/watch?v=haociigegli 2
 
6.7%
https://www.youtube.com/watch?v=zjrlq0bdjwc 2
 
6.7%
https://www.youtube.com/watch?v=jlninrngx-a 2
 
6.7%
https://www.youtube.com/watch?v=wfz4e1ksjmg 1
 
3.3%
https://www.youtube.com/watch?v=8tysyfwujik 1
 
3.3%
https://www.youtube.com/watch?v=nb54beoulem 1
 
3.3%
https://www.youtube.com/watch?v=wqlvwtcru7o 1
 
3.3%
https://www.youtube.com/watch?v=kk_hmjxg1ru 1
 
3.3%
https://www.youtube.com/watch?v=t9s90dngxde 1
 
3.3%
Other values (16) 16
53.3%
2023-12-10T23:17:59.364908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 128
 
9.9%
t 127
 
9.8%
/ 90
 
7.0%
o 68
 
5.3%
c 67
 
5.2%
u 63
 
4.9%
h 62
 
4.8%
. 60
 
4.7%
s 44
 
3.4%
p 35
 
2.7%
Other values (58) 546
42.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 864
67.0%
Other Punctuation 210
 
16.3%
Uppercase Letter 137
 
10.6%
Decimal Number 42
 
3.3%
Math Symbol 30
 
2.3%
Connector Punctuation 4
 
0.3%
Dash Punctuation 3
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 128
14.8%
t 127
14.7%
o 68
 
7.9%
c 67
 
7.8%
u 63
 
7.3%
h 62
 
7.2%
s 44
 
5.1%
p 35
 
4.1%
e 34
 
3.9%
m 34
 
3.9%
Other values (16) 202
23.4%
Uppercase Letter
ValueCountFrequency (%)
E 10
 
7.3%
N 10
 
7.3%
G 9
 
6.6%
I 8
 
5.8%
U 8
 
5.8%
Q 8
 
5.8%
J 7
 
5.1%
A 7
 
5.1%
Y 6
 
4.4%
B 6
 
4.4%
Other values (15) 58
42.3%
Decimal Number
ValueCountFrequency (%)
9 7
16.7%
5 7
16.7%
4 7
16.7%
0 5
11.9%
8 5
11.9%
3 3
7.1%
7 3
7.1%
6 2
 
4.8%
1 2
 
4.8%
2 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1001
77.6%
Common 289
 
22.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 128
 
12.8%
t 127
 
12.7%
o 68
 
6.8%
c 67
 
6.7%
u 63
 
6.3%
h 62
 
6.2%
s 44
 
4.4%
p 35
 
3.5%
e 34
 
3.4%
m 34
 
3.4%
Other values (41) 339
33.9%
Common
ValueCountFrequency (%)
/ 90
31.1%
. 60
20.8%
: 30
 
10.4%
? 30
 
10.4%
= 30
 
10.4%
9 7
 
2.4%
5 7
 
2.4%
4 7
 
2.4%
0 5
 
1.7%
8 5
 
1.7%
Other values (7) 18
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 128
 
9.9%
t 127
 
9.8%
/ 90
 
7.0%
o 68
 
5.3%
c 67
 
5.2%
u 63
 
4.9%
h 62
 
4.8%
. 60
 
4.7%
s 44
 
3.4%
p 35
 
2.7%
Other values (58) 546
42.3%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:17:59.685595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.8333333
Min length1

Characters and Unicode

Total characters55
Distinct characters41
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)83.3%

Sample

1st row공부
2nd row서울시
3rd row압해
4th row
5th row
ValueCountFrequency (%)
3
 
10.0%
압해 2
 
6.7%
민원 1
 
3.3%
공부 1
 
3.3%
황매 1
 
3.3%
직합 1
 
3.3%
브이 1
 
3.3%
1
 
3.3%
면접 1
 
3.3%
전북 1
 
3.3%
Other values (17) 17
56.7%
2023-12-10T23:18:00.186124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
10.9%
3
 
5.5%
3
 
5.5%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
1
 
1.8%
Other values (31) 31
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
10.9%
3
 
5.5%
3
 
5.5%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
1
 
1.8%
Other values (31) 31
56.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
10.9%
3
 
5.5%
3
 
5.5%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
1
 
1.8%
Other values (31) 31
56.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
10.9%
3
 
5.5%
3
 
5.5%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
1
 
1.8%
Other values (31) 31
56.4%

해시태그빈도수
Real number (ℝ)

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8333333
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:18:00.351249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31.75
95-th percentile6.2
Maximum9
Range8
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation1.9666764
Coefficient of variation (CV)1.0727326
Kurtosis8.3462296
Mean1.8333333
Median Absolute Deviation (MAD)0
Skewness2.9298395
Sum55
Variance3.8678161
MonotonicityNot monotonic
2023-12-10T23:18:00.860396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 22
73.3%
2 3
 
10.0%
3 2
 
6.7%
4 1
 
3.3%
9 1
 
3.3%
8 1
 
3.3%
ValueCountFrequency (%)
1 22
73.3%
2 3
 
10.0%
3 2
 
6.7%
4 1
 
3.3%
8 1
 
3.3%
9 1
 
3.3%
ValueCountFrequency (%)
9 1
 
3.3%
8 1
 
3.3%
4 1
 
3.3%
3 2
 
6.7%
2 3
 
10.0%
1 22
73.3%
Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
12 
17
18
16
20
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row18
2nd row19
3rd row17
4th row19
5th row19

Common Values

ValueCountFrequency (%)
19 12
40.0%
17 8
26.7%
18 7
23.3%
16 2
 
6.7%
20 1
 
3.3%

Length

2023-12-10T23:18:01.044268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:01.226702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 12
40.0%
17 8
26.7%
18 7
23.3%
16 2
 
6.7%
20 1
 
3.3%

Interactions

2023-12-10T23:17:57.386108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:18:01.346532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0001.0000.7310.491
해시태그영상ID1.0001.0000.9300.9560.982
해시태그추출명사명1.0000.9301.0001.0000.963
해시태그빈도수0.7310.9561.0001.0000.438
해시태그최근지수0.4910.9820.9630.4381.000
2023-12-10T23:18:01.483892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그최근지수
해시태그채널ID1.0000.404
해시태그최근지수0.4041.000
2023-12-10T23:18:01.604867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그빈도수해시태그채널ID해시태그최근지수
해시태그빈도수1.0000.3800.299
해시태그채널ID0.3801.0000.404
해시태그최근지수0.2990.4041.000

Missing values

2023-12-10T23:17:57.572361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:17:57.702579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=8tYsYfwUJIk공부118
12021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=4nOoQFYPEek서울시119
22021-09-01https://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clwhttps://www.youtube.com/watch?v=sGbkt4zZQsY압해117
32021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=Bg4x8FswW3w119
42021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=96Xi4tVFr5U119
52021-09-01https://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clwhttps://www.youtube.com/watch?v=zjrLQ0BdJwc신안319
62021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=JLNINRNGx-A119
72021-09-01https://www.youtube.com/channel/UCJa3a6Te2JncMHTwXYn5clwhttps://www.youtube.com/watch?v=zjrLQ0BdJwc압해117
82021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=JLNINRNGx-A행정119
92021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=37s_z2oTQss218
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=dzq6GojpYVE서파118
212021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=hAoCIigEglI서울117
222021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=s3xM9v-5Uso가을117
232021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=hAoCIigEglI117
242021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=t9S90dnGxdE전북216
252021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=kpEGkNMBdyc면접118
262021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=kk_HmjXG1RU320
272021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=wqlvWtcRu7o브이217
282021-09-01https://www.youtube.com/channel/UC-4cn_Iv6NbpEF0Ntbq3cMwhttps://www.youtube.com/watch?v=kpEGkNMBdyc직합118
292021-09-01https://www.youtube.com/channel/UCJh7YJLH8MaJQZxzMYS-gqghttps://www.youtube.com/watch?v=z0R5XPXbwms왕궁117