Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/0aa4570a-2c55-4756-b123-397fcbc9db30

Alerts

해시태그수집일자 has constant value ""Constant
해시태그영상ID is highly overall correlated with 해시태그채널ID and 1 other fieldsHigh correlation
해시태그채널ID is highly overall correlated with 해시태그영상ID and 1 other fieldsHigh correlation
해시태그빈도수 is highly overall correlated with 해시태그최근지수High correlation
해시태그최근지수 is highly overall correlated with 해시태그빈도수 and 2 other fieldsHigh correlation
해시태그추출명사명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:16:15.673596
Analysis finished2023-12-10 14:16:16.448476
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020-09-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-09-01
2nd row2020-09-01
3rd row2020-09-01
4th row2020-09-01
5th row2020-09-01

Common Values

ValueCountFrequency (%)
2020-09-01 30
100.0%

Length

2023-12-10T23:16:16.551519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:16:16.694284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-09-01 30
100.0%

해시태그채널ID
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ
15 
https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ
https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ
 
1

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st rowhttps://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ
2nd rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
3rd rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
4th rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
5th rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ 15
50.0%
https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA 9
30.0%
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ 5
 
16.7%
https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ 1
 
3.3%

Length

2023-12-10T23:16:16.835591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:16:17.008630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucfl1scaksd6_7jizwwhcwjq 15
50.0%
https://www.youtube.com/channel/ucd4vefdwg_kzvi-9eq8spha 9
30.0%
https://www.youtube.com/channel/uczbyyvm2ux-iqwzh0oxp-mq 5
 
16.7%
https://www.youtube.com/channel/ucl402yyy7rch7pbi3npmgpq 1
 
3.3%

해시태그영상ID
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/watch?v=---DUD-BgqU
15 
https://www.youtube.com/watch?v=---1gcmh6q0
https://www.youtube.com/watch?v=---HwiQbik4
https://www.youtube.com/watch?v=---1cHdqY74
 
1

Length

Max length43
Median length43
Mean length43
Min length43

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st rowhttps://www.youtube.com/watch?v=---1cHdqY74
2nd rowhttps://www.youtube.com/watch?v=---1gcmh6q0
3rd rowhttps://www.youtube.com/watch?v=---1gcmh6q0
4th rowhttps://www.youtube.com/watch?v=---1gcmh6q0
5th rowhttps://www.youtube.com/watch?v=---1gcmh6q0

Common Values

ValueCountFrequency (%)
https://www.youtube.com/watch?v=---DUD-BgqU 15
50.0%
https://www.youtube.com/watch?v=---1gcmh6q0 9
30.0%
https://www.youtube.com/watch?v=---HwiQbik4 5
 
16.7%
https://www.youtube.com/watch?v=---1cHdqY74 1
 
3.3%

Length

2023-12-10T23:16:17.174901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:16:17.326093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/watch?v=---dud-bgqu 15
50.0%
https://www.youtube.com/watch?v=---1gcmh6q0 9
30.0%
https://www.youtube.com/watch?v=---hwiqbik4 5
 
16.7%
https://www.youtube.com/watch?v=---1chdqy74 1
 
3.3%
Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:16:17.860263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.3666667
Min length1

Characters and Unicode

Total characters71
Distinct characters55
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row피파
2nd row유산
3rd row공허
4th row패치
5th row매크로
ValueCountFrequency (%)
피파 1
 
3.3%
유산 1
 
3.3%
1
 
3.3%
고래 1
 
3.3%
다이아 1
 
3.3%
1
 
3.3%
1
 
3.3%
류준열 1
 
3.3%
쿠바 1
 
3.3%
자전 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:16:18.364542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 51
71.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 51
71.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 51
71.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 51
71.8%

해시태그빈도수
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0333333
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:16:18.531270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5.65
Maximum9
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.8473342
Coefficient of variation (CV)0.90852502
Kurtosis7.5967553
Mean2.0333333
Median Absolute Deviation (MAD)0
Skewness2.6553697
Sum61
Variance3.4126437
MonotonicityNot monotonic
2023-12-10T23:16:18.704617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 17
56.7%
2 6
 
20.0%
3 4
 
13.3%
9 1
 
3.3%
7 1
 
3.3%
4 1
 
3.3%
ValueCountFrequency (%)
1 17
56.7%
2 6
 
20.0%
3 4
 
13.3%
4 1
 
3.3%
7 1
 
3.3%
9 1
 
3.3%
ValueCountFrequency (%)
9 1
 
3.3%
7 1
 
3.3%
4 1
 
3.3%
3 4
 
13.3%
2 6
 
20.0%
1 17
56.7%

해시태그최근지수
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
16 
20
18
16
17

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row19
4th row18
5th row17

Common Values

ValueCountFrequency (%)
19 16
53.3%
20 4
 
13.3%
18 4
 
13.3%
16 4
 
13.3%
17 2
 
6.7%

Length

2023-12-10T23:16:18.900642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:16:19.115343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 16
53.3%
20 4
 
13.3%
18 4
 
13.3%
16 4
 
13.3%
17 2
 
6.7%

Interactions

2023-12-10T23:16:15.988232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:16:19.657513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
해시태그채널ID1.0001.0001.0000.5550.665
해시태그영상ID1.0001.0001.0000.5550.665
해시태그추출명사명1.0001.0001.0001.0001.000
해시태그빈도수0.5550.5551.0001.0000.734
해시태그최근지수0.6650.6651.0000.7341.000
2023-12-10T23:16:19.828094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그영상ID해시태그채널ID해시태그최근지수
해시태그영상ID1.0001.0000.582
해시태그채널ID1.0001.0000.582
해시태그최근지수0.5820.5821.000
2023-12-10T23:16:19.990417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그빈도수해시태그채널ID해시태그영상ID해시태그최근지수
해시태그빈도수1.0000.3680.3680.589
해시태그채널ID0.3681.0001.0000.582
해시태그영상ID0.3681.0001.0000.582
해시태그최근지수0.5890.5820.5821.000

Missing values

2023-12-10T23:16:16.200596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:16:16.379087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
02020-09-01https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQhttps://www.youtube.com/watch?v=---1cHdqY74피파220
12020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0유산920
22020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0공허719
32020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0패치418
42020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0매크로317
52020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0밸런스317
62020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0커뮤니티216
72020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0공유216
82020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0피드백216
92020-09-01https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0사도216
해시태그수집일자해시태그채널ID해시태그영상ID해시태그추출명사명해시태그빈도수해시태그최근지수
202020-09-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU이제훈119
212020-09-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU자전119
222020-09-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU쿠바119
232020-09-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU류준열119
242020-09-01https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU119
252020-09-01https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4320
262020-09-01https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4다이아219
272020-09-01https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4고래118
282020-09-01https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4118
292020-09-01https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4김윤태118