Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/b7dadb3c-c4ee-44f1-b20a-e23289f269ee

Alerts

생성기간일자 has constant value ""Constant
체널ID is highly overall correlated with 영상ID and 1 other fieldsHigh correlation
영상ID is highly overall correlated with 체널ID and 1 other fieldsHigh correlation
빈도수 is highly overall correlated with 해시태그지수High correlation
해시태그지수 is highly overall correlated with 빈도수 and 2 other fieldsHigh correlation
영상추출명사명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:15:23.343413
Analysis finished2023-12-10 14:15:24.407380
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

생성기간일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020.8.1
30 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020.8.1
2nd row2020.8.1
3rd row2020.8.1
4th row2020.8.1
5th row2020.8.1

Common Values

ValueCountFrequency (%)
2020.8.1 30
100.0%

Length

2023-12-10T23:15:24.534319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:24.699729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020.8.1 30
100.0%

체널ID
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ
15 
https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ
https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ
 
1

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st rowhttps://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ
2nd rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
3rd rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
4th rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA
5th rowhttps://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQ 15
50.0%
https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHA 9
30.0%
https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQ 5
 
16.7%
https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQ 1
 
3.3%

Length

2023-12-10T23:15:24.868348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:25.064034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucfl1scaksd6_7jizwwhcwjq 15
50.0%
https://www.youtube.com/channel/ucd4vefdwg_kzvi-9eq8spha 9
30.0%
https://www.youtube.com/channel/uczbyyvm2ux-iqwzh0oxp-mq 5
 
16.7%
https://www.youtube.com/channel/ucl402yyy7rch7pbi3npmgpq 1
 
3.3%

영상ID
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/watch?v=---DUD-BgqU
15 
https://www.youtube.com/watch?v=---1gcmh6q0
https://www.youtube.com/watch?v=---HwiQbik4
https://www.youtube.com/watch?v=---1cHdqY74
 
1

Length

Max length43
Median length43
Mean length43
Min length43

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st rowhttps://www.youtube.com/watch?v=---1cHdqY74
2nd rowhttps://www.youtube.com/watch?v=---1gcmh6q0
3rd rowhttps://www.youtube.com/watch?v=---1gcmh6q0
4th rowhttps://www.youtube.com/watch?v=---1gcmh6q0
5th rowhttps://www.youtube.com/watch?v=---1gcmh6q0

Common Values

ValueCountFrequency (%)
https://www.youtube.com/watch?v=---DUD-BgqU 15
50.0%
https://www.youtube.com/watch?v=---1gcmh6q0 9
30.0%
https://www.youtube.com/watch?v=---HwiQbik4 5
 
16.7%
https://www.youtube.com/watch?v=---1cHdqY74 1
 
3.3%

Length

2023-12-10T23:15:25.321999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:25.501750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/watch?v=---dud-bgqu 15
50.0%
https://www.youtube.com/watch?v=---1gcmh6q0 9
30.0%
https://www.youtube.com/watch?v=---hwiqbik4 5
 
16.7%
https://www.youtube.com/watch?v=---1chdqy74 1
 
3.3%
Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:15:25.840897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.3666667
Min length1

Characters and Unicode

Total characters71
Distinct characters56
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row피파
2nd row유산
3rd row공허
4th row패치
5th row밸런스
ValueCountFrequency (%)
피파 1
 
3.3%
유산 1
 
3.3%
동양 1
 
3.3%
데이브 1
 
3.3%
다이아 1
 
3.3%
1
 
3.3%
쿠바 1
 
3.3%
자전 1
 
3.3%
이제훈 1
 
3.3%
운치 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:15:26.341615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (46) 50
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (46) 50
70.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (46) 50
70.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (46) 50
70.4%

빈도수
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0333333
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:15:26.538755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5.65
Maximum9
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.8473342
Coefficient of variation (CV)0.90852502
Kurtosis7.5967553
Mean2.0333333
Median Absolute Deviation (MAD)0
Skewness2.6553697
Sum61
Variance3.4126437
MonotonicityNot monotonic
2023-12-10T23:15:26.715922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 17
56.7%
2 6
 
20.0%
3 4
 
13.3%
9 1
 
3.3%
7 1
 
3.3%
4 1
 
3.3%
ValueCountFrequency (%)
1 17
56.7%
2 6
 
20.0%
3 4
 
13.3%
4 1
 
3.3%
7 1
 
3.3%
9 1
 
3.3%
ValueCountFrequency (%)
9 1
 
3.3%
7 1
 
3.3%
4 1
 
3.3%
3 4
 
13.3%
2 6
 
20.0%
1 17
56.7%

해시태그지수
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
19
16 
20
18
16
17

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row19
4th row18
5th row17

Common Values

ValueCountFrequency (%)
19 16
53.3%
20 4
 
13.3%
18 4
 
13.3%
16 4
 
13.3%
17 2
 
6.7%

Length

2023-12-10T23:15:26.879305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:27.326146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
19 16
53.3%
20 4
 
13.3%
18 4
 
13.3%
16 4
 
13.3%
17 2
 
6.7%

Interactions

2023-12-10T23:15:23.818703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:15:27.434246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체널ID영상ID영상추출명사명빈도수해시태그지수
체널ID1.0001.0001.0000.5550.665
영상ID1.0001.0001.0000.5550.665
영상추출명사명1.0001.0001.0001.0001.000
빈도수0.5550.5551.0001.0000.734
해시태그지수0.6650.6651.0000.7341.000
2023-12-10T23:15:27.591383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체널ID영상ID해시태그지수
체널ID1.0001.0000.582
영상ID1.0001.0000.582
해시태그지수0.5820.5821.000
2023-12-10T23:15:27.746808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
빈도수체널ID영상ID해시태그지수
빈도수1.0000.3680.3680.589
체널ID0.3681.0001.0000.582
영상ID0.3681.0001.0000.582
해시태그지수0.5890.5820.5821.000

Missing values

2023-12-10T23:15:24.147197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:15:24.328849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

생성기간일자체널ID영상ID영상추출명사명빈도수해시태그지수
02020.8.1https://www.youtube.com/channel/UCl402YYy7RcH7PBI3npmGPQhttps://www.youtube.com/watch?v=---1cHdqY74피파220
12020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0유산920
22020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0공허719
32020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0패치418
42020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0밸런스317
52020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0매크로317
62020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0피드백216
72020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0공유216
82020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0커뮤니티216
92020.8.1https://www.youtube.com/channel/UCd4vEfDwg_kzVI-9Eq8sPHAhttps://www.youtube.com/watch?v=---1gcmh6q0사도216
생성기간일자체널ID영상ID영상추출명사명빈도수해시태그지수
202020.8.1https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU배낭119
212020.8.1https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU운치119
222020.8.1https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU이제훈119
232020.8.1https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU자전119
242020.8.1https://www.youtube.com/channel/UCFL1sCAksD6_7JIZwwHcwjQhttps://www.youtube.com/watch?v=---DUD-BgqU쿠바119
252020.8.1https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4320
262020.8.1https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4다이아219
272020.8.1https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4데이브118
282020.8.1https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4동양118
292020.8.1https://www.youtube.com/channel/UCZbYYvm2uX-iqwZH0oXp-MQhttps://www.youtube.com/watch?v=---HwiQbik4118