Overview

Dataset statistics

Number of variables9
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory82.4 B

Variable types

Categorical4
Text1
DateTime1
Numeric3

Dataset

Description샘플 데이터
Author한양대
URLhttps://www.bigdata-region.kr/#/dataset/33cd631d-ff3a-4a87-8030-e01cd110df9b

Alerts

채널ID has constant value ""Constant
생성기간일자 has constant value ""Constant
채널조회수 has constant value ""Constant
구독자수 has constant value ""Constant
부정평가비율 has constant value ""Constant
영상조회수 is highly overall correlated with 긍정평가비율 and 1 other fieldsHigh correlation
긍정평가비율 is highly overall correlated with 영상조회수 and 1 other fieldsHigh correlation
영상덧글수 is highly overall correlated with 영상조회수 and 1 other fieldsHigh correlation
영상ID has unique valuesUnique
영상조회수 has unique valuesUnique
영상덧글수 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:18:37.486974
Analysis finished2023-12-10 14:18:39.134946
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

채널ID
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA
30 

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA
2nd rowhttps://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA
3rd rowhttps://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA
4th rowhttps://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA
5th rowhttps://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrA 30
100.0%

Length

2023-12-10T23:18:39.221478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:39.339237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucqrhncabshhqm_8t2sumura 30
100.0%

영상ID
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:18:39.618349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/watch?v=rA4LsSS0sW0
2nd rowhttps://www.youtube.com/watch?v=jCEd9nzrgvo
3rd rowhttps://www.youtube.com/watch?v=v6PGcT5i5to
4th rowhttps://www.youtube.com/watch?v=UIa6zG-fysA
5th rowhttps://www.youtube.com/watch?v=uILIclcYc54
ValueCountFrequency (%)
https://www.youtube.com/watch?v=ra4lsss0sw0 1
 
3.3%
https://www.youtube.com/watch?v=jced9nzrgvo 1
 
3.3%
https://www.youtube.com/watch?v=wsdpx8hgite 1
 
3.3%
https://www.youtube.com/watch?v=tp-u3w3inku 1
 
3.3%
https://www.youtube.com/watch?v=c7__uzk56ry 1
 
3.3%
https://www.youtube.com/watch?v=6bpu8xj9n3s 1
 
3.3%
https://www.youtube.com/watch?v=zorvk24awdy 1
 
3.3%
https://www.youtube.com/watch?v=nps_qsxbwaq 1
 
3.3%
https://www.youtube.com/watch?v=lqug25s8198 1
 
3.3%
https://www.youtube.com/watch?v=h-sj3ekdhsa 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:18:40.060310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 126
 
9.8%
w 126
 
9.8%
/ 90
 
7.0%
o 67
 
5.2%
c 66
 
5.1%
h 66
 
5.1%
u 65
 
5.0%
. 60
 
4.7%
v 37
 
2.9%
s 36
 
2.8%
Other values (59) 551
42.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 843
65.3%
Other Punctuation 210
 
16.3%
Uppercase Letter 136
 
10.5%
Decimal Number 61
 
4.7%
Math Symbol 30
 
2.3%
Connector Punctuation 6
 
0.5%
Dash Punctuation 4
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 126
14.9%
w 126
14.9%
o 67
 
7.9%
c 66
 
7.8%
h 66
 
7.8%
u 65
 
7.7%
v 37
 
4.4%
s 36
 
4.3%
p 36
 
4.3%
e 35
 
4.2%
Other values (16) 183
21.7%
Uppercase Letter
ValueCountFrequency (%)
S 9
 
6.6%
I 9
 
6.6%
Y 9
 
6.6%
U 8
 
5.9%
Q 7
 
5.1%
A 7
 
5.1%
W 7
 
5.1%
V 6
 
4.4%
M 6
 
4.4%
L 6
 
4.4%
Other values (16) 62
45.6%
Decimal Number
ValueCountFrequency (%)
5 10
16.4%
8 9
14.8%
9 8
13.1%
4 6
9.8%
3 6
9.8%
2 5
8.2%
1 5
8.2%
6 5
8.2%
0 5
8.2%
7 2
 
3.3%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 979
75.9%
Common 311
 
24.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 126
 
12.9%
w 126
 
12.9%
o 67
 
6.8%
c 66
 
6.7%
h 66
 
6.7%
u 65
 
6.6%
v 37
 
3.8%
s 36
 
3.7%
p 36
 
3.7%
e 35
 
3.6%
Other values (42) 319
32.6%
Common
ValueCountFrequency (%)
/ 90
28.9%
. 60
19.3%
= 30
 
9.6%
: 30
 
9.6%
? 30
 
9.6%
5 10
 
3.2%
8 9
 
2.9%
9 8
 
2.6%
4 6
 
1.9%
_ 6
 
1.9%
Other values (7) 32
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 126
 
9.8%
w 126
 
9.8%
/ 90
 
7.0%
o 67
 
5.2%
c 66
 
5.1%
h 66
 
5.1%
u 65
 
5.0%
. 60
 
4.7%
v 37
 
2.9%
s 36
 
2.8%
Other values (59) 551
42.7%

생성기간일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2019-12-21 00:00:00
Maximum2019-12-21 00:00:00
2023-12-10T23:18:40.211807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:40.357073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

채널조회수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
25812355
30 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row25812355
2nd row25812355
3rd row25812355
4th row25812355
5th row25812355

Common Values

ValueCountFrequency (%)
25812355 30
100.0%

Length

2023-12-10T23:18:40.527060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:40.627812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
25812355 30
100.0%

구독자수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
146000
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row146000
2nd row146000
3rd row146000
4th row146000
5th row146000

Common Values

ValueCountFrequency (%)
146000 30
100.0%

Length

2023-12-10T23:18:40.758158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:40.903997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
146000 30
100.0%

영상조회수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean163351.7
Minimum1633
Maximum1189110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:18:41.026262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1633
5-th percentile3716
Q110350.5
median18696
Q362183
95-th percentile1025130.9
Maximum1189110
Range1187477
Interquartile range (IQR)51832.5

Descriptive statistics

Standard deviation337740.2
Coefficient of variation (CV)2.0675646
Kurtosis4.3289937
Mean163351.7
Median Absolute Deviation (MAD)13370.5
Skewness2.3662732
Sum4900551
Variance1.1406844 × 1011
MonotonicityNot monotonic
2023-12-10T23:18:41.188920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
8792 1
 
3.3%
37987 1
 
3.3%
1115310 1
 
3.3%
8609 1
 
3.3%
17501 1
 
3.3%
6782 1
 
3.3%
10709 1
 
3.3%
53283 1
 
3.3%
218373 1
 
3.3%
49287 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1633 1
3.3%
3464 1
3.3%
4024 1
3.3%
6627 1
3.3%
6782 1
3.3%
8609 1
3.3%
8792 1
3.3%
10231 1
3.3%
10709 1
3.3%
13276 1
3.3%
ValueCountFrequency (%)
1189110 1
3.3%
1115310 1
3.3%
914912 1
3.3%
717077 1
3.3%
218373 1
3.3%
158377 1
3.3%
91139 1
3.3%
62972 1
3.3%
59816 1
3.3%
53283 1
3.3%

긍정평가비율
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99.81
Minimum98.8
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:18:41.313609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum98.8
5-th percentile99.29
Q199.8
median99.9
Q399.975
95-th percentile100
Maximum100
Range1.2
Interquartile range (IQR)0.175

Descriptive statistics

Standard deviation0.26175798
Coefficient of variation (CV)0.0026225627
Kurtosis7.6777374
Mean99.81
Median Absolute Deviation (MAD)0.1
Skewness-2.6133876
Sum2994.3
Variance0.068517241
MonotonicityNot monotonic
2023-12-10T23:18:41.469073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
99.9 11
36.7%
100.0 8
26.7%
99.7 4
 
13.3%
99.8 4
 
13.3%
99.2 1
 
3.3%
99.4 1
 
3.3%
98.8 1
 
3.3%
ValueCountFrequency (%)
98.8 1
 
3.3%
99.2 1
 
3.3%
99.4 1
 
3.3%
99.7 4
 
13.3%
99.8 4
 
13.3%
99.9 11
36.7%
100.0 8
26.7%
ValueCountFrequency (%)
100.0 8
26.7%
99.9 11
36.7%
99.8 4
 
13.3%
99.7 4
 
13.3%
99.4 1
 
3.3%
99.2 1
 
3.3%
98.8 1
 
3.3%

부정평가비율
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
0
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 30
100.0%

Length

2023-12-10T23:18:41.606571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:41.708144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 30
100.0%

영상덧글수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean817.46667
Minimum8
Maximum5987
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:18:41.815690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile17.9
Q165.25
median151
Q3455.25
95-th percentile4224.4
Maximum5987
Range5979
Interquartile range (IQR)390

Descriptive statistics

Standard deviation1538.2193
Coefficient of variation (CV)1.8816906
Kurtosis4.538241
Mean817.46667
Median Absolute Deviation (MAD)121
Skewness2.3209253
Sum24524
Variance2366118.7
MonotonicityNot monotonic
2023-12-10T23:18:41.996156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
139 1
 
3.3%
46 1
 
3.3%
4055 1
 
3.3%
41 1
 
3.3%
103 1
 
3.3%
29 1
 
3.3%
88 1
 
3.3%
321 1
 
3.3%
1690 1
 
3.3%
372 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
8 1
3.3%
17 1
3.3%
19 1
3.3%
29 1
3.3%
31 1
3.3%
41 1
3.3%
46 1
3.3%
65 1
3.3%
66 1
3.3%
88 1
3.3%
ValueCountFrequency (%)
5987 1
3.3%
4363 1
3.3%
4055 1
3.3%
3491 1
3.3%
1690 1
3.3%
1003 1
3.3%
687 1
3.3%
483 1
3.3%
372 1
3.3%
321 1
3.3%

Interactions

2023-12-10T23:18:38.503769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:37.640409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:37.888288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:38.628372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:37.727042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:38.297801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:38.770535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:37.809524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:18:38.407888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:18:42.108982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영상ID영상조회수긍정평가비율영상덧글수
영상ID1.0001.0001.0001.000
영상조회수1.0001.0000.0000.960
긍정평가비율1.0000.0001.0000.000
영상덧글수1.0000.9600.0001.000
2023-12-10T23:18:42.216977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영상조회수긍정평가비율영상덧글수
영상조회수1.0000.8990.869
긍정평가비율0.8991.0000.880
영상덧글수0.8690.8801.000

Missing values

2023-12-10T23:18:38.920070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:18:39.076454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

채널ID영상ID생성기간일자채널조회수구독자수영상조회수긍정평가비율부정평가비율영상덧글수
0https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=rA4LsSS0sW02019-12-2125812355146000879299.70139
1https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=jCEd9nzrgvo2019-12-2125812355146000346499.2031
2https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=v6PGcT5i5to2019-12-21258123551460001023199.90308
3https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=UIa6zG-fysA2019-12-21258123551460001711499.90163
4https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=uILIclcYc542019-12-21258123551460001327699.9093
5https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=WFrxx5_DFsA2019-12-21258123551460009113999.90267
6https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=95jlNM_9n9E2019-12-21258123551460001620999.8065
7https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=vSpBvxXPy282019-12-21258123551460001189110100.004363
8https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=efUZOwgtkUw2019-12-2125812355146000662799.7019
9https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=AR6Z8Qor7yc2019-12-21258123551460002929399.8066
채널ID영상ID생성기간일자채널조회수구독자수영상조회수긍정평가비율부정평가비율영상덧글수
20https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=UzGnLfhMX042019-12-212581235514600059816100.00196
21https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=H-SJ3eKdhSA2019-12-2125812355146000717077100.005987
22https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=lquG25S81982019-12-21258123551460004928799.90372
23https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=npS_QSxbWaQ2019-12-2125812355146000218373100.001690
24https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=zorVK24aWDY2019-12-21258123551460005328399.90321
25https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=6bPU8Xj9N3s2019-12-21258123551460001070999.8088
26https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=C7__uZk56RY2019-12-2125812355146000678299.7029
27https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=Tp-u3w3InkU2019-12-21258123551460001750199.90103
28https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=wSdPx8hGItE2019-12-2125812355146000860999.7041
29https://www.youtube.com/channel/UCqRHncabshHQm_8t2SuMUrAhttps://www.youtube.com/watch?v=YxZOWHl1L1o2019-12-21258123551460001115310100.004055