Overview

Dataset statistics

Number of variables7
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory64.4 B

Variable types

Text1
DateTime1
Numeric4
Categorical1

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/d46686df-a178-4088-af71-d0c63c16ab89

Alerts

수집일자 has constant value ""Constant
채널명 has constant value ""Constant
영상덧글수 is highly overall correlated with 영상좋아요수 and 2 other fieldsHigh correlation
영상좋아요수 is highly overall correlated with 영상덧글수 and 2 other fieldsHigh correlation
영상싫어요수 is highly overall correlated with 영상덧글수 and 2 other fieldsHigh correlation
영상조회수 is highly overall correlated with 영상덧글수 and 2 other fieldsHigh correlation
영상경로명 has unique valuesUnique
영상조회수 has unique valuesUnique
영상덧글수 has 5 (16.7%) zerosZeros
영상싫어요수 has 8 (26.7%) zerosZeros

Reproduction

Analysis started2023-12-10 14:01:34.786055
Analysis finished2023-12-10 14:01:37.656034
Duration2.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영상경로명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:01:37.893263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/watch?v=2RH_HLImyPI
2nd rowhttps://www.youtube.com/watch?v=h7_Cwbw1Kcs
3rd rowhttps://www.youtube.com/watch?v=mPkQhidX1kY
4th rowhttps://www.youtube.com/watch?v=KlkVQMrZ7Xg
5th rowhttps://www.youtube.com/watch?v=gNA5Cm5Ydsc
ValueCountFrequency (%)
https://www.youtube.com/watch?v=2rh_hlimypi 1
 
3.3%
https://www.youtube.com/watch?v=h7_cwbw1kcs 1
 
3.3%
https://www.youtube.com/watch?v=rtmcng28rv4 1
 
3.3%
https://www.youtube.com/watch?v=otn49e1d3ai 1
 
3.3%
https://www.youtube.com/watch?v=8x_x2nzzuwq 1
 
3.3%
https://www.youtube.com/watch?v=brav_ls5-xe 1
 
3.3%
https://www.youtube.com/watch?v=ggbwbzxpzng 1
 
3.3%
https://www.youtube.com/watch?v=o3_r9qjwo5i 1
 
3.3%
https://www.youtube.com/watch?v=jzk-sq0b9t8 1
 
3.3%
https://www.youtube.com/watch?v=uxtjoxm3yc8 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:01:38.373920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 130
 
10.1%
t 124
 
9.6%
/ 90
 
7.0%
o 66
 
5.1%
h 66
 
5.1%
c 64
 
5.0%
u 63
 
4.9%
. 60
 
4.7%
m 38
 
2.9%
v 36
 
2.8%
Other values (59) 553
42.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 844
65.4%
Other Punctuation 210
 
16.3%
Uppercase Letter 135
 
10.5%
Decimal Number 60
 
4.7%
Math Symbol 30
 
2.3%
Connector Punctuation 6
 
0.5%
Dash Punctuation 5
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 130
15.4%
t 124
14.7%
o 66
 
7.8%
h 66
 
7.8%
c 64
 
7.6%
u 63
 
7.5%
m 38
 
4.5%
v 36
 
4.3%
b 36
 
4.3%
y 35
 
4.1%
Other values (16) 186
22.0%
Uppercase Letter
ValueCountFrequency (%)
Q 11
 
8.1%
I 11
 
8.1%
Z 10
 
7.4%
E 9
 
6.7%
N 8
 
5.9%
X 7
 
5.2%
H 7
 
5.2%
S 6
 
4.4%
Y 6
 
4.4%
J 6
 
4.4%
Other values (16) 54
40.0%
Decimal Number
ValueCountFrequency (%)
5 11
18.3%
4 7
11.7%
8 7
11.7%
2 7
11.7%
3 6
10.0%
9 6
10.0%
0 6
10.0%
1 5
8.3%
6 3
 
5.0%
7 2
 
3.3%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
: 30
 
14.3%
? 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 979
75.9%
Common 311
 
24.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 130
13.3%
t 124
 
12.7%
o 66
 
6.7%
h 66
 
6.7%
c 64
 
6.5%
u 63
 
6.4%
m 38
 
3.9%
v 36
 
3.7%
b 36
 
3.7%
y 35
 
3.6%
Other values (42) 321
32.8%
Common
ValueCountFrequency (%)
/ 90
28.9%
. 60
19.3%
= 30
 
9.6%
: 30
 
9.6%
? 30
 
9.6%
5 11
 
3.5%
4 7
 
2.3%
8 7
 
2.3%
2 7
 
2.3%
_ 6
 
1.9%
Other values (7) 33
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 130
 
10.1%
t 124
 
9.6%
/ 90
 
7.0%
o 66
 
5.1%
h 66
 
5.1%
c 64
 
5.0%
u 63
 
4.9%
. 60
 
4.7%
m 38
 
2.9%
v 36
 
2.8%
Other values (59) 553
42.9%

수집일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-08-01 00:00:00
Maximum2020-08-01 00:00:00
2023-12-10T23:01:38.521162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:38.672315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

영상덧글수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct16
Distinct (%)53.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.63333
Minimum0
Maximum3326
Zeros5
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:01:38.827037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q39.25
95-th percentile115.65
Maximum3326
Range3326
Interquartile range (IQR)8.25

Descriptive statistics

Standard deviation605.84649
Coefficient of variation (CV)4.9403084
Kurtosis29.818631
Mean122.63333
Median Absolute Deviation (MAD)2.5
Skewness5.453894
Sum3679
Variance367049.96
MonotonicityNot monotonic
2023-12-10T23:01:38.996902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 5
16.7%
0 5
16.7%
2 4
13.3%
4 3
10.0%
3 2
 
6.7%
48 1
 
3.3%
6 1
 
3.3%
171 1
 
3.3%
7 1
 
3.3%
19 1
 
3.3%
Other values (6) 6
20.0%
ValueCountFrequency (%)
0 5
16.7%
1 5
16.7%
2 4
13.3%
3 2
 
6.7%
4 3
10.0%
5 1
 
3.3%
6 1
 
3.3%
7 1
 
3.3%
10 1
 
3.3%
11 1
 
3.3%
ValueCountFrequency (%)
3326 1
3.3%
171 1
3.3%
48 1
3.3%
24 1
3.3%
21 1
3.3%
19 1
3.3%
11 1
3.3%
10 1
3.3%
7 1
3.3%
6 1
3.3%

영상좋아요수
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean342.53333
Minimum4
Maximum4717
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:01:39.182096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile7
Q113
median18
Q385
95-th percentile2254.05
Maximum4717
Range4713
Interquartile range (IQR)72

Descriptive statistics

Standard deviation1002.2844
Coefficient of variation (CV)2.926093
Kurtosis13.606964
Mean342.53333
Median Absolute Deviation (MAD)11
Skewness3.6382023
Sum10276
Variance1004574
MonotonicityNot monotonic
2023-12-10T23:01:39.390973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
15 3
 
10.0%
7 3
 
10.0%
13 2
 
6.7%
166 1
 
3.3%
73 1
 
3.3%
8 1
 
3.3%
44 1
 
3.3%
1643 1
 
3.3%
4 1
 
3.3%
9 1
 
3.3%
Other values (15) 15
50.0%
ValueCountFrequency (%)
4 1
 
3.3%
7 3
10.0%
8 1
 
3.3%
9 1
 
3.3%
11 1
 
3.3%
13 2
6.7%
14 1
 
3.3%
15 3
10.0%
16 1
 
3.3%
17 1
 
3.3%
ValueCountFrequency (%)
4717 1
3.3%
2754 1
3.3%
1643 1
3.3%
166 1
3.3%
164 1
3.3%
155 1
3.3%
124 1
3.3%
89 1
3.3%
73 1
3.3%
62 1
3.3%

영상싫어요수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.533333
Minimum0
Maximum403
Zeros8
Zeros (%)26.7%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:01:39.642728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.25
median1.5
Q33.75
95-th percentile104.05
Maximum403
Range403
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation78.505758
Coefficient of variation (CV)3.6457782
Kurtosis20.88732
Mean21.533333
Median Absolute Deviation (MAD)1.5
Skewness4.482595
Sum646
Variance6163.154
MonotonicityNot monotonic
2023-12-10T23:01:39.843084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 8
26.7%
1 7
23.3%
2 5
16.7%
4 2
 
6.7%
5 2
 
6.7%
3 2
 
6.7%
9 1
 
3.3%
172 1
 
3.3%
403 1
 
3.3%
21 1
 
3.3%
ValueCountFrequency (%)
0 8
26.7%
1 7
23.3%
2 5
16.7%
3 2
 
6.7%
4 2
 
6.7%
5 2
 
6.7%
9 1
 
3.3%
21 1
 
3.3%
172 1
 
3.3%
403 1
 
3.3%
ValueCountFrequency (%)
403 1
 
3.3%
172 1
 
3.3%
21 1
 
3.3%
9 1
 
3.3%
5 2
 
6.7%
4 2
 
6.7%
3 2
 
6.7%
2 5
16.7%
1 7
23.3%
0 8
26.7%

영상조회수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35761.333
Minimum510
Maximum646841
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:01:40.036881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum510
5-th percentile639.65
Q11471.25
median1749.5
Q33332.25
95-th percentile195242.65
Maximum646841
Range646331
Interquartile range (IQR)1861

Descriptive statistics

Standard deviation129778.48
Coefficient of variation (CV)3.6290169
Kurtosis18.596175
Mean35761.333
Median Absolute Deviation (MAD)587
Skewness4.2576413
Sum1072840
Variance1.6842455 × 1010
MonotonicityNot monotonic
2023-12-10T23:01:40.241318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1548 1
 
3.3%
1758 1
 
3.3%
1211 1
 
3.3%
1421 1
 
3.3%
3414 1
 
3.3%
3087 1
 
3.3%
34146 1
 
3.3%
551 1
 
3.3%
1741 1
 
3.3%
748 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
510 1
3.3%
551 1
3.3%
748 1
3.3%
1127 1
3.3%
1196 1
3.3%
1211 1
3.3%
1421 1
3.3%
1448 1
3.3%
1541 1
3.3%
1548 1
3.3%
ValueCountFrequency (%)
646841 1
3.3%
327049 1
3.3%
34146 1
3.3%
10231 1
3.3%
7101 1
3.3%
4284 1
3.3%
3809 1
3.3%
3414 1
3.3%
3087 1
3.3%
2782 1
3.3%

채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
SBS Entertainment
30 

Length

Max length17
Median length17
Mean length17
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSBS Entertainment
2nd rowSBS Entertainment
3rd rowSBS Entertainment
4th rowSBS Entertainment
5th rowSBS Entertainment

Common Values

ValueCountFrequency (%)
SBS Entertainment 30
100.0%

Length

2023-12-10T23:01:40.504871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:01:40.668865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
sbs 30
50.0%
entertainment 30
50.0%

Interactions

2023-12-10T23:01:36.798482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.059008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.620641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.190037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.949360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.208538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.741760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.331221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:37.072305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.324120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.882258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.453259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:37.241992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:35.478798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.041343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:01:36.620817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:01:40.770872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영상경로명영상덧글수영상좋아요수영상싫어요수영상조회수
영상경로명1.0001.0001.0001.0001.000
영상덧글수1.0001.0001.0001.0001.000
영상좋아요수1.0001.0001.0001.0001.000
영상싫어요수1.0001.0001.0001.0001.000
영상조회수1.0001.0001.0001.0001.000
2023-12-10T23:01:40.963079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영상덧글수영상좋아요수영상싫어요수영상조회수
영상덧글수1.0000.6790.5470.512
영상좋아요수0.6791.0000.6090.692
영상싫어요수0.5470.6091.0000.674
영상조회수0.5120.6920.6741.000

Missing values

2023-12-10T23:01:37.410494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:01:37.586268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영상경로명수집일자영상덧글수영상좋아요수영상싫어요수영상조회수채널명
0https://www.youtube.com/watch?v=2RH_HLImyPI2020-08-014816691548SBS Entertainment
1https://www.youtube.com/watch?v=h7_Cwbw1Kcs2020-08-011112441679SBS Entertainment
2https://www.youtube.com/watch?v=mPkQhidX1kY2020-08-012416457101SBS Entertainment
3https://www.youtube.com/watch?v=KlkVQMrZ7Xg2020-08-0133262754172327049SBS Entertainment
4https://www.youtube.com/watch?v=gNA5Cm5Ydsc2020-08-012115543809SBS Entertainment
5https://www.youtube.com/watch?v=Ho-8E5MfrNg2020-08-0111501448SBS Entertainment
6https://www.youtube.com/watch?v=NjaQBlLEQ-Q2020-08-0152914284SBS Entertainment
7https://www.youtube.com/watch?v=02EH_iv5vXU2020-08-0121532084SBS Entertainment
8https://www.youtube.com/watch?v=hfmpIWDh5ew2020-08-0121611554SBS Entertainment
9https://www.youtube.com/watch?v=9KcsBZ9v22A2020-08-0111311196SBS Entertainment
영상경로명수집일자영상덧글수영상좋아요수영상싫어요수영상조회수채널명
20https://www.youtube.com/watch?v=okF9I5lEoQE2020-08-0171722370SBS Entertainment
21https://www.youtube.com/watch?v=UXtjoxm3YC82020-08-0111122256SBS Entertainment
22https://www.youtube.com/watch?v=JZk-SQ0b9T82020-08-01071748SBS Entertainment
23https://www.youtube.com/watch?v=O3_R9QJwO5I2020-08-010921741SBS Entertainment
24https://www.youtube.com/watch?v=GgbWbZxPZNg2020-08-01241551SBS Entertainment
25https://www.youtube.com/watch?v=BraV_lS5-XE2020-08-0117116432134146SBS Entertainment
26https://www.youtube.com/watch?v=8X_x2nZZUwQ2020-08-0131523087SBS Entertainment
27https://www.youtube.com/watch?v=oTn49e1d3AI2020-08-0164433414SBS Entertainment
28https://www.youtube.com/watch?v=rtmCNG28rv42020-08-0101301421SBS Entertainment
29https://www.youtube.com/watch?v=yHcjhLzuEwU2020-08-011801211SBS Entertainment