Overview

Dataset statistics

Number of variables7
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory64.4 B

Variable types

Text1
Categorical2
Numeric4

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/3f736d93-c069-41b3-ba6c-c9ed364214fc

Alerts

참여영상수집일자 has constant value ""Constant
영상영상채널명 has constant value ""Constant
is highly overall correlated with 참여영상좋아요수 and 2 other fieldsHigh correlation
참여영상좋아요수 is highly overall correlated with and 2 other fieldsHigh correlation
참여영상싫어요수 is highly overall correlated with and 2 other fieldsHigh correlation
참여영상시청수 is highly overall correlated with and 2 other fieldsHigh correlation
참여영상경로명 has unique valuesUnique
has unique valuesUnique
참여영상좋아요수 has unique valuesUnique
참여영상시청수 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:57:17.920351
Analysis finished2023-12-10 13:57:22.252613
Duration4.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T22:57:22.532709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/watch?v=xXh95-_Mxsc
2nd rowhttps://www.youtube.com/watch?v=udgboFStMkU
3rd rowhttps://www.youtube.com/watch?v=N5KDjJQ06Hw
4th rowhttps://www.youtube.com/watch?v=6IOwwtqx6SM
5th rowhttps://www.youtube.com/watch?v=QmREewMKM98
ValueCountFrequency (%)
https://www.youtube.com/watch?v=xxh95-_mxsc 1
 
3.3%
https://www.youtube.com/watch?v=udgbofstmku 1
 
3.3%
https://www.youtube.com/watch?v=8ctd-la9a2m 1
 
3.3%
https://www.youtube.com/watch?v=cdk42un3sbw 1
 
3.3%
https://www.youtube.com/watch?v=cdh5gpvcmye 1
 
3.3%
https://www.youtube.com/watch?v=dqtdkoee4ky 1
 
3.3%
https://www.youtube.com/watch?v=f7tnjcpemoy 1
 
3.3%
https://www.youtube.com/watch?v=imdvvfy9slk 1
 
3.3%
https://www.youtube.com/watch?v=-3khqshw9l4 1
 
3.3%
https://www.youtube.com/watch?v=x8sqjwkemg4 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T22:57:23.222776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 126
 
9.8%
t 124
 
9.6%
/ 90
 
7.0%
h 69
 
5.3%
u 66
 
5.1%
c 66
 
5.1%
o 65
 
5.0%
. 60
 
4.7%
m 38
 
2.9%
e 35
 
2.7%
Other values (59) 551
42.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 845
65.5%
Other Punctuation 210
 
16.3%
Uppercase Letter 141
 
10.9%
Decimal Number 54
 
4.2%
Math Symbol 30
 
2.3%
Dash Punctuation 5
 
0.4%
Connector Punctuation 5
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 126
14.9%
t 124
14.7%
h 69
 
8.2%
u 66
 
7.8%
c 66
 
7.8%
o 65
 
7.7%
m 38
 
4.5%
e 35
 
4.1%
v 34
 
4.0%
p 34
 
4.0%
Other values (16) 188
22.2%
Uppercase Letter
ValueCountFrequency (%)
M 12
 
8.5%
S 8
 
5.7%
E 8
 
5.7%
V 8
 
5.7%
A 8
 
5.7%
K 7
 
5.0%
I 7
 
5.0%
Q 6
 
4.3%
R 6
 
4.3%
Y 6
 
4.3%
Other values (16) 65
46.1%
Decimal Number
ValueCountFrequency (%)
3 7
13.0%
5 7
13.0%
9 7
13.0%
4 6
11.1%
6 6
11.1%
0 5
9.3%
2 4
7.4%
8 4
7.4%
1 4
7.4%
7 4
7.4%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
? 30
 
14.3%
: 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 986
76.4%
Common 304
 
23.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 126
 
12.8%
t 124
 
12.6%
h 69
 
7.0%
u 66
 
6.7%
c 66
 
6.7%
o 65
 
6.6%
m 38
 
3.9%
e 35
 
3.5%
v 34
 
3.4%
p 34
 
3.4%
Other values (42) 329
33.4%
Common
ValueCountFrequency (%)
/ 90
29.6%
. 60
19.7%
= 30
 
9.9%
? 30
 
9.9%
: 30
 
9.9%
3 7
 
2.3%
5 7
 
2.3%
9 7
 
2.3%
4 6
 
2.0%
6 6
 
2.0%
Other values (7) 31
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 126
 
9.8%
t 124
 
9.6%
/ 90
 
7.0%
h 69
 
5.3%
u 66
 
5.1%
c 66
 
5.1%
o 65
 
5.0%
. 60
 
4.7%
m 38
 
2.9%
e 35
 
2.7%
Other values (59) 551
42.7%

참여영상수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020-10-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-10-01
2nd row2020-10-01
3rd row2020-10-01
4th row2020-10-01
5th row2020-10-01

Common Values

ValueCountFrequency (%)
2020-10-01 30
100.0%

Length

2023-12-10T22:57:23.496020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:57:23.661655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-10-01 30
100.0%


Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean326.6
Minimum26
Maximum1770
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:57:23.815906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26
5-th percentile35.45
Q158.75
median134.5
Q3239.25
95-th percentile1352.3
Maximum1770
Range1744
Interquartile range (IQR)180.5

Descriptive statistics

Standard deviation453.22413
Coefficient of variation (CV)1.387704
Kurtosis3.5382698
Mean326.6
Median Absolute Deviation (MAD)90
Skewness2.0377964
Sum9798
Variance205412.11
MonotonicityNot monotonic
2023-12-10T22:57:24.182430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
241 1
 
3.3%
77 1
 
3.3%
147 1
 
3.3%
1472 1
 
3.3%
190 1
 
3.3%
617 1
 
3.3%
36 1
 
3.3%
1770 1
 
3.3%
41 1
 
3.3%
116 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
26 1
3.3%
35 1
3.3%
36 1
3.3%
41 1
3.3%
43 1
3.3%
44 1
3.3%
49 1
3.3%
54 1
3.3%
73 1
3.3%
76 1
3.3%
ValueCountFrequency (%)
1770 1
3.3%
1472 1
3.3%
1206 1
3.3%
799 1
3.3%
743 1
3.3%
673 1
3.3%
617 1
3.3%
241 1
3.3%
234 1
3.3%
224 1
3.3%

참여영상좋아요수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6541
Minimum305
Maximum57033
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:57:24.601739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum305
5-th percentile490.15
Q11252.5
median2380
Q34696.25
95-th percentile23875.8
Maximum57033
Range56728
Interquartile range (IQR)3443.75

Descriptive statistics

Standard deviation11564.667
Coefficient of variation (CV)1.7680274
Kurtosis13.01872
Mean6541
Median Absolute Deviation (MAD)1476.5
Skewness3.4028732
Sum196230
Variance1.3374152 × 108
MonotonicityNot monotonic
2023-12-10T22:57:24.901912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
3733 1
 
3.3%
1244 1
 
3.3%
4939 1
 
3.3%
57033 1
 
3.3%
3572 1
 
3.3%
10637 1
 
3.3%
451 1
 
3.3%
31254 1
 
3.3%
538 1
 
3.3%
2185 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
305 1
3.3%
451 1
3.3%
538 1
3.3%
592 1
3.3%
640 1
3.3%
813 1
3.3%
994 1
3.3%
1244 1
3.3%
1278 1
3.3%
1290 1
3.3%
ValueCountFrequency (%)
57033 1
3.3%
31254 1
3.3%
14858 1
3.3%
13904 1
3.3%
12211 1
3.3%
11456 1
3.3%
10637 1
3.3%
4939 1
3.3%
3968 1
3.3%
3733 1
3.3%

참여영상싫어요수
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.666667
Minimum1
Maximum285
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:57:25.089139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q15.25
median10
Q320.75
95-th percentile96.4
Maximum285
Range284
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation54.319193
Coefficient of variation (CV)1.9633443
Kurtosis18.39292
Mean27.666667
Median Absolute Deviation (MAD)6
Skewness4.0804642
Sum830
Variance2950.5747
MonotonicityNot monotonic
2023-12-10T22:57:25.304804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
10 4
 
13.3%
5 3
 
10.0%
4 2
 
6.7%
6 2
 
6.7%
20 1
 
3.3%
70 1
 
3.3%
285 1
 
3.3%
118 1
 
3.3%
1 1
 
3.3%
58 1
 
3.3%
Other values (13) 13
43.3%
ValueCountFrequency (%)
1 1
 
3.3%
2 1
 
3.3%
3 1
 
3.3%
4 2
6.7%
5 3
10.0%
6 2
6.7%
7 1
 
3.3%
9 1
 
3.3%
10 4
13.3%
12 1
 
3.3%
ValueCountFrequency (%)
285 1
3.3%
118 1
3.3%
70 1
3.3%
58 1
3.3%
31 1
3.3%
27 1
3.3%
25 1
3.3%
21 1
3.3%
20 1
3.3%
18 1
3.3%

참여영상시청수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49747.767
Minimum1597
Maximum387961
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:57:25.588406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1597
5-th percentile4001.85
Q18839
median19428
Q354185.25
95-th percentile198988.15
Maximum387961
Range386364
Interquartile range (IQR)45346.25

Descriptive statistics

Standard deviation80974.606
Coefficient of variation (CV)1.6277033
Kurtosis10.702401
Mean49747.767
Median Absolute Deviation (MAD)12362.5
Skewness3.1034164
Sum1492433
Variance6.5568868 × 109
MonotonicityNot monotonic
2023-12-10T22:57:25.875223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
22382 1
 
3.3%
11759 1
 
3.3%
57685 1
 
3.3%
387961 1
 
3.3%
32898 1
 
3.3%
170755 1
 
3.3%
5340 1
 
3.3%
222088 1
 
3.3%
4493 1
 
3.3%
12099 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1597 1
3.3%
3600 1
3.3%
4493 1
3.3%
5340 1
3.3%
6791 1
3.3%
7340 1
3.3%
7750 1
3.3%
8733 1
3.3%
9157 1
3.3%
11759 1
3.3%
ValueCountFrequency (%)
387961 1
3.3%
222088 1
3.3%
170755 1
3.3%
89617 1
3.3%
83002 1
3.3%
70978 1
3.3%
65837 1
3.3%
57685 1
3.3%
43686 1
3.3%
32898 1
3.3%

영상영상채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
MBCkpop
30 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMBCkpop
2nd rowMBCkpop
3rd rowMBCkpop
4th rowMBCkpop
5th rowMBCkpop

Common Values

ValueCountFrequency (%)
MBCkpop 30
100.0%

Length

2023-12-10T22:57:26.468752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:57:26.775681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mbckpop 30
100.0%

Interactions

2023-12-10T22:57:21.040448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:18.434477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:19.297074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.485733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:21.196847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:18.664566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:19.933074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.629617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:21.370122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:18.911119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.172191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.777874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:21.538422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:19.124242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.327030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:57:20.895519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:57:26.910062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여영상경로명참여영상좋아요수참여영상싫어요수참여영상시청수
참여영상경로명1.0001.0001.0001.0001.000
1.0001.0001.0000.8240.893
참여영상좋아요수1.0001.0001.0000.9570.928
참여영상싫어요수1.0000.8240.9571.0000.903
참여영상시청수1.0000.8930.9280.9031.000
2023-12-10T22:57:27.148678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여영상좋아요수참여영상싫어요수참여영상시청수
1.0000.9620.7330.883
참여영상좋아요수0.9621.0000.7890.907
참여영상싫어요수0.7330.7891.0000.831
참여영상시청수0.8830.9070.8311.000

Missing values

2023-12-10T22:57:21.783804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:57:22.166489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

참여영상경로명참여영상수집일자참여영상좋아요수참여영상싫어요수참여영상시청수영상영상채널명
0https://www.youtube.com/watch?v=xXh95-_Mxsc2020-10-0124137331822382MBCkpop
1https://www.youtube.com/watch?v=udgboFStMkU2020-10-0117025752118145MBCkpop
2https://www.youtube.com/watch?v=N5KDjJQ06Hw2020-10-0154129047750MBCkpop
3https://www.youtube.com/watch?v=6IOwwtqx6SM2020-10-01104127876791MBCkpop
4https://www.youtube.com/watch?v=QmREewMKM982020-10-011603017616145MBCkpop
5https://www.youtube.com/watch?v=V6Rvoiddm3Q2020-10-013559259157MBCkpop
6https://www.youtube.com/watch?v=VEOVP0MhS6Q2020-10-0173150967340MBCkpop
7https://www.youtube.com/watch?v=P_sRLyhnDco2020-10-01109148638733MBCkpop
8https://www.youtube.com/watch?v=mWIEal9Ao9A2020-10-014481343600MBCkpop
9https://www.youtube.com/watch?v=4s5IjVKuqU82020-10-0149994516325MBCkpop
참여영상경로명참여영상수집일자참여영상좋아요수참여영상싫어요수참여영상시청수영상영상채널명
20https://www.youtube.com/watch?v=W-ZK2uPq7jA2020-10-0114539682723767MBCkpop
21https://www.youtube.com/watch?v=x8SqJWKemG42020-10-012630521597MBCkpop
22https://www.youtube.com/watch?v=-3KhqShw9L42020-10-011162185512099MBCkpop
23https://www.youtube.com/watch?v=ImdVVFY9Slk2020-10-0141538104493MBCkpop
24https://www.youtube.com/watch?v=F7TNJCpEMOY2020-10-0117703125458222088MBCkpop
25https://www.youtube.com/watch?v=DqTdkOeE4kY2020-10-013645115340MBCkpop
26https://www.youtube.com/watch?v=cDh5gpvCMYE2020-10-0161710637118170755MBCkpop
27https://www.youtube.com/watch?v=Cdk42uN3SBw2020-10-0119035721032898MBCkpop
28https://www.youtube.com/watch?v=8cTD-lA9A2M2020-10-01147257033285387961MBCkpop
29https://www.youtube.com/watch?v=TRCkeUt1mZI2020-10-0114749397057685MBCkpop