Overview

Dataset statistics

Number of variables9
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory80.4 B

Variable types

Categorical6
Text1
Numeric2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/56887451-456b-4c11-aa9c-c453bd5f4e98

Alerts

이용자활동평가채널ID has constant value ""Constant
이용자활동평가생성기간 has constant value ""Constant
이용자활동평가채널조회수 has constant value ""Constant
이용자활동평가구독자수 has constant value ""Constant
이용자활동평가긍정평가비율 is highly overall correlated with 이용자활동평가영상조회수 and 2 other fieldsHigh correlation
이용자활동평가부정평가비율 is highly overall correlated with 이용자활동평가영상조회수 and 2 other fieldsHigh correlation
이용자활동평가영상조회수 is highly overall correlated with 이용자활동평가긍정평가비율 and 1 other fieldsHigh correlation
is highly overall correlated with 이용자활동평가긍정평가비율 and 1 other fieldsHigh correlation
이용자활동평가영상ID has unique valuesUnique
이용자활동평가영상조회수 has unique valuesUnique
has 14 (46.7%) zerosZeros

Reproduction

Analysis started2023-12-10 14:05:31.032263
Analysis finished2023-12-10 14:05:32.442972
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이용자활동평가채널ID
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img
30 

Length

Max length56
Median length56
Mean length56
Min length56

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img
2nd rowhttps://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img
3rd rowhttps://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img
4th rowhttps://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img
5th rowhttps://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img

Common Values

ValueCountFrequency (%)
https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5img 30
100.0%

Length

2023-12-10T23:05:32.529987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:05:32.645477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://www.youtube.com/channel/ucyylilojyqkafklvjzx5img 30
100.0%
Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:05:32.931471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length43
Mean length43
Min length43

Characters and Unicode

Total characters1290
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/watch?v=MNkYC-JQqqU
2nd rowhttps://www.youtube.com/watch?v=pOlTIlpsp9U
3rd rowhttps://www.youtube.com/watch?v=lW5vZIptBEU
4th rowhttps://www.youtube.com/watch?v=-1Zr6SQvPxk
5th rowhttps://www.youtube.com/watch?v=tLwOJBdoMqM
ValueCountFrequency (%)
https://www.youtube.com/watch?v=mnkyc-jqqqu 1
 
3.3%
https://www.youtube.com/watch?v=poltilpsp9u 1
 
3.3%
https://www.youtube.com/watch?v=za74fjxddq0 1
 
3.3%
https://www.youtube.com/watch?v=ff3erqg-ud0 1
 
3.3%
https://www.youtube.com/watch?v=k3rfljgfx-y 1
 
3.3%
https://www.youtube.com/watch?v=tqivsbqgyx8 1
 
3.3%
https://www.youtube.com/watch?v=gofy7oa_uco 1
 
3.3%
https://www.youtube.com/watch?v=ftvaskgjbk4 1
 
3.3%
https://www.youtube.com/watch?v=wyh2hlfjwzi 1
 
3.3%
https://www.youtube.com/watch?v=rl9urllahlc 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:05:33.526001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 127
 
9.8%
w 126
 
9.8%
/ 90
 
7.0%
o 67
 
5.2%
h 63
 
4.9%
c 63
 
4.9%
u 63
 
4.9%
. 60
 
4.7%
p 38
 
2.9%
v 37
 
2.9%
Other values (59) 556
43.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 852
66.0%
Other Punctuation 210
 
16.3%
Uppercase Letter 146
 
11.3%
Decimal Number 43
 
3.3%
Math Symbol 30
 
2.3%
Dash Punctuation 8
 
0.6%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 127
14.9%
w 126
14.8%
o 67
 
7.9%
h 63
 
7.4%
c 63
 
7.4%
u 63
 
7.4%
p 38
 
4.5%
v 37
 
4.3%
b 36
 
4.2%
m 35
 
4.1%
Other values (16) 197
23.1%
Uppercase Letter
ValueCountFrequency (%)
X 11
 
7.5%
I 10
 
6.8%
L 9
 
6.2%
U 8
 
5.5%
Z 8
 
5.5%
Q 8
 
5.5%
V 6
 
4.1%
Y 6
 
4.1%
C 6
 
4.1%
O 5
 
3.4%
Other values (16) 69
47.3%
Decimal Number
ValueCountFrequency (%)
9 8
18.6%
7 6
14.0%
0 6
14.0%
6 5
11.6%
3 4
9.3%
4 4
9.3%
8 3
 
7.0%
2 3
 
7.0%
1 2
 
4.7%
5 2
 
4.7%
Other Punctuation
ValueCountFrequency (%)
/ 90
42.9%
. 60
28.6%
? 30
 
14.3%
: 30
 
14.3%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 998
77.4%
Common 292
 
22.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 127
 
12.7%
w 126
 
12.6%
o 67
 
6.7%
h 63
 
6.3%
c 63
 
6.3%
u 63
 
6.3%
p 38
 
3.8%
v 37
 
3.7%
b 36
 
3.6%
m 35
 
3.5%
Other values (42) 343
34.4%
Common
ValueCountFrequency (%)
/ 90
30.8%
. 60
20.5%
? 30
 
10.3%
: 30
 
10.3%
= 30
 
10.3%
9 8
 
2.7%
- 8
 
2.7%
7 6
 
2.1%
0 6
 
2.1%
6 5
 
1.7%
Other values (7) 19
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 127
 
9.8%
w 126
 
9.8%
/ 90
 
7.0%
o 67
 
5.2%
h 63
 
4.9%
c 63
 
4.9%
u 63
 
4.9%
. 60
 
4.7%
p 38
 
2.9%
v 37
 
2.9%
Other values (59) 556
43.1%

이용자활동평가생성기간
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020-09-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-09-01
2nd row2020-09-01
3rd row2020-09-01
4th row2020-09-01
5th row2020-09-01

Common Values

ValueCountFrequency (%)
2020-09-01 30
100.0%

Length

2023-12-10T23:05:33.749383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:05:33.963472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-09-01 30
100.0%
Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
824734019
30 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row824734019
2nd row824734019
3rd row824734019
4th row824734019
5th row824734019

Common Values

ValueCountFrequency (%)
824734019 30
100.0%

Length

2023-12-10T23:05:34.252768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:05:34.394014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
824734019 30
100.0%

이용자활동평가구독자수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
979000
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row979000
2nd row979000
3rd row979000
4th row979000
5th row979000

Common Values

ValueCountFrequency (%)
979000 30
100.0%

Length

2023-12-10T23:05:34.561665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:05:34.719383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
979000 30
100.0%

이용자활동평가영상조회수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean878.9
Minimum79
Maximum6102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:05:34.887998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum79
5-th percentile83.7
Q1317
median615.5
Q3987
95-th percentile2043.5
Maximum6102
Range6023
Interquartile range (IQR)670

Descriptive statistics

Standard deviation1124.7895
Coefficient of variation (CV)1.2797696
Kurtosis16.567365
Mean878.9
Median Absolute Deviation (MAD)357.5
Skewness3.7002055
Sum26367
Variance1265151.3
MonotonicityNot monotonic
2023-12-10T23:05:35.136930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
635 1
 
3.3%
907 1
 
3.3%
6102 1
 
3.3%
1697 1
 
3.3%
1332 1
 
3.3%
1080 1
 
3.3%
418 1
 
3.3%
537 1
 
3.3%
562 1
 
3.3%
426 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
79 1
3.3%
81 1
3.3%
87 1
3.3%
113 1
3.3%
124 1
3.3%
148 1
3.3%
242 1
3.3%
297 1
3.3%
377 1
3.3%
418 1
3.3%
ValueCountFrequency (%)
6102 1
3.3%
2075 1
3.3%
2005 1
3.3%
1697 1
3.3%
1332 1
3.3%
1160 1
3.3%
1080 1
3.3%
997 1
3.3%
957 1
3.3%
907 1
3.3%

이용자활동평가긍정평가비율
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
100.0
11 
\N
66.7
88.0
 
1
92.9
 
1
Other values (9)

Length

Max length5
Median length4
Mean length4.0333333
Min length2

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row100.0
2nd row88.0
3rd row92.9
4th row100.0
5th row100.0

Common Values

ValueCountFrequency (%)
100.0 11
36.7%
\N 5
16.7%
66.7 3
 
10.0%
88.0 1
 
3.3%
92.9 1
 
3.3%
63.6 1
 
3.3%
85.7 1
 
3.3%
42.9 1
 
3.3%
90.0 1
 
3.3%
77.8 1
 
3.3%
Other values (4) 4
 
13.3%

Length

2023-12-10T23:05:35.362076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
100.0 11
36.7%
n 5
16.7%
66.7 3
 
10.0%
88.0 1
 
3.3%
92.9 1
 
3.3%
63.6 1
 
3.3%
85.7 1
 
3.3%
42.9 1
 
3.3%
90.0 1
 
3.3%
77.8 1
 
3.3%
Other values (4) 4
 
13.3%

이용자활동평가부정평가비율
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
0.0
11 
\N
33.3
12.0
 
1
7.1
 
1
Other values (9)

Length

Max length4
Median length3.5
Mean length3.1666667
Min length2

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row0.0
2nd row12.0
3rd row7.1
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 11
36.7%
\N 5
16.7%
33.3 3
 
10.0%
12.0 1
 
3.3%
7.1 1
 
3.3%
36.4 1
 
3.3%
14.3 1
 
3.3%
57.1 1
 
3.3%
10.0 1
 
3.3%
22.2 1
 
3.3%
Other values (4) 4
 
13.3%

Length

2023-12-10T23:05:35.670301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0.0 11
36.7%
n 5
16.7%
33.3 3
 
10.0%
12.0 1
 
3.3%
7.1 1
 
3.3%
36.4 1
 
3.3%
14.3 1
 
3.3%
57.1 1
 
3.3%
10.0 1
 
3.3%
22.2 1
 
3.3%
Other values (4) 4
 
13.3%


Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1666667
Minimum0
Maximum35
Zeros14
Zeros (%)46.7%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:05:35.879923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34
95-th percentile8.55
Maximum35
Range35
Interquartile range (IQR)4

Descriptive statistics

Standard deviation6.6232783
Coefficient of variation (CV)2.0915616
Kurtosis19.383273
Mean3.1666667
Median Absolute Deviation (MAD)1
Skewness4.098425
Sum95
Variance43.867816
MonotonicityNot monotonic
2023-12-10T23:05:36.218398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 14
46.7%
1 4
 
13.3%
4 2
 
6.7%
5 2
 
6.7%
8 2
 
6.7%
2 2
 
6.7%
3 1
 
3.3%
9 1
 
3.3%
6 1
 
3.3%
35 1
 
3.3%
ValueCountFrequency (%)
0 14
46.7%
1 4
 
13.3%
2 2
 
6.7%
3 1
 
3.3%
4 2
 
6.7%
5 2
 
6.7%
6 1
 
3.3%
8 2
 
6.7%
9 1
 
3.3%
35 1
 
3.3%
ValueCountFrequency (%)
35 1
 
3.3%
9 1
 
3.3%
8 2
 
6.7%
6 1
 
3.3%
5 2
 
6.7%
4 2
 
6.7%
3 1
 
3.3%
2 2
 
6.7%
1 4
 
13.3%
0 14
46.7%

Interactions

2023-12-10T23:05:31.740089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:05:31.372005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:05:31.910060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:05:31.557608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:05:36.395522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용자활동평가영상ID이용자활동평가영상조회수이용자활동평가긍정평가비율이용자활동평가부정평가비율
이용자활동평가영상ID1.0001.0001.0001.0001.000
이용자활동평가영상조회수1.0001.0000.8530.8530.624
이용자활동평가긍정평가비율1.0000.8531.0001.0000.943
이용자활동평가부정평가비율1.0000.8531.0001.0000.943
1.0000.6240.9430.9431.000
2023-12-10T23:05:36.574138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용자활동평가긍정평가비율이용자활동평가부정평가비율
이용자활동평가긍정평가비율1.0001.000
이용자활동평가부정평가비율1.0001.000
2023-12-10T23:05:36.751342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용자활동평가영상조회수이용자활동평가긍정평가비율이용자활동평가부정평가비율
이용자활동평가영상조회수1.0000.4930.5170.517
0.4931.0000.6610.661
이용자활동평가긍정평가비율0.5170.6611.0001.000
이용자활동평가부정평가비율0.5170.6611.0001.000

Missing values

2023-12-10T23:05:32.136851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:05:32.352563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

이용자활동평가채널ID이용자활동평가영상ID이용자활동평가생성기간이용자활동평가채널조회수이용자활동평가구독자수이용자활동평가영상조회수이용자활동평가긍정평가비율이용자활동평가부정평가비율
0https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=MNkYC-JQqqU2020-09-01824734019979000635100.00.03
1https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=pOlTIlpsp9U2020-09-01824734019979000116088.012.01
2https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=lW5vZIptBEU2020-09-0182473401997900065992.97.10
3https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=-1Zr6SQvPxk2020-09-01824734019979000297100.00.00
4https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=tLwOJBdoMqM2020-09-01824734019979000242100.00.00
5https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=LjGXVmiDzh42020-09-0182473401997900081\N\N0
6https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=pWwXiOmNHiw2020-09-0182473401997900012466.733.30
7https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=kZqxgVSQQeU2020-09-01824734019979000148\N\N0
8https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=90X3SbSmdz82020-09-0182473401997900079\N\N0
9https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=0GIQdKK2V0o2020-09-01824734019979000113\N\N0
이용자활동평가채널ID이용자활동평가영상ID이용자활동평가생성기간이용자활동평가채널조회수이용자활동평가구독자수이용자활동평가영상조회수이용자활동평가긍정평가비율이용자활동평가부정평가비율
20https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=Pu69V7NHjiw2020-09-018247340199790002075100.00.02
21https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=rL9URLLAHlc2020-09-01824734019979000200595.24.81
22https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=WYH2hLFjWZI2020-09-0182473401997900042675.025.04
23https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=fTVasKgjbK42020-09-01824734019979000562100.00.06
24https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=gofy7OA_uco2020-09-0182473401997900053766.733.38
25https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=tqIvsbQgyX82020-09-01824734019979000418100.00.01
26https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=k3rfLjGfX-Y2020-09-018247340199790001080100.00.00
27https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=FF3erQG-UD02020-09-018247340199790001332100.00.02
28https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=za74fjXDDq02020-09-01824734019979000169793.36.70
29https://www.youtube.com/channel/UCYyLIlOJyqkAFKlVjzX5imghttps://www.youtube.com/watch?v=93ZbCRg6IZE2020-09-01824734019979000610294.95.135