Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory46.4 B

Variable types

Text2
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/44bc4c3b-a071-446f-af61-370bb8cca862

Alerts

참여채널수집일자 has constant value ""Constant
is highly overall correlated with 채널채널조회수High correlation
채널채널조회수 is highly overall correlated with High correlation
참여채널경로명 has unique valuesUnique
has unique valuesUnique
채널채널조회수 has unique valuesUnique
채널채널명 has unique valuesUnique
has 1 (3.3%) zerosZeros
채널채널조회수 has 1 (3.3%) zerosZeros

Reproduction

Analysis started2023-12-10 14:11:09.315376
Analysis finished2023-12-10 14:11:10.699868
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:11:11.107821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length56
Mean length56
Min length56

Characters and Unicode

Total characters1680
Distinct characters67
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/channel/UCJz-yzuop09u0M17oCMdi2A
2nd rowhttps://www.youtube.com/channel/UC2pq2pfYqLXSGh4ryIfoMdg
3rd rowhttps://www.youtube.com/channel/UCnrPF6de3KuHZy3c2ZkqcMA
4th rowhttps://www.youtube.com/channel/UCVuNSSnNg8BTbZwaTeBzXdA
5th rowhttps://www.youtube.com/channel/UCKgWbt6D7Hy9xnvJjAN_4Ug
ValueCountFrequency (%)
https://www.youtube.com/channel/ucjz-yzuop09u0m17ocmdi2a 1
 
3.3%
https://www.youtube.com/channel/uc2pq2pfyqlxsgh4ryifomdg 1
 
3.3%
https://www.youtube.com/channel/ucsuz7amaob56wsqgpqunefa 1
 
3.3%
https://www.youtube.com/channel/ucb0hqs8zw8qhkkpyp6dfy0g 1
 
3.3%
https://www.youtube.com/channel/uc9hupc-qo7mufw4ofckwwnw 1
 
3.3%
https://www.youtube.com/channel/ucueebclgesoagtbc3ezfavg 1
 
3.3%
https://www.youtube.com/channel/uccv5wvp85ob-vx_ewphex-w 1
 
3.3%
https://www.youtube.com/channel/ucxvx64o-lj9m4co9zm6druq 1
 
3.3%
https://www.youtube.com/channel/uc9wajb9xafyu8oz9_fcpfdq 1
 
3.3%
https://www.youtube.com/channel/ucw0dk4y2ev-3clla7mdcila 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:11:12.159881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 120
 
7.1%
w 107
 
6.4%
t 94
 
5.6%
e 75
 
4.5%
o 75
 
4.5%
u 75
 
4.5%
n 71
 
4.2%
c 68
 
4.0%
h 68
 
4.0%
. 60
 
3.6%
Other values (57) 867
51.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1019
60.7%
Uppercase Letter 324
 
19.3%
Other Punctuation 210
 
12.5%
Decimal Number 104
 
6.2%
Dash Punctuation 12
 
0.7%
Connector Punctuation 11
 
0.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 107
 
10.5%
t 94
 
9.2%
e 75
 
7.4%
o 75
 
7.4%
u 75
 
7.4%
n 71
 
7.0%
c 68
 
6.7%
h 68
 
6.7%
y 44
 
4.3%
p 42
 
4.1%
Other values (16) 300
29.4%
Uppercase Letter
ValueCountFrequency (%)
U 40
 
12.3%
C 37
 
11.4%
A 17
 
5.2%
Q 15
 
4.6%
H 14
 
4.3%
F 13
 
4.0%
M 13
 
4.0%
V 12
 
3.7%
E 12
 
3.7%
R 12
 
3.7%
Other values (16) 139
42.9%
Decimal Number
ValueCountFrequency (%)
6 13
12.5%
2 13
12.5%
9 13
12.5%
4 12
11.5%
0 12
11.5%
8 11
10.6%
7 11
10.6%
3 9
8.7%
5 5
 
4.8%
1 5
 
4.8%
Other Punctuation
ValueCountFrequency (%)
/ 120
57.1%
. 60
28.6%
: 30
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1343
79.9%
Common 337
 
20.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 107
 
8.0%
t 94
 
7.0%
e 75
 
5.6%
o 75
 
5.6%
u 75
 
5.6%
n 71
 
5.3%
c 68
 
5.1%
h 68
 
5.1%
y 44
 
3.3%
p 42
 
3.1%
Other values (42) 624
46.5%
Common
ValueCountFrequency (%)
/ 120
35.6%
. 60
17.8%
: 30
 
8.9%
6 13
 
3.9%
2 13
 
3.9%
9 13
 
3.9%
- 12
 
3.6%
4 12
 
3.6%
0 12
 
3.6%
8 11
 
3.3%
Other values (5) 41
 
12.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1680
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 120
 
7.1%
w 107
 
6.4%
t 94
 
5.6%
e 75
 
4.5%
o 75
 
4.5%
u 75
 
4.5%
n 71
 
4.2%
c 68
 
4.0%
h 68
 
4.0%
. 60
 
3.6%
Other values (57) 867
51.6%
Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-12-10 00:00:00
Maximum2020-12-10 00:00:00
2023-12-10T23:11:12.587890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:11:12.770581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)


Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean177.13333
Minimum0
Maximum993
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:11:12.974382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile19.15
Q153.5
median140.5
Q3216
95-th percentile421.7
Maximum993
Range993
Interquartile range (IQR)162.5

Descriptive statistics

Standard deviation192.7179
Coefficient of variation (CV)1.0879821
Kurtosis10.776319
Mean177.13333
Median Absolute Deviation (MAD)86.5
Skewness2.8583538
Sum5314
Variance37140.189
MonotonicityNot monotonic
2023-12-10T23:11:13.216738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
155 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
53 1
 
3.3%
138 1
 
3.3%
189 1
 
3.3%
55 1
 
3.3%
50 1
 
3.3%
283 1
 
3.3%
65 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
0 1
3.3%
16 1
3.3%
23 1
3.3%
24 1
3.3%
38 1
3.3%
39 1
3.3%
50 1
3.3%
53 1
3.3%
55 1
3.3%
65 1
3.3%
ValueCountFrequency (%)
993 1
3.3%
491 1
3.3%
337 1
3.3%
323 1
3.3%
315 1
3.3%
283 1
3.3%
237 1
3.3%
217 1
3.3%
213 1
3.3%
189 1
3.3%

채널채널조회수
Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68439432
Minimum0
Maximum9.1669911 × 108
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:11:13.435413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile13527.1
Q1195269.75
median3463573.5
Q317644880
95-th percentile4.4604647 × 108
Maximum9.1669911 × 108
Range9.1669911 × 108
Interquartile range (IQR)17449610

Descriptive statistics

Standard deviation2.0013893 × 108
Coefficient of variation (CV)2.9243219
Kurtosis13.090585
Mean68439432
Median Absolute Deviation (MAD)3437741
Skewness3.6338449
Sum2.053183 × 109
Variance4.0055591 × 1016
MonotonicityNot monotonic
2023-12-10T23:11:13.676208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1520392 1
 
3.3%
7086747 1
 
3.3%
30065 1
 
3.3%
53281 1
 
3.3%
143864 1
 
3.3%
18255025 1
 
3.3%
925750 1
 
3.3%
104400 1
 
3.3%
62796694 1
 
3.3%
1272792 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
0 1
3.3%
6922 1
3.3%
21600 1
3.3%
30065 1
3.3%
53281 1
3.3%
104400 1
3.3%
143864 1
3.3%
161033 1
3.3%
297980 1
3.3%
306358 1
3.3%
ValueCountFrequency (%)
916699108 1
3.3%
634568550 1
3.3%
215630601 1
3.3%
65974721 1
3.3%
62796694 1
3.3%
53997396 1
3.3%
24112036 1
3.3%
18255025 1
3.3%
15814446 1
3.3%
8086827 1
3.3%

채널채널명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:11:14.196469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length17
Mean length10.733333
Min length3

Characters and Unicode

Total characters322
Distinct characters137
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowSONGHYUNC FILM
2nd rowoppa coreano
3rd row기획재정부
4th row서울시립대학교종합사회복지관
5th row정원이의 여행일기Jeongwon Traveler
ValueCountFrequency (%)
ori 2
 
3.4%
songhyunc 1
 
1.7%
김하딩의 1
 
1.7%
잉화 1
 
1.7%
다이어트하는쿠몬 1
 
1.7%
coomon 1
 
1.7%
diet 1
 
1.7%
녹색사업단 1
 
1.7%
크뇽tv 1
 
1.7%
예스잼미 1
 
1.7%
Other values (47) 47
81.0%
2023-12-10T23:11:15.059206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
8.7%
a 12
 
3.7%
e 12
 
3.7%
o 12
 
3.7%
n 10
 
3.1%
O 6
 
1.9%
T 6
 
1.9%
6
 
1.9%
S 5
 
1.6%
t 5
 
1.6%
Other values (127) 220
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 121
37.6%
Lowercase Letter 98
30.4%
Uppercase Letter 72
22.4%
Space Separator 28
 
8.7%
Decimal Number 3
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
5.0%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (77) 85
70.2%
Lowercase Letter
ValueCountFrequency (%)
a 12
12.2%
e 12
12.2%
o 12
12.2%
n 10
10.2%
t 5
 
5.1%
k 5
 
5.1%
i 5
 
5.1%
r 5
 
5.1%
m 4
 
4.1%
u 4
 
4.1%
Other values (13) 24
24.5%
Uppercase Letter
ValueCountFrequency (%)
O 6
 
8.3%
T 6
 
8.3%
S 5
 
6.9%
A 5
 
6.9%
R 5
 
6.9%
L 5
 
6.9%
I 5
 
6.9%
M 4
 
5.6%
N 4
 
5.6%
G 4
 
5.6%
Other values (13) 23
31.9%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
0 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 170
52.8%
Hangul 121
37.6%
Common 31
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
5.0%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (77) 85
70.2%
Latin
ValueCountFrequency (%)
a 12
 
7.1%
e 12
 
7.1%
o 12
 
7.1%
n 10
 
5.9%
O 6
 
3.5%
T 6
 
3.5%
S 5
 
2.9%
t 5
 
2.9%
k 5
 
2.9%
i 5
 
2.9%
Other values (36) 92
54.1%
Common
ValueCountFrequency (%)
28
90.3%
1 1
 
3.2%
0 1
 
3.2%
2 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 201
62.4%
Hangul 121
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
 
13.9%
a 12
 
6.0%
e 12
 
6.0%
o 12
 
6.0%
n 10
 
5.0%
O 6
 
3.0%
T 6
 
3.0%
S 5
 
2.5%
t 5
 
2.5%
k 5
 
2.5%
Other values (40) 100
49.8%
Hangul
ValueCountFrequency (%)
6
 
5.0%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (77) 85
70.2%

Interactions

2023-12-10T23:11:09.881311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:11:09.613232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:11:10.026252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:11:09.739292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:11:15.224189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여채널경로명채널채널조회수채널채널명
참여채널경로명1.0001.0001.0001.000
1.0001.0000.7531.000
채널채널조회수1.0000.7531.0001.000
채널채널명1.0001.0001.0001.000
2023-12-10T23:11:15.404262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채널채널조회수
1.0000.756
채널채널조회수0.7561.000

Missing values

2023-12-10T23:11:10.263762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:11:10.620576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

참여채널경로명참여채널수집일자채널채널조회수채널채널명
0https://www.youtube.com/channel/UCJz-yzuop09u0M17oCMdi2A2020-12-101551520392SONGHYUNC FILM
1https://www.youtube.com/channel/UC2pq2pfYqLXSGh4ryIfoMdg2020-12-101204105760oppa coreano
2https://www.youtube.com/channel/UCnrPF6de3KuHZy3c2ZkqcMA2020-12-1099324112036기획재정부
3https://www.youtube.com/channel/UCVuNSSnNg8BTbZwaTeBzXdA2020-12-10396922서울시립대학교종합사회복지관
4https://www.youtube.com/channel/UCKgWbt6D7Hy9xnvJjAN_4Ug2020-12-101493147467정원이의 여행일기Jeongwon Traveler
5https://www.youtube.com/channel/UC9RGd31vRfzDuLE_NH6nYpg2020-12-1023765974721김덕배 이야기
6https://www.youtube.com/channel/UCUbL5EMa0gHIMFIxkdtqy7A2020-12-1000게임창고
7https://www.youtube.com/channel/UCGohwmhhQEGv2uJ410FhfkQ2020-12-102132775078ORI ORI
8https://www.youtube.com/channel/UCAoyR-sL6B0S93AMR-HVTvg2020-12-1021753997396떡볶퀸 Tteokbokqueen
9https://www.youtube.com/channel/UCYPf4P4Dp-wueB_82VTg2Jw2020-12-1016297980은비로운 생활 Eunbi
참여채널경로명참여채널수집일자채널채널조회수채널채널명
20https://www.youtube.com/channel/UCW8ejls4SFrreNRIpEM86Fw2020-12-103153779680육사시미
21https://www.youtube.com/channel/UCW0dK4y2eV-3CllA7MdCILA2020-12-10112306358목포시
22https://www.youtube.com/channel/UC9Wajb9XAFyU8OZ9_fcpfdQ2020-12-10651272792시현하다 RECORDERS
23https://www.youtube.com/channel/UCxvX64o-lJ9m4Co9zm6druQ2020-12-1028362796694예소리SSOL
24https://www.youtube.com/channel/UCcV5wvP85Ob-Vx_EWpHeX-w2020-12-1050104400김하딩의 랜선다이어리
25https://www.youtube.com/channel/UCueeBCLgeSoaGTbc3ezfAvg2020-12-1055925750김목사 pastor kim
26https://www.youtube.com/channel/UC9HUPC-qo7mUfW4ofckWWNw2020-12-1018918255025Ivan Lam
27https://www.youtube.com/channel/UCb0Hqs8zw8QhkKPYP6dFy0g2020-12-10138143864진도군
28https://www.youtube.com/channel/UCsUZ7AMaOB56WsQgpQUNefA2020-12-105353281통일전망대 x 김팀장의 북한확대경
29https://www.youtube.com/channel/UCFyo2vDEMkBp6_NQlk-YzoQ2020-12-102330065102 LAB