Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory46.4 B

Variable types

Text2
Categorical1
Numeric2

Dataset

Description샘플 데이터
Author한양대
URLhttps://bigdata-region.kr/#/dataset/0e6aa056-7b20-4985-98d2-8a7e6772b604

Alerts

참여채널수집일자 has constant value ""Constant
is highly overall correlated with 채널채널조회수High correlation
채널채널조회수 is highly overall correlated with High correlation
참여채널경로명 has unique valuesUnique
채널채널조회수 has unique valuesUnique
채널채널명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:23:57.763569
Analysis finished2023-12-10 14:23:58.672500
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:23:58.870962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length56
Mean length56
Min length56

Characters and Unicode

Total characters1680
Distinct characters67
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowhttps://www.youtube.com/channel/UChX5JaALoA1o80Cc5zkQz_Q
2nd rowhttps://www.youtube.com/channel/UCSi-RZci6rZZu3ASym-gUFw
3rd rowhttps://www.youtube.com/channel/UC3htcPlye6noa3pAgegT2AQ
4th rowhttps://www.youtube.com/channel/UCY2YdDn7zXtRbNR0fJ9V_YQ
5th rowhttps://www.youtube.com/channel/UCeINdcOCC7Ztz1qK7LHsZdg
ValueCountFrequency (%)
https://www.youtube.com/channel/uchx5jaaloa1o80cc5zkqz_q 1
 
3.3%
https://www.youtube.com/channel/ucsi-rzci6rzzu3asym-gufw 1
 
3.3%
https://www.youtube.com/channel/ucxhs5ppzmsegt9v7qjnmgbg 1
 
3.3%
https://www.youtube.com/channel/uct15x5ehlwyp8ppntqtkudq 1
 
3.3%
https://www.youtube.com/channel/ucpvenftgbgd-cuuvrn7rrgq 1
 
3.3%
https://www.youtube.com/channel/ucs_hnpjlqtvbkqalgapi_4g 1
 
3.3%
https://www.youtube.com/channel/uc9c8t9xx52md-exhefmcfca 1
 
3.3%
https://www.youtube.com/channel/ucjuq86kvczvrolbyrm4nbdg 1
 
3.3%
https://www.youtube.com/channel/ucfs9uqjvlxrfucgkoauwozg 1
 
3.3%
https://www.youtube.com/channel/uc1rsggbqw0ftgdexi6emwga 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:23:59.265642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 120
 
7.1%
w 105
 
6.2%
t 102
 
6.1%
c 72
 
4.3%
u 71
 
4.2%
e 71
 
4.2%
n 69
 
4.1%
h 68
 
4.0%
o 66
 
3.9%
. 60
 
3.6%
Other values (57) 876
52.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1030
61.3%
Uppercase Letter 341
 
20.3%
Other Punctuation 210
 
12.5%
Decimal Number 81
 
4.8%
Connector Punctuation 10
 
0.6%
Dash Punctuation 8
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 105
 
10.2%
t 102
 
9.9%
c 72
 
7.0%
u 71
 
6.9%
e 71
 
6.9%
n 69
 
6.7%
h 68
 
6.6%
o 66
 
6.4%
p 43
 
4.2%
b 41
 
4.0%
Other values (16) 322
31.3%
Uppercase Letter
ValueCountFrequency (%)
C 44
 
12.9%
U 40
 
11.7%
Q 20
 
5.9%
Z 17
 
5.0%
A 14
 
4.1%
K 14
 
4.1%
P 14
 
4.1%
D 13
 
3.8%
B 12
 
3.5%
Y 12
 
3.5%
Other values (16) 141
41.3%
Decimal Number
ValueCountFrequency (%)
6 12
14.8%
5 11
13.6%
7 9
11.1%
1 8
9.9%
2 8
9.9%
9 8
9.9%
0 7
8.6%
4 6
7.4%
3 6
7.4%
8 6
7.4%
Other Punctuation
ValueCountFrequency (%)
/ 120
57.1%
. 60
28.6%
: 30
 
14.3%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1371
81.6%
Common 309
 
18.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 105
 
7.7%
t 102
 
7.4%
c 72
 
5.3%
u 71
 
5.2%
e 71
 
5.2%
n 69
 
5.0%
h 68
 
5.0%
o 66
 
4.8%
C 44
 
3.2%
p 43
 
3.1%
Other values (42) 660
48.1%
Common
ValueCountFrequency (%)
/ 120
38.8%
. 60
19.4%
: 30
 
9.7%
6 12
 
3.9%
5 11
 
3.6%
_ 10
 
3.2%
7 9
 
2.9%
1 8
 
2.6%
2 8
 
2.6%
9 8
 
2.6%
Other values (5) 33
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1680
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 120
 
7.1%
w 105
 
6.2%
t 102
 
6.1%
c 72
 
4.3%
u 71
 
4.2%
e 71
 
4.2%
n 69
 
4.1%
h 68
 
4.0%
o 66
 
3.9%
. 60
 
3.6%
Other values (57) 876
52.1%

참여채널수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2021-01-01
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01-01
2nd row2021-01-01
3rd row2021-01-01
4th row2021-01-01
5th row2021-01-01

Common Values

ValueCountFrequency (%)
2021-01-01 30
100.0%

Length

2023-12-10T23:23:59.415636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:23:59.511345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-01-01 30
100.0%


Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1261.4333
Minimum5
Maximum22490
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:23:59.604790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile51.15
Q1188.75
median318
Q3560.75
95-th percentile3470.1
Maximum22490
Range22485
Interquartile range (IQR)372

Descriptive statistics

Standard deviation4122.9705
Coefficient of variation (CV)3.2684807
Kurtosis26.523698
Mean1261.4333
Median Absolute Deviation (MAD)169
Skewness5.0675776
Sum37843
Variance16998886
MonotonicityNot monotonic
2023-12-10T23:23:59.727910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
271 2
 
6.7%
231 1
 
3.3%
586 1
 
3.3%
387 1
 
3.3%
397 1
 
3.3%
1082 1
 
3.3%
213 1
 
3.3%
22490 1
 
3.3%
143 1
 
3.3%
55 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
5 1
3.3%
48 1
3.3%
55 1
3.3%
71 1
3.3%
114 1
3.3%
143 1
3.3%
147 1
3.3%
181 1
3.3%
212 1
3.3%
213 1
3.3%
ValueCountFrequency (%)
22490 1
3.3%
5424 1
3.3%
1082 1
3.3%
915 1
3.3%
769 1
3.3%
730 1
3.3%
639 1
3.3%
586 1
3.3%
485 1
3.3%
397 1
3.3%

채널채널조회수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8672771 × 108
Minimum7261
Maximum3.0990534 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:23:59.838048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7261
5-th percentile106674.3
Q14560900
median19044965
Q375310780
95-th percentile6.8355295 × 108
Maximum3.0990534 × 109
Range3.0990461 × 109
Interquartile range (IQR)70749880

Descriptive statistics

Standard deviation5.785319 × 108
Coefficient of variation (CV)3.0982648
Kurtosis24.007553
Mean1.8672771 × 108
Median Absolute Deviation (MAD)18753091
Skewness4.7622599
Sum5.6018314 × 109
Variance3.3469916 × 1017
MonotonicityNot monotonic
2023-12-10T23:23:59.944217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
134932593 1
 
3.3%
12712671 1
 
3.3%
245917385 1
 
3.3%
4479422 1
 
3.3%
235736405 1
 
3.3%
87282576 1
 
3.3%
3099053362 1
 
3.3%
28152 1
 
3.3%
381102 1
 
3.3%
76821783 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
7261 1
3.3%
28152 1
3.3%
202646 1
3.3%
381102 1
3.3%
1358808 1
3.3%
1420099 1
3.3%
4397692 1
3.3%
4479422 1
3.3%
4805334 1
3.3%
5047616 1
3.3%
ValueCountFrequency (%)
3099053362 1
3.3%
895994243 1
3.3%
423902473 1
3.3%
245917385 1
3.3%
235736405 1
3.3%
134932593 1
3.3%
87282576 1
3.3%
76821783 1
3.3%
70777769 1
3.3%
68400207 1
3.3%

채널채널명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:24:00.181266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10
Mean length8.3666667
Min length3

Characters and Unicode

Total characters251
Distinct characters129
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowsandy mandy
2nd rowEmily Rios
3rd row바이준 BYJUN
4th rowArang 아랑
5th row살림TV
ValueCountFrequency (%)
sandy 1
 
2.2%
최남매 1
 
2.2%
yootrue 1
 
2.2%
on 1
 
2.2%
air 1
 
2.2%
삼대장 1
 
2.2%
samdaejang 1
 
2.2%
방위사업청 1
 
2.2%
윤호찌 1
 
2.2%
todayommmi 1
 
2.2%
Other values (35) 35
77.8%
2023-12-10T23:24:00.579114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
6.4%
a 12
 
4.8%
T 7
 
2.8%
n 7
 
2.8%
m 6
 
2.4%
V 5
 
2.0%
y 5
 
2.0%
e 5
 
2.0%
i 5
 
2.0%
O 5
 
2.0%
Other values (119) 178
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 115
45.8%
Lowercase Letter 67
26.7%
Uppercase Letter 50
19.9%
Space Separator 16
 
6.4%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (78) 89
77.4%
Uppercase Letter
ValueCountFrequency (%)
T 7
14.0%
V 5
 
10.0%
O 5
 
10.0%
S 5
 
10.0%
B 3
 
6.0%
R 3
 
6.0%
K 2
 
4.0%
P 2
 
4.0%
Y 2
 
4.0%
N 2
 
4.0%
Other values (9) 14
28.0%
Lowercase Letter
ValueCountFrequency (%)
a 12
17.9%
n 7
10.4%
m 6
9.0%
y 5
 
7.5%
e 5
 
7.5%
i 5
 
7.5%
o 5
 
7.5%
d 4
 
6.0%
g 3
 
4.5%
s 2
 
3.0%
Other values (8) 13
19.4%
Space Separator
ValueCountFrequency (%)
16
100.0%
Close Punctuation
ValueCountFrequency (%)
] 1
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 1
100.0%
Other Punctuation
ValueCountFrequency (%)
? 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 117
46.6%
Hangul 108
43.0%
Common 19
 
7.6%
Katakana 7
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (72) 82
75.9%
Latin
ValueCountFrequency (%)
a 12
 
10.3%
T 7
 
6.0%
n 7
 
6.0%
m 6
 
5.1%
V 5
 
4.3%
y 5
 
4.3%
e 5
 
4.3%
i 5
 
4.3%
O 5
 
4.3%
o 5
 
4.3%
Other values (27) 55
47.0%
Katakana
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Common
ValueCountFrequency (%)
16
84.2%
] 1
 
5.3%
[ 1
 
5.3%
? 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 136
54.2%
Hangul 108
43.0%
Katakana 7
 
2.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
 
11.8%
a 12
 
8.8%
T 7
 
5.1%
n 7
 
5.1%
m 6
 
4.4%
V 5
 
3.7%
y 5
 
3.7%
e 5
 
3.7%
i 5
 
3.7%
O 5
 
3.7%
Other values (31) 63
46.3%
Hangul
ValueCountFrequency (%)
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (72) 82
75.9%
Katakana
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Interactions

2023-12-10T23:23:58.201475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:23:57.990571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:23:58.327926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:23:58.070171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:24:00.716985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여채널경로명채널채널조회수채널채널명
참여채널경로명1.0001.0001.0001.000
1.0001.0000.6431.000
채널채널조회수1.0000.6431.0001.000
채널채널명1.0001.0001.0001.000
2023-12-10T23:24:00.823022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채널채널조회수
1.0000.531
채널채널조회수0.5311.000

Missing values

2023-12-10T23:23:58.502622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:23:58.623071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

참여채널경로명참여채널수집일자채널채널조회수채널채널명
0https://www.youtube.com/channel/UChX5JaALoA1o80Cc5zkQz_Q2021-01-01231134932593sandy mandy
1https://www.youtube.com/channel/UCSi-RZci6rZZu3ASym-gUFw2021-01-0151420099Emily Rios
2https://www.youtube.com/channel/UC3htcPlye6noa3pAgegT2AQ2021-01-017150072430바이준 BYJUN
3https://www.youtube.com/channel/UCY2YdDn7zXtRbNR0fJ9V_YQ2021-01-0127117863586Arang 아랑
4https://www.youtube.com/channel/UCeINdcOCC7Ztz1qK7LHsZdg2021-01-012714397692살림TV
5https://www.youtube.com/channel/UCnFFOjljp1_sacTz7PfIIyg2021-01-0121254146091LeoJ Makeup
6https://www.youtube.com/channel/UCQuB2-KDqHxG56e_ufJ0iiQ2021-01-013715047616쌔라튜브[적십자]
7https://www.youtube.com/channel/UCbPBCZhgG_kX_1Wb0cdu5lw2021-01-013481358808소방청
8https://www.youtube.com/channel/UCnfwIKyFYRuqZzzKBDt6JOA2021-01-01542415052424매일경제TV
9https://www.youtube.com/channel/UCyr69KqfuOVmpYecPfFKsaQ2021-01-01487261한국기상산업기술원
참여채널경로명참여채널수집일자채널채널조회수채널채널명
20https://www.youtube.com/channel/UCiWcXU1zYrEONKQtNVGr_Jw2021-01-0138270777769윤호찌
21https://www.youtube.com/channel/UC1RsGgbqw0FtgDexi6eMWGA2021-01-0111426162623Todayommmi
22https://www.youtube.com/channel/UCfs9UQJvLXRfUCGKOAuWozg2021-01-0148576821783수다쟁이쭌
23https://www.youtube.com/channel/UCjUQ86KVcZvrOlbYrM4NBDg2021-01-0155381102오공TV
24https://www.youtube.com/channel/UC9c8t9Xx52mD-ExhefmCFCA2021-01-0114328152키스타TV
25https://www.youtube.com/channel/UCS_hnpJLQTvBkqALgapi_4g2021-01-01224903099053362스브스케이팝 SBS KPOP
26https://www.youtube.com/channel/UCPVENfTGbgD-CUuvrN7rRGQ2021-01-0121387282576상해기SangHyuk
27https://www.youtube.com/channel/UCt15X5eHLwyP8PpNtQTkuDQ2021-01-011082235736405Official Dopa
28https://www.youtube.com/channel/UCxHS5PPZmSEgT9v7qJNMGbg2021-01-013974479422한국환경산업기술원
29https://www.youtube.com/channel/UCx6jsZ02B4K3SECUrkgPyzg2021-01-01387245917385놀면 뭐하니?