Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Categorical2
Text2

Alerts

flag_nm has constant value ""Constant
seq_no has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:09:05.313230
Analysis finished2023-12-10 10:09:06.562879
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

seq_no
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.74
Minimum1
Maximum223
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:09:06.739830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.95
Q127.75
median53.5
Q378.25
95-th percentile98.05
Maximum223
Range222
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation40.597711
Coefficient of variation (CV)0.71550425
Kurtosis6.2605878
Mean56.74
Median Absolute Deviation (MAD)25.5
Skewness1.8872987
Sum5674
Variance1648.1741
MonotonicityNot monotonic
2023-12-10T19:09:07.049003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
9 1
1.0%
10 1
1.0%
11 1
1.0%
12 1
1.0%
ValueCountFrequency (%)
223 1
1.0%
222 1
1.0%
221 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%

flag_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
유투브
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유투브
2nd row유투브
3rd row유투브
4th row유투브
5th row유투브

Common Values

ValueCountFrequency (%)
유투브 100
100.0%

Length

2023-12-10T19:09:07.653313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:09:07.805284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유투브 100
100.0%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:09:08.324882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length21
Mean length11.29
Min length2

Characters and Unicode

Total characters1129
Distinct characters121
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st rowJane ASMR 제인
2nd row헨리 Henry Lau
3rd row[Awesome Haeun]어썸하은
4th row[Dorothy]도로시
5th row[장난감티비]TOYTV
ValueCountFrequency (%)
kbs 4
 
2.2%
asmr 4
 
2.2%
jtbc 3
 
1.7%
tv 3
 
1.7%
nct 3
 
1.7%
world 2
 
1.1%
official 2
 
1.1%
mnet 2
 
1.1%
k-pop 2
 
1.1%
family 2
 
1.1%
Other values (147) 151
84.8%
2023-12-10T19:09:09.696730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
6.9%
a 57
 
5.0%
e 53
 
4.7%
n 52
 
4.6%
i 51
 
4.5%
A 42
 
3.7%
T 38
 
3.4%
o 35
 
3.1%
O 30
 
2.7%
t 29
 
2.6%
Other values (111) 664
58.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 499
44.2%
Uppercase Letter 437
38.7%
Other Letter 80
 
7.1%
Space Separator 78
 
6.9%
Decimal Number 12
 
1.1%
Open Punctuation 9
 
0.8%
Close Punctuation 9
 
0.8%
Other Punctuation 3
 
0.3%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
7.5%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (48) 50
62.5%
Uppercase Letter
ValueCountFrequency (%)
A 42
 
9.6%
T 38
 
8.7%
O 30
 
6.9%
B 29
 
6.6%
E 28
 
6.4%
M 28
 
6.4%
R 25
 
5.7%
S 25
 
5.7%
N 24
 
5.5%
L 21
 
4.8%
Other values (16) 147
33.6%
Lowercase Letter
ValueCountFrequency (%)
a 57
11.4%
e 53
10.6%
n 52
10.4%
i 51
10.2%
o 35
 
7.0%
t 29
 
5.8%
r 29
 
5.8%
s 25
 
5.0%
y 23
 
4.6%
u 22
 
4.4%
Other values (14) 123
24.6%
Decimal Number
ValueCountFrequency (%)
1 4
33.3%
2 4
33.3%
7 2
16.7%
5 1
 
8.3%
4 1
 
8.3%
Open Punctuation
ValueCountFrequency (%)
[ 7
77.8%
( 2
 
22.2%
Close Punctuation
ValueCountFrequency (%)
] 7
77.8%
) 2
 
22.2%
Other Punctuation
ValueCountFrequency (%)
' 2
66.7%
* 1
33.3%
Space Separator
ValueCountFrequency (%)
78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 936
82.9%
Common 113
 
10.0%
Hangul 80
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
7.5%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (48) 50
62.5%
Latin
ValueCountFrequency (%)
a 57
 
6.1%
e 53
 
5.7%
n 52
 
5.6%
i 51
 
5.4%
A 42
 
4.5%
T 38
 
4.1%
o 35
 
3.7%
O 30
 
3.2%
t 29
 
3.1%
B 29
 
3.1%
Other values (40) 520
55.6%
Common
ValueCountFrequency (%)
78
69.0%
[ 7
 
6.2%
] 7
 
6.2%
1 4
 
3.5%
2 4
 
3.5%
- 2
 
1.8%
' 2
 
1.8%
7 2
 
1.8%
( 2
 
1.8%
) 2
 
1.8%
Other values (3) 3
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1049
92.9%
Hangul 80
 
7.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
78
 
7.4%
a 57
 
5.4%
e 53
 
5.1%
n 52
 
5.0%
i 51
 
4.9%
A 42
 
4.0%
T 38
 
3.6%
o 35
 
3.3%
O 30
 
2.9%
t 29
 
2.8%
Other values (53) 584
55.7%
Hangul
ValueCountFrequency (%)
6
 
7.5%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (48) 50
62.5%

genre_nm
Categorical

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
음악
27 
키즈
14 
음악엔터테인먼트
영화/애니
엔터테인먼트
Other values (26)
40 

Length

Max length15
Median length12
Mean length5.01
Min length2

Unique

Unique19 ?
Unique (%)19.0%

Sample

1st rowASMR
2nd row음악Vlog/일상
3rd row구독자수
4th row푸드/먹방
5th row키즈

Common Values

ValueCountFrequency (%)
음악 27
27.0%
키즈 14
14.0%
음악엔터테인먼트 7
 
7.0%
영화/애니 6
 
6.0%
엔터테인먼트 6
 
6.0%
ASMR 5
 
5.0%
푸드/먹방 4
 
4.0%
음악Vlog/일상 4
 
4.0%
Vlog/일상지식/정보FUN 2
 
2.0%
음악게임 2
 
2.0%
Other values (21) 23
23.0%

Length

2023-12-10T19:09:09.901228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음악 28
27.7%
키즈 14
13.9%
음악엔터테인먼트 7
 
6.9%
엔터테인먼트 7
 
6.9%
영화/애니 6
 
5.9%
asmr 5
 
5.0%
푸드/먹방 4
 
4.0%
음악vlog/일상 4
 
4.0%
뷰티 2
 
2.0%
fun 2
 
2.0%
Other values (20) 22
21.8%
Distinct92
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:09:10.407507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.13
Min length4

Characters and Unicode

Total characters413
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)85.0%

Sample

1st row1530만
2nd row207만
3rd row517만
4th row416만
5th row571만
ValueCountFrequency (%)
225만 3
 
3.0%
328만 2
 
2.0%
457만 2
 
2.0%
248만 2
 
2.0%
220만 2
 
2.0%
846만 2
 
2.0%
445만 2
 
2.0%
281만 1
 
1.0%
1710만 1
 
1.0%
273만 1
 
1.0%
Other values (82) 82
82.0%
2023-12-10T19:09:11.138907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
24.2%
2 58
14.0%
5 34
 
8.2%
3 34
 
8.2%
4 33
 
8.0%
1 31
 
7.5%
0 30
 
7.3%
6 30
 
7.3%
8 28
 
6.8%
9 18
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 313
75.8%
Other Letter 100
 
24.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 58
18.5%
5 34
10.9%
3 34
10.9%
4 33
10.5%
1 31
9.9%
0 30
9.6%
6 30
9.6%
8 28
8.9%
9 18
 
5.8%
7 17
 
5.4%
Other Letter
ValueCountFrequency (%)
100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 313
75.8%
Hangul 100
 
24.2%

Most frequent character per script

Common
ValueCountFrequency (%)
2 58
18.5%
5 34
10.9%
3 34
10.9%
4 33
10.5%
1 31
9.9%
0 30
9.6%
6 30
9.6%
8 28
8.9%
9 18
 
5.8%
7 17
 
5.4%
Hangul
ValueCountFrequency (%)
100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 313
75.8%
Hangul 100
 
24.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
100.0%
ASCII
ValueCountFrequency (%)
2 58
18.5%
5 34
10.9%
3 34
10.9%
4 33
10.5%
1 31
9.9%
0 30
9.6%
6 30
9.6%
8 28
8.9%
9 18
 
5.8%
7 17
 
5.4%

Interactions

2023-12-10T19:09:06.009634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:09:11.348112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
seq_nochnnel_nmgenre_nmreader_co_cn
seq_no1.0001.0000.5300.000
chnnel_nm1.0001.0001.0001.000
genre_nm0.5301.0001.0000.874
reader_co_cn0.0001.0000.8741.000
2023-12-10T19:09:11.590413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
seq_nogenre_nm
seq_no1.0000.220
genre_nm0.2201.000

Missing values

2023-12-10T19:09:06.243021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:09:06.461985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

seq_noflag_nmchnnel_nmgenre_nmreader_co_cn
01유투브Jane ASMR 제인ASMR1530만
1221유투브헨리 Henry Lau음악Vlog/일상207만
23유투브[Awesome Haeun]어썸하은구독자수517만
34유투브[Dorothy]도로시푸드/먹방416만
45유투브[장난감티비]TOYTV키즈571만
56유투브[햄지]Hamzy푸드/먹방816만
67유투브1MILLION Dance Studio음악2450만
7222유투브흔한남매FUN225만
89유투브2NE1음악536만
910유투브4Minute 포미닛(Official YouTube Channel)음악211만
seq_noflag_nmchnnel_nmgenre_nmreader_co_cn
9091유투브officialpsy음악1520만
9192유투브OneRepublic음악952만
9293유투브PinkyPopTOY키즈563만
9394유투브PLAYLIST ORIGINALS 플레이리스트 오리지널영화/애니255만
9495유투브PONY Syndrome뷰티588만
9596유투브Raon Lee음악422만
9697유투브Red Velvet음악457만
9798유투브RISABAE뷰티225만
9899유투브RnC Fabrica지식/정보Vlog/일상246만
99100유투브Romiyu Vlog로미유 브이로그키즈365만