Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory59.3 B

Variable types

DateTime1
Categorical4
Text1
Numeric1

Alerts

Social_Data_Collection_Date_YM has constant value ""Constant
Collection_CH_NM has constant value ""Constant
FILE_NAME has constant value ""Constant
BASE_YMD has constant value ""Constant

Reproduction

Analysis started2023-12-10 10:13:07.481435
Analysis finished2023-12-10 10:13:08.491763
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2017-01-01 00:00:00
Maximum2017-01-01 00:00:00
2023-12-10T19:13:08.643901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:13:08.883650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Collection_CH_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Twitter
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTwitter
2nd rowTwitter
3rd rowTwitter
4th rowTwitter
5th rowTwitter

Common Values

ValueCountFrequency (%)
Twitter 100
100.0%

Length

2023-12-10T19:13:09.121799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:13:09.381796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
twitter 100
100.0%

Entertainment_NM
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
rbw
18 
yg
16 
jyp
15 
starship
15 
cube
14 
Other values (2)
22 

Length

Max length8
Median length6
Mean length3.95
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowyg
2nd rowyg
3rd rowyg
4th rowyg
5th rowyg

Common Values

ValueCountFrequency (%)
rbw 18
18.0%
yg 16
16.0%
jyp 15
15.0%
starship 15
15.0%
cube 14
14.0%
bighit 11
11.0%
cj 11
11.0%

Length

2023-12-10T19:13:09.604604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:13:10.275133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
rbw 18
18.0%
yg 16
16.0%
jyp 15
15.0%
starship 15
15.0%
cube 14
14.0%
bighit 11
11.0%
cj 11
11.0%
Distinct82
Distinct (%)82.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:13:10.755273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.5
Min length3

Characters and Unicode

Total characters550
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)72.0%

Sample

1st rowblackpink
2nd rowbts
3rd rowkpop
4th rowygentoffici
5th rowjackson
ValueCountFrequency (%)
bts 4
 
4.0%
jackson 3
 
3.0%
video 3
 
3.0%
youtube 3
 
3.0%
amp 3
 
3.0%
comeback 3
 
3.0%
kpop 3
 
3.0%
best 2
 
2.0%
eampm 2
 
2.0%
entertain 2
 
2.0%
Other values (72) 72
72.0%
2023-12-10T19:13:11.514281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 51
 
9.3%
e 51
 
9.3%
a 48
 
8.7%
t 46
 
8.4%
n 36
 
6.5%
c 30
 
5.5%
m 30
 
5.5%
i 29
 
5.3%
b 28
 
5.1%
p 24
 
4.4%
Other values (14) 177
32.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 550
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 51
 
9.3%
e 51
 
9.3%
a 48
 
8.7%
t 46
 
8.4%
n 36
 
6.5%
c 30
 
5.5%
m 30
 
5.5%
i 29
 
5.3%
b 28
 
5.1%
p 24
 
4.4%
Other values (14) 177
32.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 550
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 51
 
9.3%
e 51
 
9.3%
a 48
 
8.7%
t 46
 
8.4%
n 36
 
6.5%
c 30
 
5.5%
m 30
 
5.5%
i 29
 
5.3%
b 28
 
5.1%
p 24
 
4.4%
Other values (14) 177
32.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 550
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 51
 
9.3%
e 51
 
9.3%
a 48
 
8.7%
t 46
 
8.4%
n 36
 
6.5%
c 30
 
5.5%
m 30
 
5.5%
i 29
 
5.3%
b 28
 
5.1%
p 24
 
4.4%
Other values (14) 177
32.2%

Keyword_FQ
Real number (ℝ)

Distinct40
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.86
Minimum2
Maximum225
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:13:11.803262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q12
median13
Q325
95-th percentile92.65
Maximum225
Range223
Interquartile range (IQR)23

Descriptive statistics

Standard deviation36.703293
Coefficient of variation (CV)1.6055684
Kurtosis14.053095
Mean22.86
Median Absolute Deviation (MAD)11
Skewness3.4691468
Sum2286
Variance1347.1317
MonotonicityNot monotonic
2023-12-10T19:13:12.038232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
2 30
30.0%
4 5
 
5.0%
14 5
 
5.0%
16 5
 
5.0%
3 4
 
4.0%
25 4
 
4.0%
37 3
 
3.0%
36 3
 
3.0%
13 3
 
3.0%
19 2
 
2.0%
Other values (30) 36
36.0%
ValueCountFrequency (%)
2 30
30.0%
3 4
 
4.0%
4 5
 
5.0%
5 1
 
1.0%
7 2
 
2.0%
8 1
 
1.0%
9 2
 
2.0%
10 1
 
1.0%
11 2
 
2.0%
12 1
 
1.0%
ValueCountFrequency (%)
225 1
1.0%
194 1
1.0%
145 1
1.0%
109 1
1.0%
105 1
1.0%
92 1
1.0%
76 1
1.0%
75 1
1.0%
60 1
1.0%
53 1
1.0%

FILE_NAME
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
KC_KEYWORD_TWITTER_ENTERTAINMENT_2019
100 

Length

Max length37
Median length37
Mean length37
Min length37

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKC_KEYWORD_TWITTER_ENTERTAINMENT_2019
2nd rowKC_KEYWORD_TWITTER_ENTERTAINMENT_2019
3rd rowKC_KEYWORD_TWITTER_ENTERTAINMENT_2019
4th rowKC_KEYWORD_TWITTER_ENTERTAINMENT_2019
5th rowKC_KEYWORD_TWITTER_ENTERTAINMENT_2019

Common Values

ValueCountFrequency (%)
KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 100
100.0%

Length

2023-12-10T19:13:12.286938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:13:12.487327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kc_keyword_twitter_entertainment_2019 100
100.0%

BASE_YMD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 100
100.0%

Length

2023-12-10T19:13:12.704588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:13:12.953500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 100
100.0%

Interactions

2023-12-10T19:13:07.792191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:13:13.065942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Entertainment_NMTwitter_KEY_WKeyword_FQ
Entertainment_NM1.0000.0000.432
Twitter_KEY_W0.0001.0000.000
Keyword_FQ0.4320.0001.000
2023-12-10T19:13:13.229100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Keyword_FQEntertainment_NM
Keyword_FQ1.0000.245
Entertainment_NM0.2451.000

Missing values

2023-12-10T19:13:08.095818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:13:08.372253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Social_Data_Collection_Date_YMCollection_CH_NMEntertainment_NMTwitter_KEY_WKeyword_FQFILE_NAMEBASE_YMD
02017-01Twitterygblackpink225KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
12017-01Twitterygbts75KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
22017-01Twitterygkpop60KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
32017-01Twitterygygentoffici40KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
42017-01Twitterygjackson39KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
52017-01Twitterygvideo37KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
62017-01Twitterygphoto36KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
72017-01Twitterygyoutube36KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
82017-01Twitterygscott35KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
92017-01Twitterygamp32KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
Social_Data_Collection_Date_YMCollection_CH_NMEntertainment_NMTwitter_KEY_WKeyword_FQFILE_NAMEBASE_YMD
902017-01Twittercubecomeback4KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
912017-01Twittercubepentagon4KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
922017-01Twittercubecubetv3KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
932017-01Twittercubegood3KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
942017-01Twittercubeconcept3KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
952017-01Twittercubeweek2KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
962017-01Twittercubecut2KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
972017-01Twittercubefull2KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
982017-01Twittercubevid2KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019
992017-01Twittercubeamp2KC_KEYWORD_TWITTER_ENTERTAINMENT_20192019