Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

DateTime1
Categorical5
Text1
Numeric1

Alerts

Collection_CH_NM has constant value ""Constant
FILE_NAME has constant value ""Constant
BASE_YMD has constant value ""Constant
Fanclub_NM is highly overall correlated with Artist_NMHigh correlation
Artist_NM is highly overall correlated with Fanclub_NMHigh correlation

Reproduction

Analysis started2023-12-10 09:53:55.121786
Analysis finished2023-12-10 09:53:56.709180
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2017-01-01 00:00:00
Maximum2017-02-01 00:00:00
2023-12-10T18:53:56.791793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:53:57.002655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Collection_CH_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Twitter
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTwitter
2nd rowTwitter
3rd rowTwitter
4th rowTwitter
5th rowTwitter

Common Values

ValueCountFrequency (%)
Twitter 100
100.0%

Length

2023-12-10T18:53:57.258125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:57.455939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
twitter 100
100.0%

Artist_NM
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
blackpink
32 
BTS
26 
got7
17 
EXO
13 
TXT
12 

Length

Max length9
Median length3
Mean length5.09
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBTS
2nd rowBTS
3rd rowBTS
4th rowBTS
5th rowBTS

Common Values

ValueCountFrequency (%)
blackpink 32
32.0%
BTS 26
26.0%
got7 17
17.0%
EXO 13
13.0%
TXT 12
 
12.0%

Length

2023-12-10T18:53:57.672392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:57.911471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
blackpink 32
32.0%
bts 26
26.0%
got7 17
17.0%
exo 13
13.0%
txt 12
 
12.0%

Fanclub_NM
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
blink
32 
army
26 
igot7
17 
exol
13 
moa
12 

Length

Max length5
Median length4
Mean length4.37
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowarmy
2nd rowarmy
3rd rowarmy
4th rowarmy
5th rowarmy

Common Values

ValueCountFrequency (%)
blink 32
32.0%
army 26
26.0%
igot7 17
17.0%
exol 13
13.0%
moa 12
 
12.0%

Length

2023-12-10T18:53:58.212066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:58.463702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
blink 32
32.0%
army 26
26.0%
igot7 17
17.0%
exol 13
13.0%
moa 12
 
12.0%
Distinct73
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:53:58.951912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length5.38
Min length3

Characters and Unicode

Total characters538
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)59.0%

Sample

1st rowbts
2nd rowarmi
3rd rowbangtan
4th rowvote
5th rowlove
ValueCountFrequency (%)
bts 5
 
5.0%
love 5
 
5.0%
will 4
 
4.0%
amp 4
 
4.0%
armi 3
 
3.0%
kpop 3
 
3.0%
stan 3
 
3.0%
bangtan 2
 
2.0%
watch 2
 
2.0%
now 2
 
2.0%
Other values (63) 67
67.0%
2023-12-10T18:53:59.681666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 57
 
10.6%
n 46
 
8.6%
i 46
 
8.6%
t 41
 
7.6%
o 38
 
7.1%
e 34
 
6.3%
l 31
 
5.8%
b 26
 
4.8%
s 26
 
4.8%
m 25
 
4.6%
Other values (14) 168
31.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 538
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 57
 
10.6%
n 46
 
8.6%
i 46
 
8.6%
t 41
 
7.6%
o 38
 
7.1%
e 34
 
6.3%
l 31
 
5.8%
b 26
 
4.8%
s 26
 
4.8%
m 25
 
4.6%
Other values (14) 168
31.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 538
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 57
 
10.6%
n 46
 
8.6%
i 46
 
8.6%
t 41
 
7.6%
o 38
 
7.1%
e 34
 
6.3%
l 31
 
5.8%
b 26
 
4.8%
s 26
 
4.8%
m 25
 
4.6%
Other values (14) 168
31.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 538
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 57
 
10.6%
n 46
 
8.6%
i 46
 
8.6%
t 41
 
7.6%
o 38
 
7.1%
e 34
 
6.3%
l 31
 
5.8%
b 26
 
4.8%
s 26
 
4.8%
m 25
 
4.6%
Other values (14) 168
31.2%

Keyword_FQ
Real number (ℝ)

Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.79
Minimum1
Maximum558
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:54:00.052188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median11
Q333.75
95-th percentile99.15
Maximum558
Range557
Interquartile range (IQR)29.75

Descriptive statistics

Standard deviation99.043231
Coefficient of variation (CV)2.4891488
Kurtosis19.232072
Mean39.79
Median Absolute Deviation (MAD)9
Skewness4.4360185
Sum3979
Variance9809.5615
MonotonicityNot monotonic
2023-12-10T18:54:00.409118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 9
 
9.0%
1 8
 
8.0%
11 7
 
7.0%
10 5
 
5.0%
4 5
 
5.0%
8 5
 
5.0%
5 4
 
4.0%
3 4
 
4.0%
13 3
 
3.0%
7 3
 
3.0%
Other values (40) 47
47.0%
ValueCountFrequency (%)
1 8
8.0%
2 9
9.0%
3 4
4.0%
4 5
5.0%
5 4
4.0%
6 3
 
3.0%
7 3
 
3.0%
8 5
5.0%
9 1
 
1.0%
10 5
5.0%
ValueCountFrequency (%)
558 1
1.0%
511 1
1.0%
502 1
1.0%
466 1
1.0%
102 1
1.0%
99 1
1.0%
78 1
1.0%
75 1
1.0%
72 1
1.0%
71 1
1.0%

FILE_NAME
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
KC_KEYWORD_TWITTER_FANCLUB_2019
100 

Length

Max length31
Median length31
Mean length31
Min length31

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKC_KEYWORD_TWITTER_FANCLUB_2019
2nd rowKC_KEYWORD_TWITTER_FANCLUB_2019
3rd rowKC_KEYWORD_TWITTER_FANCLUB_2019
4th rowKC_KEYWORD_TWITTER_FANCLUB_2019
5th rowKC_KEYWORD_TWITTER_FANCLUB_2019

Common Values

ValueCountFrequency (%)
KC_KEYWORD_TWITTER_FANCLUB_2019 100
100.0%

Length

2023-12-10T18:54:00.718913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:54:00.967971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kc_keyword_twitter_fanclub_2019 100
100.0%

BASE_YMD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 100
100.0%

Length

2023-12-10T18:54:01.153208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:54:01.348254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 100
100.0%

Interactions

2023-12-10T18:53:55.620052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:54:01.462571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Social_Data_Collection_Date_YMArtist_NMFanclub_NMTwitter_KEY_WKeyword_FQ
Social_Data_Collection_Date_YM1.0000.3880.3880.0000.285
Artist_NM0.3881.0001.0000.6860.195
Fanclub_NM0.3881.0001.0000.6860.195
Twitter_KEY_W0.0000.6860.6861.0000.000
Keyword_FQ0.2850.1950.1950.0001.000
2023-12-10T18:54:01.653482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Fanclub_NMArtist_NM
Fanclub_NM1.0001.000
Artist_NM1.0001.000
2023-12-10T18:54:01.815580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Keyword_FQArtist_NMFanclub_NM
Keyword_FQ1.0000.1580.158
Artist_NM0.1581.0001.000
Fanclub_NM0.1581.0001.000

Missing values

2023-12-10T18:53:56.303652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:53:56.607971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Social_Data_Collection_Date_YMCollection_CH_NMArtist_NMFanclub_NMTwitter_KEY_WKeyword_FQFILE_NAMEBASE_YMD
02017-01TwitterBTSarmybts502KC_KEYWORD_TWITTER_FANCLUB_20192019
12017-01TwitterBTSarmyarmi466KC_KEYWORD_TWITTER_FANCLUB_20192019
22017-01TwitterBTSarmybangtan75KC_KEYWORD_TWITTER_FANCLUB_20192019
32017-01TwitterBTSarmyvote59KC_KEYWORD_TWITTER_FANCLUB_20192019
42017-01TwitterBTSarmylove54KC_KEYWORD_TWITTER_FANCLUB_20192019
52017-01TwitterBTSarmyisac49KC_KEYWORD_TWITTER_FANCLUB_20192019
62017-01TwitterBTSarmybomb36KC_KEYWORD_TWITTER_FANCLUB_20192019
72017-01TwitterBTSarmywill32KC_KEYWORD_TWITTER_FANCLUB_20192019
82017-01TwitterBTSarmykpop31KC_KEYWORD_TWITTER_FANCLUB_20192019
92017-01TwitterBTSarmybighit30KC_KEYWORD_TWITTER_FANCLUB_20192019
Social_Data_Collection_Date_YMCollection_CH_NMArtist_NMFanclub_NMTwitter_KEY_WKeyword_FQFILE_NAMEBASE_YMD
902017-02Twitterblackpinkblinkrealli3KC_KEYWORD_TWITTER_FANCLUB_20192019
912017-02Twitterblackpinkblinksinc3KC_KEYWORD_TWITTER_FANCLUB_20192019
922017-02Twitterblackpinkblinklong3KC_KEYWORD_TWITTER_FANCLUB_20192019
932017-02Twitterblackpinkblinkros3KC_KEYWORD_TWITTER_FANCLUB_20192019
942017-02Twitterblackpinkblinkfli2KC_KEYWORD_TWITTER_FANCLUB_20192019
952017-02Twitterblackpinkblinkmonth2KC_KEYWORD_TWITTER_FANCLUB_20192019
962017-02Twitterblackpinkblinkheart2KC_KEYWORD_TWITTER_FANCLUB_20192019
972017-02Twitterblackpinkblinkinsta2KC_KEYWORD_TWITTER_FANCLUB_20192019
982017-02Twitterblackpinkblinkblackpinksnap2KC_KEYWORD_TWITTER_FANCLUB_20192019
992017-02Twitterblackpinkblinkinstagramv2KC_KEYWORD_TWITTER_FANCLUB_20192019