Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.6 KiB |
Average record size in memory | 67.3 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 5 |
Text | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=4724c500-6271-11ea-8b67-7b32ce18203a |
Collection_CH_NM has constant value "" | Constant |
FILE_NAME has constant value "" | Constant |
BASE_YMD has constant value "" | Constant |
Fanclub_NM is highly overall correlated with Artist_NM | High correlation |
Artist_NM is highly overall correlated with Fanclub_NM | High correlation |
Reproduction
Analysis started | 2023-12-10 09:53:55.121786 |
---|---|
Analysis finished | 2023-12-10 09:53:56.709180 |
Duration | 1.59 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2017-01-01 00:00:00 |
---|---|
Maximum | 2017-02-01 00:00:00 |
Collection_CH_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
100 |
Artist_NM
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
blackpink | |
---|---|
BTS | |
got7 | |
EXO | |
TXT |
Length
Max length | 9 |
---|---|
Median length | 3 |
Mean length | 5.09 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | BTS |
---|---|
2nd row | BTS |
3rd row | BTS |
4th row | BTS |
5th row | BTS |
Common Values
Value | Count | Frequency (%) |
blackpink | 32 | |
BTS | 26 | |
got7 | 17 | |
EXO | 13 | |
TXT | 12 | 12.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
blackpink | 32 | |
bts | 26 | |
got7 | 17 | |
exo | 13 | |
txt | 12 | 12.0% |
Fanclub_NM
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
blink | |
---|---|
army | |
igot7 | |
exol | |
moa |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.37 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | army |
---|---|
2nd row | army |
3rd row | army |
4th row | army |
5th row | army |
Common Values
Value | Count | Frequency (%) |
blink | 32 | |
army | 26 | |
igot7 | 17 | |
exol | 13 | |
moa | 12 | 12.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
blink | 32 | |
army | 26 | |
igot7 | 17 | |
exol | 13 | |
moa | 12 | 12.0% |
Twitter_KEY_W
Text
Distinct | 73 |
---|---|
Distinct (%) | 73.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
bts | 5 | 5.0% |
love | 5 | 5.0% |
will | 4 | 4.0% |
amp | 4 | 4.0% |
armi | 3 | 3.0% |
kpop | 3 | 3.0% |
stan | 3 | 3.0% |
bangtan | 2 | 2.0% |
watch | 2 | 2.0% |
now | 2 | 2.0% |
Other values (63) | 67 |
Most occurring characters
Value | Count | Frequency (%) |
a | 57 | 10.6% |
n | 46 | 8.6% |
i | 46 | 8.6% |
t | 41 | 7.6% |
o | 38 | 7.1% |
e | 34 | 6.3% |
l | 31 | 5.8% |
b | 26 | 4.8% |
s | 26 | 4.8% |
m | 25 | 4.6% |
Other values (14) | 168 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 538 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 57 | 10.6% |
n | 46 | 8.6% |
i | 46 | 8.6% |
t | 41 | 7.6% |
o | 38 | 7.1% |
e | 34 | 6.3% |
l | 31 | 5.8% |
b | 26 | 4.8% |
s | 26 | 4.8% |
m | 25 | 4.6% |
Other values (14) | 168 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 538 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 57 | 10.6% |
n | 46 | 8.6% |
i | 46 | 8.6% |
t | 41 | 7.6% |
o | 38 | 7.1% |
e | 34 | 6.3% |
l | 31 | 5.8% |
b | 26 | 4.8% |
s | 26 | 4.8% |
m | 25 | 4.6% |
Other values (14) | 168 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 538 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 57 | 10.6% |
n | 46 | 8.6% |
i | 46 | 8.6% |
t | 41 | 7.6% |
o | 38 | 7.1% |
e | 34 | 6.3% |
l | 31 | 5.8% |
b | 26 | 4.8% |
s | 26 | 4.8% |
m | 25 | 4.6% |
Other values (14) | 168 |
Keyword_FQ
Real number (ℝ)
Distinct | 50 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.79 |
Minimum | 1 |
---|---|
Maximum | 558 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 11 |
Q3 | 33.75 |
95-th percentile | 99.15 |
Maximum | 558 |
Range | 557 |
Interquartile range (IQR) | 29.75 |
Descriptive statistics
Standard deviation | 99.043231 |
---|---|
Coefficient of variation (CV) | 2.4891488 |
Kurtosis | 19.232072 |
Mean | 39.79 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 4.4360185 |
Sum | 3979 |
Variance | 9809.5615 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 9 | 9.0% |
1 | 8 | 8.0% |
11 | 7 | 7.0% |
10 | 5 | 5.0% |
4 | 5 | 5.0% |
8 | 5 | 5.0% |
5 | 4 | 4.0% |
3 | 4 | 4.0% |
13 | 3 | 3.0% |
7 | 3 | 3.0% |
Other values (40) | 47 |
Value | Count | Frequency (%) |
1 | 8 | |
2 | 9 | |
3 | 4 | |
4 | 5 | |
5 | 4 | |
6 | 3 | 3.0% |
7 | 3 | 3.0% |
8 | 5 | |
9 | 1 | 1.0% |
10 | 5 |
Value | Count | Frequency (%) |
558 | 1 | |
511 | 1 | |
502 | 1 | |
466 | 1 | |
102 | 1 | |
99 | 1 | |
78 | 1 | |
75 | 1 | |
72 | 1 | |
71 | 1 |
FILE_NAME
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
KC_KEYWORD_TWITTER_FANCLUB_2019 |
---|
Length
Max length | 31 |
---|---|
Median length | 31 |
Mean length | 31 |
Min length | 31 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_KEYWORD_TWITTER_FANCLUB_2019 |
---|---|
2nd row | KC_KEYWORD_TWITTER_FANCLUB_2019 |
3rd row | KC_KEYWORD_TWITTER_FANCLUB_2019 |
4th row | KC_KEYWORD_TWITTER_FANCLUB_2019 |
5th row | KC_KEYWORD_TWITTER_FANCLUB_2019 |
Common Values
Value | Count | Frequency (%) |
KC_KEYWORD_TWITTER_FANCLUB_2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_keyword_twitter_fanclub_2019 | 100 |
BASE_YMD
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2019 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 100 |
Social_Data_Collection_Date_YM | Artist_NM | Fanclub_NM | Twitter_KEY_W | Keyword_FQ | |
---|---|---|---|---|---|
Social_Data_Collection_Date_YM | 1.000 | 0.388 | 0.388 | 0.000 | 0.285 |
Artist_NM | 0.388 | 1.000 | 1.000 | 0.686 | 0.195 |
Fanclub_NM | 0.388 | 1.000 | 1.000 | 0.686 | 0.195 |
Twitter_KEY_W | 0.000 | 0.686 | 0.686 | 1.000 | 0.000 |
Keyword_FQ | 0.285 | 0.195 | 0.195 | 0.000 | 1.000 |
Fanclub_NM | Artist_NM | |
---|---|---|
Fanclub_NM | 1.000 | 1.000 |
Artist_NM | 1.000 | 1.000 |
Keyword_FQ | Artist_NM | Fanclub_NM | |
---|---|---|---|
Keyword_FQ | 1.000 | 0.158 | 0.158 |
Artist_NM | 0.158 | 1.000 | 1.000 |
Fanclub_NM | 0.158 | 1.000 | 1.000 |
Social_Data_Collection_Date_YM | Collection_CH_NM | Artist_NM | Fanclub_NM | Twitter_KEY_W | Keyword_FQ | FILE_NAME | BASE_YMD | |
---|---|---|---|---|---|---|---|---|
0 | 2017-01 | BTS | army | bts | 502 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
1 | 2017-01 | BTS | army | armi | 466 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
2 | 2017-01 | BTS | army | bangtan | 75 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
3 | 2017-01 | BTS | army | vote | 59 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
4 | 2017-01 | BTS | army | love | 54 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
5 | 2017-01 | BTS | army | isac | 49 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
6 | 2017-01 | BTS | army | bomb | 36 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
7 | 2017-01 | BTS | army | will | 32 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
8 | 2017-01 | BTS | army | kpop | 31 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
9 | 2017-01 | BTS | army | bighit | 30 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 |
Social_Data_Collection_Date_YM | Collection_CH_NM | Artist_NM | Fanclub_NM | Twitter_KEY_W | Keyword_FQ | FILE_NAME | BASE_YMD | |
---|---|---|---|---|---|---|---|---|
90 | 2017-02 | blackpink | blink | realli | 3 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
91 | 2017-02 | blackpink | blink | sinc | 3 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
92 | 2017-02 | blackpink | blink | long | 3 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
93 | 2017-02 | blackpink | blink | ros | 3 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
94 | 2017-02 | blackpink | blink | fli | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
95 | 2017-02 | blackpink | blink | month | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
96 | 2017-02 | blackpink | blink | heart | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
97 | 2017-02 | blackpink | blink | insta | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
98 | 2017-02 | blackpink | blink | blackpinksnap | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 | |
99 | 2017-02 | blackpink | blink | instagramv | 2 | KC_KEYWORD_TWITTER_FANCLUB_2019 | 2019 |