Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.8 KiB |
Average record size in memory | 59.3 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 4 |
Text | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=18473dd0-6271-11ea-8b67-7b32ce18203a |
Social_Data_Collection_Date_YM has constant value "" | Constant |
Collection_CH_NM has constant value "" | Constant |
FILE_NAME has constant value "" | Constant |
BASE_YMD has constant value "" | Constant |
Reproduction
Analysis started | 2023-12-10 10:13:07.481435 |
---|---|
Analysis finished | 2023-12-10 10:13:08.491763 |
Duration | 1.01 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Social_Data_Collection_Date_YM
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2017-01-01 00:00:00 |
---|---|
Maximum | 2017-01-01 00:00:00 |
Collection_CH_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
100 |
Entertainment_NM
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
rbw | |
---|---|
yg | |
jyp | |
starship | |
cube | |
Other values (2) |
Length
Max length | 8 |
---|---|
Median length | 6 |
Mean length | 3.95 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | yg |
---|---|
2nd row | yg |
3rd row | yg |
4th row | yg |
5th row | yg |
Common Values
Value | Count | Frequency (%) |
rbw | 18 | |
yg | 16 | |
jyp | 15 | |
starship | 15 | |
cube | 14 | |
bighit | 11 | |
cj | 11 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
rbw | 18 | |
yg | 16 | |
jyp | 15 | |
starship | 15 | |
cube | 14 | |
bighit | 11 | |
cj | 11 |
Twitter_KEY_W
Text
Distinct | 82 |
---|---|
Distinct (%) | 82.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
bts | 4 | 4.0% |
jackson | 3 | 3.0% |
video | 3 | 3.0% |
youtube | 3 | 3.0% |
amp | 3 | 3.0% |
comeback | 3 | 3.0% |
kpop | 3 | 3.0% |
best | 2 | 2.0% |
eampm | 2 | 2.0% |
entertain | 2 | 2.0% |
Other values (72) | 72 |
Most occurring characters
Value | Count | Frequency (%) |
o | 51 | 9.3% |
e | 51 | 9.3% |
a | 48 | 8.7% |
t | 46 | 8.4% |
n | 36 | 6.5% |
c | 30 | 5.5% |
m | 30 | 5.5% |
i | 29 | 5.3% |
b | 28 | 5.1% |
p | 24 | 4.4% |
Other values (14) | 177 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 550 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 51 | 9.3% |
e | 51 | 9.3% |
a | 48 | 8.7% |
t | 46 | 8.4% |
n | 36 | 6.5% |
c | 30 | 5.5% |
m | 30 | 5.5% |
i | 29 | 5.3% |
b | 28 | 5.1% |
p | 24 | 4.4% |
Other values (14) | 177 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 550 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 51 | 9.3% |
e | 51 | 9.3% |
a | 48 | 8.7% |
t | 46 | 8.4% |
n | 36 | 6.5% |
c | 30 | 5.5% |
m | 30 | 5.5% |
i | 29 | 5.3% |
b | 28 | 5.1% |
p | 24 | 4.4% |
Other values (14) | 177 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 550 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 51 | 9.3% |
e | 51 | 9.3% |
a | 48 | 8.7% |
t | 46 | 8.4% |
n | 36 | 6.5% |
c | 30 | 5.5% |
m | 30 | 5.5% |
i | 29 | 5.3% |
b | 28 | 5.1% |
p | 24 | 4.4% |
Other values (14) | 177 |
Keyword_FQ
Real number (ℝ)
Distinct | 40 |
---|---|
Distinct (%) | 40.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 22.86 |
Minimum | 2 |
---|---|
Maximum | 225 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2 |
Q1 | 2 |
median | 13 |
Q3 | 25 |
95-th percentile | 92.65 |
Maximum | 225 |
Range | 223 |
Interquartile range (IQR) | 23 |
Descriptive statistics
Standard deviation | 36.703293 |
---|---|
Coefficient of variation (CV) | 1.6055684 |
Kurtosis | 14.053095 |
Mean | 22.86 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 3.4691468 |
Sum | 2286 |
Variance | 1347.1317 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 30 | |
4 | 5 | 5.0% |
14 | 5 | 5.0% |
16 | 5 | 5.0% |
3 | 4 | 4.0% |
25 | 4 | 4.0% |
37 | 3 | 3.0% |
36 | 3 | 3.0% |
13 | 3 | 3.0% |
19 | 2 | 2.0% |
Other values (30) | 36 |
Value | Count | Frequency (%) |
2 | 30 | |
3 | 4 | 4.0% |
4 | 5 | 5.0% |
5 | 1 | 1.0% |
7 | 2 | 2.0% |
8 | 1 | 1.0% |
9 | 2 | 2.0% |
10 | 1 | 1.0% |
11 | 2 | 2.0% |
12 | 1 | 1.0% |
Value | Count | Frequency (%) |
225 | 1 | |
194 | 1 | |
145 | 1 | |
109 | 1 | |
105 | 1 | |
92 | 1 | |
76 | 1 | |
75 | 1 | |
60 | 1 | |
53 | 1 |
FILE_NAME
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
---|
Length
Max length | 37 |
---|---|
Median length | 37 |
Mean length | 37 |
Min length | 37 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
---|---|
2nd row | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
3rd row | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
4th row | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
5th row | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 |
Common Values
Value | Count | Frequency (%) |
KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_keyword_twitter_entertainment_2019 | 100 |
BASE_YMD
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2019 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 100 |
Entertainment_NM | Twitter_KEY_W | Keyword_FQ | |
---|---|---|---|
Entertainment_NM | 1.000 | 0.000 | 0.432 |
Twitter_KEY_W | 0.000 | 1.000 | 0.000 |
Keyword_FQ | 0.432 | 0.000 | 1.000 |
Keyword_FQ | Entertainment_NM | |
---|---|---|
Keyword_FQ | 1.000 | 0.245 |
Entertainment_NM | 0.245 | 1.000 |
Social_Data_Collection_Date_YM | Collection_CH_NM | Entertainment_NM | Twitter_KEY_W | Keyword_FQ | FILE_NAME | BASE_YMD | |
---|---|---|---|---|---|---|---|
0 | 2017-01 | yg | blackpink | 225 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
1 | 2017-01 | yg | bts | 75 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
2 | 2017-01 | yg | kpop | 60 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
3 | 2017-01 | yg | ygentoffici | 40 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
4 | 2017-01 | yg | jackson | 39 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
5 | 2017-01 | yg | video | 37 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
6 | 2017-01 | yg | photo | 36 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
7 | 2017-01 | yg | youtube | 36 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
8 | 2017-01 | yg | scott | 35 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
9 | 2017-01 | yg | amp | 32 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 |
Social_Data_Collection_Date_YM | Collection_CH_NM | Entertainment_NM | Twitter_KEY_W | Keyword_FQ | FILE_NAME | BASE_YMD | |
---|---|---|---|---|---|---|---|
90 | 2017-01 | cube | comeback | 4 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
91 | 2017-01 | cube | pentagon | 4 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
92 | 2017-01 | cube | cubetv | 3 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
93 | 2017-01 | cube | good | 3 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
94 | 2017-01 | cube | concept | 3 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
95 | 2017-01 | cube | week | 2 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
96 | 2017-01 | cube | cut | 2 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
97 | 2017-01 | cube | full | 2 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
98 | 2017-01 | cube | vid | 2 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 | |
99 | 2017-01 | cube | amp | 2 | KC_KEYWORD_TWITTER_ENTERTAINMENT_2019 | 2019 |