Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory50.3 B

Variable types

Categorical5
Numeric1

Alerts

Country_CD has constant value ""Constant
Collection_CH_NM has constant value ""Constant

Reproduction

Analysis started2023-12-10 09:53:49.395842
Analysis finished2023-12-10 09:53:50.343151
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2017-01
46 
2017-02
42 
2017-03
12 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017-01
2nd row2017-01
3rd row2017-01
4th row2017-01
5th row2017-01

Common Values

ValueCountFrequency (%)
2017-01 46
46.0%
2017-02 42
42.0%
2017-03 12
 
12.0%

Length

2023-12-10T18:53:50.467225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:50.714629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-01 46
46.0%
2017-02 42
42.0%
2017-03 12
 
12.0%

Country_CD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
GB
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGB
2nd rowGB
3rd rowGB
4th rowGB
5th rowGB

Common Values

ValueCountFrequency (%)
GB 100
100.0%

Length

2023-12-10T18:53:50.907764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:51.078684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
gb 100
100.0%

Collection_CH_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
community
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcommunity
2nd rowcommunity
3rd rowcommunity
4th rowcommunity
5th rowcommunity

Common Values

ValueCountFrequency (%)
community 100
100.0%

Length

2023-12-10T18:53:51.257334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:51.444299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
community 100
100.0%

Artist_NM
Categorical

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
bts
45 
exo
13 
got7
13 
blackpink
10 
twice
Other values (4)
12 

Length

Max length9
Median length3
Mean length4.17
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowblackpink
2nd rowblackpink
3rd rowblackpink
4th rowblackpink
5th rowbts

Common Values

ValueCountFrequency (%)
bts 45
45.0%
exo 13
 
13.0%
got7 13
 
13.0%
blackpink 10
 
10.0%
twice 7
 
7.0%
jhope 4
 
4.0%
mamamoo 3
 
3.0%
nct 3
 
3.0%
everglow 2
 
2.0%

Length

2023-12-10T18:53:51.664609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:51.902654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
bts 45
45.0%
exo 13
 
13.0%
got7 13
 
13.0%
blackpink 10
 
10.0%
twice 7
 
7.0%
jhope 4
 
4.0%
mamamoo 3
 
3.0%
nct 3
 
3.0%
everglow 2
 
2.0%

Community_KEY_W
Categorical

Distinct40
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
youtube
14 
bts
blackpink
 
5
comeback
 
5
amp
 
5
Other values (35)
62 

Length

Max length11
Median length10
Mean length5.54
Min length2

Unique

Unique19 ?
Unique (%)19.0%

Sample

1st rowblackpink
2nd rowyoutube
3rd rowfandom
4th rowcomeback
5th rowbts

Common Values

ValueCountFrequency (%)
youtube 14
 
14.0%
bts 9
 
9.0%
blackpink 5
 
5.0%
comeback 5
 
5.0%
amp 5
 
5.0%
jackson 4
 
4.0%
nct 4
 
4.0%
kpop 4
 
4.0%
jungkook 3
 
3.0%
jimin 3
 
3.0%
Other values (30) 44
44.0%

Length

2023-12-10T18:53:52.174461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
youtube 14
 
14.0%
bts 9
 
9.0%
blackpink 5
 
5.0%
comeback 5
 
5.0%
amp 5
 
5.0%
jackson 4
 
4.0%
nct 4
 
4.0%
kpop 4
 
4.0%
jungkook 3
 
3.0%
jimin 3
 
3.0%
Other values (30) 44
44.0%

Keyword_FQ
Real number (ℝ)

Distinct27
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.59
Minimum1
Maximum283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:53:52.416049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.75
median5
Q38
95-th percentile104
Maximum283
Range282
Interquartile range (IQR)4.25

Descriptive statistics

Standard deviation45.850459
Coefficient of variation (CV)2.6066207
Kurtosis19.10572
Mean17.59
Median Absolute Deviation (MAD)2
Skewness4.3476639
Sum1759
Variance2102.2645
MonotonicityNot monotonic
2023-12-10T18:53:52.646682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
4 19
19.0%
6 11
11.0%
2 10
10.0%
3 9
9.0%
7 8
8.0%
5 7
 
7.0%
1 6
 
6.0%
8 6
 
6.0%
9 3
 
3.0%
11 2
 
2.0%
Other values (17) 19
19.0%
ValueCountFrequency (%)
1 6
 
6.0%
2 10
10.0%
3 9
9.0%
4 19
19.0%
5 7
 
7.0%
6 11
11.0%
7 8
8.0%
8 6
 
6.0%
9 3
 
3.0%
10 1
 
1.0%
ValueCountFrequency (%)
283 1
1.0%
231 1
1.0%
206 1
1.0%
182 1
1.0%
104 2
2.0%
35 1
1.0%
32 1
1.0%
28 1
1.0%
25 1
1.0%
22 1
1.0%

Interactions

2023-12-10T18:53:49.754326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:53:52.817740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Social_Data_Collection_Date_YMArtist_NMCommunity_KEY_WKeyword_FQ
Social_Data_Collection_Date_YM1.0000.0000.0000.301
Artist_NM0.0001.0000.6920.000
Community_KEY_W0.0000.6921.0000.000
Keyword_FQ0.3010.0000.0001.000
2023-12-10T18:53:53.011532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Artist_NMSocial_Data_Collection_Date_YMCommunity_KEY_W
Artist_NM1.0000.0000.263
Social_Data_Collection_Date_YM0.0001.0000.000
Community_KEY_W0.2630.0001.000
2023-12-10T18:53:53.180305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Keyword_FQSocial_Data_Collection_Date_YMArtist_NMCommunity_KEY_W
Keyword_FQ1.0000.2060.0000.000
Social_Data_Collection_Date_YM0.2061.0000.0000.000
Artist_NM0.0000.0001.0000.263
Community_KEY_W0.0000.0000.2631.000

Missing values

2023-12-10T18:53:50.057545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:53:50.257133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMArtist_NMCommunity_KEY_WKeyword_FQ
02017-01GBcommunityblackpinkblackpink18
12017-01GBcommunityblackpinkyoutube11
22017-01GBcommunityblackpinkfandom4
32017-01GBcommunityblackpinkcomeback2
42017-01GBcommunitybtsbts182
52017-01GBcommunitybtsyoutube104
62017-01GBcommunitybtsmbc20
72017-01GBcommunitybtsamp11
82017-01GBcommunitybtsbangtan7
92017-01GBcommunitybtsbtsbighit6
Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMArtist_NMCommunity_KEY_WKeyword_FQ
902017-03GBcommunityblackpink불장난2
912017-03GBcommunitybtsbts283
922017-03GBcommunitybtsyoutube206
932017-03GBcommunitybtsjimin25
942017-03GBcommunitybtsamp22
952017-03GBcommunitybtsjackson17
962017-03GBcommunitybtskpop12
972017-03GBcommunitybtsjungkook10
982017-03GBcommunitybtsjin9
992017-03GBcommunitybtsmbc9