Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Categorical4
Numeric1

Alerts

Country_CD has constant value ""Constant
Collection_CH_NM has constant value ""Constant
Keyword_FQ is highly overall correlated with News_KEY_WHigh correlation
News_KEY_W is highly overall correlated with Keyword_FQHigh correlation

Reproduction

Analysis started2023-12-10 10:08:19.484081
Analysis finished2023-12-10 10:08:20.362565
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2017-02
15 
2017-05
14 
2017-07
14 
2017-01
11 
2017-04
11 
Other values (4)
35 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017-01
2nd row2017-01
3rd row2017-01
4th row2017-01
5th row2017-01

Common Values

ValueCountFrequency (%)
2017-02 15
15.0%
2017-05 14
14.0%
2017-07 14
14.0%
2017-01 11
11.0%
2017-04 11
11.0%
2017-08 11
11.0%
2017-09 9
9.0%
2017-03 8
8.0%
2017-06 7
7.0%

Length

2023-12-10T19:08:20.521970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:20.778903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-02 15
15.0%
2017-05 14
14.0%
2017-07 14
14.0%
2017-01 11
11.0%
2017-04 11
11.0%
2017-08 11
11.0%
2017-09 9
9.0%
2017-03 8
8.0%
2017-06 7
7.0%

Country_CD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
US
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS

Common Values

ValueCountFrequency (%)
US 100
100.0%

Length

2023-12-10T19:08:21.095686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:21.270642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
us 100
100.0%

Collection_CH_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
NEWS
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNEWS
2nd rowNEWS
3rd rowNEWS
4th rowNEWS
5th rowNEWS

Common Values

ValueCountFrequency (%)
NEWS 100
100.0%

Length

2023-12-10T19:08:21.445832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:08:21.629019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
news 100
100.0%

News_KEY_W
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
kpop
music
bts
korean
song
Other values (22)
58 

Length

Max length11
Median length9
Mean length5.33
Min length3

Unique

Unique8 ?
Unique (%)8.0%

Sample

1st rowkpop
2nd rowmusic
3rd rowbts
4th rowkorean
5th rowkorea

Common Values

ValueCountFrequency (%)
kpop 9
 
9.0%
music 9
 
9.0%
bts 8
 
8.0%
korean 8
 
8.0%
song 8
 
8.0%
korea 7
 
7.0%
fans 7
 
7.0%
album 6
 
6.0%
facebook 5
 
5.0%
awards 4
 
4.0%
Other values (17) 29
29.0%

Length

2023-12-10T19:08:21.848999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
kpop 9
 
9.0%
music 9
 
9.0%
bts 8
 
8.0%
korean 8
 
8.0%
song 8
 
8.0%
korea 7
 
7.0%
fans 7
 
7.0%
album 6
 
6.0%
facebook 5
 
5.0%
awards 4
 
4.0%
Other values (17) 29
29.0%

Keyword_FQ
Real number (ℝ)

HIGH CORRELATION 

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1507.54
Minimum302
Maximum14345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:08:22.479747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum302
5-th percentile388.95
Q1571.75
median787
Q31304
95-th percentile5007.2
Maximum14345
Range14043
Interquartile range (IQR)732.25

Descriptive statistics

Standard deviation2116.3391
Coefficient of variation (CV)1.4038361
Kurtosis17.034893
Mean1507.54
Median Absolute Deviation (MAD)272.5
Skewness3.8310912
Sum150754
Variance4478891.4
MonotonicityNot monotonic
2023-12-10T19:08:23.043488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
875 2
 
2.0%
729 2
 
2.0%
630 1
 
1.0%
663 1
 
1.0%
695 1
 
1.0%
817 1
 
1.0%
1072 1
 
1.0%
1118 1
 
1.0%
1271 1
 
1.0%
1933 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
302 1
1.0%
354 1
1.0%
371 1
1.0%
385 1
1.0%
388 1
1.0%
389 1
1.0%
415 1
1.0%
418 1
1.0%
433 1
1.0%
470 1
1.0%
ValueCountFrequency (%)
14345 1
1.0%
10225 1
1.0%
8296 1
1.0%
7602 1
1.0%
6987 1
1.0%
4903 1
1.0%
4022 1
1.0%
3841 1
1.0%
3683 1
1.0%
3005 1
1.0%

Interactions

2023-12-10T19:08:19.886314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:08:23.351717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Social_Data_Collection_Date_YMNews_KEY_WKeyword_FQ
Social_Data_Collection_Date_YM1.0000.0000.307
News_KEY_W0.0001.0000.891
Keyword_FQ0.3070.8911.000
2023-12-10T19:08:23.546049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Social_Data_Collection_Date_YMNews_KEY_W
Social_Data_Collection_Date_YM1.0000.000
News_KEY_W0.0001.000
2023-12-10T19:08:23.792678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Keyword_FQSocial_Data_Collection_Date_YMNews_KEY_W
Keyword_FQ1.0000.1550.559
Social_Data_Collection_Date_YM0.1551.0000.000
News_KEY_W0.5590.0001.000

Missing values

2023-12-10T19:08:20.107865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:08:20.288954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMNews_KEY_WKeyword_FQ
02017-01USNEWSkpop947
12017-01USNEWSmusic800
22017-01USNEWSbts641
32017-01USNEWSkorean608
42017-01USNEWSkorea508
52017-01USNEWSawards490
62017-01USNEWSfans478
72017-01USNEWSsong389
82017-01USNEWSseoul371
92017-01USNEWSalbum354
Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMNews_KEY_WKeyword_FQ
902017-08USNEWSfacebook385
912017-09USNEWSkorea2082
922017-09USNEWSbts1618
932017-09USNEWSkorean1561
942017-09USNEWSmusic1035
952017-09USNEWSkpop998
962017-09USNEWSalbum831
972017-09USNEWSfans707
982017-09USNEWSfacebook535
992017-09USNEWSsong523