Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

DateTime1
Categorical3
Numeric1

Alerts

Country_CD has constant value ""Constant
Collection_CH_NM has constant value ""Constant

Reproduction

Analysis started2023-12-10 10:15:14.636286
Analysis finished2023-12-10 10:15:15.108200
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2017-01-01 00:00:00
Maximum2017-09-01 00:00:00
2023-12-10T19:15:15.163746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:15:15.317360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

Country_CD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
gb
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowgb
2nd rowgb
3rd rowgb
4th rowgb
5th rowgb

Common Values

ValueCountFrequency (%)
gb 100
100.0%

Length

2023-12-10T19:15:15.454820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:15:15.561316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
gb 100
100.0%

Collection_CH_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
NEWS
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNEWS
2nd rowNEWS
3rd rowNEWS
4th rowNEWS
5th rowNEWS

Common Values

ValueCountFrequency (%)
NEWS 100
100.0%

Length

2023-12-10T19:15:15.692742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:15:15.816194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
news 100
100.0%

News_KEY_W
Categorical

Distinct35
Distinct (%)35.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
korean
redvelvet
song
music
kpop
Other values (30)
61 

Length

Max length12
Median length11
Mean length5.5
Min length3

Unique

Unique18 ?
Unique (%)18.0%

Sample

1st rowkorea
2nd rowsong
3rd rowkorean
4th rowmusic
5th rowkpop

Common Values

ValueCountFrequency (%)
korean 8
 
8.0%
redvelvet 8
 
8.0%
song 8
 
8.0%
music 8
 
8.0%
kpop 7
 
7.0%
album 7
 
7.0%
bts 7
 
7.0%
korea 6
 
6.0%
fans 4
 
4.0%
billboard 3
 
3.0%
Other values (25) 34
34.0%

Length

2023-12-10T19:15:15.986743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
korean 8
 
8.0%
song 8
 
8.0%
music 8
 
8.0%
redvelvet 8
 
8.0%
kpop 7
 
7.0%
album 7
 
7.0%
bts 7
 
7.0%
korea 6
 
6.0%
fans 4
 
4.0%
billboard 3
 
3.0%
Other values (25) 34
34.0%

Keyword_FQ
Real number (ℝ)

Distinct70
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.09
Minimum24
Maximum244
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:15:16.152451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24
5-th percentile30.95
Q143
median63.5
Q3111.5
95-th percentile189.35
Maximum244
Range220
Interquartile range (IQR)68.5

Descriptive statistics

Standard deviation51.820415
Coefficient of variation (CV)0.63126344
Kurtosis1.0424281
Mean82.09
Median Absolute Deviation (MAD)24
Skewness1.2955879
Sum8209
Variance2685.3555
MonotonicityNot monotonic
2023-12-10T19:15:16.338021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
71 4
 
4.0%
37 4
 
4.0%
78 3
 
3.0%
51 3
 
3.0%
41 3
 
3.0%
44 3
 
3.0%
60 3
 
3.0%
80 2
 
2.0%
139 2
 
2.0%
38 2
 
2.0%
Other values (60) 71
71.0%
ValueCountFrequency (%)
24 1
 
1.0%
26 2
2.0%
30 2
2.0%
31 1
 
1.0%
32 1
 
1.0%
33 1
 
1.0%
34 2
2.0%
36 1
 
1.0%
37 4
4.0%
38 2
2.0%
ValueCountFrequency (%)
244 1
1.0%
234 1
1.0%
229 1
1.0%
200 1
1.0%
196 1
1.0%
189 1
1.0%
186 1
1.0%
180 1
1.0%
179 1
1.0%
156 1
1.0%

Interactions

2023-12-10T19:15:14.789533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:15:16.451500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Social_Data_Collection_Date_YMNews_KEY_WKeyword_FQ
Social_Data_Collection_Date_YM1.0000.0000.419
News_KEY_W0.0001.0000.411
Keyword_FQ0.4190.4111.000
2023-12-10T19:15:16.584314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Keyword_FQNews_KEY_W
Keyword_FQ1.0000.150
News_KEY_W0.1501.000

Missing values

2023-12-10T19:15:14.949550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:15:15.063064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMNews_KEY_WKeyword_FQ
02017-01gbNEWSkorea78
12017-01gbNEWSsong70
22017-01gbNEWSkorean69
32017-01gbNEWSmusic49
42017-01gbNEWSkpop43
52017-01gbNEWSredvelvet40
62017-01gbNEWSalbum34
72017-01gbNEWSpop32
82017-01gbNEWSfans30
92017-01gbNEWSfood26
Social_Data_Collection_Date_YMCountry_CDCollection_CH_NMNews_KEY_WKeyword_FQ
902017-08gbNEWSredvelvet41
912017-08gbNEWSbillboard39
922017-09gbNEWSkorean180
932017-09gbNEWSkorea139
942017-09gbNEWSmusic94
952017-09gbNEWSsong71
962017-09gbNEWSbts60
972017-09gbNEWSalbum58
982017-09gbNEWSkpop50
992017-09gbNEWSbillboard45