Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory59.3 B

Variable types

Categorical5
Text1
Numeric1

Alerts

Country_CD has constant value ""Constant
FILE_NAME has constant value ""Constant
BASE_YMD has constant value ""Constant
Survey_Base_Date_DE is highly imbalanced (91.9%)Imbalance

Reproduction

Analysis started2023-12-10 10:11:58.002887
Analysis finished2023-12-10 10:11:58.830548
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Survey_Base_Date_DE
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-01-07
99 
2020-01-08
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row2020-01-07
2nd row2020-01-07
3rd row2020-01-07
4th row2020-01-07
5th row2020-01-07

Common Values

ValueCountFrequency (%)
2020-01-07 99
99.0%
2020-01-08 1
 
1.0%

Length

2023-12-10T19:11:58.935135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:59.104740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-01-07 99
99.0%
2020-01-08 1
 
1.0%

Country_CD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
vn
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowvn
2nd rowvn
3rd rowvn
4th rowvn
5th rowvn

Common Values

ValueCountFrequency (%)
vn 100
100.0%

Length

2023-12-10T19:11:59.325098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:11:59.493802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vn 100
100.0%

Music_NM
Categorical

Distinct43
Distinct (%)43.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
BTS
24 
BIGBANG
BLACKPINK
Red Velvet
 
4
IU
 
4
Other values (38)
52 

Length

Max length29
Median length19
Mean length6.86
Min length2

Unique

Unique27 ?
Unique (%)27.0%

Sample

1st rowYounha
2nd rowRed Velvet
3rd rowATEEZ
4th rowNg?c Dolil
5th rowCHANYEOL & Punch

Common Values

ValueCountFrequency (%)
BTS 24
24.0%
BIGBANG 8
 
8.0%
BLACKPINK 8
 
8.0%
Red Velvet 4
 
4.0%
IU 4
 
4.0%
TAEYEON 4
 
4.0%
WINNER 3
 
3.0%
iKON 2
 
2.0%
EXO 2
 
2.0%
BOL4 2
 
2.0%
Other values (33) 39
39.0%

Length

2023-12-10T19:11:59.677745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
bts 26
 
19.0%
blackpink 9
 
6.6%
bigbang 8
 
5.8%
5
 
3.6%
red 4
 
2.9%
velvet 4
 
2.9%
iu 4
 
2.9%
taeyeon 4
 
2.9%
winner 3
 
2.2%
momoland 2
 
1.5%
Other values (55) 68
49.6%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:12:00.292205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length108
Median length33
Mean length16.19
Min length3

Characters and Unicode

Total characters1619
Distinct characters77
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st rowWinter Flower (feat. RM)
2nd rowPsycho
3rd rowAnswer
4th rowCung Anh - Ngoc Dolil, Hagi, STee
5th rowStay With Me
ValueCountFrequency (%)
feat 12
 
4.0%
10
 
3.3%
you 9
 
3.0%
love 8
 
2.6%
it 4
 
1.3%
me 4
 
1.3%
i 4
 
1.3%
really 3
 
1.0%
with 3
 
1.0%
bang 3
 
1.0%
Other values (203) 242
80.1%
2023-12-10T19:12:01.310856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
202
 
12.5%
e 92
 
5.7%
o 82
 
5.1%
i 76
 
4.7%
a 74
 
4.6%
t 62
 
3.8%
r 55
 
3.4%
L 46
 
2.8%
n 45
 
2.8%
s 41
 
2.5%
Other values (67) 844
52.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 804
49.7%
Uppercase Letter 490
30.3%
Space Separator 202
 
12.5%
Other Punctuation 42
 
2.6%
Open Punctuation 23
 
1.4%
Close Punctuation 22
 
1.4%
Decimal Number 22
 
1.4%
Dash Punctuation 6
 
0.4%
Other Letter 5
 
0.3%
Math Symbol 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 92
11.4%
o 82
10.2%
i 76
 
9.5%
a 74
 
9.2%
t 62
 
7.7%
r 55
 
6.8%
n 45
 
5.6%
s 41
 
5.1%
l 40
 
5.0%
u 38
 
4.7%
Other values (14) 199
24.8%
Uppercase Letter
ValueCountFrequency (%)
L 46
 
9.4%
A 40
 
8.2%
O 39
 
8.0%
S 38
 
7.8%
E 28
 
5.7%
I 27
 
5.5%
T 26
 
5.3%
D 26
 
5.3%
N 24
 
4.9%
B 22
 
4.5%
Other values (14) 174
35.5%
Decimal Number
ValueCountFrequency (%)
2 9
40.9%
1 3
 
13.6%
0 3
 
13.6%
4 2
 
9.1%
6 1
 
4.5%
5 1
 
4.5%
9 1
 
4.5%
7 1
 
4.5%
8 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 17
40.5%
' 6
 
14.3%
? 5
 
11.9%
, 5
 
11.9%
: 4
 
9.5%
& 3
 
7.1%
/ 2
 
4.8%
Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 18
78.3%
[ 5
 
21.7%
Close Punctuation
ValueCountFrequency (%)
) 17
77.3%
] 5
 
22.7%
Space Separator
ValueCountFrequency (%)
202
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1294
79.9%
Common 320
 
19.8%
Hangul 3
 
0.2%
Han 2
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 92
 
7.1%
o 82
 
6.3%
i 76
 
5.9%
a 74
 
5.7%
t 62
 
4.8%
r 55
 
4.3%
L 46
 
3.6%
n 45
 
3.5%
s 41
 
3.2%
A 40
 
3.1%
Other values (38) 681
52.6%
Common
ValueCountFrequency (%)
202
63.1%
( 18
 
5.6%
. 17
 
5.3%
) 17
 
5.3%
2 9
 
2.8%
' 6
 
1.9%
- 6
 
1.9%
] 5
 
1.6%
? 5
 
1.6%
, 5
 
1.6%
Other values (14) 30
 
9.4%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1613
99.6%
Hangul 3
 
0.2%
CJK 2
 
0.1%
Punctuation 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
202
 
12.5%
e 92
 
5.7%
o 82
 
5.1%
i 76
 
4.7%
a 74
 
4.6%
t 62
 
3.8%
r 55
 
3.4%
L 46
 
2.9%
n 45
 
2.8%
s 41
 
2.5%
Other values (61) 838
52.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%

Music_RN
Real number (ℝ)

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.99
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:12:01.554166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.95
Q124.75
median49.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation29.430107
Coefficient of variation (CV)0.58871989
Kurtosis-1.2211691
Mean49.99
Median Absolute Deviation (MAD)25.5
Skewness0.004009349
Sum4999
Variance866.13121
MonotonicityNot monotonic
2023-12-10T19:12:01.801564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 2
 
2.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
1 2
2.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

FILE_NAME
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
KC_music_chart_vn_2019
100 

Length

Max length22
Median length22
Mean length22
Min length22

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKC_music_chart_vn_2019
2nd rowKC_music_chart_vn_2019
3rd rowKC_music_chart_vn_2019
4th rowKC_music_chart_vn_2019
5th rowKC_music_chart_vn_2019

Common Values

ValueCountFrequency (%)
KC_music_chart_vn_2019 100
100.0%

Length

2023-12-10T19:12:01.989967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:12:02.146011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kc_music_chart_vn_2019 100
100.0%

BASE_YMD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 100
100.0%

Length

2023-12-10T19:12:02.316584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:12:02.575928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 100
100.0%

Interactions

2023-12-10T19:11:58.364104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:12:03.141555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Survey_Base_Date_DEMusic_NMArtist_NMMusic_RN
Survey_Base_Date_DE1.0000.4290.0000.000
Music_NM0.4291.0001.0000.547
Artist_NM0.0001.0001.0001.000
Music_RN0.0000.5471.0001.000
2023-12-10T19:12:03.361302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Survey_Base_Date_DEMusic_NM
Survey_Base_Date_DE1.0000.267
Music_NM0.2671.000
2023-12-10T19:12:03.530165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Music_RNSurvey_Base_Date_DEMusic_NM
Music_RN1.0000.0000.163
Survey_Base_Date_DE0.0001.0000.267
Music_NM0.1630.2671.000

Missing values

2023-12-10T19:11:58.545359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:11:58.752543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Survey_Base_Date_DECountry_CDMusic_NMArtist_NMMusic_RNFILE_NAMEBASE_YMD
02020-01-07vnYounhaWinter Flower (feat. RM)1KC_music_chart_vn_20192019
12020-01-07vnRed VelvetPsycho2KC_music_chart_vn_20192019
22020-01-07vnATEEZAnswer3KC_music_chart_vn_20192019
32020-01-07vnNg?c DolilCung Anh - Ngoc Dolil, Hagi, STee4KC_music_chart_vn_20192019
42020-01-07vnCHANYEOL & PunchStay With Me5KC_music_chart_vn_20192019
52020-01-07vnG-DRAGONUntitled, 20146KC_music_chart_vn_20192019
62020-01-07vnWe Are The NightTiramisu Cake7KC_music_chart_vn_20192019
72020-01-07vnTOMORROW X TOGETHERCROWN8KC_music_chart_vn_20192019
82020-01-07vnThe HiddenSay I Love You9KC_music_chart_vn_20192019
92020-01-07vnDamSoNe GongBangLoving You With All Of My Heart10KC_music_chart_vn_20192019
Survey_Base_Date_DECountry_CDMusic_NMArtist_NMMusic_RNFILE_NAMEBASE_YMD
902020-01-07vnHEIZEWe don't talk together (feat. Giriboy) [Prod. SUGA]92KC_music_chart_vn_20192019
912020-01-07vnBTSLights93KC_music_chart_vn_20192019
922020-01-07vnBLACKPINKReally94KC_music_chart_vn_20192019
932020-01-07vnBTSJamais vu95KC_music_chart_vn_20192019
942020-01-07vnEXOObsession96KC_music_chart_vn_20192019
952020-01-07vnCHUNG HARoller Coaster97KC_music_chart_vn_20192019
962020-01-07vnEXOKo Ko Bop98KC_music_chart_vn_20192019
972020-01-07vnBTSSingularity99KC_music_chart_vn_20192019
982020-01-07vnBTSEpilogue: Young Forever100KC_music_chart_vn_20192019
992020-01-08vnYounhaWinter Flower (feat. RM)1KC_music_chart_vn_20192019