Overview

Dataset statistics

Number of variables10
Number of observations97
Missing cells735
Missing cells (%)75.8%
Duplicate rows7
Duplicate rows (%)7.2%
Total size in memory8.2 KiB
Average record size in memory86.4 B

Variable types

Text4
Categorical1
Numeric1
Unsupported4

Dataset

Description샘플 데이터
AuthorMBN
URLhttps://kdx.kr/data/view/26945

Alerts

Dataset has 7 (7.2%) duplicate rowsDuplicates
STD_YEAR is highly overall correlated with MDA_CGR_NMHigh correlation
MDA_CGR_NM is highly overall correlated with STD_YEARHigh correlation
MDA_CGR_NM is highly imbalanced (58.3%)Imbalance
MBN_MDA_SP_CD has 19 (19.6%) missing valuesMissing
MDA_ART_ESSN_NO has 77 (79.4%) missing valuesMissing
STD_YEAR has 77 (79.4%) missing valuesMissing
ART_SJ_CN has 87 (89.7%) missing valuesMissing
ART_CN has 87 (89.7%) missing valuesMissing
ATCH_IMG_NM has 97 (100.0%) missing valuesMissing
JRNL_NM has 97 (100.0%) missing valuesMissing
WRT_DATE has 97 (100.0%) missing valuesMissing
Unnamed: 9 has 97 (100.0%) missing valuesMissing
ATCH_IMG_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
JRNL_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
WRT_DATE is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 20:37:18.111985
Analysis finished2023-12-11 20:37:19.973009
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

MBN_MDA_SP_CD
Text

MISSING 

Distinct58
Distinct (%)74.4%
Missing19
Missing (%)19.6%
Memory size908.0 B
2023-12-12T05:37:20.167268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length206
Median length103
Mean length72.666667
Min length3

Characters and Unicode

Total characters5668
Distinct characters495
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)61.5%

Sample

1st rowMBN
2nd row'VIP' 이상윤과 '의사요한' 이세영이 ‘2019 SBS 연기대상’에서 미니시리즈 부문 우수 연기상을 수상했다.
3rd row31일 오후 8시 55분 서울시 마포구 상암동 SBS프리즘타워에서는 신동엽, 장나라의 진행으로 ‘2019 SBS 연기대상’ 시상식이 열렸다.
4th row이날 이상윤은 "촬영을 하면서도, 방송을 할 때도 신기한 경험을 많이 해서 그것 만으로도 감사한 작품이었다. 저 때문에 화가 나신 시청자분들께 죄송하고, 작품을 하면서 바람은 피지 말아야겠다는 생각을 하게 됐다"라고 말해 웃음을 자아냈다. 이어 "다른 결의 인물을 연기할 수 있게 해주신 감독님, 작가님에게도 감사하고, 함께 촬영을 한 배우들에게 고맙다"라고 덧붙였다.
5th row이어 이세영은 "너무 큰 상 주셔서 감사하고, 부끄럽다. 현장에 갈 때마다 제가 밥값을 잘하고 있는지 힘들었는데 감독님, 배우들이 이끌어줘서 잘 마칠 수 있었다. 더운 여름에 더 덥게 고생하신 스태프분들 고생 많으셨고, 함께할 수 있어서 행복한 시간이었다"라고 울컥하는 모습을 보였다.
ValueCountFrequency (%)
‘2019 16
 
1.3%
sbs 14
 
1.1%
mbn 12
 
1.0%
31일 10
 
0.8%
kbs 9
 
0.7%
매일경제 8
 
0.6%
8
 
0.6%
7
 
0.6%
연기대상’에서 7
 
0.6%
오후 7
 
0.6%
Other values (782) 1133
92.0%
2023-12-12T05:37:20.554723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1181
 
20.8%
146
 
2.6%
. 122
 
2.2%
115
 
2.0%
, 70
 
1.2%
68
 
1.2%
67
 
1.2%
64
 
1.1%
62
 
1.1%
59
 
1.0%
Other values (485) 3714
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3608
63.7%
Space Separator 1181
 
20.8%
Other Punctuation 263
 
4.6%
Uppercase Letter 155
 
2.7%
Decimal Number 137
 
2.4%
Lowercase Letter 135
 
2.4%
Final Punctuation 78
 
1.4%
Initial Punctuation 78
 
1.4%
Dash Punctuation 10
 
0.2%
Open Punctuation 10
 
0.2%
Other values (2) 13
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
4.0%
115
 
3.2%
68
 
1.9%
67
 
1.9%
64
 
1.8%
62
 
1.7%
59
 
1.6%
58
 
1.6%
55
 
1.5%
54
 
1.5%
Other values (430) 2860
79.3%
Lowercase Letter
ValueCountFrequency (%)
k 26
19.3%
m 17
12.6%
r 16
11.9%
c 16
11.9%
o 11
8.1%
u 10
 
7.4%
t 7
 
5.2%
e 6
 
4.4%
l 5
 
3.7%
s 4
 
3.0%
Other values (7) 17
12.6%
Uppercase Letter
ValueCountFrequency (%)
S 47
30.3%
B 44
28.4%
M 16
 
10.3%
N 15
 
9.7%
K 9
 
5.8%
V 7
 
4.5%
I 6
 
3.9%
P 6
 
3.9%
E 2
 
1.3%
T 1
 
0.6%
Other values (2) 2
 
1.3%
Decimal Number
ValueCountFrequency (%)
1 40
29.2%
2 23
16.8%
0 19
13.9%
9 18
13.1%
3 15
 
10.9%
8 8
 
5.8%
4 7
 
5.1%
5 6
 
4.4%
7 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 122
46.4%
, 70
26.6%
' 38
 
14.4%
" 22
 
8.4%
@ 8
 
3.0%
& 3
 
1.1%
Final Punctuation
ValueCountFrequency (%)
57
73.1%
21
 
26.9%
Initial Punctuation
ValueCountFrequency (%)
57
73.1%
21
 
26.9%
Open Punctuation
ValueCountFrequency (%)
[ 8
80.0%
( 2
 
20.0%
Close Punctuation
ValueCountFrequency (%)
] 8
80.0%
) 2
 
20.0%
Space Separator
ValueCountFrequency (%)
1181
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3608
63.7%
Common 1770
31.2%
Latin 290
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
4.0%
115
 
3.2%
68
 
1.9%
67
 
1.9%
64
 
1.8%
62
 
1.7%
59
 
1.6%
58
 
1.6%
55
 
1.5%
54
 
1.5%
Other values (430) 2860
79.3%
Latin
ValueCountFrequency (%)
S 47
16.2%
B 44
15.2%
k 26
 
9.0%
m 17
 
5.9%
M 16
 
5.5%
r 16
 
5.5%
c 16
 
5.5%
N 15
 
5.2%
o 11
 
3.8%
u 10
 
3.4%
Other values (19) 72
24.8%
Common
ValueCountFrequency (%)
1181
66.7%
. 122
 
6.9%
, 70
 
4.0%
57
 
3.2%
57
 
3.2%
1 40
 
2.3%
' 38
 
2.1%
2 23
 
1.3%
" 22
 
1.2%
21
 
1.2%
Other values (16) 139
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3608
63.7%
ASCII 1901
33.5%
Punctuation 156
 
2.8%
Enclosed Alphanum 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1181
62.1%
. 122
 
6.4%
, 70
 
3.7%
S 47
 
2.5%
B 44
 
2.3%
1 40
 
2.1%
' 38
 
2.0%
k 26
 
1.4%
2 23
 
1.2%
" 22
 
1.2%
Other values (40) 288
 
15.1%
Hangul
ValueCountFrequency (%)
146
 
4.0%
115
 
3.2%
68
 
1.9%
67
 
1.9%
64
 
1.8%
62
 
1.7%
59
 
1.6%
58
 
1.6%
55
 
1.5%
54
 
1.5%
Other values (430) 2860
79.3%
Punctuation
ValueCountFrequency (%)
57
36.5%
57
36.5%
21
 
13.5%
21
 
13.5%
Enclosed Alphanum
ValueCountFrequency (%)
3
100.0%

MDA_ART_ESSN_NO
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing77
Missing (%)79.4%
Memory size908.0 B
2023-12-12T05:37:20.732980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length41.5
Mean length41.5
Min length7

Characters and Unicode

Total characters830
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row4023071
2nd rowhttp://img.mbn.co.kr/filewww/news/other/2020/01/01/010101000010.jpg,,,,,,,,,
3rd row4023072
4th rowhttp://img.mbn.co.kr/filewww/news/other/2020/01/01/112200001100.jpg,,,,,,,,,
5th row4023073
ValueCountFrequency (%)
4023071 1
 
5.0%
http://img.mbn.co.kr/filewww/news/other/2020/01/01/010101000010.jpg 1
 
5.0%
4023081 1
 
5.0%
http://img.mbn.co.kr/filewww/news/other/2020/01/01/000922022211.jpg 1
 
5.0%
4023080 1
 
5.0%
http://img.mbn.co.kr/filewww/news/other/2020/01/01/300020000003.jpg 1
 
5.0%
4023079 1
 
5.0%
http://img.mbn.co.kr/filewww/news/other/2020/01/01/121000001111.jpg 1
 
5.0%
4023078 1
 
5.0%
http://img.mbn.co.kr/filewww/news/other/2020/01/01/100212000010.jpg 1
 
5.0%
Other values (10) 10
50.0%
2023-12-12T05:37:21.030349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 125
15.1%
/ 90
 
10.8%
, 90
 
10.8%
2 49
 
5.9%
1 49
 
5.9%
w 40
 
4.8%
. 40
 
4.8%
e 30
 
3.6%
t 30
 
3.6%
m 20
 
2.4%
Other values (25) 267
32.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 327
39.4%
Decimal Number 270
32.5%
Other Punctuation 230
27.7%
Uppercase Letter 3
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 40
12.2%
e 30
 
9.2%
t 30
 
9.2%
m 20
 
6.1%
o 20
 
6.1%
n 20
 
6.1%
r 20
 
6.1%
i 20
 
6.1%
h 20
 
6.1%
g 19
 
5.8%
Other values (8) 88
26.9%
Decimal Number
ValueCountFrequency (%)
0 125
46.3%
2 49
 
18.1%
1 49
 
18.1%
3 15
 
5.6%
4 12
 
4.4%
7 8
 
3.0%
9 7
 
2.6%
8 3
 
1.1%
5 1
 
0.4%
6 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
/ 90
39.1%
, 90
39.1%
. 40
17.4%
: 10
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
J 1
33.3%
P 1
33.3%
G 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 500
60.2%
Latin 330
39.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 40
12.1%
e 30
 
9.1%
t 30
 
9.1%
m 20
 
6.1%
o 20
 
6.1%
n 20
 
6.1%
r 20
 
6.1%
i 20
 
6.1%
h 20
 
6.1%
g 19
 
5.8%
Other values (11) 91
27.6%
Common
ValueCountFrequency (%)
0 125
25.0%
/ 90
18.0%
, 90
18.0%
2 49
 
9.8%
1 49
 
9.8%
. 40
 
8.0%
3 15
 
3.0%
4 12
 
2.4%
: 10
 
2.0%
7 8
 
1.6%
Other values (4) 12
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 830
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 125
15.1%
/ 90
 
10.8%
, 90
 
10.8%
2 49
 
5.9%
1 49
 
5.9%
w 40
 
4.8%
. 40
 
4.8%
e 30
 
3.6%
t 30
 
3.6%
m 20
 
2.4%
Other values (25) 267
32.2%

MDA_CGR_NM
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size908.0 B
<NA>
77 
mbn00012
10 
양소영
 
3
이다겸
 
2
서지경
 
2
Other values (2)
 
3

Length

Max length8
Median length4
Mean length4.3092784
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row<NA>
2nd rowmbn00012
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 77
79.4%
mbn00012 10
 
10.3%
양소영 3
 
3.1%
이다겸 2
 
2.1%
서지경 2
 
2.1%
안하나 2
 
2.1%
신미래 1
 
1.0%

Length

2023-12-12T05:37:21.142305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:37:21.225164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 77
79.4%
mbn00012 10
 
10.3%
양소영 3
 
3.1%
이다겸 2
 
2.1%
서지경 2
 
2.1%
안하나 2
 
2.1%
신미래 1
 
1.0%

STD_YEAR
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9
Distinct (%)45.0%
Missing77
Missing (%)79.4%
Infinite0
Infinite (%)0.0%
Mean1.0100051 × 1013
Minimum2020
Maximum2.0200101 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1005.0 B
2023-12-12T05:37:21.302709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2020
5-th percentile2020
Q12020
median1.0100051 × 1013
Q32.0200101 × 1013
95-th percentile2.0200101 × 1013
Maximum2.0200101 × 1013
Range2.0200101 × 1013
Interquartile range (IQR)2.0200101 × 1013

Descriptive statistics

Standard deviation1.0362433 × 1013
Coefficient of variation (CV)1.0259784
Kurtosis-2.2352941
Mean1.0100051 × 1013
Median Absolute Deviation (MAD)1.010005 × 1013
Skewness1.5883943 × 10-17
Sum2.0200101 × 1014
Variance1.0738002 × 1026
MonotonicityNot monotonic
2023-12-12T05:37:21.390747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2020 10
 
10.3%
20200101000138 2
 
2.1%
20200101001137 2
 
2.1%
20200101000207 1
 
1.0%
20200101000237 1
 
1.0%
20200101000639 1
 
1.0%
20200101000708 1
 
1.0%
20200101001307 1
 
1.0%
20200101001437 1
 
1.0%
(Missing) 77
79.4%
ValueCountFrequency (%)
2020 10
10.3%
20200101000138 2
 
2.1%
20200101000207 1
 
1.0%
20200101000237 1
 
1.0%
20200101000639 1
 
1.0%
20200101000708 1
 
1.0%
20200101001137 2
 
2.1%
20200101001307 1
 
1.0%
20200101001437 1
 
1.0%
ValueCountFrequency (%)
20200101001437 1
 
1.0%
20200101001307 1
 
1.0%
20200101001137 2
 
2.1%
20200101000708 1
 
1.0%
20200101000639 1
 
1.0%
20200101000237 1
 
1.0%
20200101000207 1
 
1.0%
20200101000138 2
 
2.1%
2020 10
10.3%

ART_SJ_CN
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing87
Missing (%)89.7%
Memory size908.0 B
2023-12-12T05:37:21.579680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length36.4
Min length16

Characters and Unicode

Total characters364
Distinct characters141
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row이상윤X이세영, 미니시리즈 우수연기상 “밥값 하는지 몰라 힘들었다”[2019 SBS 연기대상]
2nd row김성균X한예리, 중편드라마 우수연기상 “심장 터질 것 같아”[2019 SBS 연기대상]
3rd row김명수 김세정, 한류스타상 "한류스타들 아프지 않고 건강하길"[KBS 연기대상]
4th row신혜선 "강하늘, 동기의 자랑…얼굴 찌푸린 걸 본 적 없다"[KBS 연기대상]
5th row`보이스퀸-스페셜` 3라운드 조2위 늴리리맘마...눈물바다 만든 진한 여운
ValueCountFrequency (%)
연기대상 5
 
7.4%
보이스퀸-스페셜 2
 
2.9%
우수연기상 2
 
2.9%
3라운드 2
 
2.9%
sbs 2
 
2.9%
1위 1
 
1.5%
펭수 1
 
1.5%
무대 1
 
1.5%
판소리 1
 
1.5%
소리퀸즈...압도적인 1
 
1.5%
Other values (50) 50
73.5%
2023-12-12T05:37:21.895004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
15.9%
14
 
3.8%
9
 
2.5%
8
 
2.2%
, 8
 
2.2%
7
 
1.9%
7
 
1.9%
S 7
 
1.9%
7
 
1.9%
6
 
1.6%
Other values (131) 233
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 230
63.2%
Space Separator 58
 
15.9%
Other Punctuation 19
 
5.2%
Uppercase Letter 17
 
4.7%
Decimal Number 12
 
3.3%
Close Punctuation 6
 
1.6%
Open Punctuation 6
 
1.6%
Modifier Symbol 4
 
1.1%
Dash Punctuation 3
 
0.8%
Initial Punctuation 3
 
0.8%
Other values (3) 6
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
6.1%
9
 
3.9%
8
 
3.5%
7
 
3.0%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
4
 
1.7%
Other values (107) 157
68.3%
Decimal Number
ValueCountFrequency (%)
2 3
25.0%
1 3
25.0%
3 2
16.7%
0 2
16.7%
9 2
16.7%
Other Punctuation
ValueCountFrequency (%)
, 8
42.1%
. 6
31.6%
" 4
21.1%
1
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
S 7
41.2%
B 5
29.4%
K 3
17.6%
X 2
 
11.8%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
58
100.0%
Close Punctuation
ValueCountFrequency (%)
] 6
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 230
63.2%
Common 117
32.1%
Latin 17
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
6.1%
9
 
3.9%
8
 
3.5%
7
 
3.0%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
4
 
1.7%
Other values (107) 157
68.3%
Common
ValueCountFrequency (%)
58
49.6%
, 8
 
6.8%
. 6
 
5.1%
] 6
 
5.1%
[ 6
 
5.1%
` 4
 
3.4%
" 4
 
3.4%
- 3
 
2.6%
2 3
 
2.6%
1 3
 
2.6%
Other values (10) 16
 
13.7%
Latin
ValueCountFrequency (%)
S 7
41.2%
B 5
29.4%
K 3
17.6%
X 2
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 230
63.2%
ASCII 124
34.1%
Punctuation 7
 
1.9%
Misc Symbols 2
 
0.5%
Arrows 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
58
46.8%
, 8
 
6.5%
S 7
 
5.6%
. 6
 
4.8%
] 6
 
4.8%
[ 6
 
4.8%
B 5
 
4.0%
` 4
 
3.2%
" 4
 
3.2%
- 3
 
2.4%
Other values (7) 17
 
13.7%
Hangul
ValueCountFrequency (%)
14
 
6.1%
9
 
3.9%
8
 
3.5%
7
 
3.0%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
4
 
1.7%
Other values (107) 157
68.3%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
Arrows
ValueCountFrequency (%)
1
100.0%

ART_CN
Text

MISSING 

Distinct5
Distinct (%)50.0%
Missing87
Missing (%)89.7%
Memory size908.0 B
2023-12-12T05:37:22.043067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length86
Mean length56.1
Min length41

Characters and Unicode

Total characters561
Distinct characters74
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)30.0%

Sample

1st row<!------------ PHOTO_POS_0 ------------> [매일경제 스타투데이 이다겸 기자]
2nd row<!------------ PHOTO_POS_0 ------------> [매일경제 스타투데이 이다겸 기자]
3rd row<!------------ PHOTO_POS_0 ------------>
4th row<!------------ PHOTO_POS_0 ------------>
5th row<!------------ PHOTO_POS_0 ------------>
ValueCountFrequency (%)
20
32.8%
photo_pos_0 10
16.4%
수상했다 2
 
3.3%
연기대상’에서 2
 
3.3%
sbs 2
 
3.3%
‘2019 2
 
3.3%
배우 2
 
3.3%
기자 2
 
3.3%
이다겸 2
 
3.3%
매일경제 2
 
3.3%
Other values (14) 15
24.6%
2023-12-12T05:37:22.466739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 240
42.8%
57
 
10.2%
O 30
 
5.3%
P 20
 
3.6%
_ 20
 
3.6%
S 14
 
2.5%
0 12
 
2.1%
< 10
 
1.8%
H 10
 
1.8%
T 10
 
1.8%
Other values (64) 138
24.6%

Most occurring categories

ValueCountFrequency (%)
Dash Punctuation 240
42.8%
Other Letter 98
17.5%
Uppercase Letter 86
 
15.3%
Space Separator 57
 
10.2%
Connector Punctuation 20
 
3.6%
Math Symbol 20
 
3.6%
Decimal Number 18
 
3.2%
Other Punctuation 14
 
2.5%
Open Punctuation 2
 
0.4%
Initial Punctuation 2
 
0.4%
Other values (2) 4
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
5
 
5.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
Other values (42) 56
57.1%
Uppercase Letter
ValueCountFrequency (%)
O 30
34.9%
P 20
23.3%
S 14
16.3%
H 10
 
11.6%
T 10
 
11.6%
B 2
 
2.3%
Decimal Number
ValueCountFrequency (%)
0 12
66.7%
2 2
 
11.1%
1 2
 
11.1%
9 2
 
11.1%
Other Punctuation
ValueCountFrequency (%)
! 10
71.4%
. 3
 
21.4%
, 1
 
7.1%
Math Symbol
ValueCountFrequency (%)
< 10
50.0%
> 10
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 240
100.0%
Space Separator
ValueCountFrequency (%)
57
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 20
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
] 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 377
67.2%
Hangul 98
 
17.5%
Latin 86
 
15.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
5
 
5.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
Other values (42) 56
57.1%
Common
ValueCountFrequency (%)
- 240
63.7%
57
 
15.1%
_ 20
 
5.3%
0 12
 
3.2%
< 10
 
2.7%
> 10
 
2.7%
! 10
 
2.7%
. 3
 
0.8%
[ 2
 
0.5%
2
 
0.5%
Other values (6) 11
 
2.9%
Latin
ValueCountFrequency (%)
O 30
34.9%
P 20
23.3%
S 14
16.3%
H 10
 
11.6%
T 10
 
11.6%
B 2
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 459
81.8%
Hangul 98
 
17.5%
Punctuation 4
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 240
52.3%
57
 
12.4%
O 30
 
6.5%
P 20
 
4.4%
_ 20
 
4.4%
S 14
 
3.1%
0 12
 
2.6%
< 10
 
2.2%
H 10
 
2.2%
T 10
 
2.2%
Other values (10) 36
 
7.8%
Hangul
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
5
 
5.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
Other values (42) 56
57.1%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%

ATCH_IMG_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)100.0%
Memory size1005.0 B

JRNL_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)100.0%
Memory size1005.0 B

WRT_DATE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)100.0%
Memory size1005.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)100.0%
Memory size1005.0 B

Interactions

2023-12-12T05:37:19.518610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T05:37:22.544236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
MBN_MDA_SP_CDMDA_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CN
MBN_MDA_SP_CD1.0001.0001.000NaNNaNNaN
MDA_ART_ESSN_NO1.0001.0001.000NaN1.0001.000
MDA_CGR_NM1.0001.0001.000NaNNaNNaN
STD_YEARNaNNaNNaN1.000NaNNaN
ART_SJ_CNNaN1.000NaNNaN1.0001.000
ART_CNNaN1.000NaNNaN1.0001.000
2023-12-12T05:37:22.633638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
STD_YEARMDA_CGR_NM
STD_YEAR1.0000.882
MDA_CGR_NM0.8821.000

Missing values

2023-12-12T05:37:19.672041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T05:37:19.802608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T05:37:19.905587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

MBN_MDA_SP_CDMDA_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CNATCH_IMG_NMJRNL_NMWRT_DATEUnnamed: 9
0<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1MBN4023071mbn000122020이상윤X이세영, 미니시리즈 우수연기상 “밥값 하는지 몰라 힘들었다”[2019 SBS 연기대상]<!------------ PHOTO_POS_0 ------------> [매일경제 스타투데이 이다겸 기자]<NA><NA><NA><NA>
2'VIP' 이상윤과 '의사요한' 이세영이 ‘2019 SBS 연기대상’에서 미니시리즈 부문 우수 연기상을 수상했다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
331일 오후 8시 55분 서울시 마포구 상암동 SBS프리즘타워에서는 신동엽, 장나라의 진행으로 ‘2019 SBS 연기대상’ 시상식이 열렸다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
4이날 이상윤은 "촬영을 하면서도, 방송을 할 때도 신기한 경험을 많이 해서 그것 만으로도 감사한 작품이었다. 저 때문에 화가 나신 시청자분들께 죄송하고, 작품을 하면서 바람은 피지 말아야겠다는 생각을 하게 됐다"라고 말해 웃음을 자아냈다. 이어 "다른 결의 인물을 연기할 수 있게 해주신 감독님, 작가님에게도 감사하고, 함께 촬영을 한 배우들에게 고맙다"라고 덧붙였다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
5이어 이세영은 "너무 큰 상 주셔서 감사하고, 부끄럽다. 현장에 갈 때마다 제가 밥값을 잘하고 있는지 힘들었는데 감독님, 배우들이 이끌어줘서 잘 마칠 수 있었다. 더운 여름에 더 덥게 고생하신 스태프분들 고생 많으셨고, 함께할 수 있어서 행복한 시간이었다"라고 울컥하는 모습을 보였다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
6한편 ‘2019 SBS 연기대상’은 ‘열혈사제’, ‘배가본드’, ‘스토브리그’, ‘VIP’, ‘의사요한’, ‘녹두꽃’, ‘시크릿 부티크’ 등 올해를 빛낸 SBS 드라마를 총 결산 하는 자리다. SBS에서 생중계된다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
7trdk0114@mk.co.krhttp://img.mbn.co.kr/filewww/news/other/2020/01/01/010101000010.jpg,,,,,,,,,이다겸20200101000138<NA><NA><NA><NA><NA><NA>
8MBN4023072mbn000122020김성균X한예리, 중편드라마 우수연기상 “심장 터질 것 같아”[2019 SBS 연기대상]<!------------ PHOTO_POS_0 ------------> [매일경제 스타투데이 이다겸 기자]<NA><NA><NA><NA>
9'열혈사제' 김성균과 '녹두꽃' 한예리가 ‘2019 SBS 연기대상’에서 중편드라마 부문 우수연기상을 수상했다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
MBN_MDA_SP_CDMDA_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CNATCH_IMG_NMJRNL_NMWRT_DATEUnnamed: 9
87김소현 역시 “사극치고 뽀뽀신이 많았다. 사극인데 왜 이렇게 많냐고 하더라. 저희는 새로운 시대를 열어가는 커플이었다”고 너스레를 떨었다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
88<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
89skyb1842@mk.co.krhttp://img.mbn.co.kr/filewww/news/other/2020/01/01/000922022211.jpg,,,,,,,,,양소영20200101001307<NA><NA><NA><NA><NA><NA>
90MBN4023081mbn000122020정문성, 많이 떨려요 [포토]<!------------ PHOTO_POS_0 ------------> 배우 정문성이 ‘2019 SBS 연기대상’에서 베스트 캐릭터상을 수상했다.<NA><NA><NA><NA>
91‘2019 SBS 연기대상’이 지난해 31일 오후 서울 마포구 상암동 SBS 미디어센터에서 신동엽, 장나라의 사회로 진행됐다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
92<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
93이번 ‘2019 SBS 연기대상’에서 가장 큰 관심사는 대상 수상자에 대한 궁금증이다. 특히 ‘열혈사제’, ‘황후의 품격’, ‘배가본드’, ‘녹두꽃’, ‘의사 요한’, ‘VIP’ 등 시청률과 화제성, 작품성을 인정받으며 시청자들의 마음을 사로잡은 쟁쟁한 작품들이 수두룩한 가운데 영예의 대상을 누가 받게 될지가 초미의 관심사다.<NA><NA><NA><NA><NA><NA><NA><NA><NA>
94<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
95MBN스타 대중문화부 안하나 기자 mkculture@mkculture.comhttp://img.mbn.co.kr/filewww/news/other/2020/01/01/002010010104.JPG,,,,,,,,,안하나20200101001437<NA><NA><NA><NA><NA><NA>
96<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

MBN_MDA_SP_CDMDA_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CN# duplicates
6<NA><NA><NA><NA><NA><NA>19
131일 오후 서울 KBS 여의도홀에서 ‘2019 KBS 연기대상’이 열렸다. 방송인 전현무, 배우 신혜선이 진행을 맡았다.<NA><NA><NA><NA><NA>3
3[매일경제 스타투데이 양소영 기자]<NA><NA><NA><NA><NA>3
031일 오후 8시 55분 서울시 마포구 상암동 SBS프리즘타워에서는 신동엽, 장나라의 진행으로 ‘2019 SBS 연기대상’ 시상식이 열렸다.<NA><NA><NA><NA><NA>2
2[ 매일경제 스타투데이 서지경 객원기자 ]<NA><NA><NA><NA><NA>2
4이번 ‘2019 SBS 연기대상’에서 가장 큰 관심사는 대상 수상자에 대한 궁금증이다. 특히 ‘열혈사제’, ‘황후의 품격’, ‘배가본드’, ‘녹두꽃’, ‘의사 요한’, ‘VIP’ 등 시청률과 화제성, 작품성을 인정받으며 시청자들의 마음을 사로잡은 쟁쟁한 작품들이 수두룩한 가운데 영예의 대상을 누가 받게 될지가 초미의 관심사다.<NA><NA><NA><NA><NA>2
5한편 ‘2019 SBS 연기대상’은 ‘열혈사제’, ‘배가본드’, ‘스토브리그’, ‘VIP’, ‘의사요한’, ‘녹두꽃’, ‘시크릿 부티크’ 등 올해를 빛낸 SBS 드라마를 총 결산 하는 자리다. SBS에서 생중계된다.<NA><NA><NA><NA><NA>2