Overview

Dataset statistics

Number of variables6
Number of observations52
Missing cells242
Missing cells (%)77.6%
Duplicate rows1
Duplicate rows (%)1.9%
Total size in memory2.7 KiB
Average record size in memory53.5 B

Variable types

Text3
Unsupported3

Dataset

Description샘플 데이터
AuthorMBN
URLhttps://kdx.kr/data/view/147

Alerts

Dataset has 1 (1.9%) duplicate rowsDuplicates
RSTRC_VID_ESSN_NO has 22 (42.3%) missing valuesMissing
VID_SJ_CN has 32 (61.5%) missing valuesMissing
VID_CN has 32 (61.5%) missing valuesMissing
REG_DATE has 52 (100.0%) missing valuesMissing
VOD_CRS_NM has 52 (100.0%) missing valuesMissing
Unnamed: 5 has 52 (100.0%) missing valuesMissing
REG_DATE is an unsupported type, check if it needs cleaning or further analysisUnsupported
VOD_CRS_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 21:20:47.193462
Analysis finished2023-12-11 21:20:47.701369
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RSTRC_VID_ESSN_NO
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing22
Missing (%)42.3%
Memory size548.0 B
2023-12-12T06:20:47.915097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length40.5
Mean length23.633333
Min length7

Characters and Unicode

Total characters709
Distinct characters189
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row1010583
2nd row몸속 쓰레기 축적으로 각종 질병에 취약해진다고 하는데...
3rd row그렇다면 이 쓰레기들을 방치하면 어떻게 될까?
4th row1010584
5th row이 호흡법을 잘 배우면 몸속 쓰레기를 배출할 수 있다고 하는데~
ValueCountFrequency (%)
몸속 6
 
3.4%
5
 
2.9%
하는데 4
 
2.3%
무엇일까 4
 
2.3%
있는 4
 
2.3%
3
 
1.7%
쓰레기의 2
 
1.1%
쓰레기를 2
 
1.1%
밥상에서 2
 
1.1%
2
 
1.1%
Other values (132) 140
80.5%
2023-12-12T06:20:48.288665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
 
20.3%
0 25
 
3.5%
23
 
3.2%
1 22
 
3.1%
20
 
2.8%
11
 
1.6%
? 11
 
1.6%
. 10
 
1.4%
9
 
1.3%
9
 
1.3%
Other values (179) 425
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 459
64.7%
Space Separator 144
 
20.3%
Decimal Number 72
 
10.2%
Other Punctuation 28
 
3.9%
Math Symbol 4
 
0.6%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
5.0%
20
 
4.4%
11
 
2.4%
9
 
2.0%
9
 
2.0%
9
 
2.0%
8
 
1.7%
7
 
1.5%
7
 
1.5%
7
 
1.5%
Other values (160) 349
76.0%
Decimal Number
ValueCountFrequency (%)
0 25
34.7%
1 22
30.6%
5 9
 
12.5%
6 6
 
8.3%
8 4
 
5.6%
9 2
 
2.8%
3 1
 
1.4%
7 1
 
1.4%
2 1
 
1.4%
4 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
? 11
39.3%
. 10
35.7%
! 3
 
10.7%
' 2
 
7.1%
, 2
 
7.1%
Space Separator
ValueCountFrequency (%)
144
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 459
64.7%
Common 250
35.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
5.0%
20
 
4.4%
11
 
2.4%
9
 
2.0%
9
 
2.0%
9
 
2.0%
8
 
1.7%
7
 
1.5%
7
 
1.5%
7
 
1.5%
Other values (160) 349
76.0%
Common
ValueCountFrequency (%)
144
57.6%
0 25
 
10.0%
1 22
 
8.8%
? 11
 
4.4%
. 10
 
4.0%
5 9
 
3.6%
6 6
 
2.4%
~ 4
 
1.6%
8 4
 
1.6%
! 3
 
1.2%
Other values (9) 12
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 459
64.7%
ASCII 250
35.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
144
57.6%
0 25
 
10.0%
1 22
 
8.8%
? 11
 
4.4%
. 10
 
4.0%
5 9
 
3.6%
6 6
 
2.4%
~ 4
 
1.6%
8 4
 
1.6%
! 3
 
1.2%
Other values (9) 12
 
4.8%
Hangul
ValueCountFrequency (%)
23
 
5.0%
20
 
4.4%
11
 
2.4%
9
 
2.0%
9
 
2.0%
9
 
2.0%
8
 
1.7%
7
 
1.5%
7
 
1.5%
7
 
1.5%
Other values (160) 349
76.0%

VID_SJ_CN
Text

MISSING 

Distinct11
Distinct (%)55.0%
Missing32
Missing (%)61.5%
Memory size548.0 B
2023-12-12T06:20:48.477242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length37
Mean length17.95
Min length8

Characters and Unicode

Total characters359
Distinct characters114
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)50.0%

Sample

1st row추운 겨울철에 몸속 노폐물이 더 쌓이는 이유는?!
2nd row20160106
3rd row스튜디오를 몸속 쓰레기 배출 현장으로 만든 '5분 호흡법'을 배워보자!
4th row20160106
5th row돌발 상황! 허참의 몸속 쓰레기를 배출하는 현장 포착?!
ValueCountFrequency (%)
20160106 10
 
11.8%
몸속 5
 
5.9%
쓰레기를 2
 
2.4%
5분 2
 
2.4%
육류 2
 
2.4%
쓰레기의 1
 
1.2%
원인이 1
 
1.2%
있다 1
 
1.2%
현대인은 1
 
1.2%
1
 
1.2%
Other values (59) 59
69.4%
2023-12-12T06:20:48.755226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
18.1%
0 30
 
8.4%
1 20
 
5.6%
6 20
 
5.6%
2 10
 
2.8%
! 10
 
2.8%
? 8
 
2.2%
6
 
1.7%
6
 
1.7%
5
 
1.4%
Other values (104) 179
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 191
53.2%
Decimal Number 82
22.8%
Space Separator 65
 
18.1%
Other Punctuation 21
 
5.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (94) 144
75.4%
Decimal Number
ValueCountFrequency (%)
0 30
36.6%
1 20
24.4%
6 20
24.4%
2 10
 
12.2%
5 2
 
2.4%
Other Punctuation
ValueCountFrequency (%)
! 10
47.6%
? 8
38.1%
' 2
 
9.5%
, 1
 
4.8%
Space Separator
ValueCountFrequency (%)
65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 191
53.2%
Common 168
46.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (94) 144
75.4%
Common
ValueCountFrequency (%)
65
38.7%
0 30
17.9%
1 20
 
11.9%
6 20
 
11.9%
2 10
 
6.0%
! 10
 
6.0%
? 8
 
4.8%
5 2
 
1.2%
' 2
 
1.2%
, 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 191
53.2%
ASCII 168
46.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65
38.7%
0 30
17.9%
1 20
 
11.9%
6 20
 
11.9%
2 10
 
6.0%
! 10
 
6.0%
? 8
 
4.8%
5 2
 
1.2%
' 2
 
1.2%
, 1
 
0.6%
Hangul
ValueCountFrequency (%)
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (94) 144
75.4%

VID_CN
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing32
Missing (%)61.5%
Memory size548.0 B
2023-12-12T06:20:48.924207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length58
Mean length56
Min length24

Characters and Unicode

Total characters1120
Distinct characters164
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row추운 겨울철, 몸속 쓰레기가 더 많이 쌓인다?!
2nd rowhttp://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010583
3rd row편안하게 앉은 자세로 쉽게 따라 해볼 수 있는 5분 호흡법.
4th rowhttp://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010584
5th row5분 호흡법을 배우는 도중 갑자기 신호(?)가 온 MC 허참!
ValueCountFrequency (%)
5분 3
 
3.2%
밥상 2
 
2.2%
호흡법을 2
 
2.2%
있다 2
 
2.2%
2
 
2.2%
추운 1
 
1.1%
섭취가 1
 
1.1%
습관 1
 
1.1%
먹는 1
 
1.1%
빨리 1
 
1.1%
Other values (77) 77
82.8%
2023-12-12T06:20:49.189000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 80
 
7.1%
n 80
 
7.1%
73
 
6.5%
o 50
 
4.5%
e 50
 
4.5%
c 50
 
4.5%
. 42
 
3.8%
/ 40
 
3.6%
1 32
 
2.9%
m 30
 
2.7%
Other values (154) 593
52.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 560
50.0%
Other Letter 203
 
18.1%
Other Punctuation 126
 
11.2%
Decimal Number 94
 
8.4%
Space Separator 73
 
6.5%
Connector Punctuation 30
 
2.7%
Math Symbol 20
 
1.8%
Uppercase Letter 12
 
1.1%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (111) 154
75.9%
Lowercase Letter
ValueCountFrequency (%)
t 80
14.3%
n 80
14.3%
o 50
 
8.9%
e 50
 
8.9%
c 50
 
8.9%
m 30
 
5.4%
w 30
 
5.4%
d 20
 
3.6%
l 20
 
3.6%
r 20
 
3.6%
Other values (9) 130
23.2%
Decimal Number
ValueCountFrequency (%)
1 32
34.0%
0 25
26.6%
2 12
 
12.8%
5 10
 
10.6%
6 6
 
6.4%
8 4
 
4.3%
9 2
 
2.1%
3 1
 
1.1%
4 1
 
1.1%
7 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 42
33.3%
/ 40
31.7%
? 14
 
11.1%
: 10
 
7.9%
& 10
 
7.9%
! 8
 
6.3%
, 2
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
C 11
91.7%
M 1
 
8.3%
Space Separator
ValueCountFrequency (%)
73
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%
Math Symbol
ValueCountFrequency (%)
= 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 572
51.1%
Common 345
30.8%
Hangul 203
 
18.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (111) 154
75.9%
Common
ValueCountFrequency (%)
73
21.2%
. 42
12.2%
/ 40
11.6%
1 32
9.3%
_ 30
8.7%
0 25
 
7.2%
= 20
 
5.8%
? 14
 
4.1%
2 12
 
3.5%
: 10
 
2.9%
Other values (12) 47
13.6%
Latin
ValueCountFrequency (%)
t 80
14.0%
n 80
14.0%
o 50
 
8.7%
e 50
 
8.7%
c 50
 
8.7%
m 30
 
5.2%
w 30
 
5.2%
d 20
 
3.5%
l 20
 
3.5%
r 20
 
3.5%
Other values (11) 142
24.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 917
81.9%
Hangul 203
 
18.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 80
 
8.7%
n 80
 
8.7%
73
 
8.0%
o 50
 
5.5%
e 50
 
5.5%
c 50
 
5.5%
. 42
 
4.6%
/ 40
 
4.4%
1 32
 
3.5%
m 30
 
3.3%
Other values (33) 390
42.5%
Hangul
ValueCountFrequency (%)
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (111) 154
75.9%

REG_DATE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing52
Missing (%)100.0%
Memory size600.0 B

VOD_CRS_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing52
Missing (%)100.0%
Memory size600.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing52
Missing (%)100.0%
Memory size600.0 B

Correlations

2023-12-12T06:20:49.258673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RSTRC_VID_ESSN_NOVID_SJ_CNVID_CN
RSTRC_VID_ESSN_NO1.0001.0001.000
VID_SJ_CN1.0001.0001.000
VID_CN1.0001.0001.000

Missing values

2023-12-12T06:20:47.505274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T06:20:47.592102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T06:20:47.661449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

RSTRC_VID_ESSN_NOVID_SJ_CNVID_CNREG_DATEVOD_CRS_NMUnnamed: 5
0<NA><NA><NA><NA><NA><NA>
11010583추운 겨울철에 몸속 노폐물이 더 쌓이는 이유는?!추운 겨울철, 몸속 쓰레기가 더 많이 쌓인다?!<NA><NA><NA>
2<NA><NA><NA><NA><NA><NA>
3몸속 쓰레기 축적으로 각종 질병에 취약해진다고 하는데...<NA><NA><NA><NA><NA>
4<NA><NA><NA><NA><NA><NA>
5그렇다면 이 쓰레기들을 방치하면 어떻게 될까?20160106http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010583<NA><NA><NA>
61010584스튜디오를 몸속 쓰레기 배출 현장으로 만든 '5분 호흡법'을 배워보자!편안하게 앉은 자세로 쉽게 따라 해볼 수 있는 5분 호흡법.<NA><NA><NA>
7<NA><NA><NA><NA><NA><NA>
8이 호흡법을 잘 배우면 몸속 쓰레기를 배출할 수 있다고 하는데~<NA><NA><NA><NA><NA>
9<NA><NA><NA><NA><NA><NA>
RSTRC_VID_ESSN_NOVID_SJ_CNVID_CNREG_DATEVOD_CRS_NMUnnamed: 5
42<NA><NA><NA><NA><NA><NA>
43그러나 밥상만큼이나 중요한 '무언가'가 있다고 하는데?<NA><NA><NA><NA><NA>
44<NA><NA><NA><NA><NA><NA>
45우리 몸속 쓰레기를 줄일 수 있는 건강한 식사법은 무엇일까?20160106http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010606<NA><NA><NA>
461010607물 없이 족욕, 반신욕의 효과를 누리는 방법?!자면서 물도 없이 족욕과 반신욕의 효과를 내는 방법이 있다?!<NA><NA><NA>
47<NA><NA><NA><NA><NA><NA>
48그 비밀은 바로 현미로 만든 찜질팩?!<NA><NA><NA><NA><NA>
49<NA><NA><NA><NA><NA><NA>
50긴 시간 동안 지속되는 반신욕 효과를 낼 수 있는 현미 찜질팩 만드는 법은 과연?20160106http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010607<NA><NA><NA>
51<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

RSTRC_VID_ESSN_NOVID_SJ_CNVID_CN# duplicates
0<NA><NA><NA>22