Overview

Dataset statistics

Number of variables6
Number of observations59
Missing cells280
Missing cells (%)79.1%
Duplicate rows1
Duplicate rows (%)1.7%
Total size in memory3.1 KiB
Average record size in memory53.2 B

Variable types

Text3
Unsupported3

Dataset

Description샘플 데이터
AuthorMBN
URLhttps://kdx.kr/data/view/156

Alerts

Dataset has 1 (1.7%) duplicate rowsDuplicates
RSTRC_VID_ESSN_NO has 25 (42.4%) missing valuesMissing
VID_SJ_CN has 39 (66.1%) missing valuesMissing
VID_CN has 39 (66.1%) missing valuesMissing
REG_DATE has 59 (100.0%) missing valuesMissing
VOD_CRS_NM has 59 (100.0%) missing valuesMissing
Unnamed: 5 has 59 (100.0%) missing valuesMissing
REG_DATE is an unsupported type, check if it needs cleaning or further analysisUnsupported
VOD_CRS_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-17 03:37:08.800826
Analysis finished2024-04-17 03:37:09.235889
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RSTRC_VID_ESSN_NO
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing25
Missing (%)42.4%
Memory size604.0 B
2024-04-17T12:37:09.463737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length35
Mean length23.235294
Min length7

Characters and Unicode

Total characters790
Distinct characters212
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row1010617
2nd row그 험한 산행 끝에 겨우 찾은 자연인의 보금자리~
3rd row자연인은 추운 겨울날 깊은 산 속까지 찾아온 윤택 씨를 반갑게 맞아주는데~
4th row1010618
5th row기발한 아이디어를 엿볼 수 있는 자연인의 보금자리!
ValueCountFrequency (%)
윤택 6
 
3.1%
자연인은 4
 
2.1%
자연인 4
 
2.1%
씨에게 3
 
1.6%
3
 
1.6%
2
 
1.0%
없는 2
 
1.0%
자연인과 2
 
1.0%
보금자리 2
 
1.0%
자연인의 2
 
1.0%
Other values (158) 163
84.5%
2024-04-17T12:37:09.870487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
 
20.3%
1 24
 
3.0%
23
 
2.9%
0 22
 
2.8%
19
 
2.4%
19
 
2.4%
. 17
 
2.2%
14
 
1.8%
13
 
1.6%
11
 
1.4%
Other values (202) 468
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 523
66.2%
Space Separator 160
 
20.3%
Decimal Number 71
 
9.0%
Other Punctuation 27
 
3.4%
Math Symbol 9
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.4%
19
 
3.6%
19
 
3.6%
14
 
2.7%
13
 
2.5%
11
 
2.1%
9
 
1.7%
9
 
1.7%
9
 
1.7%
8
 
1.5%
Other values (186) 389
74.4%
Decimal Number
ValueCountFrequency (%)
1 24
33.8%
0 22
31.0%
6 10
14.1%
4 4
 
5.6%
2 3
 
4.2%
3 3
 
4.2%
9 2
 
2.8%
8 1
 
1.4%
5 1
 
1.4%
7 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 17
63.0%
! 6
 
22.2%
, 2
 
7.4%
? 2
 
7.4%
Space Separator
ValueCountFrequency (%)
160
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 523
66.2%
Common 267
33.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.4%
19
 
3.6%
19
 
3.6%
14
 
2.7%
13
 
2.5%
11
 
2.1%
9
 
1.7%
9
 
1.7%
9
 
1.7%
8
 
1.5%
Other values (186) 389
74.4%
Common
ValueCountFrequency (%)
160
59.9%
1 24
 
9.0%
0 22
 
8.2%
. 17
 
6.4%
6 10
 
3.7%
~ 9
 
3.4%
! 6
 
2.2%
4 4
 
1.5%
2 3
 
1.1%
3 3
 
1.1%
Other values (6) 9
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 523
66.2%
ASCII 267
33.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
59.9%
1 24
 
9.0%
0 22
 
8.2%
. 17
 
6.4%
6 10
 
3.7%
~ 9
 
3.4%
! 6
 
2.2%
4 4
 
1.5%
2 3
 
1.1%
3 3
 
1.1%
Other values (6) 9
 
3.4%
Hangul
ValueCountFrequency (%)
23
 
4.4%
19
 
3.6%
19
 
3.6%
14
 
2.7%
13
 
2.5%
11
 
2.1%
9
 
1.7%
9
 
1.7%
9
 
1.7%
8
 
1.5%
Other values (186) 389
74.4%

VID_SJ_CN
Text

MISSING 

Distinct11
Distinct (%)55.0%
Missing39
Missing (%)66.1%
Memory size604.0 B
2024-04-17T12:37:10.068241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length30.5
Mean length17.6
Min length8

Characters and Unicode

Total characters352
Distinct characters123
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)50.0%

Sample

1st row험한 산길을 헤매고 헤맨 끝에 만난 유쾌한 자연인!
2nd row20160107
3rd row'혹시 과학자?' 기발한 아이디어가 담긴 자연인의 집!
4th row20160107
5th row윤택 씨에게 뭔가를 자꾸 주고 싶은 자연인!
ValueCountFrequency (%)
20160107 10
 
12.3%
자연인의 5
 
6.2%
자연인 3
 
3.7%
윤택 2
 
2.5%
야심 1
 
1.2%
비장의 1
 
1.2%
1
 
1.2%
꺼내 1
 
1.2%
차게 1
 
1.2%
되자 1
 
1.2%
Other values (55) 55
67.9%
2024-04-17T12:37:10.358699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61
 
17.3%
0 30
 
8.5%
1 20
 
5.7%
12
 
3.4%
2 10
 
2.8%
6 10
 
2.8%
7 10
 
2.8%
10
 
2.8%
9
 
2.6%
! 8
 
2.3%
Other values (113) 172
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 197
56.0%
Decimal Number 80
22.7%
Space Separator 61
 
17.3%
Other Punctuation 13
 
3.7%
Math Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
9
 
4.6%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
3
 
1.5%
Other values (103) 137
69.5%
Decimal Number
ValueCountFrequency (%)
0 30
37.5%
1 20
25.0%
2 10
 
12.5%
6 10
 
12.5%
7 10
 
12.5%
Other Punctuation
ValueCountFrequency (%)
! 8
61.5%
' 4
30.8%
? 1
 
7.7%
Space Separator
ValueCountFrequency (%)
61
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 197
56.0%
Common 155
44.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
9
 
4.6%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
3
 
1.5%
Other values (103) 137
69.5%
Common
ValueCountFrequency (%)
61
39.4%
0 30
19.4%
1 20
 
12.9%
2 10
 
6.5%
6 10
 
6.5%
7 10
 
6.5%
! 8
 
5.2%
' 4
 
2.6%
~ 1
 
0.6%
? 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 197
56.0%
ASCII 155
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
61
39.4%
0 30
19.4%
1 20
 
12.9%
2 10
 
6.5%
6 10
 
6.5%
7 10
 
6.5%
! 8
 
5.2%
' 4
 
2.6%
~ 1
 
0.6%
? 1
 
0.6%
Hangul
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
9
 
4.6%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
3
 
1.5%
Other values (103) 137
69.5%

VID_CN
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing39
Missing (%)66.1%
Memory size604.0 B
2024-04-17T12:37:10.493013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length60.5
Mean length55.15
Min length18

Characters and Unicode

Total characters1103
Distinct characters156
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row한발 한발 내밀기도 힘든 거친 산길!
2nd rowhttp://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010617
3rd row비닐하우스를 열어보니 황토방이?!
4th rowhttp://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010618
5th row5성급(?) 닭 호텔을 소개해주는 자연인.
ValueCountFrequency (%)
자연인 4
 
4.9%
한발 2
 
2.4%
2
 
2.4%
윤택 2
 
2.4%
해주셨던 1
 
1.2%
준비하는 1
 
1.2%
회사에 1
 
1.2%
시절 1
 
1.2%
하던 1
 
1.2%
운전을 1
 
1.2%
Other values (66) 66
80.5%
2024-04-17T12:37:10.726999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 80
 
7.3%
t 80
 
7.3%
64
 
5.8%
e 50
 
4.5%
o 50
 
4.5%
c 50
 
4.5%
. 46
 
4.2%
/ 40
 
3.6%
1 34
 
3.1%
_ 30
 
2.7%
Other values (146) 579
52.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 560
50.8%
Other Letter 202
 
18.3%
Other Punctuation 122
 
11.1%
Decimal Number 91
 
8.3%
Space Separator 64
 
5.8%
Connector Punctuation 30
 
2.7%
Math Symbol 22
 
2.0%
Uppercase Letter 10
 
0.9%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
4.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
Other values (103) 145
71.8%
Lowercase Letter
ValueCountFrequency (%)
n 80
14.3%
t 80
14.3%
e 50
 
8.9%
o 50
 
8.9%
c 50
 
8.9%
m 30
 
5.4%
w 30
 
5.4%
l 20
 
3.6%
r 20
 
3.6%
b 20
 
3.6%
Other values (9) 130
23.2%
Decimal Number
ValueCountFrequency (%)
1 34
37.4%
0 22
24.2%
2 12
 
13.2%
6 10
 
11.0%
4 4
 
4.4%
3 3
 
3.3%
9 2
 
2.2%
5 2
 
2.2%
7 1
 
1.1%
8 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 46
37.7%
/ 40
32.8%
? 12
 
9.8%
: 10
 
8.2%
& 10
 
8.2%
! 3
 
2.5%
, 1
 
0.8%
Math Symbol
ValueCountFrequency (%)
= 20
90.9%
~ 2
 
9.1%
Space Separator
ValueCountFrequency (%)
64
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 570
51.7%
Common 331
30.0%
Hangul 202
 
18.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
4.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
Other values (103) 145
71.8%
Common
ValueCountFrequency (%)
64
19.3%
. 46
13.9%
/ 40
12.1%
1 34
10.3%
_ 30
9.1%
0 22
 
6.6%
= 20
 
6.0%
? 12
 
3.6%
2 12
 
3.6%
: 10
 
3.0%
Other values (13) 41
12.4%
Latin
ValueCountFrequency (%)
n 80
14.0%
t 80
14.0%
e 50
 
8.8%
o 50
 
8.8%
c 50
 
8.8%
m 30
 
5.3%
w 30
 
5.3%
l 20
 
3.5%
r 20
 
3.5%
b 20
 
3.5%
Other values (10) 140
24.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 901
81.7%
Hangul 202
 
18.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 80
 
8.9%
t 80
 
8.9%
64
 
7.1%
e 50
 
5.5%
o 50
 
5.5%
c 50
 
5.5%
. 46
 
5.1%
/ 40
 
4.4%
1 34
 
3.8%
_ 30
 
3.3%
Other values (33) 377
41.8%
Hangul
ValueCountFrequency (%)
9
 
4.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
7
 
3.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
Other values (103) 145
71.8%

REG_DATE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing59
Missing (%)100.0%
Memory size663.0 B

VOD_CRS_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing59
Missing (%)100.0%
Memory size663.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing59
Missing (%)100.0%
Memory size663.0 B

Correlations

2024-04-17T12:37:10.791640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RSTRC_VID_ESSN_NOVID_SJ_CNVID_CN
RSTRC_VID_ESSN_NO1.0001.0001.000
VID_SJ_CN1.0001.0001.000
VID_CN1.0001.0001.000

Missing values

2024-04-17T12:37:09.049283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T12:37:09.126989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-17T12:37:09.197501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

RSTRC_VID_ESSN_NOVID_SJ_CNVID_CNREG_DATEVOD_CRS_NMUnnamed: 5
0<NA><NA><NA><NA><NA><NA>
11010617험한 산길을 헤매고 헤맨 끝에 만난 유쾌한 자연인!한발 한발 내밀기도 힘든 거친 산길!<NA><NA><NA>
2<NA><NA><NA><NA><NA><NA>
3그 험한 산행 끝에 겨우 찾은 자연인의 보금자리~<NA><NA><NA><NA><NA>
4<NA><NA><NA><NA><NA><NA>
5자연인은 추운 겨울날 깊은 산 속까지 찾아온 윤택 씨를 반갑게 맞아주는데~20160107http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010617<NA><NA><NA>
61010618'혹시 과학자?' 기발한 아이디어가 담긴 자연인의 집!비닐하우스를 열어보니 황토방이?!<NA><NA><NA>
7<NA><NA><NA><NA><NA><NA>
8기발한 아이디어를 엿볼 수 있는 자연인의 보금자리!<NA><NA><NA><NA><NA>
9<NA><NA><NA><NA><NA><NA>
RSTRC_VID_ESSN_NOVID_SJ_CNVID_CNREG_DATEVOD_CRS_NMUnnamed: 5
49<NA><NA><NA><NA><NA><NA>
50두 사람은 추위를 쫓기 위해 소란스러운 아침 운동을 시작하는데~<NA><NA><NA><NA><NA>
51<NA><NA><NA><NA><NA><NA>
52일어나자마자 제대로 몸 푸는 자연인과 윤택 씨!20160107http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010640<NA><NA><NA>
531010641홀로 육 형제를 키워내신 자연인의 위대한 어머니과거 어머니가 자주 해주셨던 도토리묵을 만드는 자연인.<NA><NA><NA>
54<NA><NA><NA><NA><NA><NA>
55자연인은 도토리묵을 만들 때면 늘 어머니 생각이 난다고...<NA><NA><NA><NA><NA>
56<NA><NA><NA><NA><NA><NA>
57육 형제를 키우느라 고생하신 어머니를 생각하자 눈물이 절로 나는 자연인.20160107http://www.mbn.co.kr/player/movieContents.mbn?content_cls_cd=21&content_id=1010641<NA><NA><NA>
58<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

RSTRC_VID_ESSN_NOVID_SJ_CNVID_CN# duplicates
0<NA><NA><NA>25