Overview

Dataset statistics

Number of variables25
Number of observations1517
Missing cells36565
Missing cells (%)96.4%
Duplicate rows93
Duplicate rows (%)6.1%
Total size in memory315.7 KiB
Average record size in memory213.1 B

Variable types

Text12
Unsupported13

Dataset

Description샘플 데이터
AuthorKDX한국데이터거래소
URLhttps://kdx.kr/data/view/31059

Alerts

Dataset has 93 (6.1%) duplicate rowsDuplicates
MBN_MDA_SP_CD has 329 (21.7%) missing valuesMissing
MBN_ART_ESSN_NO has 1487 (98.0%) missing valuesMissing
MDA_CGR_NM has 1495 (98.5%) missing valuesMissing
STD_YEAR has 1487 (98.0%) missing valuesMissing
ART_SJ_CN has 1487 (98.0%) missing valuesMissing
ART_CN has 1487 (98.0%) missing valuesMissing
ATCH_IMG_NM has 1507 (99.3%) missing valuesMissing
JRNL_NM has 1512 (99.7%) missing valuesMissing
WRT_DATE has 1509 (99.5%) missing valuesMissing
ART_POSA has 1515 (99.9%) missing valuesMissing
ART_NOUN has 1515 (99.9%) missing valuesMissing
ART_TAG has 1517 (100.0%) missing valuesMissing
ART_PRS_NM has 1514 (99.8%) missing valuesMissing
ART_RNK_NM has 1517 (100.0%) missing valuesMissing
ART_INST_NM has 1517 (100.0%) missing valuesMissing
ART_AREA_NM has 1517 (100.0%) missing valuesMissing
ART_GD_NM has 1517 (100.0%) missing valuesMissing
ART_QY has 1517 (100.0%) missing valuesMissing
ART_EVT has 1517 (100.0%) missing valuesMissing
ART_DT has 1517 (100.0%) missing valuesMissing
ART_TIME has 1517 (100.0%) missing valuesMissing
ART_ANM has 1517 (100.0%) missing valuesMissing
ART_PLNT has 1517 (100.0%) missing valuesMissing
ART_AF has 1517 (100.0%) missing valuesMissing
Unnamed: 24 has 1517 (100.0%) missing valuesMissing
ART_TAG is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_RNK_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_INST_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_AREA_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_GD_NM is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_QY is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_EVT is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_DT is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_TIME is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_ANM is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_PLNT is an unsupported type, check if it needs cleaning or further analysisUnsupported
ART_AF is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-11 03:28:55.051312
Analysis finished2024-03-11 03:28:56.168766
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

MBN_MDA_SP_CD
Text

MISSING 

Distinct981
Distinct (%)82.6%
Missing329
Missing (%)21.7%
Memory size12.0 KiB
2024-03-11T12:28:56.399948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length106
Median length81
Mean length16.56229
Min length1

Characters and Unicode

Total characters19676
Distinct characters565
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique885 ?
Unique (%)74.5%

Sample

1st rowMBN
2nd row 런던에 입성한 성화는 시내 최고 명물인 노스
3rd row그리니치 아레나 지붕 위까지 올라갔습니다.
4th row NBA 선수들로 구성된 미국 농구 대표팀은 선수촌 대신 호텔에서 머물기로 해 눈총을 사고 있습니다.
5th row 런던올림픽 소식, 김동환 기자가 전합니다.
ValueCountFrequency (%)
55
 
1.2%
선수 50
 
1.1%
스포츠 42
 
0.9%
올림픽 38
 
0.8%
mbn 38
 
0.8%
기자 36
 
0.8%
인터뷰 34
 
0.8%
ioc 33
 
0.7%
컴퍼니 25
 
0.6%
해외 24
 
0.5%
Other values (1950) 4106
91.6%
2024-03-11T12:28:56.852791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3611
 
18.4%
, 1797
 
9.1%
_ 351
 
1.8%
257
 
1.3%
214
 
1.1%
199
 
1.0%
195
 
1.0%
. 194
 
1.0%
187
 
1.0%
186
 
0.9%
Other values (555) 12485
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12197
62.0%
Space Separator 3611
 
18.4%
Other Punctuation 2118
 
10.8%
Lowercase Letter 468
 
2.4%
Decimal Number 439
 
2.2%
Uppercase Letter 379
 
1.9%
Connector Punctuation 351
 
1.8%
Open Punctuation 35
 
0.2%
Close Punctuation 35
 
0.2%
Dash Punctuation 26
 
0.1%
Other values (2) 17
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
257
 
2.1%
214
 
1.8%
199
 
1.6%
195
 
1.6%
187
 
1.5%
186
 
1.5%
169
 
1.4%
167
 
1.4%
159
 
1.3%
158
 
1.3%
Other values (484) 10306
84.5%
Lowercase Letter
ValueCountFrequency (%)
o 61
13.0%
c 57
12.2%
m 52
11.1%
n 52
11.1%
b 39
8.3%
k 33
7.1%
r 30
 
6.4%
a 30
 
6.4%
i 23
 
4.9%
l 15
 
3.2%
Other values (12) 76
16.2%
Uppercase Letter
ValueCountFrequency (%)
C 71
18.7%
N 60
15.8%
O 60
15.8%
I 51
13.5%
B 47
12.4%
M 41
10.8%
A 12
 
3.2%
D 10
 
2.6%
J 9
 
2.4%
E 5
 
1.3%
Other values (4) 13
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 1797
84.8%
. 194
 
9.2%
" 40
 
1.9%
: 22
 
1.0%
/ 20
 
0.9%
' 18
 
0.8%
@ 13
 
0.6%
% 6
 
0.3%
· 5
 
0.2%
3
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 96
21.9%
1 95
21.6%
0 58
13.2%
4 42
9.6%
5 40
9.1%
7 34
 
7.7%
9 30
 
6.8%
8 21
 
4.8%
3 17
 
3.9%
6 6
 
1.4%
Open Punctuation
ValueCountFrequency (%)
12
34.3%
10
28.6%
[ 7
20.0%
( 6
17.1%
Close Punctuation
ValueCountFrequency (%)
12
34.3%
10
28.6%
] 7
20.0%
) 6
17.1%
Other Symbol
ValueCountFrequency (%)
13
86.7%
2
 
13.3%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
3611
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 351
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12197
62.0%
Common 6632
33.7%
Latin 847
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
257
 
2.1%
214
 
1.8%
199
 
1.6%
195
 
1.6%
187
 
1.5%
186
 
1.5%
169
 
1.4%
167
 
1.4%
159
 
1.3%
158
 
1.3%
Other values (484) 10306
84.5%
Latin
ValueCountFrequency (%)
C 71
 
8.4%
o 61
 
7.2%
N 60
 
7.1%
O 60
 
7.1%
c 57
 
6.7%
m 52
 
6.1%
n 52
 
6.1%
I 51
 
6.0%
B 47
 
5.5%
M 41
 
4.8%
Other values (26) 295
34.8%
Common
ValueCountFrequency (%)
3611
54.4%
, 1797
27.1%
_ 351
 
5.3%
. 194
 
2.9%
2 96
 
1.4%
1 95
 
1.4%
0 58
 
0.9%
4 42
 
0.6%
" 40
 
0.6%
5 40
 
0.6%
Other values (25) 308
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12197
62.0%
ASCII 7412
37.7%
None 49
 
0.2%
Geometric Shapes 13
 
0.1%
Punctuation 3
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3611
48.7%
, 1797
24.2%
_ 351
 
4.7%
. 194
 
2.6%
2 96
 
1.3%
1 95
 
1.3%
C 71
 
1.0%
o 61
 
0.8%
N 60
 
0.8%
O 60
 
0.8%
Other values (53) 1016
 
13.7%
Hangul
ValueCountFrequency (%)
257
 
2.1%
214
 
1.8%
199
 
1.6%
195
 
1.6%
187
 
1.5%
186
 
1.5%
169
 
1.4%
167
 
1.4%
159
 
1.3%
158
 
1.3%
Other values (484) 10306
84.5%
Geometric Shapes
ValueCountFrequency (%)
13
100.0%
None
ValueCountFrequency (%)
12
24.5%
12
24.5%
10
20.4%
10
20.4%
· 5
10.2%
Punctuation
ValueCountFrequency (%)
3
100.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%

MBN_ART_ESSN_NO
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing1487
Missing (%)98.0%
Memory size12.0 KiB
2024-03-11T12:28:57.026028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length17
Mean length6.8
Min length2

Characters and Unicode

Total characters204
Distinct characters80
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row1030271
2nd row스타, 이스라엘
3rd row자크 로게
4th row1030336
5th row1981년_IOC, 투혼
ValueCountFrequency (%)
장미란 2
 
4.8%
이수영 2
 
4.8%
1030271 1
 
2.4%
명단 1
 
2.4%
이영학 1
 
2.4%
조중건 1
 
2.4%
1047815 1
 
2.4%
뉴스타 1
 
2.4%
1052891 1
 
2.4%
국세청 1
 
2.4%
Other values (30) 30
71.4%
2024-03-11T12:28:57.291883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18
 
8.8%
0 15
 
7.4%
12
 
5.9%
, 11
 
5.4%
3 10
 
4.9%
5 8
 
3.9%
8 6
 
2.9%
6
 
2.9%
7 6
 
2.9%
2 5
 
2.5%
Other values (70) 107
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
45.6%
Decimal Number 80
39.2%
Space Separator 12
 
5.9%
Other Punctuation 11
 
5.4%
Connector Punctuation 5
 
2.5%
Uppercase Letter 3
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
6.5%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (54) 62
66.7%
Decimal Number
ValueCountFrequency (%)
1 18
22.5%
0 15
18.8%
3 10
12.5%
5 8
10.0%
8 6
 
7.5%
7 6
 
7.5%
2 5
 
6.2%
6 5
 
6.2%
9 4
 
5.0%
4 3
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
O 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 108
52.9%
Hangul 93
45.6%
Latin 3
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
6.5%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (54) 62
66.7%
Common
ValueCountFrequency (%)
1 18
16.7%
0 15
13.9%
12
11.1%
, 11
10.2%
3 10
9.3%
5 8
7.4%
8 6
 
5.6%
7 6
 
5.6%
2 5
 
4.6%
_ 5
 
4.6%
Other values (3) 12
11.1%
Latin
ValueCountFrequency (%)
I 1
33.3%
O 1
33.3%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 111
54.4%
Hangul 93
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18
16.2%
0 15
13.5%
12
10.8%
, 11
9.9%
3 10
9.0%
5 8
7.2%
8 6
 
5.4%
7 6
 
5.4%
2 5
 
4.5%
_ 5
 
4.5%
Other values (6) 15
13.5%
Hangul
ValueCountFrequency (%)
6
 
6.5%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (54) 62
66.7%

MDA_CGR_NM
Text

MISSING 

Distinct21
Distinct (%)95.5%
Missing1495
Missing (%)98.5%
Memory size12.0 KiB
2024-03-11T12:28:57.482693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length46
Mean length17.090909
Min length3

Characters and Unicode

Total characters376
Distinct characters97
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)90.9%

Sample

1st row김동환
2nd row선수, 기자, 체조, 위원장
3rd row윤범기
4th row마라톤, 선수, 아리랑, 금메달, 기자, 위원장, MBN뉴스
5th row이권열
ValueCountFrequency (%)
기자 10
 
11.8%
선수 7
 
8.2%
mbn뉴스 5
 
5.9%
한국인 4
 
4.7%
김동환 2
 
2.4%
은메달 2
 
2.4%
현직 2
 
2.4%
장남 2
 
2.4%
회장 2
 
2.4%
선수위원 2
 
2.4%
Other values (44) 47
55.3%
2024-03-11T12:28:57.813789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
16.8%
, 60
 
16.0%
12
 
3.2%
11
 
2.9%
11
 
2.9%
10
 
2.7%
10
 
2.7%
10
 
2.7%
9
 
2.4%
0 7
 
1.9%
Other values (87) 173
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222
59.0%
Space Separator 63
 
16.8%
Other Punctuation 60
 
16.0%
Uppercase Letter 15
 
4.0%
Decimal Number 10
 
2.7%
Lowercase Letter 6
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.4%
11
 
5.0%
11
 
5.0%
10
 
4.5%
10
 
4.5%
10
 
4.5%
9
 
4.1%
7
 
3.2%
6
 
2.7%
5
 
2.3%
Other values (76) 131
59.0%
Decimal Number
ValueCountFrequency (%)
0 7
70.0%
1 2
 
20.0%
6 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
B 5
33.3%
N 5
33.3%
M 5
33.3%
Lowercase Letter
ValueCountFrequency (%)
m 2
33.3%
b 2
33.3%
n 2
33.3%
Space Separator
ValueCountFrequency (%)
63
100.0%
Other Punctuation
ValueCountFrequency (%)
, 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 222
59.0%
Common 133
35.4%
Latin 21
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.4%
11
 
5.0%
11
 
5.0%
10
 
4.5%
10
 
4.5%
10
 
4.5%
9
 
4.1%
7
 
3.2%
6
 
2.7%
5
 
2.3%
Other values (76) 131
59.0%
Latin
ValueCountFrequency (%)
B 5
23.8%
N 5
23.8%
M 5
23.8%
m 2
 
9.5%
b 2
 
9.5%
n 2
 
9.5%
Common
ValueCountFrequency (%)
63
47.4%
, 60
45.1%
0 7
 
5.3%
1 2
 
1.5%
6 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 222
59.0%
ASCII 154
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
63
40.9%
, 60
39.0%
0 7
 
4.5%
B 5
 
3.2%
N 5
 
3.2%
M 5
 
3.2%
m 2
 
1.3%
b 2
 
1.3%
n 2
 
1.3%
1 2
 
1.3%
Hangul
ValueCountFrequency (%)
12
 
5.4%
11
 
5.0%
11
 
5.0%
10
 
4.5%
10
 
4.5%
10
 
4.5%
9
 
4.1%
7
 
3.2%
6
 
2.7%
5
 
2.3%
Other values (76) 131
59.0%

STD_YEAR
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing1487
Missing (%)98.0%
Memory size12.0 KiB
2024-03-11T12:28:58.000132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length46
Mean length17.666667
Min length8

Characters and Unicode

Total characters530
Distinct characters88
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row20120722
2nd row20120722202449
3rd row바르셀로나올림픽 대표팀, 국제올림픽위원회, NBA 스타 존 아매치, MBN뉴스, NBA, IOC
4th row20120723
5th row20120723212058
ValueCountFrequency (%)
ioc 6
 
7.6%
국제올림픽위원회 4
 
5.1%
mbn뉴스 3
 
3.8%
oci 3
 
3.8%
한국 2
 
2.5%
국세청 2
 
2.5%
mbn 2
 
2.5%
대한체육회 2
 
2.5%
뉴스타파 2
 
2.5%
nba 2
 
2.5%
Other values (50) 51
64.6%
2024-03-11T12:28:58.286216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 57
 
10.8%
0 54
 
10.2%
49
 
9.2%
1 42
 
7.9%
, 39
 
7.4%
3 22
 
4.2%
C 15
 
2.8%
8 12
 
2.3%
4 11
 
2.1%
O 11
 
2.1%
Other values (78) 218
41.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 220
41.5%
Other Letter 160
30.2%
Uppercase Letter 62
 
11.7%
Space Separator 49
 
9.2%
Other Punctuation 39
 
7.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
6.2%
10
 
6.2%
9
 
5.6%
8
 
5.0%
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
Other values (58) 86
53.8%
Decimal Number
ValueCountFrequency (%)
2 57
25.9%
0 54
24.5%
1 42
19.1%
3 22
 
10.0%
8 12
 
5.5%
4 11
 
5.0%
5 8
 
3.6%
7 8
 
3.6%
9 4
 
1.8%
6 2
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
C 15
24.2%
O 11
17.7%
I 9
14.5%
N 8
12.9%
B 7
11.3%
M 5
 
8.1%
J 4
 
6.5%
A 3
 
4.8%
Space Separator
ValueCountFrequency (%)
49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 308
58.1%
Hangul 160
30.2%
Latin 62
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
6.2%
10
 
6.2%
9
 
5.6%
8
 
5.0%
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
Other values (58) 86
53.8%
Common
ValueCountFrequency (%)
2 57
18.5%
0 54
17.5%
49
15.9%
1 42
13.6%
, 39
12.7%
3 22
 
7.1%
8 12
 
3.9%
4 11
 
3.6%
5 8
 
2.6%
7 8
 
2.6%
Other values (2) 6
 
1.9%
Latin
ValueCountFrequency (%)
C 15
24.2%
O 11
17.7%
I 9
14.5%
N 8
12.9%
B 7
11.3%
M 5
 
8.1%
J 4
 
6.5%
A 3
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 370
69.8%
Hangul 160
30.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 57
15.4%
0 54
14.6%
49
13.2%
1 42
11.4%
, 39
10.5%
3 22
 
5.9%
C 15
 
4.1%
8 12
 
3.2%
4 11
 
3.0%
O 11
 
3.0%
Other values (10) 58
15.7%
Hangul
ValueCountFrequency (%)
10
 
6.2%
10
 
6.2%
9
 
5.6%
8
 
5.0%
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
Other values (58) 86
53.8%

ART_SJ_CN
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing1487
Missing (%)98.0%
Memory size12.0 KiB
2024-03-11T12:28:58.520831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length27
Mean length353.4
Min length1

Characters and Unicode

Total characters10602
Distinct characters428
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row지붕 위에 오른 성화…런던올림픽 화제
2nd row지붕/NNG, 위/NNG, 에/JKB, 오른/VV, 성화/NNG, …/SE, 런던/NNP, 올림픽/NNG, 화제/NNG, 【/SS, 앵커/NNG, 멘트/NNG, 】/SW, 런던/NNP, 에/JKB, 입성/NNG, 한/XSV, 성화/NNG, 는/JX, 시내/NNG, 최고/NNG, 명물/NNG, 인/VCP, 노스/NNP, 그리니치/NNP, 아레나/NNP, 지붕/NNG, 위/NNG, 까지/JX, 올라갔/VV, 습니다/EF, ./SF, NBA/SL, 선수/NNG, 들/XSN, 로/JKB, 구성/NNG, 된/XSV, 미국/NNP, 농구/NNG, 대표/NNG, 팀/NNG, 은/JX, 선수촌/NNG, 대신/NNG, 호텔/NNG, 에서/JKB, 머물/VV, 기/ETN, 로/JKB, 해/VV, 눈총/NNG, 을/JKO, 사/VV, 고/EC, 있/VX, 습니다/EF, ./SF, 런던/NNP, 올림픽/NNG, 소식/NNG, ,/SP, 김동환/NNP, 기자/NNG, 가/JKS, 전합/VV, 니다/EF, ./SF, 【/SS, 기자/NNG, 】/SS, 전설/NNG, 적/XSN, 인/VCP, 체조/NNG, 스타/NNG, 나디아/NNP, 코마네치/NNP, 가/JKS, 성화/NNG, 를/JKO, 들/VV, 고/EC, 100/SN, m/SL, 높이/NNG, 의/JKG, 지붕/NNG, 위/NNG, 로/JKB, 오릅/VV, 니다/EF, ./SF, 체조/NNG, 사상/NNG, 처음/NNG, 으로/JKB, 만점/NNG, 을/JKO, 받/VV, 은/ETM, 그녀/NP, 도/JX, 세계/NNG, 최대/NNG, 의/JKG, 돔/NNG, 건물/NNG, 위/NNG, 에선/JKB, 안전/NNG, 고리/NNG, 를/JKO, 매/VV, 야/EC, 합/VX, 니다/EF, ./SF, 정상/NNG, 에서/JKB, 기다리/VV, 고/EC, 있/VX, 던/ETM, NBA/SL, 스타/NNG, 존/NNP, 아/NNP, 매치/NNG, 가/JKS, 코마네치/NNP, 를/JKO, 번쩍/MAG, 들/VV, 어/EC, 올린/VV, 뒤/NNG, 성화/NNG, 를/JKO, 이어
3rd row그리니치, 이스라엘, 미국, 런던, 아레나
4th row"아! 올림픽"…고난과 영광의 순간들
5th row"/SS, 아/IC, !/SF, 올림픽/NNG, "/SS, …/SE, 고난/NNG, 과/JC, 영광/NNG, 의/JKG, 순간/NNG, 들/XSN, 【/SS, 앵커/NNG, 멘트/NNG, 】/SW, 런던/NNP, 올림픽/NNG, 에/JKB, 출전/NNG, 하/XSV, 기/ETN, 까지/JX, 우리나라/NNG, 올림픽/NNG, 의/JKG, 역사/NNG, 는/JX, 국력/NNG, 과/JKB, 궤/NNG, 를/JKO, 같이/MAG, 해/XSV, 왔/VX, 습니다/EF, ./SF, 현대사/NNG, 와/JKB, 함께/MAG, 해/VV, 온/VX, 대한민국/NNP, 올림픽/NNG, 의/JKG, 주요/NNG, 장면/NNG, 을/JKO, 담/VV, 은/ETM, 기록/NNG, 영상/NNG, 이/JKS, 공개/NNG, 됐/XSV, 습니다/EF, ./SF, 윤범기/NNP, 기자/NNG, 입/VCP, 니다/EF, ./SF, , 【/SS, 기자/NNG, 】/SS, 4/SN, ./SP, 19/SN, 혁명/NNG, 의/JKG, 열기/NNG, 가/JKS, 채/MAG, 식/VV, 지/EC, 않/VX, 았/EP, 던/ETM, 17/SN, 회/NNB, 로마/NNP, 올림픽/NNG, ./SF, 올림픽/NNG, 의/JKG, 꽃/NNG, 인/VCP, 마라톤/NNG, 대회/NNG, 에/JKB, 는/JX, 우리나라/NNG, 의/JKG, 이창훈/NNP, ,/SP, 김연범/NNP, ,/SP, 이상철/NNP, 선수/NNG, 가/JKS, 출전/NNG, 했/XSV, 습니다/EF, ./SF, 하지만/MAJ, ,/SP, 세계/NNG, 의/JKG, 벽/NNG, 은/JX, 높/VA, 았/EP, 고/EC, ,/SP, 1/SN, 위/NNB, 는/JX, 맨/XPN, 발/NNG, 투혼/NNG, 의/JKG, 에티오피아/NNP, '/SS, 아베베/NNP, '/SS, 선수/NNG, 에게/JKB, 돌아갔/VV, 습니다/EF, ./SF, 「/SS, "/SS, 이창훈/NNP, 선수/NNG, 는/JX, 좋/VA, 은/ETM, 성과/NNG, 를/JKO, 올리/VV, 지/EC, 못하
ValueCountFrequency (%)
sf 58
 
3.9%
ss 34
 
2.3%
을/jko 31
 
2.1%
에/jkb 30
 
2.0%
습니다/ef 26
 
1.7%
의/jkg 26
 
1.7%
이/jks 23
 
1.5%
니다/ef 22
 
1.5%
기자/nng 20
 
1.3%
【/ss 20
 
1.3%
Other values (649) 1202
80.6%
2024-03-11T12:28:58.877470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1463
13.8%
, 1431
13.5%
/ 1401
13.2%
N 1206
 
11.4%
G 495
 
4.7%
S 376
 
3.5%
V 263
 
2.5%
J 229
 
2.2%
K 175
 
1.7%
E 151
 
1.4%
Other values (418) 3412
32.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 3717
35.1%
Other Punctuation 2959
27.9%
Other Letter 2339
22.1%
Space Separator 1463
 
13.8%
Decimal Number 70
 
0.7%
Open Punctuation 26
 
0.2%
Close Punctuation 25
 
0.2%
Lowercase Letter 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
2.7%
62
 
2.7%
57
 
2.4%
45
 
1.9%
44
 
1.9%
36
 
1.5%
35
 
1.5%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (366) 1899
81.2%
Uppercase Letter
ValueCountFrequency (%)
N 1206
32.4%
G 495
13.3%
S 376
 
10.1%
V 263
 
7.1%
J 229
 
6.2%
K 175
 
4.7%
E 151
 
4.1%
P 151
 
4.1%
X 133
 
3.6%
F 110
 
3.0%
Other values (11) 428
 
11.5%
Other Punctuation
ValueCountFrequency (%)
, 1431
48.4%
/ 1401
47.3%
. 58
 
2.0%
' 29
 
1.0%
" 15
 
0.5%
14
 
0.5%
· 6
 
0.2%
? 2
 
0.1%
! 2
 
0.1%
: 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 16
22.9%
2 13
18.6%
0 11
15.7%
4 9
12.9%
5 9
12.9%
7 5
 
7.1%
3 4
 
5.7%
8 2
 
2.9%
9 1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
20
80.0%
] 2
 
8.0%
2
 
8.0%
) 1
 
4.0%
Open Punctuation
ValueCountFrequency (%)
20
76.9%
3
 
11.5%
[ 2
 
7.7%
( 1
 
3.8%
Space Separator
ValueCountFrequency (%)
1463
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4545
42.9%
Latin 3718
35.1%
Hangul 2339
22.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
2.7%
62
 
2.7%
57
 
2.4%
45
 
1.9%
44
 
1.9%
36
 
1.5%
35
 
1.5%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (366) 1899
81.2%
Common
ValueCountFrequency (%)
1463
32.2%
, 1431
31.5%
/ 1401
30.8%
. 58
 
1.3%
' 29
 
0.6%
20
 
0.4%
20
 
0.4%
1 16
 
0.4%
" 15
 
0.3%
14
 
0.3%
Other values (20) 78
 
1.7%
Latin
ValueCountFrequency (%)
N 1206
32.4%
G 495
13.3%
S 376
 
10.1%
V 263
 
7.1%
J 229
 
6.2%
K 175
 
4.7%
E 151
 
4.1%
P 151
 
4.1%
X 133
 
3.6%
F 110
 
3.0%
Other values (12) 429
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8197
77.3%
Hangul 2339
 
22.1%
None 51
 
0.5%
Punctuation 14
 
0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1463
17.8%
, 1431
17.5%
/ 1401
17.1%
N 1206
14.7%
G 495
 
6.0%
S 376
 
4.6%
V 263
 
3.2%
J 229
 
2.8%
K 175
 
2.1%
E 151
 
1.8%
Other values (35) 1007
12.3%
Hangul
ValueCountFrequency (%)
63
 
2.7%
62
 
2.7%
57
 
2.4%
45
 
1.9%
44
 
1.9%
36
 
1.5%
35
 
1.5%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (366) 1899
81.2%
None
ValueCountFrequency (%)
20
39.2%
20
39.2%
· 6
 
11.8%
3
 
5.9%
2
 
3.9%
Punctuation
ValueCountFrequency (%)
14
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

ART_CN
Text

MISSING 

Distinct21
Distinct (%)70.0%
Missing1487
Missing (%)98.0%
Memory size12.0 KiB
2024-03-11T12:28:59.116768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length349
Median length137.5
Mean length66
Min length8

Characters and Unicode

Total characters1980
Distinct characters221
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)66.7%

Sample

1st row【 앵커멘트 】
2nd row지붕, 위, 성화, 런던, 올림픽, 화제
3rd row나디##생활/건강#발건강용품#발건강용품, 드림##디지털/가전#멀티미디어장비#휴대용스피커, 이어##패션잡화#모자#귀마개, NBA, 바르셀로나##패션잡화#신발#운동화, 하나##디지털/가전#스마트디바이스액세서리#기타휴대폰액세서리, 아레나, 아레나##생활/건강#욕실용품#욕실잡화, 대신##디지털/가전#생활가전#구강청정기, 세계##디지털/가전#생활가전#청소기, 스타##디지털/가전#계절가전#온열기, 코비##디지털/가전#멀티미디어장비#PC마이크, 대한##디지털/가전#PC액세서리#USB액세서리, 브라##스포츠/레저#등산#등산의류, 미국##식품#축산#축산가공식품, 보안##디지털/가전#저장장치#스토리지, 앵커##스포츠/레저#등산#등산장비
4th row【 앵커멘트 】
5th row올림픽, 고난, 영광, 순간
ValueCountFrequency (%)
10
 
6.1%
10
 
6.1%
앵커멘트 10
 
6.1%
앵커##스포츠/레저#등산#등산장비 10
 
6.1%
인터뷰##디지털/가전#음향가전#마이크 8
 
4.8%
대한##디지털/가전#pc액세서리#usb액세서리 5
 
3.0%
스포츠##디지털/가전#생활가전#기타생활가전 4
 
2.4%
세계##디지털/가전#생활가전#청소기 4
 
2.4%
이어##패션잡화#모자#귀마개 3
 
1.8%
하나##디지털/가전#스마트디바이스액세서리#기타휴대폰액세서리 3
 
1.8%
Other values (87) 98
59.4%
2024-03-11T12:28:59.467849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
# 292
 
14.7%
135
 
6.8%
, 115
 
5.8%
81
 
4.1%
/ 75
 
3.8%
70
 
3.5%
45
 
2.3%
41
 
2.1%
40
 
2.0%
39
 
2.0%
Other values (211) 1047
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1296
65.5%
Other Punctuation 482
 
24.3%
Space Separator 135
 
6.8%
Uppercase Letter 34
 
1.7%
Open Punctuation 10
 
0.5%
Close Punctuation 10
 
0.5%
Decimal Number 10
 
0.5%
Lowercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
6.2%
70
 
5.4%
45
 
3.5%
41
 
3.2%
40
 
3.1%
39
 
3.0%
28
 
2.2%
26
 
2.0%
23
 
1.8%
23
 
1.8%
Other values (188) 880
67.9%
Uppercase Letter
ValueCountFrequency (%)
C 8
23.5%
P 8
23.5%
B 6
17.6%
S 5
14.7%
U 5
14.7%
A 1
 
2.9%
N 1
 
2.9%
Decimal Number
ValueCountFrequency (%)
2 3
30.0%
5 2
20.0%
1 1
 
10.0%
4 1
 
10.0%
3 1
 
10.0%
7 1
 
10.0%
0 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
# 292
60.6%
, 115
 
23.9%
/ 75
 
15.6%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
o 1
33.3%
i 1
33.3%
Space Separator
ValueCountFrequency (%)
135
100.0%
Open Punctuation
ValueCountFrequency (%)
10
100.0%
Close Punctuation
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1296
65.5%
Common 647
32.7%
Latin 37
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
6.2%
70
 
5.4%
45
 
3.5%
41
 
3.2%
40
 
3.1%
39
 
3.0%
28
 
2.2%
26
 
2.0%
23
 
1.8%
23
 
1.8%
Other values (188) 880
67.9%
Common
ValueCountFrequency (%)
# 292
45.1%
135
20.9%
, 115
 
17.8%
/ 75
 
11.6%
10
 
1.5%
10
 
1.5%
2 3
 
0.5%
5 2
 
0.3%
1 1
 
0.2%
4 1
 
0.2%
Other values (3) 3
 
0.5%
Latin
ValueCountFrequency (%)
C 8
21.6%
P 8
21.6%
B 6
16.2%
S 5
13.5%
U 5
13.5%
A 1
 
2.7%
N 1
 
2.7%
c 1
 
2.7%
o 1
 
2.7%
i 1
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1296
65.5%
ASCII 664
33.5%
None 20
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
# 292
44.0%
135
20.3%
, 115
 
17.3%
/ 75
 
11.3%
C 8
 
1.2%
P 8
 
1.2%
B 6
 
0.9%
S 5
 
0.8%
U 5
 
0.8%
2 3
 
0.5%
Other values (11) 12
 
1.8%
Hangul
ValueCountFrequency (%)
81
 
6.2%
70
 
5.4%
45
 
3.5%
41
 
3.2%
40
 
3.1%
39
 
3.0%
28
 
2.2%
26
 
2.0%
23
 
1.8%
23
 
1.8%
Other values (188) 880
67.9%
None
ValueCountFrequency (%)
10
50.0%
10
50.0%

ATCH_IMG_NM
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing1507
Missing (%)99.3%
Memory size12.0 KiB
2024-03-11T12:28:59.632031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length26.5
Mean length21.2
Min length2

Characters and Unicode

Total characters212
Distinct characters49
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row하나, 100m, 제2
2nd row88올림픽, 2, 1위, 17회, 4.19 혁명, 3:1, 하나, 7.4 남북, 27위
3rd row3,4위, 2kwon@mbn.co.kr, 하나
4th row1급
5th row2막
ValueCountFrequency (%)
하나 3
 
6.1%
2
 
4.1%
2막 2
 
4.1%
10억 2
 
4.1%
245명 2
 
4.1%
350여 1
 
2.0%
1
 
2.0%
2개 1
 
2.0%
2차 1
 
2.0%
4개 1
 
2.0%
Other values (33) 33
67.3%
2024-03-11T12:28:59.960251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
18.4%
, 33
15.6%
1 15
 
7.1%
2 12
 
5.7%
11
 
5.2%
4 9
 
4.2%
0 9
 
4.2%
5 8
 
3.8%
8 5
 
2.4%
5
 
2.4%
Other values (39) 66
31.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 71
33.5%
Other Letter 50
23.6%
Other Punctuation 40
18.9%
Space Separator 39
18.4%
Lowercase Letter 12
 
5.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
22.0%
5
 
10.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
Other values (15) 15
30.0%
Decimal Number
ValueCountFrequency (%)
1 15
21.1%
2 12
16.9%
4 9
12.7%
0 9
12.7%
5 8
11.3%
8 5
 
7.0%
3 5
 
7.0%
7 5
 
7.0%
9 2
 
2.8%
6 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
k 2
16.7%
n 2
16.7%
o 2
16.7%
m 2
16.7%
r 1
8.3%
c 1
8.3%
b 1
8.3%
w 1
8.3%
Other Punctuation
ValueCountFrequency (%)
, 33
82.5%
. 4
 
10.0%
% 1
 
2.5%
@ 1
 
2.5%
: 1
 
2.5%
Space Separator
ValueCountFrequency (%)
39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 150
70.8%
Hangul 50
 
23.6%
Latin 12
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
22.0%
5
 
10.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
Other values (15) 15
30.0%
Common
ValueCountFrequency (%)
39
26.0%
, 33
22.0%
1 15
 
10.0%
2 12
 
8.0%
4 9
 
6.0%
0 9
 
6.0%
5 8
 
5.3%
8 5
 
3.3%
3 5
 
3.3%
7 5
 
3.3%
Other values (6) 10
 
6.7%
Latin
ValueCountFrequency (%)
k 2
16.7%
n 2
16.7%
o 2
16.7%
m 2
16.7%
r 1
8.3%
c 1
8.3%
b 1
8.3%
w 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 162
76.4%
Hangul 50
 
23.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39
24.1%
, 33
20.4%
1 15
 
9.3%
2 12
 
7.4%
4 9
 
5.6%
0 9
 
5.6%
5 8
 
4.9%
8 5
 
3.1%
3 5
 
3.1%
7 5
 
3.1%
Other values (14) 22
13.6%
Hangul
ValueCountFrequency (%)
11
22.0%
5
 
10.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
Other values (15) 15
30.0%

JRNL_NM
Text

MISSING 

Distinct5
Distinct (%)100.0%
Missing1512
Missing (%)99.7%
Memory size12.0 KiB
2024-03-11T12:29:00.095813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length22
Mean length23.4
Min length6

Characters and Unicode

Total characters117
Distinct characters44
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)100.0%

Sample

1st row런던올림픽 소식, 뮌헨올림픽, 런던올림픽
2nd row대한민국 올림픽, 로마 올림픽, 몬트리올 올림픽, 뮌헨올림픽, 런던 올림픽
3rd row베이징올림픽
4th row·일월드컵, 소치동계올림픽
5th row부산아시안게임, 인천아시안게임, 국제종합대회, 인천 아시안게임
ValueCountFrequency (%)
올림픽 4
19.0%
런던올림픽 2
 
9.5%
뮌헨올림픽 2
 
9.5%
소식 1
 
4.8%
대한민국 1
 
4.8%
로마 1
 
4.8%
몬트리올 1
 
4.8%
런던 1
 
4.8%
베이징올림픽 1
 
4.8%
·일월드컵 1
 
4.8%
Other values (6) 6
28.6%
2024-03-11T12:29:00.327579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
13.7%
11
 
9.4%
10
 
8.5%
10
 
8.5%
, 10
 
8.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (34) 45
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90
76.9%
Space Separator 16
 
13.7%
Other Punctuation 11
 
9.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
12.2%
10
 
11.1%
10
 
11.1%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
Other values (31) 38
42.2%
Other Punctuation
ValueCountFrequency (%)
, 10
90.9%
· 1
 
9.1%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90
76.9%
Common 27
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
12.2%
10
 
11.1%
10
 
11.1%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
Other values (31) 38
42.2%
Common
ValueCountFrequency (%)
16
59.3%
, 10
37.0%
· 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90
76.9%
ASCII 26
 
22.2%
None 1
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
61.5%
, 10
38.5%
Hangul
ValueCountFrequency (%)
11
 
12.2%
10
 
11.1%
10
 
11.1%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
Other values (31) 38
42.2%
None
ValueCountFrequency (%)
· 1
100.0%

WRT_DATE
Text

MISSING 

Distinct8
Distinct (%)100.0%
Missing1509
Missing (%)99.5%
Memory size12.0 KiB
2024-03-11T12:29:00.476487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length16
Mean length14
Min length2

Characters and Unicode

Total characters112
Distinct characters34
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)100.0%

Sample

1st row40주년, 1972년, 1992년
2nd row오는 29일, 1972년, 1981년
3rd row수년째
4th row10년간, 지난해 런던올림픽, 15년간, 8년간
5th row오늘(27일, 지난 22일
ValueCountFrequency (%)
1972년 2
 
7.7%
2
 
7.7%
다음 2
 
7.7%
내년 2
 
7.7%
40주년 1
 
3.8%
2002년 1
 
3.8%
2월 1
 
3.8%
10년 1
 
3.8%
22일 1
 
3.8%
지난 1
 
3.8%
Other values (12) 12
46.2%
2024-03-11T12:29:00.722926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
16.1%
13
11.6%
, 11
 
9.8%
2 10
 
8.9%
1 8
 
7.1%
9 6
 
5.4%
0 5
 
4.5%
7 3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (24) 32
28.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46
41.1%
Decimal Number 36
32.1%
Space Separator 18
 
16.1%
Other Punctuation 11
 
9.8%
Open Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
28.3%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (13) 13
28.3%
Decimal Number
ValueCountFrequency (%)
2 10
27.8%
1 8
22.2%
9 6
16.7%
0 5
13.9%
7 3
 
8.3%
8 2
 
5.6%
5 1
 
2.8%
4 1
 
2.8%
Space Separator
ValueCountFrequency (%)
18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 66
58.9%
Hangul 46
41.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
28.3%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (13) 13
28.3%
Common
ValueCountFrequency (%)
18
27.3%
, 11
16.7%
2 10
15.2%
1 8
12.1%
9 6
 
9.1%
0 5
 
7.6%
7 3
 
4.5%
8 2
 
3.0%
( 1
 
1.5%
5 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66
58.9%
Hangul 46
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18
27.3%
, 11
16.7%
2 10
15.2%
1 8
12.1%
9 6
 
9.1%
0 5
 
7.6%
7 3
 
4.5%
8 2
 
3.0%
( 1
 
1.5%
5 1
 
1.5%
Hangul
ValueCountFrequency (%)
13
28.3%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (13) 13
28.3%

ART_POSA
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing1515
Missing (%)99.9%
Memory size12.0 KiB
2024-03-11T12:29:00.848049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length6
Mean length6
Min length2

Characters and Unicode

Total characters12
Distinct characters8
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row2시간 25분 2초
2nd row1초
ValueCountFrequency (%)
2시간 1
25.0%
25분 1
25.0%
2초 1
25.0%
1초 1
25.0%
2024-03-11T12:29:01.089046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 3
25.0%
2
16.7%
2
16.7%
1
 
8.3%
1
 
8.3%
5 1
 
8.3%
1
 
8.3%
1 1
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5
41.7%
Other Letter 5
41.7%
Space Separator 2
 
16.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Decimal Number
ValueCountFrequency (%)
2 3
60.0%
5 1
 
20.0%
1 1
 
20.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7
58.3%
Hangul 5
41.7%

Most frequent character per script

Common
ValueCountFrequency (%)
2 3
42.9%
2
28.6%
5 1
 
14.3%
1 1
 
14.3%
Hangul
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7
58.3%
Hangul 5
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 3
42.9%
2
28.6%
5 1
 
14.3%
1 1
 
14.3%
Hangul
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

ART_NOUN
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing1515
Missing (%)99.9%
Memory size12.0 KiB
2024-03-11T12:29:01.187878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row
2nd row
ValueCountFrequency (%)
1
50.0%
1
50.0%
2024-03-11T12:29:01.376132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

ART_TAG
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_PRS_NM
Text

MISSING 

Distinct3
Distinct (%)100.0%
Missing1514
Missing (%)99.8%
Memory size12.0 KiB
2024-03-11T12:29:01.489684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5
Min length3

Characters and Unicode

Total characters15
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row아리랑 고개
2nd row과태료 폭탄
3rd row비행기
ValueCountFrequency (%)
아리랑 1
20.0%
고개 1
20.0%
과태료 1
20.0%
폭탄 1
20.0%
비행기 1
20.0%
2024-03-11T12:29:01.723857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13
86.7%
Space Separator 2
 
13.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13
86.7%
Common 2
 
13.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13
86.7%
ASCII 2
 
13.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2
100.0%
Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%

ART_RNK_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_INST_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_AREA_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_GD_NM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_QY
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_EVT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_DT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_TIME
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_ANM
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_PLNT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

ART_AF
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1517
Missing (%)100.0%
Memory size13.5 KiB

Sample

MBN_MDA_SP_CDMBN_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CNATCH_IMG_NMJRNL_NMWRT_DATEART_POSAART_NOUNART_TAGART_PRS_NMART_RNK_NMART_INST_NMART_AREA_NMART_GD_NMART_QYART_EVTART_DTART_TIMEART_ANMART_PLNTART_AFUnnamed: 24
0<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1MBN1030271<NA>20120722지붕 위에 오른 성화…런던올림픽 화제【 앵커멘트 】<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2런던에 입성한 성화는 시내 최고 명물인 노스<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3그리니치 아레나 지붕 위까지 올라갔습니다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4NBA 선수들로 구성된 미국 농구 대표팀은 선수촌 대신 호텔에서 머물기로 해 눈총을 사고 있습니다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5런던올림픽 소식, 김동환 기자가 전합니다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
7<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
8【 기자 】<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9전설적인 체조스타 나디아 코마네치가 성화를 들고 100m 높이의 지붕 위로 오릅니다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
MBN_MDA_SP_CDMBN_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CNATCH_IMG_NMJRNL_NMWRT_DATEART_POSAART_NOUNART_TAGART_PRS_NMART_RNK_NMART_INST_NMART_AREA_NMART_GD_NMART_QYART_EVTART_DTART_TIMEART_ANMART_PLNTART_AFUnnamed: 24
1507OCA<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1508아시아올림픽평의회<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1509참가_신청서, 참가_등록, 팀장<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1510<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1511아시아올림픽평의회<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1512OCA<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1513통보<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1514<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1515북한_NOC, 영상, 마감일송광호선수, 부위원장, 기자, MBN뉴스, 사무총장NOC, 대회 조직위원회, 아시아올림픽평의회, 북한, OCA서해, 평양, 부산, 인천, 북한아시아##스포츠/레저#탁구#탁구용품, 인터뷰##디지털/가전#음향가전#마이크, 앵커##스포츠/레저#등산#등산장비703명, 18개, 14개, 350여 명, 352명, 150명, 184명부산아시안게임, 인천아시안게임, 국제종합대회, 인천 아시안게임2002년, 다음 달, 다음 달 초<NA><NA><NA>비행기<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1516<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

MBN_MDA_SP_CDMBN_ART_ESSN_NOMDA_CGR_NMSTD_YEARART_SJ_CNART_CNATCH_IMG_NMJRNL_NMWRT_DATEART_POSAART_NOUNART_PRS_NM# duplicates
92<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>329
67인터뷰<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>12
10【 기자 】<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>10
23기자<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>10
55앵커, 멘트<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>10
85포함<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>9
15공개<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>6
32명단<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>6
45선수<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>6
64은퇴<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>6