Overview

Dataset statistics

Number of variables12
Number of observations601
Missing cells83
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory56.5 KiB
Average record size in memory96.2 B

Variable types

Categorical2
Text9
DateTime1

Dataset

Description문화재 도난목록은 도난,유실된 문화재 현황, 문화재 지정유형, 도난일자, 수량 소유자 시대 등의 정보를 확인 할수 있습니다.
Author문화재청
URLhttps://www.data.go.kr/data/15089237/fileData.do

Alerts

구분 has constant value ""Constant
담당부서 has constant value ""Constant
규격 has 80 (13.3%) missing valuesMissing
제목 has unique valuesUnique
문화재명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:53:00.943460
Analysis finished2023-12-12 23:53:02.274705
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
도난
601 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도난
2nd row도난
3rd row도난
4th row도난
5th row도난

Common Values

ValueCountFrequency (%)
도난 601
100.0%

Length

2023-12-13T08:53:02.334603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:53:02.407918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도난 601
100.0%

제목
Text

UNIQUE 

Distinct601
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:02.659467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length43
Mean length24.672213
Min length7

Characters and Unicode

Total characters14828
Distinct characters503
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique601 ?
Unique (%)100.0%

Sample

1st row[유실]경상북도 도유형문화재 제399-21호 청도군 덕사 영산전 복장발원문 1점
2nd row[도난]경상북도 예천군 연안이씨 별좌공 종택 편액 긍구헌 1점
3rd row[도난]포항시 법광사지(사적 제493호) 내 삼층석탑 구형부재 상륜부 1점
4th row[도난]경주시 천관사지(사적 제340호) 내 석등 상대석, 석등 하대석 2점
5th row[도난]전라남도 진도군 덕병리 석장승 2점
ValueCountFrequency (%)
도난 176
 
5.2%
113
 
3.4%
소장 85
 
2.5%
47
 
1.4%
석조물 34
 
1.0%
영정 29
 
0.9%
묘의 27
 
0.8%
26
 
0.8%
1점 26
 
0.8%
경주 26
 
0.8%
Other values (1726) 2767
82.4%
2023-12-13T08:53:03.074690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2773
 
18.7%
[ 582
 
3.9%
] 582
 
3.9%
341
 
2.3%
281
 
1.9%
246
 
1.7%
242
 
1.6%
230
 
1.6%
230
 
1.6%
212
 
1.4%
Other values (493) 9109
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10060
67.8%
Space Separator 2773
 
18.7%
Close Punctuation 702
 
4.7%
Open Punctuation 701
 
4.7%
Decimal Number 280
 
1.9%
Other Punctuation 242
 
1.6%
Lowercase Letter 40
 
0.3%
Uppercase Letter 17
 
0.1%
Dash Punctuation 11
 
0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.4%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.8%
Other values (439) 7711
76.7%
Lowercase Letter
ValueCountFrequency (%)
a 8
20.0%
o 5
12.5%
n 5
12.5%
u 4
10.0%
e 3
 
7.5%
i 3
 
7.5%
q 2
 
5.0%
l 2
 
5.0%
y 2
 
5.0%
c 1
 
2.5%
Other values (5) 5
12.5%
Uppercase Letter
ValueCountFrequency (%)
R 4
23.5%
S 2
11.8%
N 2
11.8%
T 2
11.8%
I 1
 
5.9%
E 1
 
5.9%
P 1
 
5.9%
O 1
 
5.9%
L 1
 
5.9%
M 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 82
29.3%
2 39
13.9%
3 33
11.8%
4 29
 
10.4%
0 22
 
7.9%
6 21
 
7.5%
5 16
 
5.7%
8 15
 
5.4%
7 13
 
4.6%
9 10
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 183
75.6%
· 24
 
9.9%
' 14
 
5.8%
. 13
 
5.4%
/ 5
 
2.1%
" 2
 
0.8%
: 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
] 582
82.9%
) 93
 
13.2%
24
 
3.4%
} 3
 
0.4%
Open Punctuation
ValueCountFrequency (%)
[ 582
83.0%
( 95
 
13.6%
24
 
3.4%
Space Separator
ValueCountFrequency (%)
2773
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9986
67.3%
Common 4711
31.8%
Han 74
 
0.5%
Latin 57
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.5%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.9%
Other values (375) 7637
76.5%
Han
ValueCountFrequency (%)
4
 
5.4%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (54) 54
73.0%
Common
ValueCountFrequency (%)
2773
58.9%
[ 582
 
12.4%
] 582
 
12.4%
, 183
 
3.9%
( 95
 
2.0%
) 93
 
2.0%
1 82
 
1.7%
2 39
 
0.8%
3 33
 
0.7%
4 29
 
0.6%
Other values (18) 220
 
4.7%
Latin
ValueCountFrequency (%)
a 8
14.0%
o 5
 
8.8%
n 5
 
8.8%
R 4
 
7.0%
u 4
 
7.0%
e 3
 
5.3%
i 3
 
5.3%
S 2
 
3.5%
q 2
 
3.5%
l 2
 
3.5%
Other values (16) 19
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9986
67.3%
ASCII 4694
31.7%
None 72
 
0.5%
CJK 71
 
0.5%
CJK Compat Ideographs 3
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2773
59.1%
[ 582
 
12.4%
] 582
 
12.4%
, 183
 
3.9%
( 95
 
2.0%
) 93
 
2.0%
1 82
 
1.7%
2 39
 
0.8%
3 33
 
0.7%
4 29
 
0.6%
Other values (39) 203
 
4.3%
Hangul
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.5%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.9%
Other values (375) 7637
76.5%
None
ValueCountFrequency (%)
24
33.3%
24
33.3%
· 24
33.3%
CJK
ValueCountFrequency (%)
4
 
5.6%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (51) 51
71.8%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

담당부서
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
안전기준과
601 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안전기준과
2nd row안전기준과
3rd row안전기준과
4th row안전기준과
5th row안전기준과

Common Values

ValueCountFrequency (%)
안전기준과 601
100.0%

Length

2023-12-13T08:53:03.199400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:53:03.283781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안전기준과 601
100.0%
Distinct119
Distinct (%)19.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2003-05-28 00:00:00
Maximum2021-07-19 00:00:00
2023-12-13T08:53:03.355139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:03.449579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

문화재명
Text

UNIQUE 

Distinct601
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:03.710315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length43
Mean length24.672213
Min length7

Characters and Unicode

Total characters14828
Distinct characters503
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique601 ?
Unique (%)100.0%

Sample

1st row[유실]경상북도 도유형문화재 제399-21호 청도군 덕사 영산전 복장발원문 1점
2nd row[도난]경상북도 예천군 연안이씨 별좌공 종택 편액 긍구헌 1점
3rd row[도난]포항시 법광사지(사적 제493호) 내 삼층석탑 구형부재 상륜부 1점
4th row[도난]경주시 천관사지(사적 제340호) 내 석등 상대석, 석등 하대석 2점
5th row[도난]전라남도 진도군 덕병리 석장승 2점
ValueCountFrequency (%)
도난 176
 
5.2%
113
 
3.4%
소장 85
 
2.5%
47
 
1.4%
석조물 34
 
1.0%
영정 29
 
0.9%
묘의 27
 
0.8%
26
 
0.8%
1점 26
 
0.8%
경주 26
 
0.8%
Other values (1726) 2767
82.4%
2023-12-13T08:53:04.097705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2773
 
18.7%
[ 582
 
3.9%
] 582
 
3.9%
341
 
2.3%
281
 
1.9%
246
 
1.7%
242
 
1.6%
230
 
1.6%
230
 
1.6%
212
 
1.4%
Other values (493) 9109
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10060
67.8%
Space Separator 2773
 
18.7%
Close Punctuation 702
 
4.7%
Open Punctuation 701
 
4.7%
Decimal Number 280
 
1.9%
Other Punctuation 242
 
1.6%
Lowercase Letter 40
 
0.3%
Uppercase Letter 17
 
0.1%
Dash Punctuation 11
 
0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.4%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.8%
Other values (439) 7711
76.7%
Lowercase Letter
ValueCountFrequency (%)
a 8
20.0%
o 5
12.5%
n 5
12.5%
u 4
10.0%
e 3
 
7.5%
i 3
 
7.5%
q 2
 
5.0%
l 2
 
5.0%
y 2
 
5.0%
c 1
 
2.5%
Other values (5) 5
12.5%
Uppercase Letter
ValueCountFrequency (%)
R 4
23.5%
S 2
11.8%
N 2
11.8%
T 2
11.8%
I 1
 
5.9%
E 1
 
5.9%
P 1
 
5.9%
O 1
 
5.9%
L 1
 
5.9%
M 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 82
29.3%
2 39
13.9%
3 33
11.8%
4 29
 
10.4%
0 22
 
7.9%
6 21
 
7.5%
5 16
 
5.7%
8 15
 
5.4%
7 13
 
4.6%
9 10
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 183
75.6%
· 24
 
9.9%
' 14
 
5.8%
. 13
 
5.4%
/ 5
 
2.1%
" 2
 
0.8%
: 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
] 582
82.9%
) 93
 
13.2%
24
 
3.4%
} 3
 
0.4%
Open Punctuation
ValueCountFrequency (%)
[ 582
83.0%
( 95
 
13.6%
24
 
3.4%
Space Separator
ValueCountFrequency (%)
2773
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9986
67.3%
Common 4711
31.8%
Han 74
 
0.5%
Latin 57
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.5%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.9%
Other values (375) 7637
76.5%
Han
ValueCountFrequency (%)
4
 
5.4%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (54) 54
73.0%
Common
ValueCountFrequency (%)
2773
58.9%
[ 582
 
12.4%
] 582
 
12.4%
, 183
 
3.9%
( 95
 
2.0%
) 93
 
2.0%
1 82
 
1.7%
2 39
 
0.8%
3 33
 
0.7%
4 29
 
0.6%
Other values (18) 220
 
4.7%
Latin
ValueCountFrequency (%)
a 8
14.0%
o 5
 
8.8%
n 5
 
8.8%
R 4
 
7.0%
u 4
 
7.0%
e 3
 
5.3%
i 3
 
5.3%
S 2
 
3.5%
q 2
 
3.5%
l 2
 
3.5%
Other values (16) 19
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9986
67.3%
ASCII 4694
31.7%
None 72
 
0.5%
CJK 71
 
0.5%
CJK Compat Ideographs 3
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2773
59.1%
[ 582
 
12.4%
] 582
 
12.4%
, 183
 
3.9%
( 95
 
2.0%
) 93
 
2.0%
1 82
 
1.7%
2 39
 
0.8%
3 33
 
0.7%
4 29
 
0.6%
Other values (39) 203
 
4.3%
Hangul
ValueCountFrequency (%)
341
 
3.4%
281
 
2.8%
246
 
2.5%
242
 
2.4%
230
 
2.3%
230
 
2.3%
212
 
2.1%
194
 
1.9%
187
 
1.9%
186
 
1.9%
Other values (375) 7637
76.5%
None
ValueCountFrequency (%)
24
33.3%
24
33.3%
· 24
33.3%
CJK
ValueCountFrequency (%)
4
 
5.6%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (51) 51
71.8%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct129
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:04.386255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length6
Mean length7.5407654
Min length2

Characters and Unicode

Total characters4532
Distinct characters79
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)20.8%

Sample

1st row경상북도 유형문화재 제399-21호
2nd row비지정문화재
3rd row비지정문화재
4th row비지정문화재
5th row비지정문화재
ValueCountFrequency (%)
비지정문화재 469
57.8%
문화재자료 30
 
3.7%
유형문화재 27
 
3.3%
경상남도 19
 
2.3%
기념물 18
 
2.2%
경상북도 16
 
2.0%
보물 13
 
1.6%
경기도 11
 
1.4%
충청남도 9
 
1.1%
경남 8
 
1.0%
Other values (152) 192
23.6%
2023-12-13T08:53:04.786789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
553
12.2%
553
12.2%
553
12.2%
476
10.5%
475
10.5%
473
10.4%
211
 
4.7%
122
 
2.7%
121
 
2.7%
76
 
1.7%
Other values (69) 919
20.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3984
87.9%
Decimal Number 322
 
7.1%
Space Separator 211
 
4.7%
Dash Punctuation 5
 
0.1%
Close Punctuation 4
 
0.1%
Open Punctuation 3
 
0.1%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
553
13.9%
553
13.9%
553
13.9%
476
11.9%
475
11.9%
473
11.9%
122
 
3.1%
121
 
3.0%
76
 
1.9%
63
 
1.6%
Other values (53) 519
13.0%
Decimal Number
ValueCountFrequency (%)
1 51
15.8%
2 43
13.4%
3 40
12.4%
4 37
11.5%
9 34
10.6%
6 31
9.6%
0 28
8.7%
8 22
6.8%
7 22
6.8%
5 14
 
4.3%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
211
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3984
87.9%
Common 548
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
553
13.9%
553
13.9%
553
13.9%
476
11.9%
475
11.9%
473
11.9%
122
 
3.1%
121
 
3.0%
76
 
1.9%
63
 
1.6%
Other values (53) 519
13.0%
Common
ValueCountFrequency (%)
211
38.5%
1 51
 
9.3%
2 43
 
7.8%
3 40
 
7.3%
4 37
 
6.8%
9 34
 
6.2%
6 31
 
5.7%
0 28
 
5.1%
8 22
 
4.0%
7 22
 
4.0%
Other values (6) 29
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3984
87.9%
ASCII 548
 
12.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
553
13.9%
553
13.9%
553
13.9%
476
11.9%
475
11.9%
473
11.9%
122
 
3.1%
121
 
3.0%
76
 
1.9%
63
 
1.6%
Other values (53) 519
13.0%
ASCII
ValueCountFrequency (%)
211
38.5%
1 51
 
9.3%
2 43
 
7.8%
3 40
 
7.3%
4 37
 
6.8%
9 34
 
6.2%
6 31
 
5.7%
0 28
 
5.1%
8 22
 
4.0%
7 22
 
4.0%
Other values (6) 29
 
5.3%
Distinct580
Distinct (%)96.7%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
2023-12-13T08:53:05.029089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length11.795
Min length4

Characters and Unicode

Total characters7077
Distinct characters72
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique562 ?
Unique (%)93.7%

Sample

1st row2016년~2018년
2nd row2020.3.19.~2021.3.15(추정)
3rd row2021년 3월 이전
4th row2021-04-28
5th row1989년 3월경
ValueCountFrequency (%)
이전 33
 
4.2%
사이 16
 
2.0%
15
 
1.9%
10
 
1.3%
추정 6
 
0.8%
10 4
 
0.5%
2001/01/06-08 3
 
0.4%
7 3
 
0.4%
4 3
 
0.4%
1992-12-29 3
 
0.4%
Other values (636) 692
87.8%
2023-12-13T08:53:05.390925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1272
18.0%
1 979
13.8%
2 769
10.9%
9 681
9.6%
- 590
8.3%
. 499
 
7.1%
8 276
 
3.9%
/ 275
 
3.9%
3 241
 
3.4%
5 207
 
2.9%
Other values (62) 1288
18.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4973
70.3%
Other Punctuation 781
 
11.0%
Dash Punctuation 590
 
8.3%
Other Letter 468
 
6.6%
Space Separator 188
 
2.7%
Math Symbol 58
 
0.8%
Close Punctuation 8
 
0.1%
Open Punctuation 8
 
0.1%
Lowercase Letter 2
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
22.6%
73
15.6%
62
13.2%
44
9.4%
43
9.2%
32
 
6.8%
11
 
2.4%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (39) 70
15.0%
Decimal Number
ValueCountFrequency (%)
0 1272
25.6%
1 979
19.7%
2 769
15.5%
9 681
13.7%
8 276
 
5.5%
3 241
 
4.8%
5 207
 
4.2%
6 205
 
4.1%
4 182
 
3.7%
7 161
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 499
63.9%
/ 275
35.2%
' 3
 
0.4%
, 3
 
0.4%
: 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 1
50.0%
p 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 590
100.0%
Space Separator
ValueCountFrequency (%)
188
100.0%
Math Symbol
ValueCountFrequency (%)
~ 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6606
93.3%
Hangul 468
 
6.6%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
22.6%
73
15.6%
62
13.2%
44
9.4%
43
9.2%
32
 
6.8%
11
 
2.4%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (39) 70
15.0%
Common
ValueCountFrequency (%)
0 1272
19.3%
1 979
14.8%
2 769
11.6%
9 681
10.3%
- 590
8.9%
. 499
 
7.6%
8 276
 
4.2%
/ 275
 
4.2%
3 241
 
3.6%
5 207
 
3.1%
Other values (10) 817
12.4%
Latin
ValueCountFrequency (%)
S 1
33.3%
e 1
33.3%
p 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6609
93.4%
Hangul 468
 
6.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1272
19.2%
1 979
14.8%
2 769
11.6%
9 681
10.3%
- 590
8.9%
. 499
 
7.6%
8 276
 
4.2%
/ 275
 
4.2%
3 241
 
3.6%
5 207
 
3.1%
Other values (13) 820
12.4%
Hangul
ValueCountFrequency (%)
106
22.6%
73
15.6%
62
13.2%
44
9.4%
43
9.2%
32
 
6.8%
11
 
2.4%
10
 
2.1%
9
 
1.9%
8
 
1.7%
Other values (39) 70
15.0%

수량
Text

Distinct184
Distinct (%)30.7%
Missing2
Missing (%)0.3%
Memory size4.8 KiB
2023-12-13T08:53:05.704766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length2
Mean length2.966611
Min length1

Characters and Unicode

Total characters1777
Distinct characters111
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)25.0%

Sample

1st row1점
2nd row1점
3rd row1점
4th row2점
5th row2점
ValueCountFrequency (%)
1점 198
29.6%
2점 90
 
13.4%
3점 36
 
5.4%
1기 25
 
3.7%
6점 14
 
2.1%
4점 13
 
1.9%
1 10
 
1.5%
5점 8
 
1.2%
8
 
1.2%
8점 7
 
1.0%
Other values (203) 261
39.0%
2023-12-13T08:53:06.199836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
498
28.0%
1 343
19.3%
2 152
 
8.6%
3 85
 
4.8%
80
 
4.5%
4 56
 
3.2%
0 52
 
2.9%
5 46
 
2.6%
6 39
 
2.2%
35
 
2.0%
Other values (101) 391
22.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 850
47.8%
Other Letter 756
42.5%
Space Separator 80
 
4.5%
Other Punctuation 36
 
2.0%
Open Punctuation 27
 
1.5%
Close Punctuation 27
 
1.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
498
65.9%
35
 
4.6%
17
 
2.2%
13
 
1.7%
13
 
1.7%
12
 
1.6%
8
 
1.1%
8
 
1.1%
7
 
0.9%
5
 
0.7%
Other values (85) 140
 
18.5%
Decimal Number
ValueCountFrequency (%)
1 343
40.4%
2 152
17.9%
3 85
 
10.0%
4 56
 
6.6%
0 52
 
6.1%
5 46
 
5.4%
6 39
 
4.6%
7 29
 
3.4%
8 28
 
3.3%
9 20
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 34
94.4%
/ 2
 
5.6%
Space Separator
ValueCountFrequency (%)
80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1021
57.5%
Hangul 756
42.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
498
65.9%
35
 
4.6%
17
 
2.2%
13
 
1.7%
13
 
1.7%
12
 
1.6%
8
 
1.1%
8
 
1.1%
7
 
0.9%
5
 
0.7%
Other values (85) 140
 
18.5%
Common
ValueCountFrequency (%)
1 343
33.6%
2 152
14.9%
3 85
 
8.3%
80
 
7.8%
4 56
 
5.5%
0 52
 
5.1%
5 46
 
4.5%
6 39
 
3.8%
, 34
 
3.3%
7 29
 
2.8%
Other values (6) 105
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1021
57.5%
Hangul 756
42.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
498
65.9%
35
 
4.6%
17
 
2.2%
13
 
1.7%
13
 
1.7%
12
 
1.6%
8
 
1.1%
8
 
1.1%
7
 
0.9%
5
 
0.7%
Other values (85) 140
 
18.5%
ASCII
ValueCountFrequency (%)
1 343
33.6%
2 152
14.9%
3 85
 
8.3%
80
 
7.8%
4 56
 
5.5%
0 52
 
5.1%
5 46
 
4.5%
6 39
 
3.8%
, 34
 
3.3%
7 29
 
2.8%
Other values (6) 105
 
10.3%
Distinct512
Distinct (%)85.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:06.454175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length3
Mean length5.5058236
Min length1

Characters and Unicode

Total characters3309
Distinct characters291
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique459 ?
Unique (%)76.4%

Sample

1st row청도 덕사
2nd row연안이씨 별좌공 종택
3rd row포항시
4th row경상북도 경주시
5th row진도군 덕병마을
ValueCountFrequency (%)
문중 28
 
3.5%
종중 25
 
3.1%
국유 11
 
1.4%
11
 
1.4%
선암사 7
 
0.9%
안정사 6
 
0.7%
페루 5
 
0.6%
청곡사 4
 
0.5%
동화사 4
 
0.5%
신흥사 4
 
0.5%
Other values (616) 697
86.9%
2023-12-13T08:53:06.858447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
202
 
6.1%
163
 
4.9%
141
 
4.3%
126
 
3.8%
97
 
2.9%
82
 
2.5%
80
 
2.4%
( 78
 
2.4%
) 77
 
2.3%
71
 
2.1%
Other values (281) 2192
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2899
87.6%
Space Separator 202
 
6.1%
Open Punctuation 83
 
2.5%
Close Punctuation 82
 
2.5%
Other Punctuation 16
 
0.5%
Lowercase Letter 12
 
0.4%
Uppercase Letter 6
 
0.2%
Dash Punctuation 5
 
0.2%
Decimal Number 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
163
 
5.6%
141
 
4.9%
126
 
4.3%
97
 
3.3%
82
 
2.8%
80
 
2.8%
71
 
2.4%
53
 
1.8%
53
 
1.8%
44
 
1.5%
Other values (252) 1989
68.6%
Lowercase Letter
ValueCountFrequency (%)
o 3
25.0%
n 2
16.7%
i 1
 
8.3%
j 1
 
8.3%
y 1
 
8.3%
e 1
 
8.3%
u 1
 
8.3%
q 1
 
8.3%
a 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
O 2
33.3%
B 1
16.7%
S 1
16.7%
K 1
16.7%
R 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 7
43.8%
: 7
43.8%
· 1
 
6.2%
/ 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 78
94.0%
[ 4
 
4.8%
1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 77
93.9%
] 4
 
4.9%
1
 
1.2%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 1
25.0%
3 1
25.0%
Space Separator
ValueCountFrequency (%)
202
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2899
87.6%
Common 392
 
11.8%
Latin 18
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
163
 
5.6%
141
 
4.9%
126
 
4.3%
97
 
3.3%
82
 
2.8%
80
 
2.8%
71
 
2.4%
53
 
1.8%
53
 
1.8%
44
 
1.5%
Other values (252) 1989
68.6%
Common
ValueCountFrequency (%)
202
51.5%
( 78
 
19.9%
) 77
 
19.6%
, 7
 
1.8%
: 7
 
1.8%
- 5
 
1.3%
[ 4
 
1.0%
] 4
 
1.0%
1 2
 
0.5%
1
 
0.3%
Other values (5) 5
 
1.3%
Latin
ValueCountFrequency (%)
o 3
16.7%
n 2
11.1%
O 2
11.1%
B 1
 
5.6%
i 1
 
5.6%
S 1
 
5.6%
j 1
 
5.6%
y 1
 
5.6%
K 1
 
5.6%
e 1
 
5.6%
Other values (4) 4
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2899
87.6%
ASCII 407
 
12.3%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
202
49.6%
( 78
 
19.2%
) 77
 
18.9%
, 7
 
1.7%
: 7
 
1.7%
- 5
 
1.2%
[ 4
 
1.0%
] 4
 
1.0%
o 3
 
0.7%
n 2
 
0.5%
Other values (16) 18
 
4.4%
Hangul
ValueCountFrequency (%)
163
 
5.6%
141
 
4.9%
126
 
4.3%
97
 
3.3%
82
 
2.8%
80
 
2.8%
71
 
2.4%
53
 
1.8%
53
 
1.8%
44
 
1.5%
Other values (252) 1989
68.6%
None
ValueCountFrequency (%)
1
33.3%
· 1
33.3%
1
33.3%

규격
Text

MISSING 

Distinct399
Distinct (%)76.6%
Missing80
Missing (%)13.3%
Memory size4.8 KiB
2023-12-13T08:53:07.123222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length152
Median length79
Mean length17.690979
Min length2

Characters and Unicode

Total characters9217
Distinct characters290
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique387 ?
Unique (%)74.3%

Sample

1st row- 상대석 : 지름 52cm, 높이 20cm, - 하대석 : 한변 80cm내외, 복련 지름 58cm, 높이 21cm
2nd row대장군(높이200cm,두께31cm), 진살등(높이210cm,두께30cm)
3rd row55.5cm*26.5cm
4th row130cm*153cm
5th row132.4 *31.7
ValueCountFrequency (%)
288
 
13.0%
높이 151
 
6.8%
가로 145
 
6.6%
세로 134
 
6.1%
미상 106
 
4.8%
40
 
1.8%
참조 27
 
1.2%
26
 
1.2%
길이 15
 
0.7%
둘레 14
 
0.6%
Other values (849) 1264
57.2%
2023-12-13T08:53:07.519741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1708
18.5%
0 590
 
6.4%
m 559
 
6.1%
c 546
 
5.9%
1 441
 
4.8%
389
 
4.2%
2 343
 
3.7%
5 300
 
3.3%
218
 
2.4%
3 215
 
2.3%
Other values (280) 3908
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2934
31.8%
Decimal Number 2502
27.1%
Space Separator 1708
18.5%
Lowercase Letter 1129
 
12.2%
Other Punctuation 732
 
7.9%
Other Symbol 60
 
0.7%
Close Punctuation 55
 
0.6%
Open Punctuation 51
 
0.6%
Dash Punctuation 24
 
0.3%
Math Symbol 20
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
389
 
13.3%
218
 
7.4%
206
 
7.0%
195
 
6.6%
191
 
6.5%
142
 
4.8%
116
 
4.0%
66
 
2.2%
51
 
1.7%
45
 
1.5%
Other values (248) 1315
44.8%
Decimal Number
ValueCountFrequency (%)
0 590
23.6%
1 441
17.6%
2 343
13.7%
5 300
12.0%
3 215
 
8.6%
6 144
 
5.8%
7 136
 
5.4%
4 136
 
5.4%
8 106
 
4.2%
9 91
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
m 559
49.5%
c 546
48.4%
x 15
 
1.3%
g 4
 
0.4%
k 4
 
0.4%
t 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 211
28.8%
* 184
25.1%
: 148
20.2%
. 99
13.5%
/ 90
12.3%
Math Symbol
ValueCountFrequency (%)
~ 10
50.0%
× 9
45.0%
+ 1
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 51
92.7%
] 4
 
7.3%
Open Punctuation
ValueCountFrequency (%)
( 47
92.2%
[ 4
 
7.8%
Space Separator
ValueCountFrequency (%)
1708
100.0%
Other Symbol
ValueCountFrequency (%)
60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5152
55.9%
Hangul 2932
31.8%
Latin 1131
 
12.3%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
389
 
13.3%
218
 
7.4%
206
 
7.0%
195
 
6.7%
191
 
6.5%
142
 
4.8%
116
 
4.0%
66
 
2.3%
51
 
1.7%
45
 
1.5%
Other values (246) 1313
44.8%
Common
ValueCountFrequency (%)
1708
33.2%
0 590
 
11.5%
1 441
 
8.6%
2 343
 
6.7%
5 300
 
5.8%
3 215
 
4.2%
, 211
 
4.1%
* 184
 
3.6%
: 148
 
2.9%
6 144
 
2.8%
Other values (15) 868
16.8%
Latin
ValueCountFrequency (%)
m 559
49.4%
c 546
48.3%
x 15
 
1.3%
g 4
 
0.4%
k 4
 
0.4%
X 2
 
0.2%
t 1
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6214
67.4%
Hangul 2893
31.4%
CJK Compat 60
 
0.7%
Compat Jamo 39
 
0.4%
None 9
 
0.1%
CJK 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1708
27.5%
0 590
 
9.5%
m 559
 
9.0%
c 546
 
8.8%
1 441
 
7.1%
2 343
 
5.5%
5 300
 
4.8%
3 215
 
3.5%
, 211
 
3.4%
* 184
 
3.0%
Other values (20) 1117
18.0%
Hangul
ValueCountFrequency (%)
389
 
13.4%
218
 
7.5%
206
 
7.1%
195
 
6.7%
191
 
6.6%
142
 
4.9%
116
 
4.0%
66
 
2.3%
51
 
1.8%
45
 
1.6%
Other values (245) 1274
44.0%
CJK Compat
ValueCountFrequency (%)
60
100.0%
Compat Jamo
ValueCountFrequency (%)
39
100.0%
None
ValueCountFrequency (%)
× 9
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

시대
Text

Distinct265
Distinct (%)44.1%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:07.752845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length51
Mean length5.9467554
Min length1

Characters and Unicode

Total characters3574
Distinct characters182
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)37.8%

Sample

1st row미상
2nd row미상
3rd row미상
4th row신라시대
5th row미상
ValueCountFrequency (%)
조선 150
 
16.8%
미상 96
 
10.8%
76
 
8.5%
조선시대 31
 
3.5%
조선후기 24
 
2.7%
20
 
2.2%
고려시대 11
 
1.2%
통일신라 8
 
0.9%
근대 7
 
0.8%
추정 7
 
0.8%
Other values (355) 461
51.7%
2023-12-13T08:53:08.147311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
296
 
8.3%
293
 
8.2%
1 269
 
7.5%
268
 
7.5%
226
 
6.3%
0 139
 
3.9%
114
 
3.2%
111
 
3.1%
103
 
2.9%
9 102
 
2.9%
Other values (172) 1653
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2016
56.4%
Decimal Number 977
27.3%
Space Separator 296
 
8.3%
Other Punctuation 96
 
2.7%
Open Punctuation 72
 
2.0%
Close Punctuation 71
 
2.0%
Dash Punctuation 34
 
1.0%
Math Symbol 10
 
0.3%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
293
14.5%
268
 
13.3%
226
 
11.2%
114
 
5.7%
111
 
5.5%
103
 
5.1%
69
 
3.4%
68
 
3.4%
44
 
2.2%
42
 
2.1%
Other values (151) 678
33.6%
Decimal Number
ValueCountFrequency (%)
1 269
27.5%
0 139
14.2%
9 102
 
10.4%
7 92
 
9.4%
8 82
 
8.4%
6 73
 
7.5%
2 62
 
6.3%
5 57
 
5.8%
4 54
 
5.5%
3 47
 
4.8%
Other Punctuation
ValueCountFrequency (%)
: 44
45.8%
/ 25
26.0%
, 23
24.0%
' 2
 
2.1%
. 2
 
2.1%
Space Separator
ValueCountFrequency (%)
296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2014
56.4%
Common 1556
43.5%
Latin 2
 
0.1%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
293
14.5%
268
 
13.3%
226
 
11.2%
114
 
5.7%
111
 
5.5%
103
 
5.1%
69
 
3.4%
68
 
3.4%
44
 
2.2%
42
 
2.1%
Other values (149) 676
33.6%
Common
ValueCountFrequency (%)
296
19.0%
1 269
17.3%
0 139
8.9%
9 102
 
6.6%
7 92
 
5.9%
8 82
 
5.3%
6 73
 
4.7%
( 72
 
4.6%
) 71
 
4.6%
2 62
 
4.0%
Other values (10) 298
19.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%
Latin
ValueCountFrequency (%)
c 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1991
55.7%
ASCII 1558
43.6%
Compat Jamo 23
 
0.6%
CJK 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
296
19.0%
1 269
17.3%
0 139
8.9%
9 102
 
6.5%
7 92
 
5.9%
8 82
 
5.3%
6 73
 
4.7%
( 72
 
4.6%
) 71
 
4.6%
2 62
 
4.0%
Other values (11) 300
19.3%
Hangul
ValueCountFrequency (%)
293
14.7%
268
13.5%
226
 
11.4%
114
 
5.7%
111
 
5.6%
103
 
5.2%
69
 
3.5%
68
 
3.4%
44
 
2.2%
42
 
2.1%
Other values (148) 653
32.8%
Compat Jamo
ValueCountFrequency (%)
23
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct579
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:53:08.481650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length35
Mean length21.600666
Min length1

Characters and Unicode

Total characters12982
Distinct characters375
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique561 ?
Unique (%)93.3%

Sample

1st row경상북도 청도군 화양읍 소라리1
2nd row경상북도 예천군 호명면 송곡길 14
3rd row경상북도 포항시 북구 신광면 상읍리 874-3
4th row경상북도 경주시 교동 243번지 천관사지(사적 제340호)내
5th row전라남도 진도군 군내면 덕병리 1762
ValueCountFrequency (%)
경북 159
 
4.8%
경남 89
 
2.7%
전남 55
 
1.7%
충남 46
 
1.4%
충북 42
 
1.3%
경기도 39
 
1.2%
전북 33
 
1.0%
경주시 28
 
0.9%
24
 
0.7%
강원도 21
 
0.6%
Other values (1841) 2755
83.7%
2023-12-13T08:53:08.939819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2702
 
20.8%
473
 
3.6%
406
 
3.1%
364
 
2.8%
337
 
2.6%
311
 
2.4%
1 299
 
2.3%
276
 
2.1%
268
 
2.1%
243
 
1.9%
Other values (365) 7303
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8546
65.8%
Space Separator 2702
 
20.8%
Decimal Number 1388
 
10.7%
Dash Punctuation 140
 
1.1%
Lowercase Letter 109
 
0.8%
Uppercase Letter 26
 
0.2%
Open Punctuation 24
 
0.2%
Close Punctuation 24
 
0.2%
Other Punctuation 23
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
473
 
5.5%
406
 
4.8%
364
 
4.3%
337
 
3.9%
311
 
3.6%
276
 
3.2%
268
 
3.1%
243
 
2.8%
207
 
2.4%
183
 
2.1%
Other values (317) 5478
64.1%
Lowercase Letter
ValueCountFrequency (%)
a 20
18.3%
o 14
12.8%
e 10
9.2%
u 9
8.3%
n 7
 
6.4%
m 6
 
5.5%
r 6
 
5.5%
c 5
 
4.6%
i 5
 
4.6%
l 5
 
4.6%
Other values (10) 22
20.2%
Uppercase Letter
ValueCountFrequency (%)
N 4
15.4%
A 3
11.5%
O 3
11.5%
S 3
11.5%
P 3
11.5%
R 2
7.7%
T 2
7.7%
B 2
7.7%
H 2
7.7%
G 1
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 299
21.5%
2 196
14.1%
3 155
11.2%
4 150
10.8%
5 111
 
8.0%
8 107
 
7.7%
6 101
 
7.3%
0 100
 
7.2%
7 94
 
6.8%
9 75
 
5.4%
Other Punctuation
ValueCountFrequency (%)
. 10
43.5%
, 8
34.8%
/ 5
21.7%
Space Separator
ValueCountFrequency (%)
2702
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8546
65.8%
Common 4301
33.1%
Latin 135
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
473
 
5.5%
406
 
4.8%
364
 
4.3%
337
 
3.9%
311
 
3.6%
276
 
3.2%
268
 
3.1%
243
 
2.8%
207
 
2.4%
183
 
2.1%
Other values (317) 5478
64.1%
Latin
ValueCountFrequency (%)
a 20
14.8%
o 14
 
10.4%
e 10
 
7.4%
u 9
 
6.7%
n 7
 
5.2%
m 6
 
4.4%
r 6
 
4.4%
c 5
 
3.7%
i 5
 
3.7%
l 5
 
3.7%
Other values (21) 48
35.6%
Common
ValueCountFrequency (%)
2702
62.8%
1 299
 
7.0%
2 196
 
4.6%
3 155
 
3.6%
4 150
 
3.5%
- 140
 
3.3%
5 111
 
2.6%
8 107
 
2.5%
6 101
 
2.3%
0 100
 
2.3%
Other values (7) 240
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8546
65.8%
ASCII 4436
34.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2702
60.9%
1 299
 
6.7%
2 196
 
4.4%
3 155
 
3.5%
4 150
 
3.4%
- 140
 
3.2%
5 111
 
2.5%
8 107
 
2.4%
6 101
 
2.3%
0 100
 
2.3%
Other values (38) 375
 
8.5%
Hangul
ValueCountFrequency (%)
473
 
5.5%
406
 
4.8%
364
 
4.3%
337
 
3.9%
311
 
3.6%
276
 
3.2%
268
 
3.1%
243
 
2.8%
207
 
2.4%
183
 
2.1%
Other values (317) 5478
64.1%

Missing values

2023-12-13T08:53:01.999139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:53:02.141942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:53:02.230746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분제목담당부서작성일문화재명지정유형도난일자설명수량소유자규격시대도난장소
0도난[유실]경상북도 도유형문화재 제399-21호 청도군 덕사 영산전 복장발원문 1점안전기준과2021-07-19[유실]경상북도 도유형문화재 제399-21호 청도군 덕사 영산전 복장발원문 1점경상북도 유형문화재 제399-21호2016년~2018년1점청도 덕사<NA>미상경상북도 청도군 화양읍 소라리1
1도난[도난]경상북도 예천군 연안이씨 별좌공 종택 편액 긍구헌 1점안전기준과2021-07-19[도난]경상북도 예천군 연안이씨 별좌공 종택 편액 긍구헌 1점비지정문화재2020.3.19.~2021.3.15(추정)1점연안이씨 별좌공 종택<NA>미상경상북도 예천군 호명면 송곡길 14
2도난[도난]포항시 법광사지(사적 제493호) 내 삼층석탑 구형부재 상륜부 1점안전기준과2021-05-13[도난]포항시 법광사지(사적 제493호) 내 삼층석탑 구형부재 상륜부 1점비지정문화재2021년 3월 이전1점포항시<NA>미상경상북도 포항시 북구 신광면 상읍리 874-3
3도난[도난]경주시 천관사지(사적 제340호) 내 석등 상대석, 석등 하대석 2점안전기준과2021-05-12[도난]경주시 천관사지(사적 제340호) 내 석등 상대석, 석등 하대석 2점비지정문화재2021-04-282점경상북도 경주시- 상대석 : 지름 52cm, 높이 20cm, - 하대석 : 한변 80cm내외, 복련 지름 58cm, 높이 21cm신라시대경상북도 경주시 교동 243번지 천관사지(사적 제340호)내
4도난[도난]전라남도 진도군 덕병리 석장승 2점안전기준과2021-03-16[도난]전라남도 진도군 덕병리 석장승 2점비지정문화재1989년 3월경2점진도군 덕병마을대장군(높이200cm,두께31cm), 진살등(높이210cm,두께30cm)미상전라남도 진도군 군내면 덕병리 1762
5도난[도굴]경기도 용인시 향토유적 제70호 죽산박씨 문헌공파 묘역안전기준과2020-12-30[도굴]경기도 용인시 향토유적 제70호 죽산박씨 문헌공파 묘역경기도 용인시 향토유적 제70호2020년 9월 15일 이전<NA>죽산박씨 문헌공파 종중<NA>조선경기도 용인시 처인구 백암면 옥산리 산 48-1
6도난[도난]경기도 안양시 석수동 분청도자기 등 4점안전기준과2020-11-03[도난]경기도 안양시 석수동 분청도자기 등 4점비지정문화재2020.6.27.4점이OO<NA>조선시대(추정)경기도 안양시 만안구 석수동 300-4번지, 3OO호
7도난[도굴]경상북도 문화재자료 제647호 영주 류빈묘씨안전기준과2020-09-14[도굴]경상북도 문화재자료 제647호 영주 류빈묘씨경상북도 문화재자료 제647호2020년 9월 1일 이전<NA>전주류씨 영흥공파 종중<NA>고려말~조선경상북도 영주시 문수면 승문리 산173
8도난[도난]국가민속문화재 제40호 홍극가묘출토복식 4점안전기준과2020-09-11[도난]국가민속문화재 제40호 홍극가묘출토복식 4점국가민속문화재 제40호2018년 12월~2019년 8월4점안동대학교<NA>조선안동대학교 박물관(경북 안동시 경동로 1375)
9도난[도난]보물 제260호 유희춘 미암집목판 6점안전기준과2020-06-16[도난]보물 제260호 유희춘 미암집목판 6점보물 제260호1982년도 추정6점미암종중(유근영)55.5cm*26.5cm조선전라남도 담양군 대덕면 장동길 89-4
구분제목담당부서작성일문화재명지정유형도난일자설명수량소유자규격시대도난장소
591도난순천 선암사 부도 [화산대사사리탑 기단(동물상)]안전기준과2010-07-20순천 선암사 부도 [화산대사사리탑 기단(동물상)]전라남도 문화재자료 제42호1986-02-062점선암사높이 80cm미상전남 순천시 승주읍 죽학리 802 선암사
592도난[도난] 보령 성주리 성주사지내 [석계단(聖住寺址石階段)]안전기준과2011-08-24[도난] 보령 성주리 성주사지내 [석계단(聖住寺址石階段)]충청남도 문화재자료 제140호1986/01/28-292점보령시5단석통일신라충남 보령시 성주면 성주리 72 성주사지
593도난[도난] 마산 두척동 모덕사 소장 영정 [최치원 영정 등]안전기준과2011-08-24[도난] 마산 두척동 모덕사 소장 영정 [최치원 영정 등]비지정문화재1985-11-2625점모덕사가로 60 * 세로 100cm1904년경남 마산시 두척동 63번지 최치원선생 영당
594도난[도난] 아산 유곡리 봉곡사 소장 불화 [관음보살상]안전기준과2011-08-24[도난] 아산 유곡리 봉곡사 소장 불화 [관음보살상]충청남도 문화재자료 제242호1985.11.07-11.081점봉곡사가로 40cm 세로 75cm미상충남 아산시 송악면 유곡리 595 봉곡사
595도난[도난] 서산 대산중학교 향토관 소장 불상 등 [철조여래불상,청화백자, 장승업그림, 민화, 동경, 마패, 와당, 엽전] 114점안전기준과2011-08-24[도난] 서산 대산중학교 향토관 소장 불상 등 [철조여래불상,청화백자, 장승업그림, 민화, 동경, 마패, 와당, 엽전] 114점비지정문화재1985. 10. 281점, 3점, 5점, 105김기풍동경 23, 백자 27*43*7cm고려-조선충남 서산군 대산면 대산중학교 향토관
596도난[도난] 상주 금혼리 충의사유물전시관 정기룡장군유물 중 [유서] 1점안전기준과2011-08-24[도난] 상주 금혼리 충의사유물전시관 정기룡장군유물 중 [유서] 1점보물 제669호1985-10-071매정기목가로 91cm 세로 262cm1617년경북 상주군 사벌면 금혼리 충의사유물전시관
597도난[도난] 종로구 이화장 소장 서화류 등 [시화, 병풍 등] 19점안전기준과2011-08-24[도난] 종로구 이화장 소장 서화류 등 [시화, 병풍 등] 19점비지정문화재1985. 08. 14 - 1519점이인수미상1959년서울 종로구 이화동 이화장
598도난[도난] 김천 상원리 정양공위 사당 소장 [일산, 호패, 좌리공신교서] 6점안전기준과2011-08-24[도난] 김천 상원리 정양공위 사당 소장 [일산, 호패, 좌리공신교서] 6점비지정문화재1985-04-216점이철웅일산 : 높이 251cm /교서 가로 139 * 세로 27cm일산(1) : 조선/호패(4) : 미상/교서(1) : 1472년경북 김천시 구성면 상원리 53번지
599도난[도난] 구미 임수동 동락서원 소장 영정[여헌 영정, 가죽신]안전기준과2011-08-24[도난] 구미 임수동 동락서원 소장 영정[여헌 영정, 가죽신]비지정문화재1985/01/19-201매, 1켤레동락서원영정 : 가로 121 * 세로 182cm/ 가죽신 40cm1554-1632경북 구미시 임수동 37번지 동락서원
600도난[도난] 용인 상현리 심곡서원 소장 고문서 등[소학 등, 향합] 98점안전기준과2011-08-24[도난] 용인 상현리 심곡서원 소장 고문서 등[소학 등, 향합] 98점비지정문화재1985.01.0396책, 2점한양조씨종중<NA>미상경기도 용인군 누지면 상현리 심곡서원