"%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%20%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD_%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD%EF%BF%BD_20171019.csv"의 파일명이 "경상북도 상주시_상주박물관소장유물_20171019.csv"으로 변경 됨.

Overview

Dataset statistics

Number of variables11
Number of observations1040
Missing cells803
Missing cells (%)7.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory91.5 KiB
Average record size in memory90.1 B

Variable types

Numeric2
Text3
Categorical6

Dataset

Description상주박물관에서 소장하고 있는 유물정보에 대한 데이터로 유물명, 주수량, 시대, 장르, 재질, 크기 등의 항목을 제공합니다. )
Author경상북도 상주시
URLhttps://www.data.go.kr/data/3049752/fileData.do

Alerts

데이터기준일 has constant value ""Constant
번호 is highly overall correlated with 출토지/소장자 and 1 other fieldsHigh correlation
주수량 is highly overall correlated with 출토지/소장자High correlation
시대 is highly overall correlated with 장르 and 1 other fieldsHigh correlation
장르 is highly overall correlated with 시대 and 3 other fieldsHigh correlation
재질 is highly overall correlated with 장르 and 1 other fieldsHigh correlation
출토지/소장자 is highly overall correlated with 번호 and 5 other fieldsHigh correlation
문화재지정 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
시대 is highly imbalanced (53.6%)Imbalance
재질 is highly imbalanced (64.2%)Imbalance
출토지/소장자 is highly imbalanced (81.7%)Imbalance
유물설명 has 791 (76.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:01:18.795001
Analysis finished2023-12-12 11:01:21.703132
Duration2.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1038
Distinct (%)100.0%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean519.5
Minimum1
Maximum1038
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-12T20:01:21.812657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile52.85
Q1260.25
median519.5
Q3778.75
95-th percentile986.15
Maximum1038
Range1037
Interquartile range (IQR)518.5

Descriptive statistics

Standard deviation299.78909
Coefficient of variation (CV)0.57707236
Kurtosis-1.2
Mean519.5
Median Absolute Deviation (MAD)259.5
Skewness0
Sum539241
Variance89873.5
MonotonicityStrictly increasing
2023-12-12T20:01:21.976792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
715 1
 
0.1%
685 1
 
0.1%
686 1
 
0.1%
687 1
 
0.1%
688 1
 
0.1%
689 1
 
0.1%
690 1
 
0.1%
691 1
 
0.1%
692 1
 
0.1%
693 1
 
0.1%
Other values (1028) 1028
98.8%
(Missing) 2
 
0.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1038 1
0.1%
1037 1
0.1%
1036 1
0.1%
1035 1
0.1%
1034 1
0.1%
1033 1
0.1%
1032 1
0.1%
1031 1
0.1%
1030 1
0.1%
1029 1
0.1%
Distinct650
Distinct (%)62.6%
Missing2
Missing (%)0.2%
Memory size8.3 KiB
2023-12-12T20:01:22.297379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length6.0635838
Min length1

Characters and Unicode

Total characters6294
Distinct characters620
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique553 ?
Unique (%)53.3%

Sample

1st row휘찬려사 彙纂麗史 (1)~(23)
2nd row후집 後集 (1) ~(5)
3rd row효자공실록부양리 공문집 孝子公實錄附陽里 公文集 (1)~(2)
4th row함창향교교지 咸昌鄕校校誌
5th row학용요의변정록 學庸要義卞正錄
ValueCountFrequency (%)
간찰 175
 
10.1%
고신 77
 
4.4%
준호구 71
 
4.1%
김영기 34
 
2.0%
영남지도 34
 
2.0%
교지 29
 
1.7%
26
 
1.5%
지형도 24
 
1.4%
시문집 18
 
1.0%
1950년대 16
 
0.9%
Other values (816) 1236
71.0%
2023-12-12T20:01:22.779711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
702
 
11.2%
185
 
2.9%
178
 
2.8%
150
 
2.4%
135
 
2.1%
) 119
 
1.9%
( 119
 
1.9%
112
 
1.8%
104
 
1.7%
97
 
1.5%
Other values (610) 4393
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5067
80.5%
Space Separator 702
 
11.2%
Decimal Number 206
 
3.3%
Close Punctuation 119
 
1.9%
Open Punctuation 119
 
1.9%
Math Symbol 32
 
0.5%
Dash Punctuation 26
 
0.4%
Other Punctuation 13
 
0.2%
Other Symbol 7
 
0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
185
 
3.7%
178
 
3.5%
150
 
3.0%
135
 
2.7%
112
 
2.2%
104
 
2.1%
97
 
1.9%
96
 
1.9%
93
 
1.8%
92
 
1.8%
Other values (587) 3825
75.5%
Decimal Number
ValueCountFrequency (%)
1 79
38.3%
9 28
 
13.6%
0 27
 
13.1%
5 27
 
13.1%
2 17
 
8.3%
3 10
 
4.9%
4 8
 
3.9%
7 6
 
2.9%
6 3
 
1.5%
8 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 6
46.2%
? 4
30.8%
· 2
 
15.4%
/ 1
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
x 1
33.3%
k 1
33.3%
g 1
33.3%
Space Separator
ValueCountFrequency (%)
702
100.0%
Close Punctuation
ValueCountFrequency (%)
) 119
100.0%
Open Punctuation
ValueCountFrequency (%)
( 119
100.0%
Math Symbol
ValueCountFrequency (%)
~ 32
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4610
73.2%
Common 1224
 
19.4%
Han 456
 
7.2%
Latin 3
 
< 0.1%
Katakana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
185
 
4.0%
178
 
3.9%
150
 
3.3%
135
 
2.9%
112
 
2.4%
104
 
2.3%
97
 
2.1%
96
 
2.1%
93
 
2.0%
92
 
2.0%
Other values (355) 3368
73.1%
Han
ValueCountFrequency (%)
14
 
3.1%
13
 
2.9%
9
 
2.0%
9
 
2.0%
8
 
1.8%
7
 
1.5%
7
 
1.5%
7
 
1.5%
7
 
1.5%
7
 
1.5%
Other values (221) 368
80.7%
Common
ValueCountFrequency (%)
702
57.4%
) 119
 
9.7%
( 119
 
9.7%
1 79
 
6.5%
~ 32
 
2.6%
9 28
 
2.3%
0 27
 
2.2%
5 27
 
2.2%
- 26
 
2.1%
2 17
 
1.4%
Other values (10) 48
 
3.9%
Latin
ValueCountFrequency (%)
x 1
33.3%
k 1
33.3%
g 1
33.3%
Katakana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4610
73.2%
ASCII 1218
 
19.4%
CJK 446
 
7.1%
CJK Compat Ideographs 10
 
0.2%
Geometric Shapes 7
 
0.1%
None 2
 
< 0.1%
Katakana 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
702
57.6%
) 119
 
9.8%
( 119
 
9.8%
1 79
 
6.5%
~ 32
 
2.6%
9 28
 
2.3%
0 27
 
2.2%
5 27
 
2.2%
- 26
 
2.1%
2 17
 
1.4%
Other values (11) 42
 
3.4%
Hangul
ValueCountFrequency (%)
185
 
4.0%
178
 
3.9%
150
 
3.3%
135
 
2.9%
112
 
2.4%
104
 
2.3%
97
 
2.1%
96
 
2.1%
93
 
2.0%
92
 
2.0%
Other values (355) 3368
73.1%
CJK
ValueCountFrequency (%)
14
 
3.1%
13
 
2.9%
9
 
2.0%
9
 
2.0%
8
 
1.8%
7
 
1.6%
7
 
1.6%
7
 
1.6%
7
 
1.6%
7
 
1.6%
Other values (216) 358
80.3%
Geometric Shapes
ValueCountFrequency (%)
7
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
5
50.0%
2
 
20.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
None
ValueCountFrequency (%)
· 2
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%

주수량
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)1.9%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean1.6252408
Minimum1
Maximum47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-12T20:01:22.951160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile4
Maximum47
Range46
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.8996756
Coefficient of variation (CV)1.7841513
Kurtosis109.22685
Mean1.6252408
Median Absolute Deviation (MAD)0
Skewness8.9992367
Sum1687
Variance8.4081183
MonotonicityNot monotonic
2023-12-12T20:01:23.107822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
1 899
86.4%
2 52
 
5.0%
3 25
 
2.4%
4 14
 
1.3%
5 14
 
1.3%
14 4
 
0.4%
16 4
 
0.4%
7 4
 
0.4%
12 4
 
0.4%
6 3
 
0.3%
Other values (10) 15
 
1.4%
ValueCountFrequency (%)
1 899
86.4%
2 52
 
5.0%
3 25
 
2.4%
4 14
 
1.3%
5 14
 
1.3%
6 3
 
0.3%
7 4
 
0.4%
8 2
 
0.2%
9 1
 
0.1%
10 2
 
0.2%
ValueCountFrequency (%)
47 1
 
0.1%
43 1
 
0.1%
25 1
 
0.1%
23 1
 
0.1%
17 1
 
0.1%
16 4
0.4%
15 2
0.2%
14 4
0.4%
13 3
0.3%
12 4
0.4%

시대
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
조선
659 
근/현대
200 
기타
93 
일제강점기
 
52
대한제국
 
9
Other values (7)
 
27

Length

Max length5
Median length2
Mean length2.5769231
Min length1

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
조선 659
63.4%
근/현대 200
 
19.2%
기타 93
 
8.9%
일제강점기 52
 
5.0%
대한제국 9
 
0.9%
광복이후 8
 
0.8%
삼국 6
 
0.6%
고려 6
 
0.6%
<NA> 3
 
0.3%
통일신라 2
 
0.2%
Other values (2) 2
 
0.2%

Length

2023-12-12T20:01:23.276902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
조선 659
63.4%
근/현대 200
 
19.2%
기타 93
 
8.9%
일제강점기 52
 
5.0%
대한제국 9
 
0.9%
광복이후 8
 
0.8%
삼국 6
 
0.6%
고려 6
 
0.6%
na 3
 
0.3%
통일신라 2
 
0.2%
Other values (2) 2
 
0.2%

장르
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
고문서
551 
민속품
214 
고서
143 
서화
63 
공예
 
49
Other values (4)
 
20

Length

Max length4
Median length3
Mean length2.7413462
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row고서
2nd row고서
3rd row고서
4th row고서
5th row고서

Common Values

ValueCountFrequency (%)
고문서 551
53.0%
민속품 214
 
20.6%
고서 143
 
13.8%
서화 63
 
6.1%
공예 49
 
4.7%
기타 13
 
1.2%
<NA> 3
 
0.3%
건축 3
 
0.3%
조선 1
 
0.1%

Length

2023-12-12T20:01:24.079964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:01:24.301421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고문서 551
53.0%
민속품 214
 
20.6%
고서 143
 
13.8%
서화 63
 
6.1%
공예 49
 
4.7%
기타 13
 
1.2%
na 3
 
0.3%
건축 3
 
0.3%
조선 1
 
0.1%

재질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
822 
금속
 
75
목재
 
60
도자기
 
44
사직
 
13
Other values (6)
 
26

Length

Max length4
Median length1
Mean length1.2576923
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
822
79.0%
금속 75
 
7.2%
목재 60
 
5.8%
도자기 44
 
4.2%
사직 13
 
1.2%
토제 11
 
1.1%
기타 6
 
0.6%
석재 3
 
0.3%
<NA> 3
 
0.3%
유리 2
 
0.2%

Length

2023-12-12T20:01:24.482737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
822
79.0%
금속 75
 
7.2%
목재 60
 
5.8%
도자기 44
 
4.2%
사직 13
 
1.2%
토제 11
 
1.1%
기타 6
 
0.6%
석재 3
 
0.3%
na 3
 
0.3%
유리 2
 
0.2%

크기
Text

Distinct970
Distinct (%)93.8%
Missing6
Missing (%)0.6%
Memory size8.3 KiB
2023-12-12T20:01:24.863464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length11
Mean length11.846228
Min length3

Characters and Unicode

Total characters12249
Distinct characters113
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique928 ?
Unique (%)89.7%

Sample

1st row27.6×19.8cm
2nd row28.3×19.2cm
3rd row25.9×18.6cm
4th row26.5 × 19.0cm
5th row29.1 × 18.5cm
ValueCountFrequency (%)
× 15
 
1.2%
높이 14
 
1.1%
1 12
 
0.9%
길이 8
 
0.6%
14.1×9.1 7
 
0.5%
2 7
 
0.5%
78.5×98.5cm 7
 
0.5%
전체길이 7
 
0.5%
56.1x46.8cm 6
 
0.5%
지름 6
 
0.5%
Other values (1106) 1205
93.1%
2023-12-12T20:01:25.550789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1961
16.0%
m 1110
9.1%
c 1109
9.1%
× 925
 
7.6%
5 863
 
7.0%
2 854
 
7.0%
1 816
 
6.7%
3 708
 
5.8%
0 631
 
5.2%
4 604
 
4.9%
Other values (103) 2668
21.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6194
50.6%
Lowercase Letter 2342
 
19.1%
Other Punctuation 2007
 
16.4%
Math Symbol 925
 
7.6%
Other Letter 406
 
3.3%
Space Separator 260
 
2.1%
Close Punctuation 75
 
0.6%
Open Punctuation 32
 
0.3%
Dash Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
14.0%
27
 
6.7%
23
 
5.7%
20
 
4.9%
19
 
4.7%
18
 
4.4%
14
 
3.4%
14
 
3.4%
13
 
3.2%
12
 
3.0%
Other values (80) 189
46.6%
Decimal Number
ValueCountFrequency (%)
5 863
13.9%
2 854
13.8%
1 816
13.2%
3 708
11.4%
0 631
10.2%
4 604
9.8%
7 475
7.7%
8 467
7.5%
6 427
6.9%
9 349
5.6%
Other Punctuation
ValueCountFrequency (%)
. 1961
97.7%
, 45
 
2.2%
: 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
m 1110
47.4%
c 1109
47.4%
x 123
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 73
97.3%
] 2
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 30
93.8%
[ 2
 
6.2%
Math Symbol
ValueCountFrequency (%)
× 925
100.0%
Space Separator
ValueCountFrequency (%)
260
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9501
77.6%
Latin 2342
 
19.1%
Hangul 406
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
14.0%
27
 
6.7%
23
 
5.7%
20
 
4.9%
19
 
4.7%
18
 
4.4%
14
 
3.4%
14
 
3.4%
13
 
3.2%
12
 
3.0%
Other values (80) 189
46.6%
Common
ValueCountFrequency (%)
. 1961
20.6%
× 925
9.7%
5 863
9.1%
2 854
9.0%
1 816
8.6%
3 708
 
7.5%
0 631
 
6.6%
4 604
 
6.4%
7 475
 
5.0%
8 467
 
4.9%
Other values (10) 1197
12.6%
Latin
ValueCountFrequency (%)
m 1110
47.4%
c 1109
47.4%
x 123
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10918
89.1%
None 925
 
7.6%
Hangul 406
 
3.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1961
18.0%
m 1110
10.2%
c 1109
10.2%
5 863
7.9%
2 854
7.8%
1 816
7.5%
3 708
 
6.5%
0 631
 
5.8%
4 604
 
5.5%
7 475
 
4.4%
Other values (12) 1787
16.4%
None
ValueCountFrequency (%)
× 925
100.0%
Hangul
ValueCountFrequency (%)
57
 
14.0%
27
 
6.7%
23
 
5.7%
20
 
4.9%
19
 
4.7%
18
 
4.4%
14
 
3.4%
14
 
3.4%
13
 
3.2%
12
 
3.0%
Other values (80) 189
46.6%

출토지/소장자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
<NA>
921 
조용중 기증
 
68
경북 상주시 모서면 호음리 일원
 
29
이상무 기증
 
4
김행일 기증
 
4
Other values (12)
 
14

Length

Max length17
Median length4
Mean length4.5711538
Min length4

Unique

Unique10 ?
Unique (%)1.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 921
88.6%
조용중 기증 68
 
6.5%
경북 상주시 모서면 호음리 일원 29
 
2.8%
이상무 기증 4
 
0.4%
김행일 기증 4
 
0.4%
정춘목 기증 2
 
0.2%
경북 상주시 개운동 일원 2
 
0.2%
권기순 기증 1
 
0.1%
김주진 기증 1
 
0.1%
김경락 기증 1
 
0.1%
Other values (7) 7
 
0.7%

Length

2023-12-12T20:01:25.761643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 921
73.4%
기증 86
 
6.9%
조용중 68
 
5.4%
경북 33
 
2.6%
상주시 33
 
2.6%
일원 33
 
2.6%
모서면 29
 
2.3%
호음리 29
 
2.3%
이상무 4
 
0.3%
김행일 4
 
0.3%
Other values (13) 15
 
1.2%

문화재지정
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
<NA>
793 
X
179 
보물 1004호
 
61
보물 1003호
 
7

Length

Max length8
Median length4
Mean length3.7451923
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 793
76.2%
X 179
 
17.2%
보물 1004호 61
 
5.9%
보물 1003호 7
 
0.7%

Length

2023-12-12T20:01:25.947426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:01:26.118123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 793
71.6%
x 179
 
16.2%
보물 68
 
6.1%
1004호 61
 
5.5%
1003호 7
 
0.6%

유물설명
Text

MISSING 

Distinct244
Distinct (%)98.0%
Missing791
Missing (%)76.1%
Memory size8.3 KiB
2023-12-12T20:01:26.594039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length631
Median length140
Mean length97.51004
Min length7

Characters and Unicode

Total characters24280
Distinct characters941
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)96.4%

Sample

1st row일제강점기에 엄선하여 제작한 회엽서로 당시의 경상북도 상주수비대의 실경모습으로 전체적으로 엽서의 보존상태는 양호?
2nd row일제강점기에 엄선하여 제작한 회엽서로 당시의 경상북도 상주성내 시가의 실경모습으로 전체적으로 엽서의 보존상태는 양호?
3rd row일제강점기에 엄선하여 제작한 회엽서로 당시의 경상북도 상주구 재판소의 실경모습으로 전체적으로 엽서의 보존상태는 양호?
4th row일제강점기에 엄선하여 제작한 회엽서로 당시의 경상북도 상주성 남문의 실경모습으로 전체적으로 엽서의 보존상태는 양호?
5th row일제강점기에 엄선하여 제작한 회엽서로 당시의 경상북도 상주성 북문의 실경모습으로 전체적으로 엽서의 보존상태는 양호?
ValueCountFrequency (%)
있다 170
 
3.1%
있으며 108
 
2.0%
조금 77
 
1.4%
상태는 62
 
1.1%
있고 49
 
0.9%
명하는 45
 
0.8%
양호한 45
 
0.8%
되어 37
 
0.7%
부분에 35
 
0.6%
접히는 34
 
0.6%
Other values (2280) 4845
88.0%
2023-12-12T20:01:27.380838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5258
 
21.7%
695
 
2.9%
. 503
 
2.1%
478
 
2.0%
471
 
1.9%
446
 
1.8%
415
 
1.7%
377
 
1.6%
368
 
1.5%
361
 
1.5%
Other values (931) 14908
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17497
72.1%
Space Separator 5258
 
21.7%
Other Punctuation 760
 
3.1%
Decimal Number 477
 
2.0%
Close Punctuation 90
 
0.4%
Open Punctuation 86
 
0.4%
Lowercase Letter 44
 
0.2%
Dash Punctuation 37
 
0.2%
Uppercase Letter 10
 
< 0.1%
Modifier Symbol 7
 
< 0.1%
Other values (3) 14
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
695
 
4.0%
478
 
2.7%
471
 
2.7%
446
 
2.5%
415
 
2.4%
377
 
2.2%
368
 
2.1%
361
 
2.1%
293
 
1.7%
271
 
1.5%
Other values (896) 13322
76.1%
Decimal Number
ValueCountFrequency (%)
1 89
18.7%
2 63
13.2%
6 55
11.5%
5 55
11.5%
3 51
10.7%
9 47
9.9%
4 42
8.8%
8 39
8.2%
7 24
 
5.0%
0 12
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
V 4
40.0%
S 1
 
10.0%
O 1
 
10.0%
C 1
 
10.0%
T 1
 
10.0%
E 1
 
10.0%
U 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 503
66.2%
, 217
28.6%
? 19
 
2.5%
' 15
 
2.0%
/ 4
 
0.5%
· 2
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
m 20
45.5%
c 20
45.5%
g 2
 
4.5%
k 2
 
4.5%
Space Separator
ValueCountFrequency (%)
5258
100.0%
Close Punctuation
ValueCountFrequency (%)
) 90
100.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15734
64.8%
Common 6729
27.7%
Han 1763
 
7.3%
Latin 54
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
695
 
4.4%
478
 
3.0%
471
 
3.0%
446
 
2.8%
415
 
2.6%
377
 
2.4%
368
 
2.3%
361
 
2.3%
293
 
1.9%
271
 
1.7%
Other values (561) 11559
73.5%
Han
ValueCountFrequency (%)
104
 
5.9%
53
 
3.0%
49
 
2.8%
48
 
2.7%
47
 
2.7%
44
 
2.5%
43
 
2.4%
40
 
2.3%
38
 
2.2%
37
 
2.1%
Other values (325) 1260
71.5%
Common
ValueCountFrequency (%)
5258
78.1%
. 503
 
7.5%
, 217
 
3.2%
) 90
 
1.3%
1 89
 
1.3%
( 86
 
1.3%
2 63
 
0.9%
6 55
 
0.8%
5 55
 
0.8%
3 51
 
0.8%
Other values (14) 262
 
3.9%
Latin
ValueCountFrequency (%)
m 20
37.0%
c 20
37.0%
V 4
 
7.4%
g 2
 
3.7%
k 2
 
3.7%
S 1
 
1.9%
O 1
 
1.9%
C 1
 
1.9%
T 1
 
1.9%
E 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15723
64.8%
ASCII 6773
27.9%
CJK 1731
 
7.1%
CJK Compat Ideographs 32
 
0.1%
Compat Jamo 11
 
< 0.1%
Punctuation 8
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5258
77.6%
. 503
 
7.4%
, 217
 
3.2%
) 90
 
1.3%
1 89
 
1.3%
( 86
 
1.3%
2 63
 
0.9%
6 55
 
0.8%
5 55
 
0.8%
3 51
 
0.8%
Other values (22) 306
 
4.5%
Hangul
ValueCountFrequency (%)
695
 
4.4%
478
 
3.0%
471
 
3.0%
446
 
2.8%
415
 
2.6%
377
 
2.4%
368
 
2.3%
361
 
2.3%
293
 
1.9%
271
 
1.7%
Other values (560) 11548
73.4%
CJK
ValueCountFrequency (%)
104
 
6.0%
53
 
3.1%
49
 
2.8%
48
 
2.8%
47
 
2.7%
44
 
2.5%
43
 
2.5%
40
 
2.3%
38
 
2.2%
37
 
2.1%
Other values (313) 1228
70.9%
Compat Jamo
ValueCountFrequency (%)
11
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
10
31.2%
4
 
12.5%
4
 
12.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (2) 2
 
6.2%
Punctuation
ValueCountFrequency (%)
4
50.0%
4
50.0%
None
ValueCountFrequency (%)
· 2
100.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2017-10-19
1040 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017-10-19
2nd row2017-10-19
3rd row2017-10-19
4th row2017-10-19
5th row2017-10-19

Common Values

ValueCountFrequency (%)
2017-10-19 1040
100.0%

Length

2023-12-12T20:01:27.621845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:01:27.760543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-10-19 1040
100.0%

Interactions

2023-12-12T20:01:20.784977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:01:20.480539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:01:20.934015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:01:20.633844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:01:27.866927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호주수량시대장르재질출토지/소장자문화재지정
번호1.0000.1370.6160.6630.6070.8800.839
주수량0.1371.0000.0440.0000.000NaN0.000
시대0.6160.0441.0000.7920.7890.9710.589
장르0.6630.0000.7921.0000.8470.9360.857
재질0.6070.0000.7890.8471.0000.9920.797
출토지/소장자0.880NaN0.9710.9360.9921.0000.813
문화재지정0.8390.0000.5890.8570.7970.8131.000
2023-12-12T20:01:28.058672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출토지/소장자재질시대장르문화재지정
출토지/소장자1.0000.8230.8630.7610.616
재질0.8231.0000.4880.6230.497
시대0.8630.4881.0000.5340.427
장르0.7610.6230.5341.0000.804
문화재지정0.6160.4970.4270.8041.000
2023-12-12T20:01:28.227226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호주수량시대장르재질출토지/소장자문화재지정
번호1.000-0.0690.3180.3930.2260.7150.516
주수량-0.0691.0000.0210.0000.0001.0000.000
시대0.3180.0211.0000.5340.4880.8630.427
장르0.3930.0000.5341.0000.6230.7610.804
재질0.2260.0000.4880.6231.0000.8230.497
출토지/소장자0.7151.0000.8630.7610.8231.0000.616
문화재지정0.5160.0000.4270.8040.4970.6161.000

Missing values

2023-12-12T20:01:21.117361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:01:21.357455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:01:21.566923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호유물명주수량시대장르재질크기출토지/소장자문화재지정유물설명데이터기준일
01휘찬려사 彙纂麗史 (1)~(23)23기타고서27.6×19.8cm<NA><NA><NA>2017-10-19
12후집 後集 (1) ~(5)5기타고서28.3×19.2cm<NA><NA><NA>2017-10-19
23효자공실록부양리 공문집 孝子公實錄附陽里 公文集 (1)~(2)2기타고서25.9×18.6cm<NA><NA><NA>2017-10-19
34함창향교교지 咸昌鄕校校誌1기타고서26.5 × 19.0cm<NA><NA><NA>2017-10-19
45학용요의변정록 學庸要義卞正錄1기타고서29.1 × 18.5cm<NA><NA><NA>2017-10-19
56청구풍아 靑邱風雅1기타고서27.0×17.6cm<NA><NA><NA>2017-10-19
67징비록 懲毖錄 (1)~(6)6기타고서29.6 × 20.0cm<NA><NA><NA>2017-10-19
78주자봉사 朱子封事1기타고서30.5 × 21cm<NA><NA><NA>2017-10-19
89주역 周易 (1) ~(13)13기타고서31.6 × 20.4cm<NA><NA><NA>2017-10-19
910주서백선 朱書百選 (1)~(2)2기타고서32.5 × 22.0cm<NA><NA><NA>2017-10-19
번호유물명주수량시대장르재질크기출토지/소장자문화재지정유물설명데이터기준일
10301029족두리1근/현대민속품사직7cm<NA>X혼례식때 신부의 머리 위에 쓰던 족두리로 끈이 달려있다. 족두리 꾸밈은 잘 남아 있다.2017-10-19
10311030목기러기1근/현대민속품목재12×26cm<NA>X혼례상에 올려놓는 기러기로 몸통과 머리 부분을 따로 만들어 붙인 것이다. 간단 모양을 만든 후 색을 칠하였다.2017-10-19
10321031혼수함1근/현대민속품목재48×25×12.5cm<NA>X나무 합판으로 만든 다음 종이로 도배한 것으로 현재 상태는 도배한 종이가 일부분 떨어져 나갔고 윗부분이 조금 휘어졌다. 안쪽도 도배를 하였다. 함 내부에 여자 원삼이 들어있는데 내용물은 상의,족두리 꾸밈이 있다.2017-10-19
10331032손저울1근/현대민속품금속32cm<NA>X최대 측량 22kg까지 측정할 수 있는 휴대용 손저울이다. 이래쪽은 물건을 걸 수 있는 갈고리가 있고, 중간부분은 무게를 표시했는데 좌측은 貫, 우측은 kg을 표시하였다. 상단은 아치형태의 손잡이가 있다. 전체적으로 녹이 조금 쓸었을 뿐 양호하다.2017-10-19
10341033놋재털이1근/현대민속품금속2×15×18cm<NA>X유기로 만든 재떨이로 계속 사용하였던 흔적이 남아 있다. 제작시 기는 그리 오래 되지 않았던 것으로 보이는데 재떨이 둘레에 `祝壽宴' 이라는 글과 아래쪽에 `三三楔員一同'이라는 글을 새겨놓았다.2017-10-19
10351034주판1근/현대민속품목재22×7×1.5cm<NA>X주 재료를 나무로 하고 5와 1단위 구분을 플라스틱으로 구분하였다. 전체 주판알은 15열을 이루고 있다.2017-10-19
10361035놋수저2근/현대민속품금속21cm<NA>X2쌍으로 이루어진 숟가락,젓가락 세트로 상태는 양호하다. 수저의 모양은 현대에 쓰는 수저 모양과 거의 비슷하면 근대까지 사용한 것으로 보인다.2017-10-19
10371036청동젓가락3고려민속품금속26cm<NA>X청동제 젓가락으로 상태는 비교적 양호하다. 3개중 2개는 한 쌍을 이루고 있다. 나머지 하나는 상주36과 한 벌을 이루고 있는 것인데, 상주37-1의 젓가락에는 줄선과 점선 그리고 지느러미 모양이 세겨져 있다.2017-10-19
10381037청동숟가락1고려민속품금속25cm<NA>X청동의 재료로 만든 숟가락으로 목 부분이 떨어진 것을 붙인 것이다. 몸체에 비해 숟가락 머리가 크고 길게 표현되었다. 전체적인 모습은 누운 S자 형태이다. 숟가락 끝부분도 크게 반전되어 물고기 꼬리지느러미 문양을 표현하고 있다.2017-10-19
10391038장경호1통일신라공예토제29×15×15cm<NA>X굽다리가 있는 장경호로서 전체높이 29cm에서 굽다리 높이 5cm, 목길이 6cm이다. 목은 위로 갈수록 밖으로 벌어진 형태이며 어깨와 몸통부분에서 각이 꺽여 몸통으로 이어지는데, 어깨와 몸통의 경계에 요철선이 둘러져 있다. 굽다리에는 5개의 구멍이 있고 몸통부분이 깨진 것을 붙인 것이며 몸과 배 일부분에 검색 유약이 뭉쳐져 있다.2017-10-19