Overview

Dataset statistics

Number of variables10
Number of observations1145
Missing cells22
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory90.7 KiB
Average record size in memory81.1 B

Variable types

Numeric1
Text5
Categorical3
DateTime1

Dataset

Description제주특별자치도에서 관리하는 각종 기록물에 관련한 데이터로 고유번호, 제목, 내용, 유형, 수량, 규격, 재질등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3067447/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
수량(권_건_점) is highly imbalanced (65.8%)Imbalance
재질 is highly imbalanced (79.8%)Imbalance
생산정보(년도) has 13 (1.1%) missing valuesMissing
일련번호 has unique valuesUnique
고유번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:25:46.010301
Analysis finished2023-12-12 22:25:47.467941
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct1145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean573
Minimum1
Maximum1145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.2 KiB
2023-12-13T07:25:47.532729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile58.2
Q1287
median573
Q3859
95-th percentile1087.8
Maximum1145
Range1144
Interquartile range (IQR)572

Descriptive statistics

Standard deviation330.67734
Coefficient of variation (CV)0.57709832
Kurtosis-1.2
Mean573
Median Absolute Deviation (MAD)286
Skewness0
Sum656085
Variance109347.5
MonotonicityStrictly increasing
2023-12-13T07:25:47.648163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
762 1
 
0.1%
768 1
 
0.1%
767 1
 
0.1%
766 1
 
0.1%
765 1
 
0.1%
764 1
 
0.1%
763 1
 
0.1%
761 1
 
0.1%
770 1
 
0.1%
Other values (1135) 1135
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1145 1
0.1%
1144 1
0.1%
1143 1
0.1%
1142 1
0.1%
1141 1
0.1%
1140 1
0.1%
1139 1
0.1%
1138 1
0.1%
1137 1
0.1%
1136 1
0.1%

고유번호
Text

UNIQUE 

Distinct1145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2023-12-13T07:25:47.932004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.9624454
Min length5

Characters and Unicode

Total characters7972
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1145 ?
Unique (%)100.0%

Sample

1st row001-001
2nd row001-002
3rd row001-003
4th row001-004
5th row001-005
ValueCountFrequency (%)
001-001 1
 
0.1%
156-018 1
 
0.1%
157-004 1
 
0.1%
157-003 1
 
0.1%
157-002 1
 
0.1%
157-001 1
 
0.1%
156-020 1
 
0.1%
157-008 1
 
0.1%
156-017 1
 
0.1%
157-006 1
 
0.1%
Other values (1135) 1135
99.1%
2023-12-13T07:25:48.343043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2801
35.1%
1 1210
15.2%
- 1146
14.4%
2 591
 
7.4%
3 443
 
5.6%
4 356
 
4.5%
6 308
 
3.9%
5 307
 
3.9%
9 300
 
3.8%
8 263
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6826
85.6%
Dash Punctuation 1146
 
14.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2801
41.0%
1 1210
17.7%
2 591
 
8.7%
3 443
 
6.5%
4 356
 
5.2%
6 308
 
4.5%
5 307
 
4.5%
9 300
 
4.4%
8 263
 
3.9%
7 247
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 1146
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7972
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2801
35.1%
1 1210
15.2%
- 1146
14.4%
2 591
 
7.4%
3 443
 
5.6%
4 356
 
4.5%
6 308
 
3.9%
5 307
 
3.9%
9 300
 
3.8%
8 263
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7972
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2801
35.1%
1 1210
15.2%
- 1146
14.4%
2 591
 
7.4%
3 443
 
5.6%
4 356
 
4.5%
6 308
 
3.9%
5 307
 
3.9%
9 300
 
3.8%
8 263
 
3.3%

제목
Text

Distinct928
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2023-12-13T07:25:48.603727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length9.4681223
Min length1

Characters and Unicode

Total characters10841
Distinct characters534
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique862 ?
Unique (%)75.3%

Sample

1st row가례증해 전10권
2nd row가례증해 전10권
3rd row주역 전14권
4th row주역언해 전7권
5th row맹자
ValueCountFrequency (%)
사진 66
 
2.8%
개인사진 60
 
2.5%
표창장 29
 
1.2%
사본 27
 
1.1%
졸업앨범 21
 
0.9%
20
 
0.8%
수료증 20
 
0.8%
상장 20
 
0.8%
임명장 18
 
0.8%
신문 17
 
0.7%
Other values (1492) 2070
87.4%
2023-12-13T07:25:48.980086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1223
 
11.3%
399
 
3.7%
233
 
2.1%
225
 
2.1%
217
 
2.0%
178
 
1.6%
157
 
1.4%
157
 
1.4%
) 148
 
1.4%
( 148
 
1.4%
Other values (524) 7756
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8716
80.4%
Space Separator 1223
 
11.3%
Decimal Number 447
 
4.1%
Close Punctuation 148
 
1.4%
Open Punctuation 148
 
1.4%
Uppercase Letter 52
 
0.5%
Dash Punctuation 49
 
0.5%
Other Punctuation 42
 
0.4%
Math Symbol 12
 
0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
399
 
4.6%
233
 
2.7%
225
 
2.6%
217
 
2.5%
178
 
2.0%
157
 
1.8%
157
 
1.8%
142
 
1.6%
141
 
1.6%
140
 
1.6%
Other values (482) 6727
77.2%
Uppercase Letter
ValueCountFrequency (%)
C 7
13.5%
R 5
9.6%
E 5
9.6%
B 5
9.6%
A 5
9.6%
I 5
9.6%
O 4
7.7%
J 3
5.8%
K 3
5.8%
S 3
5.8%
Other values (5) 7
13.5%
Decimal Number
ValueCountFrequency (%)
1 90
20.1%
0 87
19.5%
2 64
14.3%
9 49
11.0%
3 33
 
7.4%
4 33
 
7.4%
8 27
 
6.0%
6 26
 
5.8%
7 20
 
4.5%
5 18
 
4.0%
Other Punctuation
ValueCountFrequency (%)
· 23
54.8%
. 10
23.8%
' 3
 
7.1%
? 2
 
4.8%
" 2
 
4.8%
! 1
 
2.4%
/ 1
 
2.4%
Math Symbol
ValueCountFrequency (%)
< 4
33.3%
> 4
33.3%
~ 4
33.3%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
1223
100.0%
Close Punctuation
ValueCountFrequency (%)
) 148
100.0%
Open Punctuation
ValueCountFrequency (%)
( 148
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Modifier Symbol
ValueCountFrequency (%)
˙ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8695
80.2%
Common 2070
 
19.1%
Latin 55
 
0.5%
Han 21
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
399
 
4.6%
233
 
2.7%
225
 
2.6%
217
 
2.5%
178
 
2.0%
157
 
1.8%
157
 
1.8%
142
 
1.6%
141
 
1.6%
140
 
1.6%
Other values (464) 6706
77.1%
Common
ValueCountFrequency (%)
1223
59.1%
) 148
 
7.1%
( 148
 
7.1%
1 90
 
4.3%
0 87
 
4.2%
2 64
 
3.1%
9 49
 
2.4%
- 49
 
2.4%
3 33
 
1.6%
4 33
 
1.6%
Other values (15) 146
 
7.1%
Han
ValueCountFrequency (%)
2
 
9.5%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (8) 8
38.1%
Latin
ValueCountFrequency (%)
C 7
12.7%
R 5
9.1%
E 5
9.1%
B 5
9.1%
A 5
9.1%
I 5
9.1%
O 4
 
7.3%
J 3
 
5.5%
K 3
 
5.5%
S 3
 
5.5%
Other values (7) 10
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8695
80.2%
ASCII 2098
 
19.4%
None 23
 
0.2%
CJK 18
 
0.2%
Number Forms 3
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Modifier Letters 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1223
58.3%
) 148
 
7.1%
( 148
 
7.1%
1 90
 
4.3%
0 87
 
4.1%
2 64
 
3.1%
9 49
 
2.3%
- 49
 
2.3%
3 33
 
1.6%
4 33
 
1.6%
Other values (28) 174
 
8.3%
Hangul
ValueCountFrequency (%)
399
 
4.6%
233
 
2.7%
225
 
2.6%
217
 
2.5%
178
 
2.0%
157
 
1.8%
157
 
1.8%
142
 
1.6%
141
 
1.6%
140
 
1.6%
Other values (464) 6706
77.1%
None
ValueCountFrequency (%)
· 23
100.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Modifier Letters
ValueCountFrequency (%)
˙ 1
100.0%

내용
Text

Distinct1087
Distinct (%)95.2%
Missing3
Missing (%)0.3%
Memory size9.1 KiB
2023-12-13T07:25:49.280933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length200
Median length110
Mean length28.505254
Min length1

Characters and Unicode

Total characters32553
Distinct characters728
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1063 ?
Unique (%)93.1%

Sample

1st row가례증해 목록(10권)과 1권부터 9권까지. 2권 통례·관례·계례·혼례. 3권 상례(초종·습). 4권 상례(소렴·대렴·성복). 5권 상례(성복·조석곡존상식·조존부·문상분상). 6권 상례(치규·천구·유존·발인·하관·제주·성분·반곡·여묘). 7권 상례(우제·졸곡·부제·소상). 8권 상례(대상·선제·길제·개규). 9권 제례(사시제·초조제·예제·기제)
2nd row가례증해 목록과 1권부터 9권까지.
3rd row주역전집·주역1권-13권까지 (주역 4권은 편집·도판·조형 형태가 나머지 책자와 다름)
4th row주역언해 개별 권으로 1권·2권·4권이 있고·5-6권 합권·7-9권 합권·9-11권 합권·12-13권 합권. 제목에 표기 오류가 있는 것이 있음. 마지막 권에는 경진신간 내각장판 표시로 마지막 권에 쓰는 출처를 명시하고 있음.
5th row총 14권. 맹자1권-맹자7권 (맹자집주서설·맹자집주대전권지2-권지14까지 포함하고 있으며 누락된 부분 있으며·마지막 7권에는 경진신간 내각장판 표시로 마지막 권에 쓰는 출처 명시하고 있음)·맹자 2권-7권. 1권은 누락되었고·4권의 경우 표시는 없지만 내용상 4권으로 확인됨. 맹자7권은 동일한 내용의 권으로 확인되는 2권이 존재하나 한 권은 훼손이 심함.
ValueCountFrequency (%)
사진 102
 
1.6%
90
 
1.4%
제주도 36
 
0.6%
있음 30
 
0.5%
수여 29
 
0.5%
대한 23
 
0.4%
기념 23
 
0.4%
22
 
0.4%
대통령 21
 
0.3%
자료 19
 
0.3%
Other values (4172) 5868
93.7%
2023-12-13T07:25:49.729376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5121
 
15.7%
596
 
1.8%
· 587
 
1.8%
1 565
 
1.7%
505
 
1.6%
431
 
1.3%
427
 
1.3%
9 393
 
1.2%
2 380
 
1.2%
366
 
1.1%
Other values (718) 23182
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22442
68.9%
Space Separator 5121
 
15.7%
Decimal Number 2744
 
8.4%
Other Punctuation 1037
 
3.2%
Open Punctuation 349
 
1.1%
Close Punctuation 345
 
1.1%
Dash Punctuation 188
 
0.6%
Lowercase Letter 164
 
0.5%
Uppercase Letter 145
 
0.4%
Math Symbol 11
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
596
 
2.7%
505
 
2.3%
431
 
1.9%
427
 
1.9%
366
 
1.6%
348
 
1.6%
333
 
1.5%
330
 
1.5%
322
 
1.4%
304
 
1.4%
Other values (644) 18480
82.3%
Uppercase Letter
ValueCountFrequency (%)
O 14
 
9.7%
C 12
 
8.3%
R 12
 
8.3%
E 11
 
7.6%
B 10
 
6.9%
U 9
 
6.2%
S 9
 
6.2%
D 8
 
5.5%
A 8
 
5.5%
T 7
 
4.8%
Other values (12) 45
31.0%
Lowercase Letter
ValueCountFrequency (%)
i 20
12.2%
e 19
11.6%
n 18
11.0%
t 15
9.1%
o 13
 
7.9%
s 12
 
7.3%
r 11
 
6.7%
a 9
 
5.5%
h 8
 
4.9%
c 5
 
3.0%
Other values (10) 34
20.7%
Other Punctuation
ValueCountFrequency (%)
· 587
56.6%
. 357
34.4%
' 52
 
5.0%
/ 19
 
1.8%
" 8
 
0.8%
: 4
 
0.4%
* 4
 
0.4%
3
 
0.3%
% 2
 
0.2%
? 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 565
20.6%
9 393
14.3%
2 380
13.8%
0 342
12.5%
8 209
 
7.6%
7 192
 
7.0%
3 185
 
6.7%
6 173
 
6.3%
4 164
 
6.0%
5 141
 
5.1%
Math Symbol
ValueCountFrequency (%)
~ 7
63.6%
< 2
 
18.2%
> 2
 
18.2%
Open Punctuation
ValueCountFrequency (%)
( 348
99.7%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 344
99.7%
1
 
0.3%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
5121
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 188
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22397
68.8%
Common 9795
30.1%
Latin 312
 
1.0%
Han 49
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
596
 
2.7%
505
 
2.3%
431
 
1.9%
427
 
1.9%
366
 
1.6%
348
 
1.6%
333
 
1.5%
330
 
1.5%
322
 
1.4%
304
 
1.4%
Other values (603) 18435
82.3%
Latin
ValueCountFrequency (%)
i 20
 
6.4%
e 19
 
6.1%
n 18
 
5.8%
t 15
 
4.8%
O 14
 
4.5%
o 13
 
4.2%
C 12
 
3.8%
R 12
 
3.8%
s 12
 
3.8%
r 11
 
3.5%
Other values (34) 166
53.2%
Han
ValueCountFrequency (%)
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
1
 
2.0%
1
 
2.0%
1
 
2.0%
1
 
2.0%
Other values (32) 32
65.3%
Common
ValueCountFrequency (%)
5121
52.3%
· 587
 
6.0%
1 565
 
5.8%
9 393
 
4.0%
2 380
 
3.9%
. 357
 
3.6%
( 348
 
3.6%
) 344
 
3.5%
0 342
 
3.5%
8 209
 
2.1%
Other values (19) 1149
 
11.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22390
68.8%
ASCII 9512
29.2%
None 593
 
1.8%
CJK 45
 
0.1%
CJK Compat Ideographs 4
 
< 0.1%
Punctuation 3
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5121
53.8%
1 565
 
5.9%
9 393
 
4.1%
2 380
 
4.0%
. 357
 
3.8%
( 348
 
3.7%
) 344
 
3.6%
0 342
 
3.6%
8 209
 
2.2%
7 192
 
2.0%
Other values (57) 1261
 
13.3%
Hangul
ValueCountFrequency (%)
596
 
2.7%
505
 
2.3%
431
 
1.9%
427
 
1.9%
366
 
1.6%
348
 
1.6%
333
 
1.5%
330
 
1.5%
322
 
1.4%
304
 
1.4%
Other values (600) 18428
82.3%
None
ValueCountFrequency (%)
· 587
99.0%
4
 
0.7%
1
 
0.2%
1
 
0.2%
Punctuation
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (29) 29
64.4%
CJK Compat Ideographs
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

생산정보(년도)
Text

MISSING 

Distinct360
Distinct (%)31.8%
Missing13
Missing (%)1.1%
Memory size9.1 KiB
2023-12-13T07:25:49.958948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length4
Mean length5.0954064
Min length1

Characters and Unicode

Total characters5768
Distinct characters70
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)22.5%

Sample

1st row미상
2nd row미상
3rd row미상
4th row미상
5th row미상
ValueCountFrequency (%)
미상 81
 
6.9%
1988 24
 
2.0%
1990 24
 
2.0%
1980 24
 
2.0%
1970년대 22
 
1.9%
1986 20
 
1.7%
1985 20
 
1.7%
1991 19
 
1.6%
1982 19
 
1.6%
1989 19
 
1.6%
Other values (358) 904
76.9%
2023-12-13T07:25:50.304045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 1314
22.8%
1 1159
20.1%
0 607
10.5%
8 418
 
7.2%
2 316
 
5.5%
6 267
 
4.6%
7 267
 
4.6%
5 231
 
4.0%
177
 
3.1%
4 158
 
2.7%
Other values (60) 854
14.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4836
83.8%
Other Letter 638
 
11.1%
Dash Punctuation 143
 
2.5%
Space Separator 44
 
0.8%
Math Symbol 41
 
0.7%
Other Punctuation 32
 
0.6%
Open Punctuation 17
 
0.3%
Close Punctuation 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
177
27.7%
114
17.9%
88
13.8%
88
13.8%
14
 
2.2%
14
 
2.2%
11
 
1.7%
10
 
1.6%
7
 
1.1%
7
 
1.1%
Other values (41) 108
16.9%
Decimal Number
ValueCountFrequency (%)
9 1314
27.2%
1 1159
24.0%
0 607
12.6%
8 418
 
8.6%
2 316
 
6.5%
6 267
 
5.5%
7 267
 
5.5%
5 231
 
4.8%
4 158
 
3.3%
3 99
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 21
65.6%
. 5
 
15.6%
· 3
 
9.4%
' 3
 
9.4%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Math Symbol
ValueCountFrequency (%)
~ 41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5130
88.9%
Hangul 638
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
177
27.7%
114
17.9%
88
13.8%
88
13.8%
14
 
2.2%
14
 
2.2%
11
 
1.7%
10
 
1.6%
7
 
1.1%
7
 
1.1%
Other values (41) 108
16.9%
Common
ValueCountFrequency (%)
9 1314
25.6%
1 1159
22.6%
0 607
11.8%
8 418
 
8.1%
2 316
 
6.2%
6 267
 
5.2%
7 267
 
5.2%
5 231
 
4.5%
4 158
 
3.1%
- 143
 
2.8%
Other values (9) 250
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5127
88.9%
Hangul 638
 
11.1%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 1314
25.6%
1 1159
22.6%
0 607
11.8%
8 418
 
8.2%
2 316
 
6.2%
6 267
 
5.2%
7 267
 
5.2%
5 231
 
4.5%
4 158
 
3.1%
- 143
 
2.8%
Other values (8) 247
 
4.8%
Hangul
ValueCountFrequency (%)
177
27.7%
114
17.9%
88
13.8%
88
13.8%
14
 
2.2%
14
 
2.2%
11
 
1.7%
10
 
1.6%
7
 
1.1%
7
 
1.1%
Other values (41) 108
16.9%
None
ValueCountFrequency (%)
· 3
100.0%

유형
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
문서류
587 
박물류
333 
시청각류
225 

Length

Max length4
Median length3
Mean length3.1965066
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문서류
2nd row문서류
3rd row문서류
4th row문서류
5th row문서류

Common Values

ValueCountFrequency (%)
문서류 587
51.3%
박물류 333
29.1%
시청각류 225
 
19.7%

Length

2023-12-13T07:25:50.411094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:25:50.485040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문서류 587
51.3%
박물류 333
29.1%
시청각류 225
 
19.7%

수량(권_건_점)
Categorical

IMBALANCE 

Distinct41
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
1
840 
2
 
81
3
 
56
4
 
36
8
 
17
Other values (36)
115 

Length

Max length3
Median length1
Mean length1.060262
Min length1

Unique

Unique23 ?
Unique (%)2.0%

Sample

1st row10
2nd row9
3rd row14
4th row7
5th row15

Common Values

ValueCountFrequency (%)
1 840
73.4%
2 81
 
7.1%
3 56
 
4.9%
4 36
 
3.1%
8 17
 
1.5%
5 16
 
1.4%
6 13
 
1.1%
14 11
 
1.0%
10 10
 
0.9%
7 10
 
0.9%
Other values (31) 55
 
4.8%

Length

2023-12-13T07:25:50.575395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 840
73.4%
2 81
 
7.1%
3 56
 
4.9%
4 36
 
3.1%
8 17
 
1.5%
5 16
 
1.4%
6 13
 
1.1%
14 11
 
1.0%
10 10
 
0.9%
7 10
 
0.9%
Other values (31) 55
 
4.8%

규격
Text

Distinct505
Distinct (%)44.3%
Missing6
Missing (%)0.5%
Memory size9.1 KiB
2023-12-13T07:25:50.843641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length5
Mean length5.4064969
Min length1

Characters and Unicode

Total characters6158
Distinct characters25
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique353 ?
Unique (%)31.0%

Sample

1st row21*31
2nd row21*31
3rd row20*30
4th row21*31
5th row20*30
ValueCountFrequency (%)
21*30 81
 
7.0%
19*26 56
 
4.9%
20*30 24
 
2.1%
20*27 23
 
2.0%
19*27 21
 
1.8%
15*22 18
 
1.6%
13*19 16
 
1.4%
11*8 15
 
1.3%
20*26 14
 
1.2%
15*11 14
 
1.2%
Other values (505) 868
75.5%
2023-12-13T07:25:51.228670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 1165
18.9%
2 1089
17.7%
1 960
15.6%
5 531
8.6%
3 489
7.9%
0 432
 
7.0%
. 289
 
4.7%
9 279
 
4.5%
6 252
 
4.1%
8 227
 
3.7%
Other values (15) 445
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4656
75.6%
Other Punctuation 1454
 
23.6%
Other Letter 36
 
0.6%
Space Separator 11
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
30.6%
11
30.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%
Decimal Number
ValueCountFrequency (%)
2 1089
23.4%
1 960
20.6%
5 531
11.4%
3 489
10.5%
0 432
 
9.3%
9 279
 
6.0%
6 252
 
5.4%
8 227
 
4.9%
7 223
 
4.8%
4 174
 
3.7%
Other Punctuation
ValueCountFrequency (%)
* 1165
80.1%
. 289
 
19.9%
Space Separator
ValueCountFrequency (%)
11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6122
99.4%
Hangul 36
 
0.6%

Most frequent character per script

Common
ValueCountFrequency (%)
* 1165
19.0%
2 1089
17.8%
1 960
15.7%
5 531
8.7%
3 489
8.0%
0 432
 
7.1%
. 289
 
4.7%
9 279
 
4.6%
6 252
 
4.1%
8 227
 
3.7%
Other values (4) 409
 
6.7%
Hangul
ValueCountFrequency (%)
11
30.6%
11
30.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6122
99.4%
Hangul 36
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 1165
19.0%
2 1089
17.8%
1 960
15.7%
5 531
8.7%
3 489
8.0%
0 432
 
7.1%
. 289
 
4.7%
9 279
 
4.6%
6 252
 
4.1%
8 227
 
3.7%
Other values (4) 409
 
6.7%
Hangul
ValueCountFrequency (%)
11
30.6%
11
30.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%

재질
Categorical

IMBALANCE 

Distinct15
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
종이
1023 
금속
 
33
나무
 
33
플라스틱
 
17
유리
 
16
Other values (10)
 
23

Length

Max length5
Median length2
Mean length2.0419214
Min length1

Unique

Unique5 ?
Unique (%)0.4%

Sample

1st row종이
2nd row종이
3rd row종이
4th row종이
5th row종이

Common Values

ValueCountFrequency (%)
종이 1023
89.3%
금속 33
 
2.9%
나무 33
 
2.9%
플라스틱 17
 
1.5%
유리 16
 
1.4%
도자기 8
 
0.7%
섬유 3
 
0.3%
사진 3
 
0.3%
석재 2
 
0.2%
종이,금속 2
 
0.2%
Other values (5) 5
 
0.4%

Length

2023-12-13T07:25:51.349993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
종이 1023
89.3%
금속 33
 
2.9%
나무 33
 
2.9%
플라스틱 17
 
1.5%
유리 16
 
1.4%
도자기 8
 
0.7%
섬유 3
 
0.3%
사진 3
 
0.3%
석재 2
 
0.2%
종이,금속 2
 
0.2%
Other values (5) 5
 
0.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
Minimum2022-11-18 00:00:00
Maximum2022-11-18 00:00:00
2023-12-13T07:25:51.433990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:25:51.525988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T07:25:46.956830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:25:51.585441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호유형수량(권_건_점)재질
일련번호1.0000.4110.3610.267
유형0.4111.0000.3650.635
수량(권_건_점)0.3610.3651.0000.742
재질0.2670.6350.7421.000
2023-12-13T07:25:51.657932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형수량(권_건_점)재질
유형1.0000.1910.367
수량(권_건_점)0.1911.0000.305
재질0.3670.3051.000
2023-12-13T07:25:51.732575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호유형수량(권_건_점)재질
일련번호1.0000.2710.1300.102
유형0.2711.0000.1910.367
수량(권_건_점)0.1300.1911.0000.305
재질0.1020.3670.3051.000

Missing values

2023-12-13T07:25:47.091548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:25:47.292264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:25:47.394566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일련번호고유번호제목내용생산정보(년도)유형수량(권_건_점)규격재질데이터기준일자
01001-001가례증해 전10권가례증해 목록(10권)과 1권부터 9권까지. 2권 통례·관례·계례·혼례. 3권 상례(초종·습). 4권 상례(소렴·대렴·성복). 5권 상례(성복·조석곡존상식·조존부·문상분상). 6권 상례(치규·천구·유존·발인·하관·제주·성분·반곡·여묘). 7권 상례(우제·졸곡·부제·소상). 8권 상례(대상·선제·길제·개규). 9권 제례(사시제·초조제·예제·기제)미상문서류1021*31종이2022-11-18
12001-002가례증해 전10권가례증해 목록과 1권부터 9권까지.미상문서류921*31종이2022-11-18
23001-003주역 전14권주역전집·주역1권-13권까지 (주역 4권은 편집·도판·조형 형태가 나머지 책자와 다름)미상문서류1420*30종이2022-11-18
34001-004주역언해 전7권주역언해 개별 권으로 1권·2권·4권이 있고·5-6권 합권·7-9권 합권·9-11권 합권·12-13권 합권. 제목에 표기 오류가 있는 것이 있음. 마지막 권에는 경진신간 내각장판 표시로 마지막 권에 쓰는 출처를 명시하고 있음.미상문서류721*31종이2022-11-18
45001-005맹자총 14권. 맹자1권-맹자7권 (맹자집주서설·맹자집주대전권지2-권지14까지 포함하고 있으며 누락된 부분 있으며·마지막 7권에는 경진신간 내각장판 표시로 마지막 권에 쓰는 출처 명시하고 있음)·맹자 2권-7권. 1권은 누락되었고·4권의 경우 표시는 없지만 내용상 4권으로 확인됨. 맹자7권은 동일한 내용의 권으로 확인되는 2권이 존재하나 한 권은 훼손이 심함.미상문서류1520*30종이2022-11-18
56001-006맹자언해총 15권. 맹자 또는 맹해라고 제목이 붙은 1권-7권으로 총 6권(5권 누락. 3권은 제목 없으나 내용상 3권으로 확인). 맹해·맹자해·언해맹자 등의 이름이 붙은 1권-7권으로 총 7권·다른 버전의 맹자언해5·6권 총 2권이 더 존재.미상문서류1520*30종이2022-11-18
67001-007중용· 중용언해총 3권. 중용전집(애월면책 직인이 찍힌 중용장구대전. 경진신간 내각장판 표시로 마지막 권에 쓰는 출처 명시)·중용언해(표지파손으로 제목 훼손. 애월면납읍리치첩 표기. 한 권은 계몽편 표기로 보아 중용 외에 여러 언해본 섞인 것으로 확인)미상문서류321*31종이2022-11-18
78001-008논어총 8권. 논어1-7권(3권 누락). 제목불명의 2권은 각 논어1권과 5권으로 확인(학이·위정편과 자로편이 확인됨)미상문서류820*30종이2022-11-18
89001-009논어언해총 6권. 논어언해 1편부터 20편에 해당하는 1권-7권(6-7편에 해당하는 3권이 누락). 각 권 마다 표지에 연필로 납읍이라는 표시가 있으며·7권에는 경진신간 내각장판 표시로 마지막 권에 쓰는 출처 명시되어 있음.미상문서류620*30종이2022-11-18
910001-010서전 전10권각 권 마다 서전대전권지1·2 등 표시가 있음. 서전 송두준 이름 표시와 함께 연필로 전명창·현재송·납읍·용창 등의 이름 또는 지명이 표기 되어 있음. 신우면장 인장 찍혀 있는 권(4권)이 있으며·경진신간 내각장판 표시로 마지막 권에 쓰는 출처 명시하고 있는 권(10권)도 있음.미상문서류1020*30종이2022-11-18
일련번호고유번호제목내용생산정보(년도)유형수량(권_건_점)규격재질데이터기준일자
11351136237-6상예2리지상예리 6대 이장이신 강정옥 님이 작성한 필사본 형태의 향토지1980년대문서류120*25종이2022-11-18
11361137237-7상예3리지상예리 7대 이장이신 강정옥 님이 작성한 필사본 형태의 향토지1981년문서류120*26종이2022-11-18
11371138237-8농가경영카드당시 농가의 경영상태를 파악할 수 있는 사료1966년문서류120*15종이2022-11-18
11381139237-9각서시멘트 45대에 따른 모래를 마을리장에게 인계하겠다는 각서1980년문서류115*8종이2022-11-18
11391140238-1아라중 체육과 교육·교수안기증자가 교사 재임시 사용했던 교육자료2000년문서류120*30종이2022-11-18
11401141238-2중등(3학년) 체육교육˙학습 과정안기증자가 교사 재임시 사용했던 교육자료2001년문서류120*30종이2022-11-18
11411142238-3정구지도안기증자가 애월중학교 재직시에 학생들에게 정구를 가르치위한 지도안1997년문서류120*30종이2022-11-18
11421143238-4열린학습과정안기증자가 애월중학교 재직시에 학생들에게 정구를 가르치위한 지도안1997년문서류120*30종이2022-11-18
11431144238-5청소년체조 도해본86년 대한체육회에서 제작한 청소년체조 도해본1986년문서류230*40종이2022-11-18
11441145238-6강강술래 사진87년 6월 한림여중 축제 때 기증자의 지도아래 한림체육광에서 한림여중 학생들이 강강술래 공연을 펼치고 있는 모습1987년시청각류411*8종이2022-11-18