Overview

Dataset statistics

Number of variables5
Number of observations1000
Missing cells51
Missing cells (%)1.0%
Duplicate rows16
Duplicate rows (%)1.6%
Total size in memory40.2 KiB
Average record size in memory41.1 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description전라남도 농업박물관에 보유중인 도서자료 입니다.(도서명, 저자 및 발행처, 소장년원일,소장경위, 발행연도 등)
URLhttps://www.data.go.kr/data/15041799/fileData.do

Alerts

Dataset has 16 (1.6%) duplicate rowsDuplicates
발행연도 is highly overall correlated with 소장경위High correlation
소장경위 is highly overall correlated with 발행연도High correlation
발행연도 has 37 (3.7%) missing valuesMissing
저자 및 발행처 has 14 (1.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:36:19.392642
Analysis finished2023-12-12 03:36:20.742387
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소장경위
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
죽암도서
735 
기증도서
166 
구입도서
98 
기타
 
1

Length

Max length4
Median length4
Mean length3.998
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row기증도서
2nd row기증도서
3rd row기타
4th row기증도서
5th row기증도서

Common Values

ValueCountFrequency (%)
죽암도서 735
73.5%
기증도서 166
 
16.6%
구입도서 98
 
9.8%
기타 1
 
0.1%

Length

2023-12-12T12:36:20.877512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:36:21.055190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
죽암도서 735
73.5%
기증도서 166
 
16.6%
구입도서 98
 
9.8%
기타 1
 
0.1%
Distinct948
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T12:36:21.371030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length27
Mean length12.083
Min length2

Characters and Unicode

Total characters12083
Distinct characters661
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique921 ?
Unique (%)92.1%

Sample

1st row인문연구 제30편
2nd row개관 6주년기념 운반용구특별전
3rd row동두천시의 역사와 문화유적
4th row삼국시대의 동물원
5th row부산의 역사와 복천동 고분군
ValueCountFrequency (%)
한국의 42
 
1.5%
한국 36
 
1.3%
브리테니커 26
 
1.0%
대백과사전 26
 
1.0%
1 21
 
0.8%
도록 21
 
0.8%
연구 19
 
0.7%
특별전 18
 
0.7%
2 17
 
0.6%
논총 15
 
0.5%
Other values (1752) 2488
91.2%
2023-12-12T12:36:21.887131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1733
 
14.3%
261
 
2.2%
246
 
2.0%
221
 
1.8%
208
 
1.7%
202
 
1.7%
201
 
1.7%
186
 
1.5%
168
 
1.4%
159
 
1.3%
Other values (651) 8498
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9196
76.1%
Space Separator 1733
 
14.3%
Decimal Number 579
 
4.8%
Lowercase Letter 319
 
2.6%
Connector Punctuation 84
 
0.7%
Other Punctuation 71
 
0.6%
Uppercase Letter 71
 
0.6%
Open Punctuation 9
 
0.1%
Close Punctuation 9
 
0.1%
Dash Punctuation 7
 
0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
261
 
2.8%
246
 
2.7%
221
 
2.4%
208
 
2.3%
202
 
2.2%
201
 
2.2%
186
 
2.0%
168
 
1.8%
159
 
1.7%
135
 
1.5%
Other values (584) 7209
78.4%
Uppercase Letter
ValueCountFrequency (%)
K 12
16.9%
C 7
 
9.9%
T 6
 
8.5%
B 6
 
8.5%
A 5
 
7.0%
S 4
 
5.6%
G 3
 
4.2%
M 3
 
4.2%
H 3
 
4.2%
R 2
 
2.8%
Other values (15) 20
28.2%
Lowercase Letter
ValueCountFrequency (%)
e 44
13.8%
o 32
10.0%
a 30
9.4%
r 30
9.4%
i 25
7.8%
u 21
 
6.6%
s 20
 
6.3%
d 19
 
6.0%
n 19
 
6.0%
t 18
 
5.6%
Other values (12) 61
19.1%
Decimal Number
ValueCountFrequency (%)
1 145
25.0%
2 94
16.2%
9 81
14.0%
3 65
11.2%
0 44
 
7.6%
4 41
 
7.1%
8 31
 
5.4%
5 31
 
5.4%
7 26
 
4.5%
6 21
 
3.6%
Other Punctuation
ValueCountFrequency (%)
. 55
77.5%
, 16
 
22.5%
Space Separator
ValueCountFrequency (%)
1733
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 84
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9075
75.1%
Common 2496
 
20.7%
Latin 379
 
3.1%
Han 117
 
1.0%
Cyrillic 12
 
0.1%
Katakana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
261
 
2.9%
246
 
2.7%
221
 
2.4%
208
 
2.3%
202
 
2.2%
201
 
2.2%
186
 
2.0%
168
 
1.9%
159
 
1.8%
135
 
1.5%
Other values (484) 7088
78.1%
Han
ValueCountFrequency (%)
4
 
3.4%
4
 
3.4%
3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (86) 92
78.6%
Latin
ValueCountFrequency (%)
e 44
 
11.6%
o 32
 
8.4%
a 30
 
7.9%
r 30
 
7.9%
i 25
 
6.6%
u 21
 
5.5%
s 20
 
5.3%
d 19
 
5.0%
n 19
 
5.0%
t 18
 
4.7%
Other values (29) 121
31.9%
Common
ValueCountFrequency (%)
1733
69.4%
1 145
 
5.8%
2 94
 
3.8%
_ 84
 
3.4%
9 81
 
3.2%
3 65
 
2.6%
. 55
 
2.2%
0 44
 
1.8%
4 41
 
1.6%
8 31
 
1.2%
Other values (9) 123
 
4.9%
Cyrillic
ValueCountFrequency (%)
Н 2
16.7%
А 2
16.7%
С 2
16.7%
Д 1
8.3%
Э 1
8.3%
И 1
8.3%
Й 1
8.3%
Т 1
8.3%
Л 1
8.3%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9075
75.1%
ASCII 2874
 
23.8%
CJK 111
 
0.9%
Cyrillic 12
 
0.1%
CJK Compat Ideographs 6
 
< 0.1%
Katakana 4
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1733
60.3%
1 145
 
5.0%
2 94
 
3.3%
_ 84
 
2.9%
9 81
 
2.8%
3 65
 
2.3%
. 55
 
1.9%
0 44
 
1.5%
e 44
 
1.5%
4 41
 
1.4%
Other values (47) 488
 
17.0%
Hangul
ValueCountFrequency (%)
261
 
2.9%
246
 
2.7%
221
 
2.4%
208
 
2.3%
202
 
2.2%
201
 
2.2%
186
 
2.0%
168
 
1.9%
159
 
1.8%
135
 
1.5%
Other values (484) 7088
78.1%
CJK
ValueCountFrequency (%)
4
 
3.6%
4
 
3.6%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (81) 86
77.5%
CJK Compat Ideographs
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Cyrillic
ValueCountFrequency (%)
Н 2
16.7%
А 2
16.7%
С 2
16.7%
Д 1
8.3%
Э 1
8.3%
И 1
8.3%
Й 1
8.3%
Т 1
8.3%
Л 1
8.3%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

발행연도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct34
Distinct (%)3.5%
Missing37
Missing (%)3.7%
Infinite0
Infinite (%)0.0%
Mean1990.1776
Minimum1959
Maximum1999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T12:36:22.061594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1959
5-th percentile1977.1
Q11987
median1992
Q31995
95-th percentile1999
Maximum1999
Range40
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.5888013
Coefficient of variation (CV)0.00331066
Kurtosis1.5855515
Mean1990.1776
Median Absolute Deviation (MAD)4
Skewness-1.1675401
Sum1916541
Variance43.412303
MonotonicityNot monotonic
2023-12-12T12:36:22.237016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1997 105
 
10.5%
1993 99
 
9.9%
1995 70
 
7.0%
1992 61
 
6.1%
1990 59
 
5.9%
1988 57
 
5.7%
1991 56
 
5.6%
1999 54
 
5.4%
1994 54
 
5.4%
1986 51
 
5.1%
Other values (24) 297
29.7%
ValueCountFrequency (%)
1959 1
 
0.1%
1964 1
 
0.1%
1965 2
 
0.2%
1968 5
0.5%
1970 3
0.3%
1971 6
0.6%
1972 6
0.6%
1973 2
 
0.2%
1974 5
0.5%
1975 5
0.5%
ValueCountFrequency (%)
1999 54
5.4%
1998 16
 
1.6%
1997 105
10.5%
1996 25
 
2.5%
1995 70
7.0%
1994 54
5.4%
1993 99
9.9%
1992 61
6.1%
1991 56
5.6%
1990 59
5.9%

저자 및 발행처
Text

MISSING 

Distinct575
Distinct (%)58.3%
Missing14
Missing (%)1.4%
Memory size7.9 KiB
2023-12-12T12:36:22.620869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length21
Mean length8.2119675
Min length1

Characters and Unicode

Total characters8097
Distinct characters402
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique438 ?
Unique (%)44.4%

Sample

1st row인하대 인문과학연구소
2nd row전라남도농업박물관
3rd row한양대학교박물관, 동두천시
4th row부산광역시립박물관
5th row부산광역시립박물관
ValueCountFrequency (%)
문화재관리국 30
 
2.3%
브리테니커 26
 
2.0%
국립민속박물관 23
 
1.8%
문화체육부 18
 
1.4%
한국동굴학회 17
 
1.3%
아세아문화사 16
 
1.2%
국립중앙박물관 14
 
1.1%
대원사 13
 
1.0%
정태진.홍시환 13
 
1.0%
한국문화예술진흥원 13
 
1.0%
Other values (702) 1120
86.0%
2023-12-12T12:36:23.117776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
329
 
4.1%
317
 
3.9%
278
 
3.4%
271
 
3.3%
253
 
3.1%
248
 
3.1%
211
 
2.6%
205
 
2.5%
196
 
2.4%
, 190
 
2.3%
Other values (392) 5599
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7154
88.4%
Space Separator 317
 
3.9%
Other Punctuation 225
 
2.8%
Lowercase Letter 216
 
2.7%
Uppercase Letter 88
 
1.1%
Close Punctuation 42
 
0.5%
Open Punctuation 42
 
0.5%
Decimal Number 11
 
0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
329
 
4.6%
278
 
3.9%
271
 
3.8%
253
 
3.5%
248
 
3.5%
211
 
2.9%
205
 
2.9%
196
 
2.7%
185
 
2.6%
180
 
2.5%
Other values (334) 4798
67.1%
Uppercase Letter
ValueCountFrequency (%)
K 15
17.0%
S 14
15.9%
O 10
11.4%
C 9
10.2%
B 7
 
8.0%
А 6
 
6.8%
A 3
 
3.4%
R 2
 
2.3%
H 2
 
2.3%
E 2
 
2.3%
Other values (15) 18
20.5%
Lowercase Letter
ValueCountFrequency (%)
e 37
17.1%
r 30
13.9%
o 25
11.6%
a 17
 
7.9%
i 12
 
5.6%
s 11
 
5.1%
n 9
 
4.2%
t 9
 
4.2%
u 8
 
3.7%
l 8
 
3.7%
Other values (10) 50
23.1%
Decimal Number
ValueCountFrequency (%)
9 4
36.4%
3 2
18.2%
0 2
18.2%
5 1
 
9.1%
4 1
 
9.1%
6 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 190
84.4%
. 34
 
15.1%
· 1
 
0.4%
Space Separator
ValueCountFrequency (%)
317
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7148
88.3%
Common 639
 
7.9%
Latin 287
 
3.5%
Cyrillic 17
 
0.2%
Han 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
329
 
4.6%
278
 
3.9%
271
 
3.8%
253
 
3.5%
248
 
3.5%
211
 
3.0%
205
 
2.9%
196
 
2.7%
185
 
2.6%
180
 
2.5%
Other values (328) 4792
67.0%
Latin
ValueCountFrequency (%)
e 37
 
12.9%
r 30
 
10.5%
o 25
 
8.7%
a 17
 
5.9%
K 15
 
5.2%
S 14
 
4.9%
i 12
 
4.2%
s 11
 
3.8%
O 10
 
3.5%
C 9
 
3.1%
Other values (23) 107
37.3%
Common
ValueCountFrequency (%)
317
49.6%
, 190
29.7%
) 42
 
6.6%
( 42
 
6.6%
. 34
 
5.3%
9 4
 
0.6%
3 2
 
0.3%
` 2
 
0.3%
0 2
 
0.3%
5 1
 
0.2%
Other values (3) 3
 
0.5%
Cyrillic
ValueCountFrequency (%)
А 6
35.3%
Р 1
 
5.9%
О 1
 
5.9%
М 1
 
5.9%
Т 1
 
5.9%
Б 1
 
5.9%
Н 1
 
5.9%
Л 1
 
5.9%
У 1
 
5.9%
С 1
 
5.9%
Other values (2) 2
 
11.8%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7148
88.3%
ASCII 925
 
11.4%
Cyrillic 17
 
0.2%
CJK 6
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
329
 
4.6%
278
 
3.9%
271
 
3.8%
253
 
3.5%
248
 
3.5%
211
 
3.0%
205
 
2.9%
196
 
2.7%
185
 
2.6%
180
 
2.5%
Other values (328) 4792
67.0%
ASCII
ValueCountFrequency (%)
317
34.3%
, 190
20.5%
) 42
 
4.5%
( 42
 
4.5%
e 37
 
4.0%
. 34
 
3.7%
r 30
 
3.2%
o 25
 
2.7%
a 17
 
1.8%
K 15
 
1.6%
Other values (35) 176
19.0%
Cyrillic
ValueCountFrequency (%)
А 6
35.3%
Р 1
 
5.9%
О 1
 
5.9%
М 1
 
5.9%
Т 1
 
5.9%
Б 1
 
5.9%
Н 1
 
5.9%
Л 1
 
5.9%
У 1
 
5.9%
С 1
 
5.9%
Other values (2) 2
 
11.8%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct57
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum1994-04-26 00:00:00
Maximum2000-01-10 00:00:00
2023-12-12T12:36:23.315504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:36:23.525673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T12:36:20.147003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:36:23.635370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장경위발행연도소장년월일
소장경위1.0000.6920.948
발행연도0.6921.0000.721
소장년월일0.9480.7211.000
2023-12-12T12:36:23.727063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행연도소장경위
발행연도1.0000.525
소장경위0.5251.000

Missing values

2023-12-12T12:36:20.374721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:36:20.551620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:36:20.677063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소장경위도서명발행연도저자 및 발행처소장년월일
0기증도서인문연구 제30편1999인하대 인문과학연구소2000-01-10
1기증도서개관 6주년기념 운반용구특별전1999전라남도농업박물관1999-12-29
2기타동두천시의 역사와 문화유적1999한양대학교박물관, 동두천시1999-12-29
3기증도서삼국시대의 동물원1997부산광역시립박물관1999-12-29
4기증도서부산의 역사와 복천동 고분군1996부산광역시립박물관1999-12-29
5기증도서유물에 새겨진 고대문자1997부산광역시립박물관1999-12-29
6기증도서조선후기 완도 청산진의 치폐와 그 배경1999이명헌1999-12-29
7구입도서청년영웅 칭기즈칸 81998해냄1999-12-29
8죽암도서불멸의 증언1986정웅구악부1994-05-10
9죽암도서베른 조약 축조 해설1984허희성, 범우사1994-05-10
소장경위도서명발행연도저자 및 발행처소장년월일
990죽암도서근대인물한국사 408_최현배1993허웅, 동아일보사1996-03-20
991죽암도서93년 12월의 문화인물 윤백남 작품세계1993문화체육부1996-03-20
992죽암도서노산문학상 수상기념 수필집 오늘과 내일1978노산문학회1996-03-20
993죽암도서명수 산문록1985김명수, 삼형문화1996-03-20
994죽암도서삼우 정해근 선생 문집1986덕수상업고등학교1996-03-20
995죽암도서남명 조식의 교학은상1990한상규, 세종출판사1996-03-20
996죽암도서월궁항아 원강.원영례1991고글1996-03-20
997죽암도서빛깔있는 책들 168 궁중유물_둘1995대원사1996-03-20
998죽암도서시와 그림이 걸린 풍류산방1994따비밭1996-03-20
999죽암도서빛깔있는 책들 167 궁중유물_하나1995대원사1996-03-20

Duplicate rows

Most frequently occurring

소장경위도서명발행연도저자 및 발행처소장년월일# duplicates
9죽암도서일본 시찰 보고 동굴과 부대시설1988정태진.홍시환, 한국동굴학회1994-04-2612
4죽암도서건조물 문화재 지정 조사보고서_석조및목조1987문화재관리국1994-04-264
5죽암도서고.중세시대 한중 문화교류사1993문화체육부1996-03-204
2죽암도서Korea Buddhism1988Chogye Order1994-05-103
6죽암도서문화재 조사보고서_경남 산청지역<NA>문화재관리국1994-04-263
14죽암도서한국의 금속공예1987이호관, 문화재관리국1994-04-263
0죽암도서549돌 한글날기념 한글사랑 나라사랑1995문체부, KBS1996-04-062
1죽암도서Korea Buddhism1986Chogye Order1994-05-102
3죽암도서개교 40주년 기념 박물관 도록1986국민대학교 박물관1994-05-022
7죽암도서빛깔있는 책들 한국의 철새1990윤무부, 대원사1994-05-102