Overview

Dataset statistics

Number of variables5
Number of observations1310
Missing cells0
Missing cells (%)0.0%
Duplicate rows22
Duplicate rows (%)1.7%
Total size in memory52.6 KiB
Average record size in memory41.1 B

Variable types

Text1
Categorical3
Numeric1

Dataset

Description전북특별자치도 산림박물관 전시자료 목록(물품명, 보관실 위치, 취득일자 등)전북 산림박물관이 소장한 소장품의 이름
Author전북특별자치도
URLhttps://www.data.go.kr/data/15055675/fileData.do

Alerts

Dataset has 22 (1.7%) duplicate rowsDuplicates
단위 is highly overall correlated with 취득일자High correlation
취득일자 is highly overall correlated with 단위High correlation
단위 is highly imbalanced (55.1%)Imbalance

Reproduction

Analysis started2024-03-14 14:43:19.916813
Analysis finished2024-03-14 14:43:21.390957
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1115
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
2024-03-14T23:43:22.443958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length41
Mean length5.8519084
Min length1

Characters and Unicode

Total characters7666
Distinct characters634
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique980 ?
Unique (%)74.8%

Sample

1st row물관과 체관
2nd row고려사
3rd row고려사절요
4th row고소설
5th row고암문집
ValueCountFrequency (%)
환경스페셜 35
 
2.0%
21
 
1.2%
kbs자연다큐멘터리 13
 
0.7%
pdp 10
 
0.6%
8
 
0.4%
모니터 8
 
0.4%
느티나무 7
 
0.4%
하회탈 6
 
0.3%
나이테 6
 
0.3%
스크린 5
 
0.3%
Other values (1368) 1674
93.4%
2024-03-14T23:43:24.371504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
555
 
7.2%
0 226
 
2.9%
2 159
 
2.1%
153
 
2.0%
140
 
1.8%
1 128
 
1.7%
119
 
1.6%
117
 
1.5%
116
 
1.5%
112
 
1.5%
Other values (624) 5841
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5945
77.6%
Decimal Number 739
 
9.6%
Space Separator 555
 
7.2%
Uppercase Letter 140
 
1.8%
Open Punctuation 104
 
1.4%
Close Punctuation 104
 
1.4%
Other Punctuation 70
 
0.9%
Lowercase Letter 4
 
0.1%
Other Symbol 3
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
 
2.6%
140
 
2.4%
119
 
2.0%
117
 
2.0%
116
 
2.0%
112
 
1.9%
88
 
1.5%
76
 
1.3%
68
 
1.1%
66
 
1.1%
Other values (585) 4890
82.3%
Uppercase Letter
ValueCountFrequency (%)
D 25
17.9%
S 25
17.9%
P 23
16.4%
B 19
13.6%
K 19
13.6%
C 6
 
4.3%
L 4
 
2.9%
V 4
 
2.9%
W 3
 
2.1%
I 3
 
2.1%
Other values (6) 9
 
6.4%
Decimal Number
ValueCountFrequency (%)
0 226
30.6%
2 159
21.5%
1 128
17.3%
3 50
 
6.8%
6 36
 
4.9%
4 35
 
4.7%
5 35
 
4.7%
9 30
 
4.1%
8 22
 
3.0%
7 18
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 57
81.4%
/ 8
 
11.4%
' 2
 
2.9%
" 2
 
2.9%
. 1
 
1.4%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Lowercase Letter
ValueCountFrequency (%)
m 2
50.0%
k 2
50.0%
Space Separator
ValueCountFrequency (%)
555
100.0%
Open Punctuation
ValueCountFrequency (%)
( 104
100.0%
Close Punctuation
ValueCountFrequency (%)
) 104
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5943
77.5%
Common 1577
 
20.6%
Latin 144
 
1.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
153
 
2.6%
140
 
2.4%
119
 
2.0%
117
 
2.0%
116
 
2.0%
112
 
1.9%
88
 
1.5%
76
 
1.3%
68
 
1.1%
66
 
1.1%
Other values (583) 4888
82.2%
Common
ValueCountFrequency (%)
555
35.2%
0 226
14.3%
2 159
 
10.1%
1 128
 
8.1%
( 104
 
6.6%
) 104
 
6.6%
, 57
 
3.6%
3 50
 
3.2%
6 36
 
2.3%
4 35
 
2.2%
Other values (11) 123
 
7.8%
Latin
ValueCountFrequency (%)
D 25
17.4%
S 25
17.4%
P 23
16.0%
B 19
13.2%
K 19
13.2%
C 6
 
4.2%
L 4
 
2.8%
V 4
 
2.8%
W 3
 
2.1%
I 3
 
2.1%
Other values (8) 13
9.0%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5942
77.5%
ASCII 1718
 
22.4%
Misc Symbols 3
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
555
32.3%
0 226
13.2%
2 159
 
9.3%
1 128
 
7.5%
( 104
 
6.1%
) 104
 
6.1%
, 57
 
3.3%
3 50
 
2.9%
6 36
 
2.1%
4 35
 
2.0%
Other values (27) 264
15.4%
Hangul
ValueCountFrequency (%)
153
 
2.6%
140
 
2.4%
119
 
2.0%
117
 
2.0%
116
 
2.0%
112
 
1.9%
88
 
1.5%
76
 
1.3%
68
 
1.1%
66
 
1.1%
Other values (582) 4887
82.2%
Misc Symbols
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
808 
<NA>
346 
마리
82 
 
28
 
23
Other values (6)
 
23

Length

Max length4
Median length1
Mean length1.8572519
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
808
61.7%
<NA> 346
26.4%
마리 82
 
6.3%
28
 
2.1%
23
 
1.8%
6
 
0.5%
5
 
0.4%
4
 
0.3%
4
 
0.3%
세트 3
 
0.2%

Length

2024-03-14T23:43:24.801046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
808
61.7%
na 346
26.4%
마리 82
 
6.3%
28
 
2.1%
23
 
1.8%
6
 
0.5%
5
 
0.4%
4
 
0.3%
4
 
0.3%
세트 3
 
0.2%

수량
Real number (ℝ)

Distinct46
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7938931
Minimum1
Maximum182
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2024-03-14T23:43:25.234662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile7
Maximum182
Range181
Interquartile range (IQR)0

Descriptive statistics

Standard deviation9.4818532
Coefficient of variation (CV)3.393778
Kurtosis160.27401
Mean2.7938931
Median Absolute Deviation (MAD)0
Skewness11.066946
Sum3660
Variance89.905539
MonotonicityNot monotonic
2024-03-14T23:43:25.845248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1 1048
80.0%
2 82
 
6.3%
3 40
 
3.1%
4 27
 
2.1%
5 22
 
1.7%
6 20
 
1.5%
7 9
 
0.7%
10 6
 
0.5%
11 5
 
0.4%
9 5
 
0.4%
Other values (36) 46
 
3.5%
ValueCountFrequency (%)
1 1048
80.0%
2 82
 
6.3%
3 40
 
3.1%
4 27
 
2.1%
5 22
 
1.7%
6 20
 
1.5%
7 9
 
0.7%
8 3
 
0.2%
9 5
 
0.4%
10 6
 
0.5%
ValueCountFrequency (%)
182 1
0.1%
149 1
0.1%
96 1
0.1%
82 1
0.1%
74 1
0.1%
69 1
0.1%
68 1
0.1%
65 1
0.1%
52 1
0.1%
51 1
0.1%

취득일자
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
2004-12-10
274 
2001-12-01
206 
2005-11-19
171 
2004-12-01
82 
2007-12-18
81 
Other values (42)
496 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique14 ?
Unique (%)1.1%

Sample

1st row2007-04-14
2nd row2001-12-01
3rd row2001-12-01
4th row2001-12-01
5th row2001-12-01

Common Values

ValueCountFrequency (%)
2004-12-10 274
20.9%
2001-12-01 206
15.7%
2005-11-19 171
13.1%
2004-12-01 82
 
6.3%
2007-12-18 81
 
6.2%
2007-04-14 73
 
5.6%
2007-12-24 69
 
5.3%
2003-06-01 51
 
3.9%
2008-12-05 41
 
3.1%
2003-06-05 34
 
2.6%
Other values (37) 228
17.4%

Length

2024-03-14T23:43:26.352125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2004-12-10 274
20.9%
2001-12-01 206
15.7%
2005-11-19 171
13.1%
2004-12-01 82
 
6.3%
2007-12-18 81
 
6.2%
2007-04-14 73
 
5.6%
2007-12-24 69
 
5.3%
2003-06-01 51
 
3.9%
2008-12-05 41
 
3.1%
2003-06-05 34
 
2.6%
Other values (37) 228
17.4%

보관실
Categorical

Distinct27
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
수장고
361 
제1전시실
251 
제4전시실
153 
제5전시실
116 
로비홀
103 
Other values (22)
326 

Length

Max length7
Median length5
Mean length4.0312977
Min length2

Unique

Unique9 ?
Unique (%)0.7%

Sample

1st row숲속친구들
2nd row제1전시실
3rd row제1전시실
4th row제1전시실
5th row제1전시실

Common Values

ValueCountFrequency (%)
수장고 361
27.6%
제1전시실 251
19.2%
제4전시실 153
11.7%
제5전시실 116
 
8.9%
로비홀 103
 
7.9%
숲속친구들 57
 
4.4%
샛집 54
 
4.1%
제3전시실 49
 
3.7%
표본실 42
 
3.2%
제2전시실 41
 
3.1%
Other values (17) 83
 
6.3%

Length

2024-03-14T23:43:26.939712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수장고 361
27.5%
제1전시실 251
19.1%
제4전시실 153
11.7%
제5전시실 116
 
8.8%
로비홀 103
 
7.9%
숲속친구들 57
 
4.3%
샛집 54
 
4.1%
제3전시실 49
 
3.7%
표본실 42
 
3.2%
제2전시실 41
 
3.1%
Other values (17) 84
 
6.4%

Interactions

2024-03-14T23:43:20.592174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:43:27.251236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위수량취득일자보관실
단위1.0000.0000.8800.684
수량0.0001.0000.6630.000
취득일자0.8800.6631.0000.902
보관실0.6840.0000.9021.000
2024-03-14T23:43:27.705798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위보관실취득일자
단위1.0000.3270.539
보관실0.3271.0000.412
취득일자0.5390.4121.000
2024-03-14T23:43:28.030371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수량단위취득일자보관실
수량1.0000.0000.3140.000
단위0.0001.0000.5390.327
취득일자0.3140.5391.0000.412
보관실0.0000.3270.4121.000

Missing values

2024-03-14T23:43:20.954598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:43:21.266019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

물품명단위수량취득일자보관실
0물관과 체관12007-04-14숲속친구들
1고려사12001-12-01제1전시실
2고려사절요12001-12-01제1전시실
3고소설12001-12-01제1전시실
4고암문집12001-12-01제1전시실
5충재일기12001-12-01제1전시실
6고서적42003-06-05표본실
7백두대간사료(고서적)42007-04-14제1전시실
8백두대간사료(지도사본)22007-04-14제1전시실
9네발나비 (지층속생태)마리12001-12-01제1전시실
물품명단위수량취득일자보관실
1300오작교 및 전통문양목교52004-12-07제2전시실
1301국산재를활용한건축자재362004-12-07제2전시실
1302각종 몰딩제품 및 소재682004-12-07제2전시실
1303제지생산과정별 원료 등422004-12-07제2전시실
1304영지버섯 외 18종182001-12-01제1전시실
1305버섯표본692003-06-05표본실
1306석엽표본742003-12-31표본실
1307야생화 압화222003-12-31표본실
1308종자표본322003-06-05수장고
1309파충류 표본52003-06-05표본실

Duplicate rows

Most frequently occurring

물품명단위수량취득일자보관실# duplicates
1PDP<NA>12008-12-05제1전시실5
5느티나무12001-12-01로비홀4
13장기알12004-12-10제2전시실4
17큰소쩍새마리22001-08-29제1전시실3
0KBS자연다큐멘터리 숲 (20020101)<NA>12007-12-24수장고2
2계류보전12008-12-05제1전시실2
3고누알12004-12-10제2전시실2
4광주리22005-11-19샛집2
6딱새마리12001-08-29제1전시실2
7반닫이12001-12-01수장고2