Overview

Dataset statistics

Number of variables6
Number of observations128
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory51.0 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description인천광역시 문화유산표준관리시스템 보존처리 정보입니다. 소장구분, 일련번호, 보존처리시작일자, 보존처리종료일자, 보존처리기관에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15119575/fileData.do

Alerts

소장구분 is highly overall correlated with 일련번호 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 소장구분 and 1 other fieldsHigh correlation
보존처리기관 is highly overall correlated with 소장구분 and 1 other fieldsHigh correlation
소장구분 is highly imbalanced (83.4%)Imbalance
일련번호 is highly imbalanced (90.1%)Imbalance
보존처리기관 is highly imbalanced (58.9%)Imbalance
순번 has unique valuesUnique
보존처리종료일자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:31:14.096250
Analysis finished2023-12-12 11:31:15.073252
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct128
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64.5
Minimum1
Maximum128
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:31:15.230399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.35
Q132.75
median64.5
Q396.25
95-th percentile121.65
Maximum128
Range127
Interquartile range (IQR)63.5

Descriptive statistics

Standard deviation37.094474
Coefficient of variation (CV)0.57510812
Kurtosis-1.2
Mean64.5
Median Absolute Deviation (MAD)32
Skewness0
Sum8256
Variance1376
MonotonicityStrictly increasing
2023-12-12T20:31:15.529792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
66 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
Other values (118) 118
92.2%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%
119 1
0.8%

소장구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
PS01003001001
120 
PS01003116001
 
1
PS01003316001
 
1
PS01003117001
 
1
PS01003001002
 
1
Other values (4)
 
4

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique8 ?
Unique (%)6.2%

Sample

1st rowPS01003001001
2nd rowPS01003116001
3rd rowPS01003316001
4th rowPS01003117001
5th rowPS01003001002

Common Values

ValueCountFrequency (%)
PS01003001001 120
93.8%
PS01003116001 1
 
0.8%
PS01003316001 1
 
0.8%
PS01003117001 1
 
0.8%
PS01003001002 1
 
0.8%
PS01003001007 1
 
0.8%
PS01003001004 1
 
0.8%
PS01003117007 1
 
0.8%
PS01003414001 1
 
0.8%

Length

2023-12-12T20:31:15.801996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:31:15.994447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ps01003001001 120
93.8%
ps01003116001 1
 
0.8%
ps01003316001 1
 
0.8%
ps01003117001 1
 
0.8%
ps01003001002 1
 
0.8%
ps01003001007 1
 
0.8%
ps01003001004 1
 
0.8%
ps01003117007 1
 
0.8%
ps01003414001 1
 
0.8%

일련번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
125 
2
 
1
3
 
1
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique3 ?
Unique (%)2.3%

Sample

1st row1
2nd row2
3rd row3
4th row4
5th row1

Common Values

ValueCountFrequency (%)
1 125
97.7%
2 1
 
0.8%
3 1
 
0.8%
4 1
 
0.8%

Length

2023-12-12T20:31:16.166797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:31:16.368608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 125
97.7%
2 1
 
0.8%
3 1
 
0.8%
4 1
 
0.8%
Distinct114
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T20:31:17.043236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9765625
Min length7

Characters and Unicode

Total characters1277
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)78.1%

Sample

1st row2021-09-30
2nd row2005-06-01
3rd row2008-06-30
4th row2010-07-08
5th row2012-08-02
ValueCountFrequency (%)
2021-09-30 2
 
1.6%
2005-06-01 2
 
1.6%
2014-02-05 2
 
1.6%
2013-10-11 2
 
1.6%
2013-08-13 2
 
1.6%
2010-02-12 2
 
1.6%
2016-10-12 2
 
1.6%
2015-02-04 2
 
1.6%
2012-02-13 2
 
1.6%
2012-08-02 2
 
1.6%
Other values (104) 108
84.4%
2023-12-12T20:31:17.812711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 348
27.3%
- 254
19.9%
2 241
18.9%
1 167
13.1%
7 48
 
3.8%
3 45
 
3.5%
8 44
 
3.4%
6 36
 
2.8%
9 32
 
2.5%
5 32
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1023
80.1%
Dash Punctuation 254
 
19.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 348
34.0%
2 241
23.6%
1 167
16.3%
7 48
 
4.7%
3 45
 
4.4%
8 44
 
4.3%
6 36
 
3.5%
9 32
 
3.1%
5 32
 
3.1%
4 30
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 254
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1277
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 348
27.3%
- 254
19.9%
2 241
18.9%
1 167
13.1%
7 48
 
3.8%
3 45
 
3.5%
8 44
 
3.4%
6 36
 
2.8%
9 32
 
2.5%
5 32
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1277
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 348
27.3%
- 254
19.9%
2 241
18.9%
1 167
13.1%
7 48
 
3.8%
3 45
 
3.5%
8 44
 
3.4%
6 36
 
2.8%
9 32
 
2.5%
5 32
 
2.5%
Distinct128
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T20:31:18.264802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9765625
Min length7

Characters and Unicode

Total characters1277
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)100.0%

Sample

1st row2021-10-08
2nd row2005-12-31
3rd row2008-10-31
4th row2010-11-30
5th row2013-01-31
ValueCountFrequency (%)
2021-10-08 1
 
0.8%
2005-12-31 1
 
0.8%
2009-11-19 1
 
0.8%
2007-09-12 1
 
0.8%
2021-10-25 1
 
0.8%
2022-12-27 1
 
0.8%
2021-12-16 1
 
0.8%
2014-07-18 1
 
0.8%
2009-08-31 1
 
0.8%
2008-12-20 1
 
0.8%
Other values (118) 118
92.2%
2023-12-12T20:31:19.001394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 304
23.8%
2 255
20.0%
- 254
19.9%
1 210
16.4%
3 47
 
3.7%
7 41
 
3.2%
8 38
 
3.0%
9 38
 
3.0%
5 37
 
2.9%
6 27
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1023
80.1%
Dash Punctuation 254
 
19.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 304
29.7%
2 255
24.9%
1 210
20.5%
3 47
 
4.6%
7 41
 
4.0%
8 38
 
3.7%
9 38
 
3.7%
5 37
 
3.6%
6 27
 
2.6%
4 26
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 254
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1277
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 304
23.8%
2 255
20.0%
- 254
19.9%
1 210
16.4%
3 47
 
3.7%
7 41
 
3.2%
8 38
 
3.0%
9 38
 
3.0%
5 37
 
2.9%
6 27
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1277
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 304
23.8%
2 255
20.0%
- 254
19.9%
1 210
16.4%
3 47
 
3.7%
7 41
 
3.2%
8 38
 
3.0%
9 38
 
3.0%
5 37
 
2.9%
6 27
 
2.1%

보존처리기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct30
Distinct (%)23.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
인천시립박물관
96 
본관 유물관리부 보존처리실
 
2
선광문화재보존연구소
 
2
한국종합방제주식회사
 
2
한송칠피공방
 
1
Other values (25)
25 

Length

Max length15
Median length7
Mean length7.140625
Min length2

Unique

Unique26 ?
Unique (%)20.3%

Sample

1st row인천시립박물관
2nd row인천시립박물관 보존처리실
3rd row엔가드
4th row한국종합방제주식회사
5th row인천광역시립박물관

Common Values

ValueCountFrequency (%)
인천시립박물관 96
75.0%
본관 유물관리부 보존처리실 2
 
1.6%
선광문화재보존연구소 2
 
1.6%
한국종합방제주식회사 2
 
1.6%
한송칠피공방 1
 
0.8%
엔가드 1
 
0.8%
인천광역시립박물관 1
 
0.8%
(주)엔가드 1
 
0.8%
서진 1
 
0.8%
시립박물관 유물관리부 1
 
0.8%
Other values (20) 20
 
15.6%

Length

2023-12-12T20:31:19.250673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천시립박물관 98
71.5%
유물관리부 4
 
2.9%
보존처리실 3
 
2.2%
본관 3
 
2.2%
선광문화재보존연구소 2
 
1.5%
한국종합방제주식회사 2
 
1.5%
보존과학실 2
 
1.5%
한송공방 1
 
0.7%
주)한켐문화재보존 1
 
0.7%
경담연구소 1
 
0.7%
Other values (20) 20
 
14.6%

Interactions

2023-12-12T20:31:14.605210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:31:19.429759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소장구분일련번호보존처리기관
순번1.0000.1330.0370.419
소장구분0.1331.0001.0000.988
일련번호0.0371.0001.0000.972
보존처리기관0.4190.9880.9721.000
2023-12-12T20:31:19.732250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장구분보존처리기관일련번호
소장구분1.0000.8300.980
보존처리기관0.8301.0000.786
일련번호0.9800.7861.000
2023-12-12T20:31:19.995465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소장구분일련번호보존처리기관
순번1.0000.0550.0000.126
소장구분0.0551.0000.9800.830
일련번호0.0000.9801.0000.786
보존처리기관0.1260.8300.7861.000

Missing values

2023-12-12T20:31:14.823931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:31:15.001004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번소장구분일련번호보존처리시작일자보존처리종료일자보존처리기관
01PS0100300100112021-09-302021-10-08인천시립박물관
12PS0100311600122005-06-012005-12-31인천시립박물관 보존처리실
23PS0100331600132008-06-302008-10-31엔가드
34PS0100311700142010-07-082010-11-30한국종합방제주식회사
45PS0100300100212012-08-022013-01-31인천광역시립박물관
56PS0100300100712013-02-062010-06-30(주)엔가드
67PS0100300100412009-07-072013-06-30본관 유물관리부 보존처리실
78PS0100311700712015-02-042009-12-04서진
89PS0100341400112012-02-132012-06-30시립박물관 유물관리부
910PS0100300100112014-02-052015-07-31엔마스타
순번소장구분일련번호보존처리시작일자보존처리종료일자보존처리기관
118119PS0100300100112012-08-022014-07-04인천시립박물관
119120PS0100300100112013-02-062018-12-21인천시립박물관
120121PS0100300100112009-07-072020-05-19인천시립박물관
121122PS0100300100112015-02-042018-08-30인천시립박물관
122123PS0100300100112012-02-132015-05-28인천시립박물관
123124PS0100300100112014-02-052013-01-30인천시립박물관
124125PS0100300100112016-10-122013-09-30인천시립박물관
125126PS0100300100112010-02-122001231인천시립박물관
126127PS0100300100112013-08-132021-01-29인천시립박물관
127128PS0100300100112013-10-112020-03-09인천시립박물관