Overview

Dataset statistics

Number of variables5
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Text1
Categorical3

Dataset

Description인천시립박물관 문화유산표준관리시스템 승인처리정보입니다. 유물명, 소장구분, 승인상태에 대한 정보를 제공합니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15119580&srcSe=7661IVAWM27C61E190

Alerts

요청일자 is highly overall correlated with 소장구분 and 1 other fieldsHigh correlation
소장구분 is highly overall correlated with 요청일자 and 1 other fieldsHigh correlation
승인상태 is highly overall correlated with 소장구분 and 1 other fieldsHigh correlation
소장구분 is highly imbalanced (90.3%)Imbalance
요청일자 is highly imbalanced (78.1%)Imbalance
승인상태 is highly imbalanced (94.3%)Imbalance
순번 has unique valuesUnique
유물명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:38:11.252454
Analysis finished2024-03-18 03:38:12.981983
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.5
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-18T12:38:13.064160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median100.5
Q3150.25
95-th percentile190.05
Maximum200
Range199
Interquartile range (IQR)99.5

Descriptive statistics

Standard deviation57.879185
Coefficient of variation (CV)0.57591228
Kurtosis-1.2
Mean100.5
Median Absolute Deviation (MAD)50
Skewness0
Sum20100
Variance3350
MonotonicityStrictly increasing
2024-03-18T12:38:13.207219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%

유물명
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-18T12:38:13.485849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length5.56
Min length1

Characters and Unicode

Total characters1112
Distinct characters272
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st row토기편
2nd row모조석기
3rd row남창문구사 영수증
4th row패각류
5th row암키와편
ValueCountFrequency (%)
성냥 7
 
2.8%
동체부편 4
 
1.6%
도기 4
 
1.6%
상표 3
 
1.2%
재떨이 2
 
0.8%
전화기 2
 
0.8%
2
 
0.8%
쌀포대 2
 
0.8%
영수증 2
 
0.8%
백자 2
 
0.8%
Other values (217) 219
88.0%
2024-03-18T12:38:13.855093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
4.4%
40
 
3.6%
37
 
3.3%
0 37
 
3.3%
31
 
2.8%
25
 
2.2%
24
 
2.2%
19
 
1.7%
19
 
1.7%
19
 
1.7%
Other values (262) 812
73.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 944
84.9%
Decimal Number 73
 
6.6%
Space Separator 49
 
4.4%
Lowercase Letter 17
 
1.5%
Uppercase Letter 13
 
1.2%
Open Punctuation 7
 
0.6%
Close Punctuation 7
 
0.6%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
4.2%
37
 
3.9%
31
 
3.3%
25
 
2.6%
24
 
2.5%
19
 
2.0%
19
 
2.0%
19
 
2.0%
18
 
1.9%
17
 
1.8%
Other values (233) 695
73.6%
Lowercase Letter
ValueCountFrequency (%)
t 2
11.8%
p 2
11.8%
a 2
11.8%
s 2
11.8%
c 1
 
5.9%
h 1
 
5.9%
d 1
 
5.9%
i 1
 
5.9%
r 1
 
5.9%
e 1
 
5.9%
Other values (3) 3
17.6%
Uppercase Letter
ValueCountFrequency (%)
R 3
23.1%
C 3
23.1%
E 2
15.4%
T 1
 
7.7%
I 1
 
7.7%
W 1
 
7.7%
K 1
 
7.7%
H 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
0 37
50.7%
1 17
23.3%
5 15
20.5%
2 4
 
5.5%
Space Separator
ValueCountFrequency (%)
49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 932
83.8%
Common 138
 
12.4%
Latin 30
 
2.7%
Han 12
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
4.3%
37
 
4.0%
31
 
3.3%
25
 
2.7%
24
 
2.6%
19
 
2.0%
19
 
2.0%
19
 
2.0%
18
 
1.9%
17
 
1.8%
Other values (224) 683
73.3%
Latin
ValueCountFrequency (%)
R 3
 
10.0%
C 3
 
10.0%
t 2
 
6.7%
p 2
 
6.7%
a 2
 
6.7%
E 2
 
6.7%
s 2
 
6.7%
c 1
 
3.3%
h 1
 
3.3%
d 1
 
3.3%
Other values (11) 11
36.7%
Han
ValueCountFrequency (%)
4
33.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Common
ValueCountFrequency (%)
49
35.5%
0 37
26.8%
1 17
 
12.3%
5 15
 
10.9%
( 7
 
5.1%
) 7
 
5.1%
2 4
 
2.9%
2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 932
83.8%
ASCII 166
 
14.9%
CJK 10
 
0.9%
None 2
 
0.2%
CJK Compat Ideographs 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
29.5%
0 37
22.3%
1 17
 
10.2%
5 15
 
9.0%
( 7
 
4.2%
) 7
 
4.2%
2 4
 
2.4%
R 3
 
1.8%
C 3
 
1.8%
t 2
 
1.2%
Other values (18) 22
13.3%
Hangul
ValueCountFrequency (%)
40
 
4.3%
37
 
4.0%
31
 
3.3%
25
 
2.7%
24
 
2.6%
19
 
2.0%
19
 
2.0%
19
 
2.0%
18
 
1.9%
17
 
1.8%
Other values (224) 683
73.3%
CJK
ValueCountFrequency (%)
4
40.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
None
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%

소장구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
PS01003001001
194 
PS01003001005
 
1
PS01003117007
 
1
PS01003414001
 
1
PS01003116001
 
1
Other values (2)
 
2

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique6 ?
Unique (%)3.0%

Sample

1st rowPS01003001001
2nd rowPS01003001005
3rd rowPS01003117007
4th rowPS01003414001
5th rowPS01003116001

Common Values

ValueCountFrequency (%)
PS01003001001 194
97.0%
PS01003001005 1
 
0.5%
PS01003117007 1
 
0.5%
PS01003414001 1
 
0.5%
PS01003116001 1
 
0.5%
PS01003316001 1
 
0.5%
PS01003316003 1
 
0.5%

Length

2024-03-18T12:38:13.958420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:38:14.050937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ps01003001001 194
97.0%
ps01003001005 1
 
0.5%
ps01003117007 1
 
0.5%
ps01003414001 1
 
0.5%
ps01003116001 1
 
0.5%
ps01003316001 1
 
0.5%
ps01003316003 1
 
0.5%

요청일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct23
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2017-10-10
178 
2018-12-20
 
1
2018-12-24
 
1
2016-07-25
 
1
2018-12-21
 
1
Other values (18)
18 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique22 ?
Unique (%)11.0%

Sample

1st row2017-10-10
2nd row2018-12-20
3rd row2018-12-24
4th row2016-07-25
5th row2018-12-21

Common Values

ValueCountFrequency (%)
2017-10-10 178
89.0%
2018-12-20 1
 
0.5%
2018-12-24 1
 
0.5%
2016-07-25 1
 
0.5%
2018-12-21 1
 
0.5%
2018-12-17 1
 
0.5%
2016-10-17 1
 
0.5%
2019-08-30 1
 
0.5%
2016-10-18 1
 
0.5%
2018-12-19 1
 
0.5%
Other values (13) 13
 
6.5%

Length

2024-03-18T12:38:14.171193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2017-10-10 178
89.0%
2017-10-25 1
 
0.5%
2023-06-23 1
 
0.5%
2016-11-03 1
 
0.5%
2022-09-22 1
 
0.5%
2016-12-29 1
 
0.5%
2016-07-22 1
 
0.5%
2020-09-11 1
 
0.5%
2020-07-30 1
 
0.5%
2019-08-20 1
 
0.5%
Other values (13) 13
 
6.5%

승인상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Y
198 
R
 
1
N
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)1.0%

Sample

1st rowY
2nd rowR
3rd rowN
4th rowY
5th rowY

Common Values

ValueCountFrequency (%)
Y 198
99.0%
R 1
 
0.5%
N 1
 
0.5%

Length

2024-03-18T12:38:14.261585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:38:14.355257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
y 198
99.0%
r 1
 
0.5%
n 1
 
0.5%

Interactions

2024-03-18T12:38:12.726649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:38:14.424414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소장구분요청일자승인상태
순번1.0000.0760.2780.037
소장구분0.0761.0001.0001.000
요청일자0.2781.0001.0001.000
승인상태0.0371.0001.0001.000
2024-03-18T12:38:14.523158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요청일자소장구분승인상태
요청일자1.0000.9580.948
소장구분0.9581.0000.990
승인상태0.9480.9901.000
2024-03-18T12:38:14.606442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소장구분요청일자승인상태
순번1.0000.0350.1000.015
소장구분0.0351.0000.9580.990
요청일자0.1000.9581.0000.948
승인상태0.0150.9900.9481.000

Missing values

2024-03-18T12:38:12.867987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:38:12.947612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번유물명소장구분요청일자승인상태
01토기편PS010030010012017-10-10Y
12모조석기PS010030010052018-12-20R
23남창문구사 영수증PS010031170072018-12-24N
34패각류PS010034140012016-07-25Y
45암키와편PS010031160012018-12-21Y
56PS010033160012018-12-17Y
67청자편PS010033160032016-10-17Y
78청자저부편PS010030010012019-08-30Y
89관정PS010030010012016-10-18Y
910청자구연부편PS010030010012018-12-19Y
순번유물명소장구분요청일자승인상태
190191사진엽서(오타후쿠와타주식회사)PS010030010012017-10-10Y
191192나전매화모양시회연상PS010030010012017-10-10Y
192193백자청화이화학문 주자PS010030010012017-10-10Y
193194도태칠기 매병PS010030010012017-10-10Y
194195인천고려성냥합동공업사 성냥PS010030010012017-10-10Y
195196인천밀쌀공장 준공기념 재떨이PS010030010012017-10-10Y
196197유마정미부 쌀포대PS010030010012017-10-10Y
197198가등정미소 쌀포대PS010030010012017-10-10Y
198199흥한방적주식회사 상표PS010030010012017-10-10Y
199200조선화상무역조합PS010030010012017-10-10Y