Overview

Dataset statistics

Number of variables4
Number of observations124
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory36.1 B

Variable types

Numeric2
Categorical2

Dataset

Description인천시립박물관 문화유산표준관리시스템 연관유물정보입니다. 연관유물번호, 일련번호, 소장구분에 대한 정보를 제공합니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15119583&srcSe=7661IVAWM27C61E190

Alerts

일련번호 is highly overall correlated with 소장구분High correlation
소장구분 is highly overall correlated with 일련번호High correlation
일련번호 is highly imbalanced (91.5%)Imbalance
소장구분 is highly imbalanced (89.9%)Imbalance
순번 has unique valuesUnique
연관유물번호 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:38:04.746126
Analysis finished2024-03-18 03:38:06.462341
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.5
Minimum1
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-03-18T12:38:06.535764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.15
Q131.75
median62.5
Q393.25
95-th percentile117.85
Maximum124
Range123
Interquartile range (IQR)61.5

Descriptive statistics

Standard deviation35.939764
Coefficient of variation (CV)0.57503623
Kurtosis-1.2
Mean62.5
Median Absolute Deviation (MAD)31
Skewness0
Sum7750
Variance1291.6667
MonotonicityStrictly increasing
2024-03-18T12:38:06.689566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
80 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
Other values (114) 114
91.9%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%

연관유물번호
Real number (ℝ)

UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5044.3629
Minimum80
Maximum9596
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-03-18T12:38:06.829659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80
5-th percentile771.45
Q11976.75
median5684
Q37647.75
95-th percentile8273.85
Maximum9596
Range9516
Interquartile range (IQR)5671

Descriptive statistics

Standard deviation2895.9344
Coefficient of variation (CV)0.57409319
Kurtosis-1.4724943
Mean5044.3629
Median Absolute Deviation (MAD)2346
Skewness-0.32761817
Sum625501
Variance8386435.9
MonotonicityNot monotonic
2024-03-18T12:38:06.941740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1076 1
 
0.8%
8254 1
 
0.8%
8149 1
 
0.8%
3827 1
 
0.8%
6142 1
 
0.8%
3822 1
 
0.8%
7268 1
 
0.8%
80 1
 
0.8%
7023 1
 
0.8%
5615 1
 
0.8%
Other values (114) 114
91.9%
ValueCountFrequency (%)
80 1
0.8%
128 1
0.8%
424 1
0.8%
555 1
0.8%
747 1
0.8%
765 1
0.8%
771 1
0.8%
774 1
0.8%
826 1
0.8%
830 1
0.8%
ValueCountFrequency (%)
9596 1
0.8%
9593 1
0.8%
8823 1
0.8%
8626 1
0.8%
8621 1
0.8%
8277 1
0.8%
8274 1
0.8%
8273 1
0.8%
8257 1
0.8%
8255 1
0.8%

일련번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
0
122 
1
 
1
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)1.6%

Sample

1st row0
2nd row1
3rd row2
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 122
98.4%
1 1
 
0.8%
2 1
 
0.8%

Length

2024-03-18T12:38:07.050434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:38:07.128441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 122
98.4%
1 1
 
0.8%
2 1
 
0.8%

소장구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
PS01003001001
121 
PS01003001004
 
1
PS01003316001
 
1
PS01003316003
 
1

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st rowPS01003001001
2nd rowPS01003001004
3rd rowPS01003316001
4th rowPS01003316003
5th rowPS01003001001

Common Values

ValueCountFrequency (%)
PS01003001001 121
97.6%
PS01003001004 1
 
0.8%
PS01003316001 1
 
0.8%
PS01003316003 1
 
0.8%

Length

2024-03-18T12:38:07.217646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:38:07.311459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ps01003001001 121
97.6%
ps01003001004 1
 
0.8%
ps01003316001 1
 
0.8%
ps01003316003 1
 
0.8%

Interactions

2024-03-18T12:38:06.003543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:38:05.813067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:38:06.231990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:38:05.930856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:38:07.371533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연관유물번호일련번호소장구분
순번1.0000.0000.0000.000
연관유물번호0.0001.0000.0000.234
일련번호0.0000.0001.0001.000
소장구분0.0000.2341.0001.000
2024-03-18T12:38:07.449460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호소장구분
일련번호1.0000.996
소장구분0.9961.000
2024-03-18T12:38:07.521898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연관유물번호일련번호소장구분
순번1.000-0.0520.0000.000
연관유물번호-0.0521.0000.0000.136
일련번호0.0000.0001.0000.996
소장구분0.0000.1360.9961.000

Missing values

2024-03-18T12:38:06.336077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:38:06.419039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번연관유물번호일련번호소장구분
0110760PS01003001001
127711PS01003001004
2311172PS01003316001
3426280PS01003316003
4573340PS01003001001
5668280PS01003001001
6710450PS01003001001
7882730PS01003001001
8980800PS01003001001
91080810PS01003001001
순번연관유물번호일련번호소장구분
11411577900PS01003001001
11511695930PS01003001001
11611710980PS01003001001
11711867190PS01003001001
11811974490PS01003001001
11912010990PS01003001001
12012129990PS01003001001
12112276180PS01003001001
12212381100PS01003001001
1231245550PS01003001001