Overview

Dataset statistics

Number of variables3
Number of observations619
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.8 KiB
Average record size in memory26.2 B

Variable types

Numeric1
Categorical2

Dataset

Description품목별규격DB 관리(일련번호, 최종수정일, 평균별점) 품목의 일련번호 규격DB의 마지막 수정일자 별점의 평균값
URLhttps://www.data.go.kr/data/15070457/fileData.do

Alerts

일련번호 is highly overall correlated with 최종수정일High correlation
최종수정일 is highly overall correlated with 일련번호High correlation
평균별점 is highly imbalanced (98.3%)Imbalance
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:19:41.705917
Analysis finished2023-12-12 23:19:42.125147
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct619
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean548.88691
Minimum22
Maximum996
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-13T08:19:42.202375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22
5-th percentile149.9
Q1320.5
median602
Q3756.5
95-th percentile949.1
Maximum996
Range974
Interquartile range (IQR)436

Descriptive statistics

Standard deviation259.55234
Coefficient of variation (CV)0.47287034
Kurtosis-1.2316506
Mean548.88691
Median Absolute Deviation (MAD)226
Skewness-0.061778035
Sum339761
Variance67367.418
MonotonicityNot monotonic
2023-12-13T08:19:42.341228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199 1
 
0.2%
435 1
 
0.2%
842 1
 
0.2%
320 1
 
0.2%
309 1
 
0.2%
312 1
 
0.2%
322 1
 
0.2%
324 1
 
0.2%
450 1
 
0.2%
444 1
 
0.2%
Other values (609) 609
98.4%
ValueCountFrequency (%)
22 1
0.2%
43 1
0.2%
88 1
0.2%
92 1
0.2%
93 1
0.2%
119 1
0.2%
120 1
0.2%
121 1
0.2%
124 1
0.2%
125 1
0.2%
ValueCountFrequency (%)
996 1
0.2%
995 1
0.2%
991 1
0.2%
990 1
0.2%
989 1
0.2%
988 1
0.2%
987 1
0.2%
986 1
0.2%
985 1
0.2%
984 1
0.2%

최종수정일
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2017-01-07
168 
2018-12-11
65 
2013-12-19
58 
2015-12-29
58 
2013-01-22
48 
Other values (19)
222 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row2013-01-22
2nd row2013-01-22
3rd row2015-12-20
4th row2014-12-25
5th row2015-12-20

Common Values

ValueCountFrequency (%)
2017-01-07 168
27.1%
2018-12-11 65
 
10.5%
2013-12-19 58
 
9.4%
2015-12-29 58
 
9.4%
2013-01-22 48
 
7.8%
2015-12-21 43
 
6.9%
2013-01-30 36
 
5.8%
2015-12-20 28
 
4.5%
2015-12-24 25
 
4.0%
2020-01-26 23
 
3.7%
Other values (14) 67
 
10.8%

Length

2023-12-13T08:19:42.467798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2017-01-07 168
27.1%
2018-12-11 65
 
10.5%
2013-12-19 58
 
9.4%
2015-12-29 58
 
9.4%
2013-01-22 48
 
7.8%
2015-12-21 43
 
6.9%
2013-01-30 36
 
5.8%
2015-12-20 28
 
4.5%
2015-12-24 25
 
4.0%
2020-01-26 23
 
3.7%
Other values (14) 67
 
10.8%

평균별점
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
0
618 
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 618
99.8%
4 1
 
0.2%

Length

2023-12-13T08:19:42.574003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:19:42.695954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 618
99.8%
4 1
 
0.2%

Interactions

2023-12-13T08:19:41.849921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:19:42.770714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호최종수정일평균별점
일련번호1.0000.9080.508
최종수정일0.9081.0000.000
평균별점0.5080.0001.000
2023-12-13T08:19:42.888529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최종수정일평균별점
최종수정일1.0000.000
평균별점0.0001.000
2023-12-13T08:19:42.967759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호최종수정일평균별점
일련번호1.0000.6270.389
최종수정일0.6271.0000.000
평균별점0.3890.0001.000

Missing values

2023-12-13T08:19:41.987858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:19:42.087583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호최종수정일평균별점
01992013-01-220
12012013-01-220
25982015-12-200
3882014-12-250
41352015-12-200
5932013-01-304
61402013-01-220
71212020-01-260
81302013-01-220
91432013-01-220
일련번호최종수정일평균별점
6099752020-01-260
6109772020-01-260
6119782020-01-260
6129792020-01-260
6139822020-01-260
6149832020-01-260
6159882020-01-270
6169892020-01-270
6179952021-05-280
6189962021-05-280