Overview

Dataset statistics

Number of variables5
Number of observations3248
Missing cells0
Missing cells (%)0.0%
Duplicate rows182
Duplicate rows (%)5.6%
Total size in memory133.3 KiB
Average record size in memory42.0 B

Variable types

Categorical2
Boolean2
Numeric1

Dataset

Description우수축산물학교급식전산시스템 정산 자료
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=XH484NLT8KBEPAHQMWIV31456025&infSeq=1

Alerts

품목분류 has constant value ""Constant
품명 has constant value ""Constant
부위 has constant value ""Constant
Dataset has 182 (5.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 22:35:32.860599
Analysis finished2023-12-10 22:35:33.171002
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

정산년월
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.5 KiB
202204
812 
202203
812 
202202
812 
202201
812 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202204
2nd row202204
3rd row202204
4th row202204
5th row202204

Common Values

ValueCountFrequency (%)
202204 812
25.0%
202203 812
25.0%
202202 812
25.0%
202201 812
25.0%

Length

2023-12-11T07:35:33.220434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:35:33.491769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202204 812
25.0%
202203 812
25.0%
202202 812
25.0%
202201 812
25.0%

품목분류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.5 KiB
축산물
3248 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row축산물
2nd row축산물
3rd row축산물
4th row축산물
5th row축산물

Common Values

ValueCountFrequency (%)
축산물 3248
100.0%

Length

2023-12-11T07:35:33.586796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:35:33.671629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
축산물 3248
100.0%

품명
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
False
3248 
ValueCountFrequency (%)
False 3248
100.0%
2023-12-11T07:35:33.744170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부위
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
False
3248 
ValueCountFrequency (%)
False 3248
100.0%
2023-12-11T07:35:33.819133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

총수량
Real number (ℝ)

Distinct2824
Distinct (%)86.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36488.822
Minimum7
Maximum820384
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size28.7 KiB
2023-12-11T07:35:33.904139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile140
Q12075
median10845.5
Q336745.75
95-th percentile155186.1
Maximum820384
Range820377
Interquartile range (IQR)34670.75

Descriptive statistics

Standard deviation73827.209
Coefficient of variation (CV)2.0232829
Kurtosis32.895092
Mean36488.822
Median Absolute Deviation (MAD)10222.5
Skewness4.9176337
Sum1.1851569 × 108
Variance5.4504568 × 109
MonotonicityNot monotonic
2023-12-11T07:35:34.012467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
120 9
 
0.3%
60 9
 
0.3%
70 8
 
0.2%
160 8
 
0.2%
480 8
 
0.2%
150 8
 
0.2%
240 7
 
0.2%
80 7
 
0.2%
75 7
 
0.2%
35 7
 
0.2%
Other values (2814) 3170
97.6%
ValueCountFrequency (%)
7 2
0.1%
12 2
0.1%
14 3
0.1%
15 2
0.1%
16 2
0.1%
21 4
0.1%
22 1
 
< 0.1%
24 3
0.1%
28 2
0.1%
30 3
0.1%
ValueCountFrequency (%)
820384 1
< 0.1%
769110 1
< 0.1%
759184 1
< 0.1%
723771 1
< 0.1%
722912 1
< 0.1%
711735 1
< 0.1%
678536 1
< 0.1%
677730 1
< 0.1%
615288 1
< 0.1%
612877 1
< 0.1%

Interactions

2023-12-11T07:35:32.977582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:35:34.074903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정산년월총수량
정산년월1.0000.110
총수량0.1101.000
2023-12-11T07:35:34.135128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총수량정산년월
총수량1.0000.066
정산년월0.0661.000

Missing values

2023-12-11T07:35:33.072430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:35:33.141379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

정산년월품목분류품명부위총수량
0202204축산물NN18769
1202204축산물NN18791
2202204축산물NN156
3202204축산물NN26550
4202204축산물NN28986
5202204축산물NN2546
6202204축산물NN34754
7202204축산물NN73962
8202204축산물NN47004
9202204축산물NN8888
정산년월품목분류품명부위총수량
3238202201축산물NN3840
3239202201축산물NN4083
3240202201축산물NN15528
3241202201축산물NN8555
3242202201축산물NN1424
3243202201축산물NN10403
3244202201축산물NN42842
3245202201축산물NN23760
3246202201축산물NN204434
3247202201축산물NN34235

Duplicate rows

Most frequently occurring

정산년월품목분류품명부위총수량# duplicates
7202201축산물NN1608
52202202축산물NN708
99202203축산물NN1508
144202204축산물NN1208
4202201축산물NN807
49202202축산물NN357
96202203축산물NN757
141202204축산물NN607
13202201축산물NN3205
58202202축산물NN1405