Overview

Dataset statistics

Number of variables7
Number of observations184
Missing cells179
Missing cells (%)13.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory61.7 B

Variable types

Numeric5
DateTime2

Dataset

Description대구광역시 서구의 쓰레기종량제봉투관리시스템의 발주 상세내역 현황입니다. 2024. 2. 23. 기준 자료이며, 발주일자, 발주수량, 발주금액 등의 데이터를 포함합니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15084559/fileData.do

Alerts

발주순서 is highly overall correlated with 미입고수량High correlation
발주수량 is highly overall correlated with 입고수량 and 2 other fieldsHigh correlation
입고수량 is highly overall correlated with 발주수량 and 1 other fieldsHigh correlation
발주금액 is highly overall correlated with 발주수량 and 2 other fieldsHigh correlation
미입고수량 is highly overall correlated with 발주순서 and 2 other fieldsHigh correlation
입고수량 has 9 (4.9%) missing valuesMissing
미입고수량 has 170 (92.4%) missing valuesMissing
발주순서 has unique valuesUnique

Reproduction

Analysis started2024-03-16 04:17:09.720938
Analysis finished2024-03-16 04:17:15.745551
Duration6.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발주순서
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct184
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.5
Minimum1
Maximum184
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T13:17:15.848267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.15
Q146.75
median92.5
Q3138.25
95-th percentile174.85
Maximum184
Range183
Interquartile range (IQR)91.5

Descriptive statistics

Standard deviation53.260367
Coefficient of variation (CV)0.57578775
Kurtosis-1.2
Mean92.5
Median Absolute Deviation (MAD)46
Skewness0
Sum17020
Variance2836.6667
MonotonicityStrictly increasing
2024-03-16T13:17:16.095072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
128 1
 
0.5%
119 1
 
0.5%
120 1
 
0.5%
121 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
Other values (174) 174
94.6%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
179 1
0.5%
178 1
0.5%
177 1
0.5%
176 1
0.5%
175 1
0.5%
Distinct56
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2016-11-03 00:00:00
Maximum2024-02-23 00:00:00
2024-03-16T13:17:16.345346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:16.572041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

발주수량
Real number (ℝ)

HIGH CORRELATION 

Distinct70
Distinct (%)38.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean270228.26
Minimum1000
Maximum1400000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T13:17:16.770385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile9000
Q150000
median110000
Q3350000
95-th percentile1100000
Maximum1400000
Range1399000
Interquartile range (IQR)300000

Descriptive statistics

Standard deviation334156.33
Coefficient of variation (CV)1.2365706
Kurtosis2.0042082
Mean270228.26
Median Absolute Deviation (MAD)90000
Skewness1.6816028
Sum49722000
Variance1.1166045 × 1011
MonotonicityNot monotonic
2024-03-16T13:17:16.956122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 15
 
8.2%
200000 14
 
7.6%
100000 10
 
5.4%
700000 9
 
4.9%
60000 8
 
4.3%
20000 7
 
3.8%
30000 6
 
3.3%
400000 6
 
3.3%
300000 5
 
2.7%
70000 5
 
2.7%
Other values (60) 99
53.8%
ValueCountFrequency (%)
1000 2
1.1%
2000 3
1.6%
3000 1
 
0.5%
4000 1
 
0.5%
5000 1
 
0.5%
6000 1
 
0.5%
9000 2
1.1%
10000 1
 
0.5%
11000 1
 
0.5%
15000 2
1.1%
ValueCountFrequency (%)
1400000 2
1.1%
1300000 1
 
0.5%
1200000 3
1.6%
1140000 1
 
0.5%
1100000 4
2.2%
1000000 2
1.1%
990000 1
 
0.5%
900000 3
1.6%
800000 1
 
0.5%
750000 1
 
0.5%

입고수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct70
Distinct (%)40.0%
Missing9
Missing (%)4.9%
Infinite0
Infinite (%)0.0%
Mean256171.43
Minimum1000
Maximum1400000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T13:17:17.175174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile9000
Q150000
median100000
Q3300000
95-th percentile1000000
Maximum1400000
Range1399000
Interquartile range (IQR)250000

Descriptive statistics

Standard deviation320769.95
Coefficient of variation (CV)1.2521691
Kurtosis2.5642559
Mean256171.43
Median Absolute Deviation (MAD)85000
Skewness1.7918115
Sum44830000
Variance1.0289336 × 1011
MonotonicityNot monotonic
2024-03-16T13:17:17.391145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 15
 
8.2%
200000 13
 
7.1%
100000 10
 
5.4%
700000 9
 
4.9%
60000 8
 
4.3%
20000 7
 
3.8%
400000 6
 
3.3%
30000 6
 
3.3%
70000 5
 
2.7%
250000 4
 
2.2%
Other values (60) 92
50.0%
(Missing) 9
 
4.9%
ValueCountFrequency (%)
1000 2
1.1%
2000 3
1.6%
3000 1
 
0.5%
4000 1
 
0.5%
6000 1
 
0.5%
9000 2
1.1%
10000 1
 
0.5%
11000 1
 
0.5%
15000 2
1.1%
17000 1
 
0.5%
ValueCountFrequency (%)
1400000 2
1.1%
1300000 1
 
0.5%
1200000 2
1.1%
1140000 1
 
0.5%
1100000 2
1.1%
1000000 2
1.1%
990000 1
 
0.5%
900000 3
1.6%
800000 1
 
0.5%
750000 1
 
0.5%

발주금액
Real number (ℝ)

HIGH CORRELATION 

Distinct119
Distinct (%)64.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13074712
Minimum1000
Maximum71500000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T13:17:17.668224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile20000
Q1760000
median6954000
Q318312500
95-th percentile53509500
Maximum71500000
Range71499000
Interquartile range (IQR)17552500

Descriptive statistics

Standard deviation16737266
Coefficient of variation (CV)1.280125
Kurtosis2.3040556
Mean13074712
Median Absolute Deviation (MAD)6854500
Skewness1.6940745
Sum2.405747 × 109
Variance2.8013607 × 1014
MonotonicityNot monotonic
2024-03-16T13:17:17.894283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 8
 
4.3%
2340000 6
 
3.3%
700000 5
 
2.7%
100000 5
 
2.7%
7200000 5
 
2.7%
22800000 4
 
2.2%
22000000 4
 
2.2%
20000 4
 
2.2%
1560000 3
 
1.6%
6840000 3
 
1.6%
Other values (109) 137
74.5%
ValueCountFrequency (%)
1000 1
 
0.5%
2000 2
1.1%
3000 1
 
0.5%
5000 1
 
0.5%
6000 1
 
0.5%
9000 1
 
0.5%
17000 2
1.1%
20000 4
2.2%
26000 1
 
0.5%
27000 1
 
0.5%
ValueCountFrequency (%)
71500000 1
0.5%
69290000 1
0.5%
66000000 2
1.1%
62700000 1
0.5%
59150000 1
0.5%
55000000 2
1.1%
54450000 1
0.5%
53820000 1
0.5%
51750000 1
0.5%
50700000 1
0.5%

미입고수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct12
Distinct (%)85.7%
Missing170
Missing (%)92.4%
Infinite0
Infinite (%)0.0%
Mean349428.57
Minimum5000
Maximum1200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T13:17:18.164034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile8250
Q173500
median150000
Q3382500
95-th percentile1135000
Maximum1200000
Range1195000
Interquartile range (IQR)309000

Descriptive statistics

Standard deviation439867.24
Coefficient of variation (CV)1.2588188
Kurtosis0.1360985
Mean349428.57
Median Absolute Deviation (MAD)136500
Skewness1.3375549
Sum4892000
Variance1.9348319 × 1011
MonotonicityNot monotonic
2024-03-16T13:17:18.363072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
150000 2
 
1.1%
1100000 2
 
1.1%
84000 1
 
0.5%
70000 1
 
0.5%
10000 1
 
0.5%
96000 1
 
0.5%
5000 1
 
0.5%
17000 1
 
0.5%
200000 1
 
0.5%
300000 1
 
0.5%
Other values (2) 2
 
1.1%
(Missing) 170
92.4%
ValueCountFrequency (%)
5000 1
0.5%
10000 1
0.5%
17000 1
0.5%
70000 1
0.5%
84000 1
0.5%
96000 1
0.5%
150000 2
1.1%
200000 1
0.5%
300000 1
0.5%
410000 1
0.5%
ValueCountFrequency (%)
1200000 1
0.5%
1100000 2
1.1%
410000 1
0.5%
300000 1
0.5%
200000 1
0.5%
150000 2
1.1%
96000 1
0.5%
84000 1
0.5%
70000 1
0.5%
17000 1
0.5%
Distinct56
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2016-11-03 00:00:00
Maximum2024-02-23 00:00:00
2024-03-16T13:17:18.998114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:19.167708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-16T13:17:14.608471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:10.131925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:11.850550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:12.935309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:13.861943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.728407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:10.675968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:12.046275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:13.137191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.027175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.850267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:10.891564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:12.272105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:13.309418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.209901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:15.004813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:11.189016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:12.489458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:13.504671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.352019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:15.145520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:11.586107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:12.744386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:13.704686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:17:14.494417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:17:19.279732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
발주순서1.0000.9960.0500.1150.2760.1330.996
발주일자0.9961.0000.6600.6920.0000.0001.000
발주수량0.0500.6601.0001.0000.8540.3240.660
입고수량0.1150.6921.0001.0000.8761.0000.692
발주금액0.2760.0000.8540.8761.0000.6070.000
미입고수량0.1330.0000.3241.0000.6071.0000.000
생성일자0.9961.0000.6600.6920.0000.0001.000
2024-03-16T13:17:19.447936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주순서발주수량입고수량발주금액미입고수량
발주순서1.000-0.150-0.220-0.1040.678
발주수량-0.1501.0001.0000.6740.757
입고수량-0.2201.0001.0000.6670.500
발주금액-0.1040.6740.6671.0000.520
미입고수량0.6780.7570.5000.5201.000

Missing values

2024-03-16T13:17:15.352403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:17:15.535428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-16T13:17:15.675021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
012016-11-03700000700000700000<NA>2016-11-03
122016-11-03100000100000100000<NA>2016-11-03
232017-01-2690000090000049500000<NA>2017-01-26
342017-01-261000001000007800000<NA>2017-01-26
452017-01-2620000020000022800000<NA>2017-01-26
562017-02-1090000900002340000<NA>2017-02-10
672017-02-1025000025000051750000<NA>2017-02-10
782017-02-101200000120000043200000<NA>2017-02-10
892017-03-14800000800000800000<NA>2017-03-14
9102017-05-24500000500000500000<NA>2017-05-24
발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
1741752024-01-26300030003000<NA>2024-01-26
1751762024-02-195000<NA>500050002024-02-19
1761772024-02-191100000<NA>110000011000002024-02-19
1771782024-02-1917000<NA>17000170002024-02-19
1781792024-02-23200000<NA>156000002000002024-02-23
1791802024-02-23300000<NA>342000003000002024-02-23
1801812024-02-23410000<NA>692900004100002024-02-23
1811822024-02-23150000<NA>39000001500002024-02-23
1821832024-02-231100000<NA>3960000011000002024-02-23
1831842024-02-231200000<NA>6600000012000002024-02-23