Overview

Dataset statistics

Number of variables7
Number of observations184
Missing cells179
Missing cells (%)13.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory61.7 B

Variable types

Numeric5
DateTime2

Dataset

Description대구광역시 서구_쓰레기종량제봉투 발주 상세내역 현황_20240223
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15084559&dataSetDetailId=150845591e0d16d5eb218&provdMethod=FILE

Alerts

발주순서 is highly overall correlated with 미입고수량High correlation
발주수량 is highly overall correlated with 입고수량 and 2 other fieldsHigh correlation
입고수량 is highly overall correlated with 발주수량 and 1 other fieldsHigh correlation
발주금액 is highly overall correlated with 발주수량 and 2 other fieldsHigh correlation
미입고수량 is highly overall correlated with 발주순서 and 2 other fieldsHigh correlation
입고수량 has 9 (4.9%) missing valuesMissing
미입고수량 has 170 (92.4%) missing valuesMissing
발주순서 has unique valuesUnique

Reproduction

Analysis started2024-03-16 03:54:46.635409
Analysis finished2024-03-16 03:54:59.793736
Duration13.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발주순서
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct184
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.5
Minimum1
Maximum184
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T03:55:00.061632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.15
Q146.75
median92.5
Q3138.25
95-th percentile174.85
Maximum184
Range183
Interquartile range (IQR)91.5

Descriptive statistics

Standard deviation53.260367
Coefficient of variation (CV)0.57578775
Kurtosis-1.2
Mean92.5
Median Absolute Deviation (MAD)46
Skewness0
Sum17020
Variance2836.6667
MonotonicityStrictly increasing
2024-03-16T03:55:00.480531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
128 1
 
0.5%
119 1
 
0.5%
120 1
 
0.5%
121 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
Other values (174) 174
94.6%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
179 1
0.5%
178 1
0.5%
177 1
0.5%
176 1
0.5%
175 1
0.5%
Distinct56
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2016-11-03 00:00:00
Maximum2024-02-23 00:00:00
2024-03-16T03:55:01.034028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:55:01.466057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

발주수량
Real number (ℝ)

HIGH CORRELATION 

Distinct70
Distinct (%)38.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean270228.26
Minimum1000
Maximum1400000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T03:55:01.868600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile9000
Q150000
median110000
Q3350000
95-th percentile1100000
Maximum1400000
Range1399000
Interquartile range (IQR)300000

Descriptive statistics

Standard deviation334156.33
Coefficient of variation (CV)1.2365706
Kurtosis2.0042082
Mean270228.26
Median Absolute Deviation (MAD)90000
Skewness1.6816028
Sum49722000
Variance1.1166045 × 1011
MonotonicityNot monotonic
2024-03-16T03:55:02.570930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 15
 
8.2%
200000 14
 
7.6%
100000 10
 
5.4%
700000 9
 
4.9%
60000 8
 
4.3%
20000 7
 
3.8%
30000 6
 
3.3%
400000 6
 
3.3%
300000 5
 
2.7%
70000 5
 
2.7%
Other values (60) 99
53.8%
ValueCountFrequency (%)
1000 2
1.1%
2000 3
1.6%
3000 1
 
0.5%
4000 1
 
0.5%
5000 1
 
0.5%
6000 1
 
0.5%
9000 2
1.1%
10000 1
 
0.5%
11000 1
 
0.5%
15000 2
1.1%
ValueCountFrequency (%)
1400000 2
1.1%
1300000 1
 
0.5%
1200000 3
1.6%
1140000 1
 
0.5%
1100000 4
2.2%
1000000 2
1.1%
990000 1
 
0.5%
900000 3
1.6%
800000 1
 
0.5%
750000 1
 
0.5%

입고수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct70
Distinct (%)40.0%
Missing9
Missing (%)4.9%
Infinite0
Infinite (%)0.0%
Mean256171.43
Minimum1000
Maximum1400000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T03:55:03.196400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile9000
Q150000
median100000
Q3300000
95-th percentile1000000
Maximum1400000
Range1399000
Interquartile range (IQR)250000

Descriptive statistics

Standard deviation320769.95
Coefficient of variation (CV)1.2521691
Kurtosis2.5642559
Mean256171.43
Median Absolute Deviation (MAD)85000
Skewness1.7918115
Sum44830000
Variance1.0289336 × 1011
MonotonicityNot monotonic
2024-03-16T03:55:03.696094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 15
 
8.2%
200000 13
 
7.1%
100000 10
 
5.4%
700000 9
 
4.9%
60000 8
 
4.3%
20000 7
 
3.8%
400000 6
 
3.3%
30000 6
 
3.3%
70000 5
 
2.7%
250000 4
 
2.2%
Other values (60) 92
50.0%
(Missing) 9
 
4.9%
ValueCountFrequency (%)
1000 2
1.1%
2000 3
1.6%
3000 1
 
0.5%
4000 1
 
0.5%
6000 1
 
0.5%
9000 2
1.1%
10000 1
 
0.5%
11000 1
 
0.5%
15000 2
1.1%
17000 1
 
0.5%
ValueCountFrequency (%)
1400000 2
1.1%
1300000 1
 
0.5%
1200000 2
1.1%
1140000 1
 
0.5%
1100000 2
1.1%
1000000 2
1.1%
990000 1
 
0.5%
900000 3
1.6%
800000 1
 
0.5%
750000 1
 
0.5%

발주금액
Real number (ℝ)

HIGH CORRELATION 

Distinct119
Distinct (%)64.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13074712
Minimum1000
Maximum71500000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T03:55:04.301383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile20000
Q1760000
median6954000
Q318312500
95-th percentile53509500
Maximum71500000
Range71499000
Interquartile range (IQR)17552500

Descriptive statistics

Standard deviation16737266
Coefficient of variation (CV)1.280125
Kurtosis2.3040556
Mean13074712
Median Absolute Deviation (MAD)6854500
Skewness1.6940745
Sum2.405747 × 109
Variance2.8013607 × 1014
MonotonicityNot monotonic
2024-03-16T03:55:04.839851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 8
 
4.3%
2340000 6
 
3.3%
700000 5
 
2.7%
100000 5
 
2.7%
7200000 5
 
2.7%
22800000 4
 
2.2%
22000000 4
 
2.2%
20000 4
 
2.2%
1560000 3
 
1.6%
6840000 3
 
1.6%
Other values (109) 137
74.5%
ValueCountFrequency (%)
1000 1
 
0.5%
2000 2
1.1%
3000 1
 
0.5%
5000 1
 
0.5%
6000 1
 
0.5%
9000 1
 
0.5%
17000 2
1.1%
20000 4
2.2%
26000 1
 
0.5%
27000 1
 
0.5%
ValueCountFrequency (%)
71500000 1
0.5%
69290000 1
0.5%
66000000 2
1.1%
62700000 1
0.5%
59150000 1
0.5%
55000000 2
1.1%
54450000 1
0.5%
53820000 1
0.5%
51750000 1
0.5%
50700000 1
0.5%

미입고수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct12
Distinct (%)85.7%
Missing170
Missing (%)92.4%
Infinite0
Infinite (%)0.0%
Mean349428.57
Minimum5000
Maximum1200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-16T03:55:05.331799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile8250
Q173500
median150000
Q3382500
95-th percentile1135000
Maximum1200000
Range1195000
Interquartile range (IQR)309000

Descriptive statistics

Standard deviation439867.24
Coefficient of variation (CV)1.2588188
Kurtosis0.1360985
Mean349428.57
Median Absolute Deviation (MAD)136500
Skewness1.3375549
Sum4892000
Variance1.9348319 × 1011
MonotonicityNot monotonic
2024-03-16T03:55:05.930642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
150000 2
 
1.1%
1100000 2
 
1.1%
84000 1
 
0.5%
70000 1
 
0.5%
10000 1
 
0.5%
96000 1
 
0.5%
5000 1
 
0.5%
17000 1
 
0.5%
200000 1
 
0.5%
300000 1
 
0.5%
Other values (2) 2
 
1.1%
(Missing) 170
92.4%
ValueCountFrequency (%)
5000 1
0.5%
10000 1
0.5%
17000 1
0.5%
70000 1
0.5%
84000 1
0.5%
96000 1
0.5%
150000 2
1.1%
200000 1
0.5%
300000 1
0.5%
410000 1
0.5%
ValueCountFrequency (%)
1200000 1
0.5%
1100000 2
1.1%
410000 1
0.5%
300000 1
0.5%
200000 1
0.5%
150000 2
1.1%
96000 1
0.5%
84000 1
0.5%
70000 1
0.5%
17000 1
0.5%
Distinct56
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2016-11-03 00:00:00
Maximum2024-02-23 00:00:00
2024-03-16T03:55:06.312455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:55:06.781018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-16T03:54:57.342386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:51.674101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:53.135561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:54.529787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:55.935127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:57.591171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:51.958129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:53.435666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:54.836084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:56.202670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:57.845991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:52.310287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:53.738842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:55.110719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:56.472872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:58.094466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:52.598078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:54.009439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:55.399386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:56.766971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:58.464226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:52.870986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:54.283201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:55.669114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T03:54:57.084877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T03:55:07.034883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
발주순서1.0000.9960.0500.1150.2760.1330.996
발주일자0.9961.0000.6600.6920.0000.0001.000
발주수량0.0500.6601.0001.0000.8540.3240.660
입고수량0.1150.6921.0001.0000.8761.0000.692
발주금액0.2760.0000.8540.8761.0000.6070.000
미입고수량0.1330.0000.3241.0000.6071.0000.000
생성일자0.9961.0000.6600.6920.0000.0001.000
2024-03-16T03:55:07.330002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주순서발주수량입고수량발주금액미입고수량
발주순서1.000-0.150-0.220-0.1040.678
발주수량-0.1501.0001.0000.6740.757
입고수량-0.2201.0001.0000.6670.500
발주금액-0.1040.6740.6671.0000.520
미입고수량0.6780.7570.5000.5201.000

Missing values

2024-03-16T03:54:58.828357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T03:54:59.213560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-16T03:54:59.670466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
012016-11-03700000700000700000<NA>2016-11-03
122016-11-03100000100000100000<NA>2016-11-03
232017-01-2690000090000049500000<NA>2017-01-26
342017-01-261000001000007800000<NA>2017-01-26
452017-01-2620000020000022800000<NA>2017-01-26
562017-02-1090000900002340000<NA>2017-02-10
672017-02-1025000025000051750000<NA>2017-02-10
782017-02-101200000120000043200000<NA>2017-02-10
892017-03-14800000800000800000<NA>2017-03-14
9102017-05-24500000500000500000<NA>2017-05-24
발주순서발주일자발주수량입고수량발주금액미입고수량생성일자
1741752024-01-26300030003000<NA>2024-01-26
1751762024-02-195000<NA>500050002024-02-19
1761772024-02-191100000<NA>110000011000002024-02-19
1771782024-02-1917000<NA>17000170002024-02-19
1781792024-02-23200000<NA>156000002000002024-02-23
1791802024-02-23300000<NA>342000003000002024-02-23
1801812024-02-23410000<NA>692900004100002024-02-23
1811822024-02-23150000<NA>39000001500002024-02-23
1821832024-02-231100000<NA>3960000011000002024-02-23
1831842024-02-231200000<NA>6600000012000002024-02-23