Overview

Dataset statistics

Number of variables9
Number of observations4149
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory320.2 KiB
Average record size in memory79.0 B

Variable types

Numeric6
Categorical1
DateTime2

Dataset

Description인천광역시 서구 쓰레기종량제봉투 LOT정보에 대한 데이터로 로트(LOT), 봉투단위, 봉투종류 등의 정보가 포함되어 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090797/fileData.do

Alerts

시작번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
판매가 is highly overall correlated with 도매가High correlation
도매가 is highly overall correlated with 판매가High correlation
종료번호 is highly skewed (γ1 = 25.64733239)Skewed
로트(LOT) has unique valuesUnique
제작업체코드 has 87 (2.1%) zerosZeros

Reproduction

Analysis started2024-03-14 08:48:30.501530
Analysis finished2024-03-14 08:48:41.326328
Duration10.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

로트(LOT)
Real number (ℝ)

UNIQUE 

Distinct4149
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40849.492
Minimum12
Maximum99998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:41.472259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile1365.4
Q112446
median33754
Q369084
95-th percentile94453.6
Maximum99998
Range99986
Interquartile range (IQR)56638

Descriptive statistics

Standard deviation31087.158
Coefficient of variation (CV)0.76101699
Kurtosis-1.1786673
Mean40849.492
Median Absolute Deviation (MAD)25428
Skewness0.39566777
Sum1.6948454 × 108
Variance9.6641137 × 108
MonotonicityNot monotonic
2024-03-14T17:48:41.726221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135 1
 
< 0.1%
1166 1
 
< 0.1%
38288 1
 
< 0.1%
92541 1
 
< 0.1%
92542 1
 
< 0.1%
92543 1
 
< 0.1%
86151 1
 
< 0.1%
86152 1
 
< 0.1%
86153 1
 
< 0.1%
91353 1
 
< 0.1%
Other values (4139) 4139
99.8%
ValueCountFrequency (%)
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
24 1
< 0.1%
25 1
< 0.1%
26 1
< 0.1%
27 1
< 0.1%
28 1
< 0.1%
29 1
< 0.1%
30 1
< 0.1%
ValueCountFrequency (%)
99998 1
< 0.1%
99997 1
< 0.1%
99996 1
< 0.1%
99995 1
< 0.1%
99994 1
< 0.1%
99993 1
< 0.1%
99992 1
< 0.1%
99991 1
< 0.1%
99990 1
< 0.1%
99989 1
< 0.1%

제작업체코드
Real number (ℝ)

ZEROS 

Distinct18
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.582791
Minimum0
Maximum99
Zeros87
Zeros (%)2.1%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:41.944014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q17
median12
Q317
95-th percentile17
Maximum99
Range99
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.1148761
Coefficient of variation (CV)0.52792769
Kurtosis28.745639
Mean11.582791
Median Absolute Deviation (MAD)5
Skewness1.7030466
Sum48057
Variance37.39171
MonotonicityNot monotonic
2024-03-14T17:48:42.152053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
17 1779
42.9%
7 714
17.2%
9 531
 
12.8%
12 444
 
10.7%
1 306
 
7.4%
2 126
 
3.0%
0 87
 
2.1%
16 51
 
1.2%
10 30
 
0.7%
6 24
 
0.6%
Other values (8) 57
 
1.4%
ValueCountFrequency (%)
0 87
 
2.1%
1 306
7.4%
2 126
 
3.0%
5 12
 
0.3%
6 24
 
0.6%
7 714
17.2%
8 15
 
0.4%
9 531
12.8%
10 30
 
0.7%
11 12
 
0.3%
ValueCountFrequency (%)
99 3
 
0.1%
26 6
 
0.1%
17 1779
42.9%
16 51
 
1.2%
15 3
 
0.1%
14 3
 
0.1%
13 3
 
0.1%
12 444
 
10.7%
11 12
 
0.3%
10 30
 
0.7%

봉투종류
Real number (ℝ)

Distinct12
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.636298
Minimum10
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:42.343677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile10
Q110
median60
Q370
95-th percentile80
Maximum81
Range71
Interquartile range (IQR)60

Descriptive statistics

Standard deviation28.48948
Coefficient of variation (CV)0.6528849
Kurtosis-1.7370321
Mean43.636298
Median Absolute Deviation (MAD)20
Skewness-0.14618799
Sum181047
Variance811.65048
MonotonicityNot monotonic
2024-03-14T17:48:42.723073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
10 1242
29.9%
70 762
18.4%
80 432
 
10.4%
40 369
 
8.9%
61 282
 
6.8%
64 276
 
6.7%
12 240
 
5.8%
63 198
 
4.8%
11 153
 
3.7%
81 111
 
2.7%
Other values (2) 84
 
2.0%
ValueCountFrequency (%)
10 1242
29.9%
11 153
 
3.7%
12 240
 
5.8%
40 369
 
8.9%
60 81
 
2.0%
61 282
 
6.8%
63 198
 
4.8%
64 276
 
6.7%
70 762
18.4%
71 3
 
0.1%
ValueCountFrequency (%)
81 111
 
2.7%
80 432
10.4%
71 3
 
0.1%
70 762
18.4%
64 276
 
6.7%
63 198
 
4.8%
61 282
 
6.8%
60 81
 
2.0%
40 369
8.9%
12 240
 
5.8%

시작번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.5 KiB
1
4149 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 4149
100.0%

Length

2024-03-14T17:48:43.113901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:48:43.420687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4149
100.0%

종료번호
Real number (ℝ)

SKEWED 

Distinct271
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60825.004
Minimum1
Maximum9999999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:43.774894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10
Q1200
median1500
Q320000
95-th percentile400000
Maximum9999999
Range9999998
Interquartile range (IQR)19800

Descriptive statistics

Standard deviation213047.37
Coefficient of variation (CV)3.5026281
Kurtosis1144.7132
Mean60825.004
Median Absolute Deviation (MAD)1484
Skewness25.647332
Sum2.5236294 × 108
Variance4.5389182 × 1010
MonotonicityNot monotonic
2024-03-14T17:48:44.229620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000 303
 
7.3%
100 191
 
4.6%
2000 181
 
4.4%
500 179
 
4.3%
5000 159
 
3.8%
100000 153
 
3.7%
10000 143
 
3.4%
200 134
 
3.2%
200000 130
 
3.1%
20000 121
 
2.9%
Other values (261) 2455
59.2%
ValueCountFrequency (%)
1 90
2.2%
2 5
 
0.1%
3 4
 
0.1%
4 5
 
0.1%
5 24
 
0.6%
6 2
 
< 0.1%
7 4
 
0.1%
8 3
 
0.1%
9 2
 
< 0.1%
10 73
1.8%
ValueCountFrequency (%)
9999999 1
 
< 0.1%
1400000 1
 
< 0.1%
1300000 1
 
< 0.1%
1200000 3
 
0.1%
1000000 10
0.2%
970000 1
 
< 0.1%
910000 1
 
< 0.1%
900000 2
 
< 0.1%
850000 2
 
< 0.1%
800000 6
0.1%

판매가
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2030.3868
Minimum0
Maximum10000
Zeros6
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:44.635757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q1310
median620
Q33070
95-th percentile7660
Maximum10000
Range10000
Interquartile range (IQR)2760

Descriptive statistics

Standard deviation2481.2768
Coefficient of variation (CV)1.222071
Kurtosis2.3762948
Mean2030.3868
Median Absolute Deviation (MAD)490
Skewness1.700714
Sum8424075
Variance6156734.5
MonotonicityNot monotonic
2024-03-14T17:48:45.055452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
620 690
16.6%
310 546
13.2%
1540 342
 
8.2%
160 264
 
6.4%
1000 204
 
4.9%
5000 201
 
4.8%
3000 198
 
4.8%
3070 198
 
4.8%
1 162
 
3.9%
10000 162
 
3.9%
Other values (29) 1182
28.5%
ValueCountFrequency (%)
0 6
 
0.1%
1 162
3.9%
60 12
 
0.3%
70 9
 
0.2%
93 6
 
0.1%
100 30
 
0.7%
120 63
 
1.5%
130 3
 
0.1%
155 9
 
0.2%
160 264
6.4%
ValueCountFrequency (%)
10000 162
3.9%
7660 135
3.3%
7200 6
 
0.1%
6770 3
 
0.1%
5060 99
2.4%
5000 201
4.8%
4730 87
2.1%
4400 12
 
0.3%
3720 12
 
0.3%
3710 144
3.5%

도매가
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1874.7304
Minimum0
Maximum10000
Zeros6
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size36.6 KiB
2024-03-14T17:48:45.459781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile93
Q1285
median571
Q32825
95-th percentile7050
Maximum10000
Range10000
Interquartile range (IQR)2540

Descriptive statistics

Standard deviation2296.3517
Coefficient of variation (CV)1.224897
Kurtosis2.4638075
Mean1874.7304
Median Absolute Deviation (MAD)450
Skewness1.7171012
Sum7778256.3
Variance5273231
MonotonicityNot monotonic
2024-03-14T17:48:45.896996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
571.0 684
16.5%
285.0 546
13.2%
1417.0 342
 
8.2%
148.0 264
 
6.4%
2825.0 198
 
4.8%
921.0 186
 
4.5%
4605.0 183
 
4.4%
2763.0 180
 
4.3%
1.0 162
 
3.9%
9210.0 147
 
3.5%
Other values (38) 1257
30.3%
ValueCountFrequency (%)
0.0 6
 
0.1%
1.0 162
3.9%
56.0 12
 
0.3%
65.0 9
 
0.2%
85.0 3
 
0.1%
85.6 3
 
0.1%
93.0 30
 
0.7%
112.0 63
 
1.5%
121.0 3
 
0.1%
142.0 6
 
0.1%
ValueCountFrequency (%)
10000.0 15
 
0.4%
9210.0 147
3.5%
7660.0 3
 
0.1%
7050.0 132
3.2%
6770.0 3
 
0.1%
6696.0 6
 
0.1%
5000.0 18
 
0.4%
4605.0 183
4.4%
4600.0 99
2.4%
4300.0 87
2.1%
Distinct358
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size32.5 KiB
Minimum2000-12-18 00:00:00
Maximum2023-10-27 00:00:00
2024-03-14T17:48:46.290740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:46.725225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.5 KiB
Minimum2023-12-06 00:00:00
Maximum2023-12-06 00:00:00
2024-03-14T17:48:47.079485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:47.392848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-14T17:48:39.454544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:30.862288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:32.669497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:34.417803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:36.134806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:38.256849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:39.710478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:31.125013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:32.976056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:34.591013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:36.523720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:38.429013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:39.968195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:31.385129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:33.299170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:34.869560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:36.815412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:38.589233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:40.388468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:31.673645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:33.653101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:35.202758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:37.157356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:38.758821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:40.548982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:31.995342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:34.067203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:35.527512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:37.524604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:38.940774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:40.702657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:32.364449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:34.256714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:35.831203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:37.888430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:48:39.197557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T17:48:47.604205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
로트(LOT)제작업체코드봉투종류종료번호판매가도매가
로트(LOT)1.0000.1990.2620.0230.1680.152
제작업체코드0.1991.0000.4370.4160.4560.584
봉투종류0.2620.4371.0000.0800.7500.716
종료번호0.0230.4160.0801.0000.0000.000
판매가0.1680.4560.7500.0001.0000.964
도매가0.1520.5840.7160.0000.9641.000
2024-03-14T17:48:47.885790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
로트(LOT)제작업체코드봉투종류종료번호판매가도매가
로트(LOT)1.000-0.154-0.113-0.072-0.090-0.089
제작업체코드-0.1541.000-0.0240.155-0.391-0.395
봉투종류-0.113-0.0241.000-0.1680.0680.072
종료번호-0.0720.155-0.1681.000-0.131-0.130
판매가-0.090-0.3910.068-0.1311.0000.999
도매가-0.089-0.3950.072-0.1300.9991.000

Missing values

2024-03-14T17:48:40.910739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T17:48:41.151846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

로트(LOT)제작업체코드봉투종류시작번호종료번호판매가도매가LOT발생일데이터기준일자
01351101100130121.02000-12-182023-12-06
113611015000130121.02000-12-182023-12-06
21371101100000130121.02000-12-182023-12-06
37621101500250231.02000-12-182023-12-06
4763110125000250231.02000-12-182023-12-06
57641101500000250231.02000-12-182023-12-06
61062110140024402247.02000-12-182023-12-06
710631101400024402247.02000-12-182023-12-06
8106411014000024402247.02000-12-182023-12-06
913921101600500462.02000-12-182023-12-06
로트(LOT)제작업체코드봉투종류시작번호종료번호판매가도매가LOT발생일데이터기준일자
413928544174015000076607050.02022-02-042023-12-06
41408426717801300310285.02022-02-042023-12-06
414184268178013000310285.02022-02-042023-12-06
41428426917801300000310285.02022-02-042023-12-06
41431875317801600620571.02022-02-042023-12-06
414418754178016000620571.02022-02-042023-12-06
41451875517801600000620571.02022-02-042023-12-06
41462430317811100620571.02022-02-042023-12-06
414724304178111000620571.02022-02-042023-12-06
41482430517811100000620571.02022-02-042023-12-06