Overview

Dataset statistics

Number of variables10
Number of observations3735
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory313.8 KiB
Average record size in memory86.0 B

Variable types

Numeric5
Categorical4
DateTime1

Dataset

Description인천광역시 서구 쓰레기종량제봉투 LOT정보에 대한 데이터로 LOT코드, 봉투단위, 봉투종류 등의 정보가 포함되어 있습니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15090797&srcSe=7661IVAWM27C61E190

Alerts

시작번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
제작업체코드 is highly overall correlated with 봉투종류High correlation
판매가 is highly overall correlated with 도매가 and 1 other fieldsHigh correlation
도매가 is highly overall correlated with 판매가 and 1 other fieldsHigh correlation
봉투종류 is highly overall correlated with 제작업체코드 and 2 other fieldsHigh correlation
종료번호 is highly skewed (γ1 = 26.6644523)Skewed
LOT코드 has unique valuesUnique
제작업체코드 has 87 (2.3%) zerosZeros

Reproduction

Analysis started2024-03-18 01:56:26.970336
Analysis finished2024-03-18 01:56:29.698539
Duration2.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

LOT코드
Real number (ℝ)

UNIQUE 

Distinct3735
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41693.569
Minimum12
Maximum99998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size33.0 KiB
2024-03-18T10:56:29.762233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile1350.7
Q112885.5
median34711
Q369805.5
95-th percentile94912.3
Maximum99998
Range99986
Interquartile range (IQR)56920

Descriptive statistics

Standard deviation31371.587
Coefficient of variation (CV)0.75243227
Kurtosis-1.2180453
Mean41693.569
Median Absolute Deviation (MAD)26385
Skewness0.35746095
Sum1.5572548 × 108
Variance9.8417645 × 108
MonotonicityNot monotonic
2024-03-18T10:56:29.871141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135 1
 
< 0.1%
15608 1
 
< 0.1%
26435 1
 
< 0.1%
13245 1
 
< 0.1%
13246 1
 
< 0.1%
13247 1
 
< 0.1%
6213 1
 
< 0.1%
6214 1
 
< 0.1%
6215 1
 
< 0.1%
20670 1
 
< 0.1%
Other values (3725) 3725
99.7%
ValueCountFrequency (%)
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
24 1
< 0.1%
25 1
< 0.1%
26 1
< 0.1%
27 1
< 0.1%
28 1
< 0.1%
29 1
< 0.1%
30 1
< 0.1%
ValueCountFrequency (%)
99998 1
< 0.1%
99997 1
< 0.1%
99996 1
< 0.1%
99995 1
< 0.1%
99994 1
< 0.1%
99993 1
< 0.1%
99992 1
< 0.1%
99991 1
< 0.1%
99990 1
< 0.1%
99989 1
< 0.1%

제작업체코드
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.334137
Minimum0
Maximum99
Zeros87
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size33.0 KiB
2024-03-18T10:56:29.982868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q17
median12
Q317
95-th percentile17
Maximum99
Range99
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2381113
Coefficient of variation (CV)0.55038258
Kurtosis29.807596
Mean11.334137
Median Absolute Deviation (MAD)5
Skewness1.8632484
Sum42333
Variance38.914033
MonotonicityNot monotonic
2024-03-18T10:56:30.124195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
17 1515
40.6%
7 657
17.6%
12 444
 
11.9%
9 438
 
11.7%
1 306
 
8.2%
2 126
 
3.4%
0 87
 
2.3%
16 51
 
1.4%
10 30
 
0.8%
6 24
 
0.6%
Other values (8) 57
 
1.5%
ValueCountFrequency (%)
0 87
 
2.3%
1 306
8.2%
2 126
 
3.4%
5 12
 
0.3%
6 24
 
0.6%
7 657
17.6%
8 15
 
0.4%
9 438
11.7%
10 30
 
0.8%
11 12
 
0.3%
ValueCountFrequency (%)
99 3
 
0.1%
26 6
 
0.2%
17 1515
40.6%
16 51
 
1.4%
15 3
 
0.1%
14 3
 
0.1%
13 3
 
0.1%
12 444
 
11.9%
11 12
 
0.3%
10 30
 
0.8%

봉투단위
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size29.3 KiB
낱장
1247 
묶음
1245 
박스
1243 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row박스
2nd row묶음
3rd row낱장
4th row박스
5th row묶음

Common Values

ValueCountFrequency (%)
낱장 1247
33.4%
묶음 1245
33.3%
박스 1243
33.3%

Length

2024-03-18T10:56:30.268209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T10:56:30.358250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
낱장 1247
33.4%
묶음 1245
33.3%
박스 1243
33.3%

봉투종류
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size29.3 KiB
일반용 10L
288 
일반용 20L
284 
일반용 50L
252 
일반용 100L
 
222
스티커 5000원권
 
189
Other values (34)
2500 

Length

Max length15
Median length12
Mean length8.3419009
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반용 5L
2nd row일반용 5L
3rd row일반용 5L
4th row일반용 10L
5th row일반용 10L

Common Values

ValueCountFrequency (%)
일반용 10L 288
 
7.7%
일반용 20L 284
 
7.6%
일반용 50L 252
 
6.7%
일반용 100L 222
 
5.9%
스티커 5000원권 189
 
5.1%
스티커 1000원권 189
 
5.1%
스티커 3000원권 186
 
5.0%
일반용 5L 166
 
4.4%
사업계용125L 153
 
4.1%
스티커 10000원권 153
 
4.1%
Other values (29) 1653
44.3%

Length

2024-03-18T10:56:30.454921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반용 1437
19.9%
스티커 783
 
10.8%
20l 488
 
6.7%
10l 399
 
5.5%
음식물 348
 
4.8%
재사용 327
 
4.5%
50l 324
 
4.5%
5l 244
 
3.4%
필증 234
 
3.2%
사업계용 225
 
3.1%
Other values (25) 2427
33.5%

시작번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.3 KiB
1
3735 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 3735
100.0%

Length

2024-03-18T10:56:30.544906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T10:56:30.618880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3735
100.0%

종료번호
Real number (ℝ)

SKEWED 

Distinct267
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60468.92
Minimum1
Maximum9999999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size33.0 KiB
2024-03-18T10:56:30.710800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10
Q1200
median1600
Q320000
95-th percentile400000
Maximum9999999
Range9999998
Interquartile range (IQR)19800

Descriptive statistics

Standard deviation217187.87
Coefficient of variation (CV)3.5917272
Kurtosis1176.8175
Mean60468.92
Median Absolute Deviation (MAD)1580
Skewness26.664452
Sum2.2585142 × 108
Variance4.7170569 × 1010
MonotonicityNot monotonic
2024-03-18T10:56:31.004813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000 262
 
7.0%
2000 168
 
4.5%
500 159
 
4.3%
100 157
 
4.2%
5000 152
 
4.1%
10000 134
 
3.6%
100000 126
 
3.4%
200 122
 
3.3%
20000 114
 
3.1%
200000 107
 
2.9%
Other values (257) 2234
59.8%
ValueCountFrequency (%)
1 89
2.4%
2 4
 
0.1%
3 4
 
0.1%
4 3
 
0.1%
5 18
 
0.5%
6 2
 
0.1%
7 4
 
0.1%
8 2
 
0.1%
9 1
 
< 0.1%
10 65
1.7%
ValueCountFrequency (%)
9999999 1
 
< 0.1%
1400000 1
 
< 0.1%
1300000 1
 
< 0.1%
1000000 7
0.2%
970000 1
 
< 0.1%
910000 1
 
< 0.1%
900000 1
 
< 0.1%
850000 2
 
0.1%
800000 6
0.2%
780000 1
 
< 0.1%

판매가
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2067.4096
Minimum0
Maximum10000
Zeros6
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size33.0 KiB
2024-03-18T10:56:31.104212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile93
Q1310
median930
Q33070
95-th percentile7660
Maximum10000
Range10000
Interquartile range (IQR)2760

Descriptive statistics

Standard deviation2507.1206
Coefficient of variation (CV)1.2126869
Kurtosis2.2588884
Mean2067.4096
Median Absolute Deviation (MAD)750
Skewness1.6801246
Sum7721775
Variance6285653.8
MonotonicityNot monotonic
2024-03-18T10:56:31.220178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
620 621
16.6%
310 477
12.8%
1540 318
 
8.5%
160 231
 
6.2%
3070 198
 
5.3%
1000 189
 
5.1%
5000 186
 
5.0%
3000 183
 
4.9%
1 162
 
4.3%
10000 150
 
4.0%
Other values (29) 1020
27.3%
ValueCountFrequency (%)
0 6
 
0.2%
1 162
4.3%
60 6
 
0.2%
70 9
 
0.2%
93 6
 
0.2%
100 30
 
0.8%
120 51
 
1.4%
130 3
 
0.1%
155 9
 
0.2%
160 231
6.2%
ValueCountFrequency (%)
10000 150
4.0%
7660 135
3.6%
7200 6
 
0.2%
6770 3
 
0.1%
5060 81
2.2%
5000 186
5.0%
4730 69
 
1.8%
4400 3
 
0.1%
3720 12
 
0.3%
3710 135
3.6%

도매가
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1909.8692
Minimum0
Maximum10000
Zeros6
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size33.0 KiB
2024-03-18T10:56:31.364858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile85.6
Q1285
median856
Q32825
95-th percentile7050
Maximum10000
Range10000
Interquartile range (IQR)2540

Descriptive statistics

Standard deviation2321.8573
Coefficient of variation (CV)1.2157154
Kurtosis2.3432492
Mean1909.8692
Median Absolute Deviation (MAD)688
Skewness1.6961409
Sum7133361.3
Variance5391021.5
MonotonicityNot monotonic
2024-03-18T10:56:31.501021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
571.0 615
16.5%
285.0 477
12.8%
1417.0 318
 
8.5%
148.0 231
 
6.2%
2825.0 198
 
5.3%
921.0 171
 
4.6%
4605.0 168
 
4.5%
2763.0 165
 
4.4%
1.0 162
 
4.3%
9210.0 135
 
3.6%
Other values (38) 1095
29.3%
ValueCountFrequency (%)
0.0 6
 
0.2%
1.0 162
4.3%
56.0 6
 
0.2%
65.0 9
 
0.2%
85.0 3
 
0.1%
85.6 3
 
0.1%
93.0 30
 
0.8%
112.0 51
 
1.4%
121.0 3
 
0.1%
142.0 6
 
0.2%
ValueCountFrequency (%)
10000.0 15
 
0.4%
9210.0 135
3.6%
7660.0 3
 
0.1%
7050.0 132
3.5%
6770.0 3
 
0.1%
6696.0 6
 
0.2%
5000.0 18
 
0.5%
4605.0 168
4.5%
4600.0 81
2.2%
4300.0 69
1.8%
Distinct326
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size29.3 KiB
Minimum2000-12-18 00:00:00
Maximum2022-08-05 00:00:00
2024-03-18T10:56:31.617455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:31.736742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.3 KiB
2022-09-06
3735 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-09-06
2nd row2022-09-06
3rd row2022-09-06
4th row2022-09-06
5th row2022-09-06

Common Values

ValueCountFrequency (%)
2022-09-06 3735
100.0%

Length

2024-03-18T10:56:31.867646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T10:56:31.959002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-06 3735
100.0%

Interactions

2024-03-18T10:56:29.130385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.414765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.994487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.392682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.780332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:29.212709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.482484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.078888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.461501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.849740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:29.290067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.552996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.158422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.531845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.917511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:29.367643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.621398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.240059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.603551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.986162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:29.435559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:27.898240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.318560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:28.708273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T10:56:29.055766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T10:56:32.020335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LOT코드제작업체코드봉투단위봉투종류종료번호판매가도매가
LOT코드1.0000.2000.0000.3810.0170.1840.169
제작업체코드0.2001.0000.0000.8630.4160.4440.568
봉투단위0.0000.0001.0000.0000.0550.0000.000
봉투종류0.3810.8630.0001.0000.6610.9300.930
종료번호0.0170.4160.0550.6611.0000.0000.000
판매가0.1840.4440.0000.9300.0001.0000.964
도매가0.1690.5680.0000.9300.0000.9641.000
2024-03-18T10:56:32.134388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
봉투종류봉투단위
봉투종류1.0000.000
봉투단위0.0001.000
2024-03-18T10:56:32.240667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LOT코드제작업체코드종료번호판매가도매가봉투단위봉투종류
LOT코드1.000-0.156-0.065-0.096-0.0950.0000.141
제작업체코드-0.1561.0000.148-0.367-0.3700.0000.634
종료번호-0.0650.1481.000-0.128-0.1290.0160.398
판매가-0.096-0.367-0.1281.0000.9990.0000.676
도매가-0.095-0.370-0.1290.9991.0000.0000.691
봉투단위0.0000.0000.0160.0000.0001.0000.000
봉투종류0.1410.6340.3980.6760.6910.0001.000

Missing values

2024-03-18T10:56:29.528350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T10:56:29.645316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

LOT코드제작업체코드봉투단위봉투종류시작번호종료번호판매가도매가LOT발생일데이터기준일자
01351박스일반용 5L1100130121.02000-12-182022-09-06
11361묶음일반용 5L15000130121.02000-12-182022-09-06
21371낱장일반용 5L1100000130121.02000-12-182022-09-06
37621박스일반용 10L1500250231.02000-12-182022-09-06
47631묶음일반용 10L125000250231.02000-12-182022-09-06
57641낱장일반용 10L1500000250231.02000-12-182022-09-06
610621박스일반용 100L140024402247.02000-12-182022-09-06
710631묶음일반용 100L1400024402247.02000-12-182022-09-06
810641낱장일반용 100L14000024402247.02000-12-182022-09-06
913921박스일반용 20L1600500462.02000-12-182022-09-06
LOT코드제작업체코드봉투단위봉투종류시작번호종료번호판매가도매가LOT발생일데이터기준일자
37252854417묶음필증 5L15000076607050.02022-02-042022-09-06
37268426717낱장필증 5L1300310285.02022-02-042022-09-06
37278426817박스음식물 5L(청라)13000310285.02022-02-042022-09-06
37288426917묶음음식물 5L(청라)1300000310285.02022-02-042022-09-06
37291875317낱장일반용 5L1600620571.02022-02-042022-09-06
37301875417박스일반용 10L16000620571.02022-02-042022-09-06
37311875517묶음일반용 10L1600000620571.02022-02-042022-09-06
37322430317낱장일반용 10L1100620571.02022-02-042022-09-06
37332430417박스일반용 20L11000620571.02022-02-042022-09-06
37342430517묶음일반용 20L1100000620571.02022-02-042022-09-06