Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 22 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 5 |
Duplicate rows (%) | 22.7% |
Total size in memory | 2.2 KiB |
Average record size in memory | 104.0 B |
Variable types
Categorical | 10 |
---|---|
DateTime | 1 |
Dataset
Description | 인천광역시 부평구 음식물류폐기물 기간별 판매현황(site코드, 전표일자, 상품코드, 확정수량, 매출단가, 스티커시작번호, 스티커최종번호, 출고단위, 출고단위수량, 묶음당매수, BOX당묶음수량) |
---|---|
Author | 인천광역시 부평구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15062412&srcSe=7661IVAWM27C61E190 |
site코드 has constant value "" | Constant |
전표일자 has constant value "" | Constant |
출고단위수량 has constant value "" | Constant |
Dataset has 5 (22.7%) duplicate rows | Duplicates |
BOX당묶음수량 is highly overall correlated with 상품코드 and 5 other fields | High correlation |
스티커시작번호 is highly overall correlated with 상품코드 and 5 other fields | High correlation |
상품코드 is highly overall correlated with 확정수량 and 5 other fields | High correlation |
매출단가 is highly overall correlated with 상품코드 and 5 other fields | High correlation |
출고단위 is highly overall correlated with 확정수량 | High correlation |
스티커최종번호 is highly overall correlated with 상품코드 and 5 other fields | High correlation |
묶음당매수 is highly overall correlated with 상품코드 and 5 other fields | High correlation |
확정수량 is highly overall correlated with 상품코드 and 6 other fields | High correlation |
출고단위 is highly imbalanced (73.3%) | Imbalance |
Reproduction
Analysis started | 2024-04-17 09:51:06.130183 |
---|---|
Analysis finished | 2024-04-17 09:51:06.712545 |
Duration | 0.58 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
site코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
306 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 306 |
---|---|
2nd row | 306 |
3rd row | 306 |
4th row | 306 |
5th row | 306 |
Common Values
Value | Count | Frequency (%) |
306 | 22 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
306 | 22 |
전표일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
Minimum | 2020-07-27 00:00:00 |
---|---|
Maximum | 2020-07-27 00:00:00 |
상품코드
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 22.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
5050 | |
---|---|
5005 | |
7020 | |
5010 | |
6125 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5050 |
---|---|
2nd row | 5050 |
3rd row | 5050 |
4th row | 5050 |
5th row | 5050 |
Common Values
Value | Count | Frequency (%) |
5050 | 11 | |
5005 | 5 | |
7020 | 2 | 9.1% |
5010 | 2 | 9.1% |
6125 | 2 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5050 | 11 | |
5005 | 5 | |
7020 | 2 | 9.1% |
5010 | 2 | 9.1% |
6125 | 2 | 9.1% |
확정수량
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 18.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
10 | |
---|---|
20 | |
50 | |
200 | 1 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.0454545 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.5% |
Sample
1st row | 10 |
---|---|
2nd row | 10 |
3rd row | 10 |
4th row | 10 |
5th row | 10 |
Common Values
Value | Count | Frequency (%) |
10 | 12 | |
20 | 5 | |
50 | 4 | 18.2% |
200 | 1 | 4.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10 | 12 | |
20 | 5 | |
50 | 4 | 18.2% |
200 | 1 | 4.5% |
매출단가
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 22.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
1699 | |
---|---|
193 | |
689 | |
363 | |
7060 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.5909091 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1699 |
---|---|
2nd row | 1699 |
3rd row | 1699 |
4th row | 1699 |
5th row | 1699 |
Common Values
Value | Count | Frequency (%) |
1699 | 11 | |
193 | 5 | |
689 | 2 | 9.1% |
363 | 2 | 9.1% |
7060 | 2 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1699 | 11 | |
193 | 5 | |
689 | 2 | 9.1% |
363 | 2 | 9.1% |
7060 | 2 | 9.1% |
스티커시작번호
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 13.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
20100000000000000 | |
---|---|
20000000000000000 | |
19100000000000000 |
Length
Max length | 17 |
---|---|
Median length | 17 |
Mean length | 17 |
Min length | 17 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20100000000000000 |
---|---|
2nd row | 20100000000000000 |
3rd row | 20100000000000000 |
4th row | 20100000000000000 |
5th row | 20100000000000000 |
Common Values
Value | Count | Frequency (%) |
20100000000000000 | 11 | |
20000000000000000 | 9 | |
19100000000000000 | 2 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20100000000000000 | 11 | |
20000000000000000 | 9 | |
19100000000000000 | 2 | 9.1% |
스티커최종번호
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 13.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
20100000000000000 | |
---|---|
20000000000000000 | |
19100000000000000 |
Length
Max length | 17 |
---|---|
Median length | 17 |
Mean length | 17 |
Min length | 17 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20100000000000000 |
---|---|
2nd row | 20100000000000000 |
3rd row | 20100000000000000 |
4th row | 20100000000000000 |
5th row | 20100000000000000 |
Common Values
Value | Count | Frequency (%) |
20100000000000000 | 11 | |
20000000000000000 | 9 | |
19100000000000000 | 2 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20100000000000000 | 11 | |
20000000000000000 | 9 | |
19100000000000000 | 2 | 9.1% |
출고단위
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 9.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
2 | |
---|---|
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.5% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 21 | |
3 | 1 | 4.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 21 | |
3 | 1 | 4.5% |
출고단위수량
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 22 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 22 |
묶음당매수
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 13.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
10 | |
---|---|
20 | |
50 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 10 |
---|---|
2nd row | 10 |
3rd row | 10 |
4th row | 10 |
5th row | 10 |
Common Values
Value | Count | Frequency (%) |
10 | 13 | |
20 | 5 | 22.7% |
50 | 4 | 18.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10 | 13 | |
20 | 5 | 22.7% |
50 | 4 | 18.2% |
BOX당묶음수량
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 13.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 308.0 B |
20 | |
---|---|
50 | |
10 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 20 |
4th row | 20 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 15 | |
50 | 5 | 22.7% |
10 | 2 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 15 | |
50 | 5 | 22.7% |
10 | 2 | 9.1% |
상품코드 | 확정수량 | 매출단가 | 스티커시작번호 | 스티커최종번호 | 출고단위 | 묶음당매수 | BOX당묶음수량 | |
---|---|---|---|---|---|---|---|---|
상품코드 | 1.000 | 0.811 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
확정수량 | 0.811 | 1.000 | 0.811 | 0.634 | 0.634 | 1.000 | 1.000 | 0.649 |
매출단가 | 1.000 | 0.811 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
스티커시작번호 | 1.000 | 0.634 | 1.000 | 1.000 | 1.000 | 0.000 | 0.927 | 0.979 |
스티커최종번호 | 1.000 | 0.634 | 1.000 | 1.000 | 1.000 | 0.000 | 0.927 | 0.979 |
출고단위 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
묶음당매수 | 1.000 | 1.000 | 1.000 | 0.927 | 0.927 | 0.000 | 1.000 | 0.935 |
BOX당묶음수량 | 1.000 | 0.649 | 1.000 | 0.979 | 0.979 | 0.000 | 0.935 | 1.000 |
BOX당묶음수량 | 스티커시작번호 | 상품코드 | 매출단가 | 출고단위 | 스티커최종번호 | 묶음당매수 | 확정수량 | |
---|---|---|---|---|---|---|---|---|
BOX당묶음수량 | 1.000 | 0.820 | 0.946 | 0.946 | 0.000 | 0.820 | 0.686 | 0.652 |
스티커시작번호 | 0.820 | 1.000 | 0.946 | 0.946 | 0.000 | 1.000 | 0.669 | 0.635 |
상품코드 | 0.946 | 0.946 | 1.000 | 1.000 | 0.000 | 0.946 | 0.946 | 0.749 |
매출단가 | 0.946 | 0.946 | 1.000 | 1.000 | 0.000 | 0.946 | 0.946 | 0.749 |
출고단위 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.949 |
스티커최종번호 | 0.820 | 1.000 | 0.946 | 0.946 | 0.000 | 1.000 | 0.669 | 0.635 |
묶음당매수 | 0.686 | 0.669 | 0.946 | 0.946 | 0.000 | 0.669 | 1.000 | 0.973 |
확정수량 | 0.652 | 0.635 | 0.749 | 0.749 | 0.949 | 0.635 | 0.973 | 1.000 |
상품코드 | 확정수량 | 매출단가 | 스티커시작번호 | 스티커최종번호 | 출고단위 | 묶음당매수 | BOX당묶음수량 | |
---|---|---|---|---|---|---|---|---|
상품코드 | 1.000 | 0.749 | 1.000 | 0.946 | 0.946 | 0.000 | 0.946 | 0.946 |
확정수량 | 0.749 | 1.000 | 0.749 | 0.635 | 0.635 | 0.949 | 0.973 | 0.652 |
매출단가 | 1.000 | 0.749 | 1.000 | 0.946 | 0.946 | 0.000 | 0.946 | 0.946 |
스티커시작번호 | 0.946 | 0.635 | 0.946 | 1.000 | 1.000 | 0.000 | 0.669 | 0.820 |
스티커최종번호 | 0.946 | 0.635 | 0.946 | 1.000 | 1.000 | 0.000 | 0.669 | 0.820 |
출고단위 | 0.000 | 0.949 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
묶음당매수 | 0.946 | 0.973 | 0.946 | 0.669 | 0.669 | 0.000 | 1.000 | 0.686 |
BOX당묶음수량 | 0.946 | 0.652 | 0.946 | 0.820 | 0.820 | 0.000 | 0.686 | 1.000 |
site코드 | 전표일자 | 상품코드 | 확정수량 | 매출단가 | 스티커시작번호 | 스티커최종번호 | 출고단위 | 출고단위수량 | 묶음당매수 | BOX당묶음수량 | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
1 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
2 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
3 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
4 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
5 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
6 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
7 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
8 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
9 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 |
site코드 | 전표일자 | 상품코드 | 확정수량 | 매출단가 | 스티커시작번호 | 스티커최종번호 | 출고단위 | 출고단위수량 | 묶음당매수 | BOX당묶음수량 | |
---|---|---|---|---|---|---|---|---|---|---|---|
12 | 306 | 2020-07-27 | 5010 | 50 | 363 | 20000000000000000 | 20000000000000000 | 2 | 1 | 50 | 20 |
13 | 306 | 2020-07-27 | 5010 | 50 | 363 | 20000000000000000 | 20000000000000000 | 2 | 1 | 50 | 20 |
14 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 |
15 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 |
16 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 |
17 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 |
18 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 |
19 | 306 | 2020-07-27 | 5050 | 200 | 1699 | 20100000000000000 | 20100000000000000 | 3 | 1 | 10 | 20 |
20 | 306 | 2020-07-27 | 6125 | 10 | 7060 | 19100000000000000 | 19100000000000000 | 2 | 1 | 10 | 10 |
21 | 306 | 2020-07-27 | 6125 | 10 | 7060 | 19100000000000000 | 19100000000000000 | 2 | 1 | 10 | 10 |
Most frequently occurring
site코드 | 전표일자 | 상품코드 | 확정수량 | 매출단가 | 스티커시작번호 | 스티커최종번호 | 출고단위 | 출고단위수량 | 묶음당매수 | BOX당묶음수량 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2 | 306 | 2020-07-27 | 5050 | 10 | 1699 | 20100000000000000 | 20100000000000000 | 2 | 1 | 10 | 20 | 10 |
0 | 306 | 2020-07-27 | 5005 | 20 | 193 | 20000000000000000 | 20000000000000000 | 2 | 1 | 20 | 50 | 5 |
1 | 306 | 2020-07-27 | 5010 | 50 | 363 | 20000000000000000 | 20000000000000000 | 2 | 1 | 50 | 20 | 2 |
3 | 306 | 2020-07-27 | 6125 | 10 | 7060 | 19100000000000000 | 19100000000000000 | 2 | 1 | 10 | 10 | 2 |
4 | 306 | 2020-07-27 | 7020 | 50 | 689 | 20000000000000000 | 20000000000000000 | 2 | 1 | 50 | 20 | 2 |