Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 2815 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 408 |
Duplicate rows (%) | 14.5% |
Total size in memory | 231.0 KiB |
Average record size in memory | 84.0 B |
Variable types
DateTime | 3 |
---|---|
Numeric | 3 |
Categorical | 3 |
Text | 1 |
Dataset
Description | 충청북도 충주시 대형폐기물인터넷 배출신고처리시스템 환불처리내역에 대한 정보(환불처리일, 환불접수일, 환불금액,환불처리사유, 환불결제수단, 금액, 수량, 단가, 품목코드, 데이터기준일자) |
---|---|
URL | https://www.data.go.kr/data/15122270/fileData.do |
수량 has constant value "" | Constant |
데이터기준일자 has constant value "" | Constant |
Dataset has 408 (14.5%) duplicate rows | Duplicates |
금액 is highly overall correlated with 단가 | High correlation |
단가 is highly overall correlated with 금액 | High correlation |
환불처리사유 is highly overall correlated with 환불결제수단 | High correlation |
환불결제수단 is highly overall correlated with 환불처리사유 | High correlation |
환불처리사유 is highly imbalanced (91.1%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 07:39:17.353990 |
---|---|
Analysis finished | 2023-12-12 07:39:19.206408 |
Duration | 1.85 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
환불처리일자
Date
Distinct | 234 |
---|---|
Distinct (%) | 8.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
Minimum | 2023-01-01 00:00:00 |
---|---|
Maximum | 2023-09-01 00:00:00 |
환불접수일자
Date
Distinct | 105 |
---|---|
Distinct (%) | 3.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
Minimum | 2023-01-02 00:00:00 |
---|---|
Maximum | 2023-09-04 00:00:00 |
환불금액
Real number (ℝ)
Distinct | 58 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23198.934 |
Minimum | 1000 |
---|---|
Maximum | 168000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 24.9 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 2000 |
Q1 | 5000 |
median | 11000 |
Q3 | 24000 |
95-th percentile | 94000 |
Maximum | 168000 |
Range | 167000 |
Interquartile range (IQR) | 19000 |
Descriptive statistics
Standard deviation | 32296.127 |
---|---|
Coefficient of variation (CV) | 1.3921384 |
Kurtosis | 7.067053 |
Mean | 23198.934 |
Median Absolute Deviation (MAD) | 7000 |
Skewness | 2.5957792 |
Sum | 65305000 |
Variance | 1.0430398 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6000 | 203 | 7.2% |
2000 | 202 | 7.2% |
4000 | 188 | 6.7% |
3000 | 179 | 6.4% |
8000 | 175 | 6.2% |
5000 | 157 | 5.6% |
10000 | 153 | 5.4% |
12000 | 132 | 4.7% |
14000 | 83 | 2.9% |
9000 | 71 | 2.5% |
Other values (48) | 1272 |
Value | Count | Frequency (%) |
1000 | 2 | 0.1% |
2000 | 202 | |
3000 | 179 | |
4000 | 188 | |
5000 | 157 | |
5500 | 1 | < 0.1% |
6000 | 203 | |
7000 | 69 | 2.5% |
8000 | 175 | |
9000 | 71 | 2.5% |
Value | Count | Frequency (%) |
168000 | 55 | |
119000 | 28 | |
101000 | 18 | 0.6% |
95000 | 25 | |
94000 | 35 | |
92000 | 28 | |
82000 | 25 | |
80000 | 17 | 0.6% |
79000 | 20 | 0.7% |
78000 | 23 |
환불처리사유
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 41 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
미수거 | |
---|---|
클린센터에 민원인이 직접 입고 | 15 |
없어짐 | 10 |
환불 | 9 |
재접수 | 8 |
Other values (36) | 84 |
Length
Max length | 39 |
---|---|
Median length | 3 |
Mean length | 3.19254 |
Min length | 2 |
Unique
Unique | 16 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | 미수거 |
---|---|
2nd row | 미수거 |
3rd row | 미수거 |
4th row | 미수거 |
5th row | 미수거 |
Common Values
Value | Count | Frequency (%) |
미수거 | 2689 | |
클린센터에 민원인이 직접 입고 | 15 | 0.5% |
없어짐 | 10 | 0.4% |
환불 | 9 | 0.3% |
재접수 | 8 | 0.3% |
다시 접수처리 | 7 | 0.2% |
변심 | 7 | 0.2% |
착오송금 | 6 | 0.2% |
제접수함 | 5 | 0.2% |
금가면처리 | 5 | 0.2% |
Other values (31) | 54 | 1.9% |
Length
Value | Count | Frequency (%) |
미수거 | 2693 | |
민원인이 | 16 | 0.6% |
직접 | 16 | 0.6% |
입고 | 15 | 0.5% |
클린센터에 | 15 | 0.5% |
없어짐 | 11 | 0.4% |
재접수 | 9 | 0.3% |
환불 | 9 | 0.3% |
다시 | 8 | 0.3% |
접수처리 | 7 | 0.2% |
Other values (52) | 110 | 3.8% |
환불결제수단
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
카드 | |
---|---|
계좌이체 | |
방문 | 17 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.8554174 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 카드 |
---|---|
2nd row | 카드 |
3rd row | 카드 |
4th row | 카드 |
5th row | 카드 |
Common Values
Value | Count | Frequency (%) |
카드 | 1594 | |
계좌이체 | 1204 | |
방문 | 17 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
카드 | 1594 | |
계좌이체 | 1204 | |
방문 | 17 | 0.6% |
금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3665.0089 |
Minimum | 1000 |
---|---|
Maximum | 30000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 24.9 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 3000 |
Q3 | 4000 |
95-th percentile | 8000 |
Maximum | 30000 |
Range | 29000 |
Interquartile range (IQR) | 2000 |
Descriptive statistics
Standard deviation | 2403.5114 |
---|---|
Coefficient of variation (CV) | 0.6557996 |
Kurtosis | 17.052494 |
Mean | 3665.0089 |
Median Absolute Deviation (MAD) | 1000 |
Skewness | 2.9968206 |
Sum | 10317000 |
Variance | 5776866.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 1092 | |
3000 | 657 | |
4000 | 344 | 12.2% |
5000 | 250 | 8.9% |
6000 | 171 | 6.1% |
8000 | 106 | 3.8% |
10000 | 96 | 3.4% |
1000 | 40 | 1.4% |
7000 | 22 | 0.8% |
9000 | 12 | 0.4% |
Other values (5) | 25 | 0.9% |
Value | Count | Frequency (%) |
1000 | 40 | 1.4% |
2000 | 1092 | |
3000 | 657 | |
3500 | 1 | < 0.1% |
4000 | 344 | 12.2% |
5000 | 250 | 8.9% |
5500 | 3 | 0.1% |
6000 | 171 | 6.1% |
7000 | 22 | 0.8% |
8000 | 106 | 3.8% |
Value | Count | Frequency (%) |
30000 | 2 | 0.1% |
20000 | 7 | 0.2% |
15000 | 12 | 0.4% |
10000 | 96 | 3.4% |
9000 | 12 | 0.4% |
8000 | 106 | |
7000 | 22 | 0.8% |
6000 | 171 | |
5500 | 3 | 0.1% |
5000 | 250 |
수량
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 2815 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 2815 |
단가
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3665.0089 |
Minimum | 1000 |
---|---|
Maximum | 30000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 24.9 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 3000 |
Q3 | 4000 |
95-th percentile | 8000 |
Maximum | 30000 |
Range | 29000 |
Interquartile range (IQR) | 2000 |
Descriptive statistics
Standard deviation | 2403.5114 |
---|---|
Coefficient of variation (CV) | 0.6557996 |
Kurtosis | 17.052494 |
Mean | 3665.0089 |
Median Absolute Deviation (MAD) | 1000 |
Skewness | 2.9968206 |
Sum | 10317000 |
Variance | 5776866.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 1092 | |
3000 | 657 | |
4000 | 344 | 12.2% |
5000 | 250 | 8.9% |
6000 | 171 | 6.1% |
8000 | 106 | 3.8% |
10000 | 96 | 3.4% |
1000 | 40 | 1.4% |
7000 | 22 | 0.8% |
9000 | 12 | 0.4% |
Other values (5) | 25 | 0.9% |
Value | Count | Frequency (%) |
1000 | 40 | 1.4% |
2000 | 1092 | |
3000 | 657 | |
3500 | 1 | < 0.1% |
4000 | 344 | 12.2% |
5000 | 250 | 8.9% |
5500 | 3 | 0.1% |
6000 | 171 | 6.1% |
7000 | 22 | 0.8% |
8000 | 106 | 3.8% |
Value | Count | Frequency (%) |
30000 | 2 | 0.1% |
20000 | 7 | 0.2% |
15000 | 12 | 0.4% |
10000 | 96 | 3.4% |
9000 | 12 | 0.4% |
8000 | 106 | |
7000 | 22 | 0.8% |
6000 | 171 | |
5500 | 3 | 0.1% |
5000 | 250 |
품목코드
Text
Distinct | 217 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
Value | Count | Frequency (%) |
2020055 | 166 | 5.9% |
2020053 | 119 | 4.2% |
2090115 | 110 | 3.9% |
2020054 | 93 | 3.3% |
2010012 | 70 | 2.5% |
2010001 | 68 | 2.4% |
2020074 | 63 | 2.2% |
2090071 | 52 | 1.8% |
2020002 | 51 | 1.8% |
2020070 | 48 | 1.7% |
Other values (207) | 1975 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8610 | |
2 | 4741 | |
1 | 1228 | 6.4% |
9 | 1089 | 5.7% |
5 | 889 | 4.6% |
7 | 624 | 3.2% |
3 | 601 | 3.1% |
8 | 558 | 2.9% |
4 | 502 | 2.6% |
6 | 376 | 2.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 19218 | |
Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8610 | |
2 | 4741 | |
1 | 1228 | 6.4% |
9 | 1089 | 5.7% |
5 | 889 | 4.6% |
7 | 624 | 3.2% |
3 | 601 | 3.1% |
8 | 558 | 2.9% |
4 | 502 | 2.6% |
6 | 376 | 2.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 19222 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8610 | |
2 | 4741 | |
1 | 1228 | 6.4% |
9 | 1089 | 5.7% |
5 | 889 | 4.6% |
7 | 624 | 3.2% |
3 | 601 | 3.1% |
8 | 558 | 2.9% |
4 | 502 | 2.6% |
6 | 376 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19222 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8610 | |
2 | 4741 | |
1 | 1228 | 6.4% |
9 | 1089 | 5.7% |
5 | 889 | 4.6% |
7 | 624 | 3.2% |
3 | 601 | 3.1% |
8 | 558 | 2.9% |
4 | 502 | 2.6% |
6 | 376 | 2.0% |
데이터기준일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 22.1 KiB |
Minimum | 2023-08-31 00:00:00 |
---|---|
Maximum | 2023-08-31 00:00:00 |
환불금액 | 환불처리사유 | 환불결제수단 | 금액 | 단가 | |
---|---|---|---|---|---|
환불금액 | 1.000 | 0.482 | 0.425 | 0.170 | 0.170 |
환불처리사유 | 0.482 | 1.000 | 0.862 | 0.706 | 0.706 |
환불결제수단 | 0.425 | 0.862 | 1.000 | 0.029 | 0.029 |
금액 | 0.170 | 0.706 | 0.029 | 1.000 | 1.000 |
단가 | 0.170 | 0.706 | 0.029 | 1.000 | 1.000 |
환불결제수단 | 환불처리사유 | |
---|---|---|
환불결제수단 | 1.000 | 0.665 |
환불처리사유 | 0.665 | 1.000 |
환불금액 | 금액 | 단가 | 환불처리사유 | 환불결제수단 | |
---|---|---|---|---|---|
환불금액 | 1.000 | 0.129 | 0.129 | 0.204 | 0.299 |
금액 | 0.129 | 1.000 | 1.000 | 0.348 | 0.018 |
단가 | 0.129 | 1.000 | 1.000 | 0.348 | 0.018 |
환불처리사유 | 0.204 | 0.348 | 0.348 | 1.000 | 0.665 |
환불결제수단 | 0.299 | 0.018 | 0.018 | 0.665 | 1.000 |
환불처리일자 | 환불접수일자 | 환불금액 | 환불처리사유 | 환불결제수단 | 금액 | 수량 | 단가 | 품목코드 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 2023-01-01 | 2023-01-02 | 7000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2010012 | 2023-08-31 |
1 | 2023-01-01 | 2023-01-02 | 7000 | 미수거 | 카드 | 3000 | 1 | 3000 | 2010038 | 2023-08-31 |
2 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 카드 | 8000 | 1 | 8000 | 2020057 | 2023-08-31 |
3 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 카드 | 8000 | 1 | 8000 | 2020057 | 2023-08-31 |
4 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 카드 | 9000 | 1 | 9000 | 2020065 | 2023-08-31 |
5 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 카드 | 4000 | 1 | 4000 | 2010009 | 2023-08-31 |
6 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 계좌이체 | 2000 | 1 | 2000 | 2020048 | 2023-08-31 |
7 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 계좌이체 | 2000 | 1 | 2000 | 2020055 | 2023-08-31 |
8 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 계좌이체 | 3000 | 1 | 3000 | 2020060 | 2023-08-31 |
9 | 2023-01-01 | 2023-01-02 | 29000 | 미수거 | 계좌이체 | 4000 | 1 | 4000 | 2020037 | 2023-08-31 |
환불처리일자 | 환불접수일자 | 환불금액 | 환불처리사유 | 환불결제수단 | 금액 | 수량 | 단가 | 품목코드 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|
2805 | 2023-08-31 | 2023-09-03 | 21000 | 미수거 | 카드 | 5000 | 1 | 5000 | 2010043 | 2023-08-31 |
2806 | 2023-08-31 | 2023-09-03 | 21000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2010012 | 2023-08-31 |
2807 | 2023-08-31 | 2023-09-03 | 21000 | 미수거 | 카드 | 3000 | 1 | 3000 | 2010004 | 2023-08-31 |
2808 | 2023-08-31 | 2023-09-03 | 21000 | 미수거 | 카드 | 4000 | 1 | 4000 | 2090005 | 2023-08-31 |
2809 | 2023-08-31 | 2023-09-03 | 21000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 |
2810 | 2023-08-31 | 2023-09-04 | 8000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 |
2811 | 2023-08-31 | 2023-09-04 | 8000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 |
2812 | 2023-08-31 | 2023-09-04 | 8000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 |
2813 | 2023-08-31 | 2023-09-04 | 8000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 |
2814 | 2023-09-01 | 2023-09-03 | 2000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2020068 | 2023-08-31 |
Most frequently occurring
환불처리일자 | 환불접수일자 | 환불금액 | 환불처리사유 | 환불결제수단 | 금액 | 수량 | 단가 | 품목코드 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|
337 | 2023-07-31 | 2023-08-07 | 82000 | 미수거 | 카드 | 3000 | 1 | 3000 | 2020053 | 2023-08-31 | 21 |
58 | 2023-02-17 | 2023-02-19 | 45000 | 클린센터에 민원인이 직접 입고 | 방문 | 3000 | 1 | 3000 | 2090044 | 2023-08-31 | 15 |
80 | 2023-02-27 | 2023-03-02 | 75000 | 미수거 | 계좌이체 | 5000 | 1 | 5000 | 2090063 | 2023-08-31 | 14 |
267 | 2023-06-15 | 2023-06-18 | 58000 | 미수거 | 계좌이체 | 4000 | 1 | 4000 | 2020009 | 2023-08-31 | 14 |
381 | 2023-08-16 | 2023-08-20 | 28000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 | 14 |
357 | 2023-08-06 | 2023-08-07 | 94000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2020054 | 2023-08-31 | 11 |
142 | 2023-04-04 | 2023-04-06 | 34000 | 미수거 | 계좌이체 | 2000 | 1 | 2000 | 2090053 | 2023-08-31 | 10 |
174 | 2023-04-28 | 2023-04-30 | 20000 | 미수거 | 계좌이체 | 2000 | 1 | 2000 | 2090115 | 2023-08-31 | 10 |
319 | 2023-07-16 | 2023-07-23 | 119000 | 미수거 | 계좌이체 | 2000 | 1 | 2000 | 2020002 | 2023-08-31 | 10 |
331 | 2023-07-24 | 2023-07-30 | 20000 | 미수거 | 카드 | 2000 | 1 | 2000 | 2020055 | 2023-08-31 | 10 |