Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 898.4 KiB |
Average record size in memory | 92.0 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 4 |
Boolean | 1 |
DateTime | 2 |
Dataset
Description | N/A |
---|---|
Author | 인천시설공단 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15119158&srcSe=7661IVAWM27C61E190 |
취소여부 has constant value "" | Constant |
환불인 has constant value "" | Constant |
확인자 has constant value "" | Constant |
지불금액 is highly overall correlated with 받은금액 | High correlation |
받은금액 is highly overall correlated with 지불금액 | High correlation |
코드명 is highly imbalanced (52.3%) | Imbalance |
환불금액 is highly skewed (γ1 = 38.96158009) | Skewed |
영수증번호 has unique values | Unique |
환불금액 has 9971 (99.7%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-18 03:39:03.582585 |
---|---|
Analysis finished | 2024-03-18 03:39:06.996800 |
Duration | 3.41 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
코드명
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
I_FMC4 | |
---|---|
I_FMC5 | |
I_FMC1 | 664 |
I_FMC3 | 331 |
I_FMC2 | 54 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | I_FMC4 |
---|---|
2nd row | I_FMC4 |
3rd row | I_FMC4 |
4th row | I_FMC4 |
5th row | I_FMC4 |
Common Values
Value | Count | Frequency (%) |
I_FMC4 | 7775 | |
I_FMC5 | 1176 | 11.8% |
I_FMC1 | 664 | 6.6% |
I_FMC3 | 331 | 3.3% |
I_FMC2 | 54 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
i_fmc4 | 7775 | |
i_fmc5 | 1176 | 11.8% |
i_fmc1 | 664 | 6.6% |
i_fmc3 | 331 | 3.3% |
i_fmc2 | 54 | 0.5% |
영수증번호
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 486539.25 |
Minimum | 14 |
---|---|
Maximum | 641113 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 14 |
---|---|
5-th percentile | 376026.8 |
Q1 | 422417.75 |
median | 484992 |
Q3 | 549735.75 |
95-th percentile | 610092.6 |
Maximum | 641113 |
Range | 641099 |
Interquartile range (IQR) | 127318 |
Descriptive statistics
Standard deviation | 76627.833 |
---|---|
Coefficient of variation (CV) | 0.15749569 |
Kurtosis | 0.4668799 |
Mean | 486539.25 |
Median Absolute Deviation (MAD) | 63511.5 |
Skewness | -0.15900367 |
Sum | 4.8653925 × 109 |
Variance | 5.8718248 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
560335 | 1 | < 0.1% |
536617 | 1 | < 0.1% |
388716 | 1 | < 0.1% |
521276 | 1 | < 0.1% |
424994 | 1 | < 0.1% |
397486 | 1 | < 0.1% |
429803 | 1 | < 0.1% |
611096 | 1 | < 0.1% |
493170 | 1 | < 0.1% |
482910 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
14 | 1 | |
22 | 1 | |
49 | 1 | |
87 | 1 | |
95 | 1 | |
101 | 1 | |
123 | 1 | |
177 | 1 | |
181 | 1 | |
204 | 1 |
Value | Count | Frequency (%) |
641113 | 1 | |
641108 | 1 | |
641101 | 1 | |
641082 | 1 | |
641069 | 1 | |
641051 | 1 | |
641028 | 1 | |
640529 | 1 | |
640522 | 1 | |
639734 | 1 |
지불금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 119 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5749.7609 |
Minimum | 19 |
---|---|
Maximum | 409100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 19 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 2000 |
Q3 | 2000 |
95-th percentile | 30000 |
Maximum | 409100 |
Range | 409081 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 11580.268 |
---|---|
Coefficient of variation (CV) | 2.0140433 |
Kurtosis | 178.89823 |
Mean | 5749.7609 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.0793135 |
Sum | 57497609 |
Variance | 1.341026 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 7486 | |
4000 | 609 | 6.1% |
8000 | 195 | 1.9% |
3000 | 177 | 1.8% |
25000 | 169 | 1.7% |
1500 | 138 | 1.4% |
20000 | 83 | 0.8% |
6000 | 72 | 0.7% |
10000 | 67 | 0.7% |
40000 | 64 | 0.6% |
Other values (109) | 940 | 9.4% |
Value | Count | Frequency (%) |
19 | 1 | < 0.1% |
1000 | 6 | 0.1% |
1500 | 138 | 1.4% |
2000 | 7486 | |
2300 | 1 | < 0.1% |
3000 | 177 | 1.8% |
3300 | 8 | 0.1% |
3500 | 6 | 0.1% |
4000 | 609 | 6.1% |
4500 | 26 | 0.3% |
Value | Count | Frequency (%) |
409100 | 1 | |
245000 | 1 | |
147000 | 1 | |
144000 | 1 | |
139200 | 1 | |
110000 | 1 | |
92000 | 1 | |
90000 | 2 | |
88200 | 1 | |
83200 | 1 |
받은금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 119 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5768.1609 |
Minimum | 19 |
---|---|
Maximum | 409100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 19 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 2000 |
Q3 | 2000 |
95-th percentile | 30000 |
Maximum | 409100 |
Range | 409081 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 11624.451 |
---|---|
Coefficient of variation (CV) | 2.0152786 |
Kurtosis | 176.31848 |
Mean | 5768.1609 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.0195861 |
Sum | 57681609 |
Variance | 1.3512786 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 7486 | |
4000 | 609 | 6.1% |
8000 | 192 | 1.9% |
3000 | 177 | 1.8% |
25000 | 163 | 1.6% |
1500 | 138 | 1.4% |
20000 | 83 | 0.8% |
6000 | 72 | 0.7% |
10000 | 71 | 0.7% |
40000 | 64 | 0.6% |
Other values (109) | 945 | 9.4% |
Value | Count | Frequency (%) |
19 | 1 | < 0.1% |
1000 | 6 | 0.1% |
1500 | 138 | 1.4% |
2000 | 7486 | |
2300 | 1 | < 0.1% |
3000 | 177 | 1.8% |
3300 | 5 | 0.1% |
3500 | 7 | 0.1% |
4000 | 609 | 6.1% |
4500 | 26 | 0.3% |
Value | Count | Frequency (%) |
409100 | 1 | |
245000 | 1 | |
147000 | 1 | |
144000 | 1 | |
139200 | 1 | |
114400 | 1 | |
92000 | 1 | |
90000 | 2 | |
88200 | 1 | |
83200 | 1 |
환불금액
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18.5 |
Minimum | -5000 |
---|---|
Maximum | 25000 |
Zeros | 9971 |
Zeros (%) | 99.7% |
Negative | 1 |
Negative (%) | < 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -5000 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 25000 |
Range | 30000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 508.52887 |
---|---|
Coefficient of variation (CV) | 27.488047 |
Kurtosis | 1795.7115 |
Mean | 18.5 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 38.96158 |
Sum | 185000 |
Variance | 258601.61 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9971 | |
5000 | 5 | 0.1% |
7000 | 5 | 0.1% |
2000 | 4 | < 0.1% |
25000 | 3 | < 0.1% |
200 | 2 | < 0.1% |
4000 | 2 | < 0.1% |
6500 | 1 | < 0.1% |
700 | 1 | < 0.1% |
3000 | 1 | < 0.1% |
Other values (5) | 5 | 0.1% |
Value | Count | Frequency (%) |
-5000 | 1 | < 0.1% |
0 | 9971 | |
200 | 2 | < 0.1% |
700 | 1 | < 0.1% |
2000 | 4 | < 0.1% |
3000 | 1 | < 0.1% |
4000 | 2 | < 0.1% |
4400 | 1 | < 0.1% |
5000 | 5 | 0.1% |
6000 | 1 | < 0.1% |
Value | Count | Frequency (%) |
25000 | 3 | |
10000 | 1 | < 0.1% |
8000 | 1 | < 0.1% |
7000 | 5 | |
6500 | 1 | < 0.1% |
6000 | 1 | < 0.1% |
5000 | 5 | |
4400 | 1 | < 0.1% |
4000 | 2 | < 0.1% |
3000 | 1 | < 0.1% |
취소여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 10000 |
환불날짜
Date
Distinct | 304 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2010-03-03 00:00:00 |
---|---|
Maximum | 2011-02-09 00:00:00 |
환불인
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
*** |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *** |
---|---|
2nd row | *** |
3rd row | *** |
4th row | *** |
5th row | *** |
Common Values
Value | Count | Frequency (%) |
*** | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10000 |
처리날짜
Date
Distinct | 304 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2010-03-03 00:00:00 |
---|---|
Maximum | 2011-02-09 00:00:00 |
확인자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
*** |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *** |
---|---|
2nd row | *** |
3rd row | *** |
4th row | *** |
5th row | *** |
Common Values
Value | Count | Frequency (%) |
*** | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10000 |
코드명 | 영수증번호 | 지불금액 | 받은금액 | 환불금액 | |
---|---|---|---|---|---|
코드명 | 1.000 | 0.210 | 0.311 | 0.309 | 0.138 |
영수증번호 | 0.210 | 1.000 | 0.066 | 0.072 | 0.181 |
지불금액 | 0.311 | 0.066 | 1.000 | 1.000 | 0.088 |
받은금액 | 0.309 | 0.072 | 1.000 | 1.000 | 0.115 |
환불금액 | 0.138 | 0.181 | 0.088 | 0.115 | 1.000 |
영수증번호 | 지불금액 | 받은금액 | 환불금액 | 코드명 | |
---|---|---|---|---|---|
영수증번호 | 1.000 | -0.123 | -0.124 | -0.061 | 0.144 |
지불금액 | -0.123 | 1.000 | 1.000 | 0.089 | 0.216 |
받은금액 | -0.124 | 1.000 | 1.000 | 0.099 | 0.215 |
환불금액 | -0.061 | 0.089 | 0.099 | 1.000 | 0.054 |
코드명 | 0.144 | 0.216 | 0.215 | 0.054 | 1.000 |
코드명 | 영수증번호 | 지불금액 | 받은금액 | 환불금액 | 취소여부 | 환불날짜 | 환불인 | 처리날짜 | 확인자 | |
---|---|---|---|---|---|---|---|---|---|---|
77282 | I_FMC4 | 560335 | 8000 | 4000 | 0 | N | 2010-11-25 | *** | 2010-11-25 | *** |
3025 | I_FMC4 | 368014 | 25000 | 25000 | 0 | N | 2010-04-21 | *** | 2010-04-21 | *** |
8091 | I_FMC4 | 380383 | 2000 | 2000 | 0 | N | 2010-05-13 | *** | 2010-05-13 | *** |
65174 | I_FMC4 | 525498 | 2000 | 2000 | 0 | N | 2010-10-20 | *** | 2010-10-20 | *** |
97251 | I_FMC4 | 614484 | 2000 | 2000 | 0 | N | 2011-01-18 | *** | 2011-01-18 | *** |
50473 | I_FMC4 | 484690 | 2000 | 2000 | 0 | N | 2010-09-06 | *** | 2010-09-06 | *** |
5860 | I_FMC4 | 378045 | 23000 | 23000 | 0 | N | 2010-05-06 | *** | 2010-05-06 | *** |
1663 | I_FMC4 | 366931 | 2000 | 2000 | 0 | N | 2010-04-20 | *** | 2010-04-20 | *** |
47967 | I_FMC4 | 470290 | 2000 | 2000 | 0 | N | 2010-08-26 | *** | 2010-08-26 | *** |
18117 | I_FMC4 | 406217 | 2000 | 2000 | 0 | N | 2010-06-22 | *** | 2010-06-22 | *** |
코드명 | 영수증번호 | 지불금액 | 받은금액 | 환불금액 | 취소여부 | 환불날짜 | 환불인 | 처리날짜 | 확인자 | |
---|---|---|---|---|---|---|---|---|---|---|
51613 | I_FMC4 | 489329 | 2000 | 2000 | 0 | N | 2010-09-13 | *** | 2010-09-13 | *** |
29334 | I_FMC4 | 449098 | 2000 | 2000 | 0 | N | 2010-08-09 | *** | 2010-08-09 | *** |
95272 | I_FMC4 | 609109 | 2000 | 2000 | 0 | N | 2011-01-10 | *** | 2011-01-10 | *** |
5014 | I_FMC4 | 377284 | 2000 | 2000 | 0 | N | 2010-05-04 | *** | 2010-05-04 | *** |
91398 | I_FMC4 | 610299 | 2000 | 2000 | 0 | N | 2011-01-12 | *** | 2011-01-12 | *** |
40247 | I_FMC4 | 450510 | 2000 | 2000 | 0 | N | 2010-08-11 | *** | 2010-08-11 | *** |
23897 | I_FMC4 | 422555 | 2000 | 2000 | 0 | N | 2010-07-12 | *** | 2010-07-12 | *** |
78130 | I_FMC4 | 558429 | 2000 | 2000 | 0 | N | 2010-11-24 | *** | 2010-11-24 | *** |
13440 | I_FMC4 | 408190 | 2000 | 2000 | 0 | N | 2010-06-24 | *** | 2010-06-24 | *** |
30286 | I_FMC4 | 428415 | 2000 | 2000 | 0 | N | 2010-07-20 | *** | 2010-07-20 | *** |