Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory898.4 KiB
Average record size in memory92.0 B

Variable types

Categorical3
Numeric4
Boolean1
DateTime2

Alerts

취소여부 has constant value ""Constant
환불인 has constant value ""Constant
확인자 has constant value ""Constant
지불금액 is highly overall correlated with 받은금액High correlation
받은금액 is highly overall correlated with 지불금액High correlation
코드명 is highly imbalanced (52.3%)Imbalance
환불금액 is highly skewed (γ1 = 38.96158009)Skewed
영수증번호 has unique valuesUnique
환불금액 has 9971 (99.7%) zerosZeros

Reproduction

Analysis started2024-03-18 03:39:03.582585
Analysis finished2024-03-18 03:39:06.996800
Duration3.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

코드명
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I_FMC4
7775 
I_FMC5
1176 
I_FMC1
 
664
I_FMC3
 
331
I_FMC2
 
54

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI_FMC4
2nd rowI_FMC4
3rd rowI_FMC4
4th rowI_FMC4
5th rowI_FMC4

Common Values

ValueCountFrequency (%)
I_FMC4 7775
77.8%
I_FMC5 1176
 
11.8%
I_FMC1 664
 
6.6%
I_FMC3 331
 
3.3%
I_FMC2 54
 
0.5%

Length

2024-03-18T12:39:07.061733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:39:07.151266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i_fmc4 7775
77.8%
i_fmc5 1176
 
11.8%
i_fmc1 664
 
6.6%
i_fmc3 331
 
3.3%
i_fmc2 54
 
0.5%

영수증번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean486539.25
Minimum14
Maximum641113
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-18T12:39:07.245329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile376026.8
Q1422417.75
median484992
Q3549735.75
95-th percentile610092.6
Maximum641113
Range641099
Interquartile range (IQR)127318

Descriptive statistics

Standard deviation76627.833
Coefficient of variation (CV)0.15749569
Kurtosis0.4668799
Mean486539.25
Median Absolute Deviation (MAD)63511.5
Skewness-0.15900367
Sum4.8653925 × 109
Variance5.8718248 × 109
MonotonicityNot monotonic
2024-03-18T12:39:07.353485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
560335 1
 
< 0.1%
536617 1
 
< 0.1%
388716 1
 
< 0.1%
521276 1
 
< 0.1%
424994 1
 
< 0.1%
397486 1
 
< 0.1%
429803 1
 
< 0.1%
611096 1
 
< 0.1%
493170 1
 
< 0.1%
482910 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
14 1
< 0.1%
22 1
< 0.1%
49 1
< 0.1%
87 1
< 0.1%
95 1
< 0.1%
101 1
< 0.1%
123 1
< 0.1%
177 1
< 0.1%
181 1
< 0.1%
204 1
< 0.1%
ValueCountFrequency (%)
641113 1
< 0.1%
641108 1
< 0.1%
641101 1
< 0.1%
641082 1
< 0.1%
641069 1
< 0.1%
641051 1
< 0.1%
641028 1
< 0.1%
640529 1
< 0.1%
640522 1
< 0.1%
639734 1
< 0.1%

지불금액
Real number (ℝ)

HIGH CORRELATION 

Distinct119
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5749.7609
Minimum19
Maximum409100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-18T12:39:07.462077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19
5-th percentile2000
Q12000
median2000
Q32000
95-th percentile30000
Maximum409100
Range409081
Interquartile range (IQR)0

Descriptive statistics

Standard deviation11580.268
Coefficient of variation (CV)2.0140433
Kurtosis178.89823
Mean5749.7609
Median Absolute Deviation (MAD)0
Skewness8.0793135
Sum57497609
Variance1.341026 × 108
MonotonicityNot monotonic
2024-03-18T12:39:07.585119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 7486
74.9%
4000 609
 
6.1%
8000 195
 
1.9%
3000 177
 
1.8%
25000 169
 
1.7%
1500 138
 
1.4%
20000 83
 
0.8%
6000 72
 
0.7%
10000 67
 
0.7%
40000 64
 
0.6%
Other values (109) 940
 
9.4%
ValueCountFrequency (%)
19 1
 
< 0.1%
1000 6
 
0.1%
1500 138
 
1.4%
2000 7486
74.9%
2300 1
 
< 0.1%
3000 177
 
1.8%
3300 8
 
0.1%
3500 6
 
0.1%
4000 609
 
6.1%
4500 26
 
0.3%
ValueCountFrequency (%)
409100 1
< 0.1%
245000 1
< 0.1%
147000 1
< 0.1%
144000 1
< 0.1%
139200 1
< 0.1%
110000 1
< 0.1%
92000 1
< 0.1%
90000 2
< 0.1%
88200 1
< 0.1%
83200 1
< 0.1%

받은금액
Real number (ℝ)

HIGH CORRELATION 

Distinct119
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5768.1609
Minimum19
Maximum409100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-18T12:39:07.717409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19
5-th percentile2000
Q12000
median2000
Q32000
95-th percentile30000
Maximum409100
Range409081
Interquartile range (IQR)0

Descriptive statistics

Standard deviation11624.451
Coefficient of variation (CV)2.0152786
Kurtosis176.31848
Mean5768.1609
Median Absolute Deviation (MAD)0
Skewness8.0195861
Sum57681609
Variance1.3512786 × 108
MonotonicityNot monotonic
2024-03-18T12:39:07.848390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 7486
74.9%
4000 609
 
6.1%
8000 192
 
1.9%
3000 177
 
1.8%
25000 163
 
1.6%
1500 138
 
1.4%
20000 83
 
0.8%
6000 72
 
0.7%
10000 71
 
0.7%
40000 64
 
0.6%
Other values (109) 945
 
9.4%
ValueCountFrequency (%)
19 1
 
< 0.1%
1000 6
 
0.1%
1500 138
 
1.4%
2000 7486
74.9%
2300 1
 
< 0.1%
3000 177
 
1.8%
3300 5
 
0.1%
3500 7
 
0.1%
4000 609
 
6.1%
4500 26
 
0.3%
ValueCountFrequency (%)
409100 1
< 0.1%
245000 1
< 0.1%
147000 1
< 0.1%
144000 1
< 0.1%
139200 1
< 0.1%
114400 1
< 0.1%
92000 1
< 0.1%
90000 2
< 0.1%
88200 1
< 0.1%
83200 1
< 0.1%

환불금액
Real number (ℝ)

SKEWED  ZEROS 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum-5000
Maximum25000
Zeros9971
Zeros (%)99.7%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2024-03-18T12:39:07.966094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-5000
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum25000
Range30000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation508.52887
Coefficient of variation (CV)27.488047
Kurtosis1795.7115
Mean18.5
Median Absolute Deviation (MAD)0
Skewness38.96158
Sum185000
Variance258601.61
MonotonicityNot monotonic
2024-03-18T12:39:08.218154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 9971
99.7%
5000 5
 
0.1%
7000 5
 
0.1%
2000 4
 
< 0.1%
25000 3
 
< 0.1%
200 2
 
< 0.1%
4000 2
 
< 0.1%
6500 1
 
< 0.1%
700 1
 
< 0.1%
3000 1
 
< 0.1%
Other values (5) 5
 
0.1%
ValueCountFrequency (%)
-5000 1
 
< 0.1%
0 9971
99.7%
200 2
 
< 0.1%
700 1
 
< 0.1%
2000 4
 
< 0.1%
3000 1
 
< 0.1%
4000 2
 
< 0.1%
4400 1
 
< 0.1%
5000 5
 
0.1%
6000 1
 
< 0.1%
ValueCountFrequency (%)
25000 3
< 0.1%
10000 1
 
< 0.1%
8000 1
 
< 0.1%
7000 5
0.1%
6500 1
 
< 0.1%
6000 1
 
< 0.1%
5000 5
0.1%
4400 1
 
< 0.1%
4000 2
 
< 0.1%
3000 1
 
< 0.1%

취소여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2024-03-18T12:39:08.290059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct304
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2010-03-03 00:00:00
Maximum2011-02-09 00:00:00
2024-03-18T12:39:08.364879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:08.465157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

환불인
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
***
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***
2nd row***
3rd row***
4th row***
5th row***

Common Values

ValueCountFrequency (%)
*** 10000
100.0%

Length

2024-03-18T12:39:08.560901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:39:08.627114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10000
100.0%
Distinct304
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2010-03-03 00:00:00
Maximum2011-02-09 00:00:00
2024-03-18T12:39:08.703012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:08.810924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

확인자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
***
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***
2nd row***
3rd row***
4th row***
5th row***

Common Values

ValueCountFrequency (%)
*** 10000
100.0%

Length

2024-03-18T12:39:08.908817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:39:08.982497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10000
100.0%

Interactions

2024-03-18T12:39:06.424542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.166217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.674159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.078356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.504020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.288519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.819798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.162331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.586936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.499628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.911285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.254401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.673817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.577941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:05.992841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:39:06.345874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:39:09.031456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
코드명영수증번호지불금액받은금액환불금액
코드명1.0000.2100.3110.3090.138
영수증번호0.2101.0000.0660.0720.181
지불금액0.3110.0661.0001.0000.088
받은금액0.3090.0721.0001.0000.115
환불금액0.1380.1810.0880.1151.000
2024-03-18T12:39:09.129243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영수증번호지불금액받은금액환불금액코드명
영수증번호1.000-0.123-0.124-0.0610.144
지불금액-0.1231.0001.0000.0890.216
받은금액-0.1241.0001.0000.0990.215
환불금액-0.0610.0890.0991.0000.054
코드명0.1440.2160.2150.0541.000

Missing values

2024-03-18T12:39:06.784411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:39:06.916528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

코드명영수증번호지불금액받은금액환불금액취소여부환불날짜환불인처리날짜확인자
77282I_FMC4560335800040000N2010-11-25***2010-11-25***
3025I_FMC436801425000250000N2010-04-21***2010-04-21***
8091I_FMC4380383200020000N2010-05-13***2010-05-13***
65174I_FMC4525498200020000N2010-10-20***2010-10-20***
97251I_FMC4614484200020000N2011-01-18***2011-01-18***
50473I_FMC4484690200020000N2010-09-06***2010-09-06***
5860I_FMC437804523000230000N2010-05-06***2010-05-06***
1663I_FMC4366931200020000N2010-04-20***2010-04-20***
47967I_FMC4470290200020000N2010-08-26***2010-08-26***
18117I_FMC4406217200020000N2010-06-22***2010-06-22***
코드명영수증번호지불금액받은금액환불금액취소여부환불날짜환불인처리날짜확인자
51613I_FMC4489329200020000N2010-09-13***2010-09-13***
29334I_FMC4449098200020000N2010-08-09***2010-08-09***
95272I_FMC4609109200020000N2011-01-10***2011-01-10***
5014I_FMC4377284200020000N2010-05-04***2010-05-04***
91398I_FMC4610299200020000N2011-01-12***2011-01-12***
40247I_FMC4450510200020000N2010-08-11***2010-08-11***
23897I_FMC4422555200020000N2010-07-12***2010-07-12***
78130I_FMC4558429200020000N2010-11-24***2010-11-24***
13440I_FMC4408190200020000N2010-06-24***2010-06-24***
30286I_FMC4428415200020000N2010-07-20***2010-07-20***