Overview

Dataset statistics

Number of variables5
Number of observations250
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory43.5 B

Variable types

DateTime1
Categorical1
Numeric3

Dataset

Description한국남동발전의 상품권 구입 현황입니다. 구매일자, 종류, 구매수량, 구매단가, 구매금액의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15064205/fileData.do

Alerts

구매수량 is highly overall correlated with 구매단가 and 1 other fieldsHigh correlation
구매단가 is highly overall correlated with 구매수량 High correlation
구매금액 is highly overall correlated with 구매수량 High correlation
종류 is highly imbalanced (63.6%)Imbalance

Reproduction

Analysis started2023-12-12 13:07:56.752758
Analysis finished2023-12-12 13:07:58.312194
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct201
Distinct (%)80.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2021-01-06 00:00:00
Maximum2022-12-30 00:00:00
2023-12-12T22:07:58.382789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:58.511376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

종류
Categorical

IMBALANCE 

Distinct23
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
온누리상품권
186 
온누리 상품권
28 
상품권
 
8
스타벅스 상품권
 
4
(주)스타벅스코리아
 
3
Other values (18)
21 

Length

Max length13
Median length6
Mean length6.212
Min length3

Unique

Unique15 ?
Unique (%)6.0%

Sample

1st row온누리상품권
2nd row온누리상품권
3rd row여수사랑상품권
4th row온누리상품권
5th row온누리상품권

Common Values

ValueCountFrequency (%)
온누리상품권 186
74.4%
온누리 상품권 28
 
11.2%
상품권 8
 
3.2%
스타벅스 상품권 4
 
1.6%
(주)스타벅스코리아 3
 
1.2%
국민관광상품권 2
 
0.8%
문화상품권 2
 
0.8%
기프트카드 2
 
0.8%
여수사랑상품권 1
 
0.4%
커피카드 1
 
0.4%
Other values (13) 13
 
5.2%

Length

2023-12-12T22:07:58.660272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
온누리상품권 186
64.4%
상품권 42
 
14.5%
온누리 28
 
9.7%
스타벅스 4
 
1.4%
주)스타벅스코리아 3
 
1.0%
국민관광상품권 2
 
0.7%
문화상품권 2
 
0.7%
기프트카드 2
 
0.7%
동반성장몰 2
 
0.7%
모바일커피쿠폰(기프티콘 1
 
0.3%
Other values (17) 17
 
5.9%

구매수량
Real number (ℝ)

HIGH CORRELATION 

Distinct121
Distinct (%)48.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean227.86
Minimum1
Maximum2050
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T22:07:58.784452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.45
Q135
median100
Q3245
95-th percentile1019.25
Maximum2050
Range2049
Interquartile range (IQR)210

Descriptive statistics

Standard deviation356.99131
Coefficient of variation (CV)1.5667134
Kurtosis8.6845362
Mean227.86
Median Absolute Deviation (MAD)80
Skewness2.8162712
Sum56965
Variance127442.8
MonotonicityNot monotonic
2023-12-12T22:07:58.907185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 13
 
5.2%
10 11
 
4.4%
20 10
 
4.0%
100 9
 
3.6%
70 8
 
3.2%
40 8
 
3.2%
300 7
 
2.8%
120 7
 
2.8%
140 6
 
2.4%
80 6
 
2.4%
Other values (111) 165
66.0%
ValueCountFrequency (%)
1 13
5.2%
2 3
 
1.2%
3 1
 
0.4%
5 1
 
0.4%
6 3
 
1.2%
7 1
 
0.4%
8 1
 
0.4%
10 11
4.4%
11 3
 
1.2%
12 1
 
0.4%
ValueCountFrequency (%)
2050 1
0.4%
2000 1
0.4%
1831 1
0.4%
1775 1
0.4%
1569 1
0.4%
1500 1
0.4%
1240 1
0.4%
1190 1
0.4%
1169 1
0.4%
1070 2
0.8%

구매단가
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80965.544
Minimum10000
Maximum5000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T22:07:59.024549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile10000
Q110000
median10000
Q310000
95-th percentile100000
Maximum5000000
Range4990000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation509174.77
Coefficient of variation (CV)6.2887834
Kurtosis79.145095
Mean80965.544
Median Absolute Deviation (MAD)0
Skewness8.8781011
Sum20241386
Variance2.5925895 × 1011
MonotonicityNot monotonic
2023-12-12T22:07:59.127284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
10000 207
82.8%
100000 14
 
5.6%
50000 9
 
3.6%
30000 5
 
2.0%
20000 3
 
1.2%
300000 2
 
0.8%
13286 1
 
0.4%
4800000 1
 
0.4%
69300 1
 
0.4%
500000 1
 
0.4%
Other values (6) 6
 
2.4%
ValueCountFrequency (%)
10000 207
82.8%
13286 1
 
0.4%
18800 1
 
0.4%
20000 3
 
1.2%
30000 5
 
2.0%
50000 9
 
3.6%
69300 1
 
0.4%
80000 1
 
0.4%
90000 1
 
0.4%
100000 14
 
5.6%
ValueCountFrequency (%)
5000000 1
 
0.4%
4800000 1
 
0.4%
4100000 1
 
0.4%
840000 1
 
0.4%
500000 1
 
0.4%
300000 2
 
0.8%
100000 14
5.6%
90000 1
 
0.4%
80000 1
 
0.4%
69300 1
 
0.4%

구매금액
Real number (ℝ)

HIGH CORRELATION 

Distinct119
Distinct (%)47.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2831130.2
Minimum30000
Maximum70560000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T22:07:59.237007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30000
5-th percentile100000
Q1500000
median1075000
Q32800000
95-th percentile10700000
Maximum70560000
Range70530000
Interquartile range (IQR)2300000

Descriptive statistics

Standard deviation5805411.2
Coefficient of variation (CV)2.0505631
Kurtosis75.584482
Mean2831130.2
Median Absolute Deviation (MAD)775000
Skewness7.2484154
Sum7.0778256 × 108
Variance3.3702799 × 1013
MonotonicityNot monotonic
2023-12-12T22:07:59.383566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200000 12
 
4.8%
1000000 12
 
4.8%
700000 9
 
3.6%
100000 9
 
3.6%
1200000 8
 
3.2%
600000 8
 
3.2%
3000000 7
 
2.8%
1400000 7
 
2.8%
400000 7
 
2.8%
500000 6
 
2.4%
Other values (109) 165
66.0%
ValueCountFrequency (%)
30000 4
 
1.6%
50000 2
 
0.8%
80000 1
 
0.4%
90000 1
 
0.4%
100000 9
3.6%
132860 1
 
0.4%
140000 1
 
0.4%
150000 1
 
0.4%
160000 1
 
0.4%
200000 12
4.8%
ValueCountFrequency (%)
70560000 1
0.4%
25000000 1
0.4%
20500000 1
0.4%
20000000 1
0.4%
18310000 1
0.4%
17750000 1
0.4%
15690000 1
0.4%
15500000 1
0.4%
15000000 1
0.4%
12400000 1
0.4%

Interactions

2023-12-12T22:07:57.790516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:56.908030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:57.189105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:57.890291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:56.993033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:57.264196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:58.011589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:57.097671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:57.673437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:07:59.478360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류구매수량구매단가구매금액
종류1.0000.0000.0000.000
구매수량0.0001.0000.0000.917
구매단가0.0000.0001.0000.585
구매금액0.0000.9170.5851.000
2023-12-12T22:07:59.561934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구매수량구매단가구매금액종류
구매수량1.000-0.5530.8710.000
구매단가-0.5531.000-0.1770.282
구매금액0.871-0.1771.0000.000
종류0.0000.2820.0001.000

Missing values

2023-12-12T22:07:58.158054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:07:58.263181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구매일자종류구매수량구매단가구매금액
02021-01-06온누리상품권7010000700000
12021-01-13온누리상품권3010000300000
22021-01-21여수사랑상품권180100001800000
32021-01-28온누리상품권7010000700000
42021-02-02온누리상품권300100003000000
52021-02-03온누리상품권396100003960000
62021-02-04온누리상품권150100001500000
72021-02-15커피카드13000030000
82021-03-02온누리상품권4010000400000
92021-03-03(주)스타벅스코리아1150000550000
구매일자종류구매수량구매단가구매금액
2402022-12-26온누리상품권219100002190000
2412022-12-26스타벅스 상품권100100001000000
2422022-12-27온누리상품권115100001150000
2432022-12-28온누리상품권965100009650000
2442022-12-28상품권1020000200000
2452022-12-28상품권11901000011900000
2462022-12-28스타벅스 상품권1320000260000
2472022-12-29온누리 상품권8010000800000
2482022-12-29온누리상품권1010000100000
2492022-12-30온누리상품권10701000010700000