Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory54.6 B

Variable types

Categorical4
Numeric2

Dataset

Description샘플 데이터
Author경기도경제과학진흥원
URLhttps://www.bigdata-region.kr/#/dataset/e6681bd5-8ab5-4ab1-a764-91e4a394fbf0

Alerts

시도명 has constant value ""Constant
년월 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
시군구명 is highly overall correlated with 년월 and 1 other fieldsHigh correlation
결제상품명 is highly overall correlated with 년월 and 1 other fieldsHigh correlation
년월 is highly imbalanced (78.4%)Imbalance

Reproduction

Analysis started2023-12-10 14:17:06.037461
Analysis finished2023-12-10 14:17:07.271624
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
2019-04
28 
2019-03
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row2019-03
2nd row2019-04
3rd row2019-04
4th row2019-04
5th row2019-04

Common Values

ValueCountFrequency (%)
2019-04 28
96.6%
2019-03 1
 
3.4%

Length

2023-12-10T23:17:07.392993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:07.557783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-04 28
96.6%
2019-03 1
 
3.4%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
경기도
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 29
100.0%

Length

2023-12-10T23:17:07.706470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:07.839074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 29
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size364.0 B
고양시 덕양구
13 
가평군
고양시 일산동구
양주시
 
1

Length

Max length8
Median length7
Mean length6
Min length3

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row양주시
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
고양시 덕양구 13
44.8%
가평군 8
27.6%
고양시 일산동구 7
24.1%
양주시 1
 
3.4%

Length

2023-12-10T23:17:07.985866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:08.152505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양시 20
40.8%
덕양구 13
26.5%
가평군 8
 
16.3%
일산동구 7
 
14.3%
양주시 1
 
2.0%

연령대코드
Real number (ℝ)

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40
Minimum10
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-10T23:17:08.317375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile20
Q130
median40
Q350
95-th percentile60
Maximum60
Range50
Interquartile range (IQR)20

Descriptive statistics

Standard deviation13.627703
Coefficient of variation (CV)0.34069257
Kurtosis-0.52713296
Mean40
Median Absolute Deviation (MAD)10
Skewness-0.1818819
Sum1160
Variance185.71429
MonotonicityNot monotonic
2023-12-10T23:17:08.489917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
40 9
31.0%
30 6
20.7%
50 5
17.2%
60 5
17.2%
20 3
 
10.3%
10 1
 
3.4%
ValueCountFrequency (%)
10 1
 
3.4%
20 3
 
10.3%
30 6
20.7%
40 9
31.0%
50 5
17.2%
60 5
17.2%
ValueCountFrequency (%)
60 5
17.2%
50 5
17.2%
40 9
31.0%
30 6
20.7%
20 3
 
10.3%
10 1
 
3.4%

결제상품명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)34.5%
Missing0
Missing (%)0.0%
Memory size364.0 B
고양페이카드
11 
가평사랑상품권
수원페이
용인와이페이
부천페이
Other values (5)

Length

Max length10
Median length9
Mean length6.1034483
Min length4

Unique

Unique5 ?
Unique (%)17.2%

Sample

1st row양주사랑카드
2nd row가평사랑상품권
3rd row가평사랑상품권
4th row오산화폐 오색전
5th row가평사랑상품권

Common Values

ValueCountFrequency (%)
고양페이카드 11
37.9%
가평사랑상품권 5
17.2%
수원페이 4
 
13.8%
용인와이페이 2
 
6.9%
부천페이 2
 
6.9%
양주사랑카드 1
 
3.4%
오산화폐 오색전 1
 
3.4%
안산사랑상품권 다온 1
 
3.4%
과천화폐 과천토리 1
 
3.4%
의정부사랑카드 1
 
3.4%

Length

2023-12-10T23:17:08.691557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:17:08.911624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양페이카드 11
34.4%
가평사랑상품권 5
15.6%
수원페이 4
 
12.5%
용인와이페이 2
 
6.2%
부천페이 2
 
6.2%
양주사랑카드 1
 
3.1%
오산화폐 1
 
3.1%
오색전 1
 
3.1%
안산사랑상품권 1
 
3.1%
다온 1
 
3.1%
Other values (3) 3
 
9.4%

사용빈도
Real number (ℝ)

Distinct16
Distinct (%)55.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.517241
Minimum1
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-10T23:17:09.127136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q326
95-th percentile61.6
Maximum85
Range84
Interquartile range (IQR)25

Descriptive statistics

Standard deviation22.598389
Coefficient of variation (CV)1.3681697
Kurtosis2.0229261
Mean16.517241
Median Absolute Deviation (MAD)2
Skewness1.5943747
Sum479
Variance510.68719
MonotonicityNot monotonic
2023-12-10T23:17:09.317927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 10
34.5%
3 3
 
10.3%
2 3
 
10.3%
8 1
 
3.4%
26 1
 
3.4%
30 1
 
3.4%
40 1
 
3.4%
19 1
 
3.4%
4 1
 
3.4%
58 1
 
3.4%
Other values (6) 6
20.7%
ValueCountFrequency (%)
1 10
34.5%
2 3
 
10.3%
3 3
 
10.3%
4 1
 
3.4%
8 1
 
3.4%
19 1
 
3.4%
21 1
 
3.4%
23 1
 
3.4%
26 1
 
3.4%
30 1
 
3.4%
ValueCountFrequency (%)
85 1
3.4%
64 1
3.4%
58 1
3.4%
40 1
3.4%
39 1
3.4%
37 1
3.4%
30 1
3.4%
26 1
3.4%
23 1
3.4%
21 1
3.4%

Interactions

2023-12-10T23:17:06.670867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:17:06.374190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:17:06.806994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:17:06.539534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:17:09.466769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월시군구명연령대코드결제상품명사용빈도
년월1.0001.0000.0001.0000.000
시군구명1.0001.0000.0000.8900.000
연령대코드0.0000.0001.0000.0000.362
결제상품명1.0000.8900.0001.0000.000
사용빈도0.0000.0000.3620.0001.000
2023-12-10T23:17:09.725028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결제상품명년월시군구명
결제상품명1.0000.8390.661
년월0.8391.0000.962
시군구명0.6610.9621.000
2023-12-10T23:17:09.873273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대코드사용빈도년월시군구명결제상품명
연령대코드1.000-0.2240.0000.0000.000
사용빈도-0.2241.0000.0000.0000.000
년월0.0000.0001.0000.9620.839
시군구명0.0000.0000.9621.0000.661
결제상품명0.0000.0000.8390.6611.000

Missing values

2023-12-10T23:17:06.993327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:17:07.200435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월시도명시군구명연령대코드결제상품명사용빈도
02019-03경기도양주시40양주사랑카드3
12019-04경기도가평군20가평사랑상품권3
22019-04경기도가평군30가평사랑상품권37
32019-04경기도가평군40오산화폐 오색전1
42019-04경기도가평군40가평사랑상품권23
52019-04경기도가평군50가평사랑상품권85
62019-04경기도가평군60고양페이카드1
72019-04경기도가평군60가평사랑상품권2
82019-04경기도가평군60수원페이1
92019-04경기도고양시 덕양구20고양페이카드21
년월시도명시군구명연령대코드결제상품명사용빈도
192019-04경기도고양시 덕양구50용인와이페이1
202019-04경기도고양시 덕양구60고양페이카드8
212019-04경기도고양시 덕양구60수원페이1
222019-04경기도고양시 일산동구10고양페이카드4
232019-04경기도고양시 일산동구20고양페이카드19
242019-04경기도고양시 일산동구30고양페이카드40
252019-04경기도고양시 일산동구40부천페이2
262019-04경기도고양시 일산동구40고양페이카드30
272019-04경기도고양시 일산동구50고양페이카드26
282019-04경기도고양시 일산동구50부천페이1