Overview

Dataset statistics

Number of variables9
Number of observations1537
Missing cells363
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory117.2 KiB
Average record size in memory78.1 B

Variable types

Categorical4
Numeric5

Dataset

Description지역화폐 연령별 성별 이용현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=DYIQ115949LR11ULQ2JU32226136&infSeq=1

Alerts

결제건수 is highly overall correlated with 결제금액 and 2 other fieldsHigh correlation
결제금액 is highly overall correlated with 결제건수 and 2 other fieldsHigh correlation
결제취소건수 is highly overall correlated with 결제건수 and 2 other fieldsHigh correlation
결제취소금액 is highly overall correlated with 결제건수 and 2 other fieldsHigh correlation
1회평균결제금액 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 1회평균결제금액 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 연령대High correlation
성별 is highly imbalanced (50.4%)Imbalance
결제취소건수 has 121 (7.9%) missing valuesMissing
결제취소금액 has 121 (7.9%) missing valuesMissing
1회평균결제금액 has 121 (7.9%) missing valuesMissing

Reproduction

Analysis started2024-04-14 03:17:02.582730
Analysis finished2024-04-14 03:17:06.370060
Duration3.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2022
560 
2023
524 
2021
453 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2022 560
36.4%
2023 524
34.1%
2021 453
29.5%

Length

2024-04-14T12:17:06.427775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:17:06.529921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 560
36.4%
2023 524
34.1%
2021 453
29.5%

시군명
Categorical

Distinct31
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
안양시
 
54
하남시
 
54
김포시
 
54
동두천시
 
54
의왕시
 
54
Other values (26)
1267 

Length

Max length4
Median length3
Mean length3.0917372
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
안양시 54
 
3.5%
하남시 54
 
3.5%
김포시 54
 
3.5%
동두천시 54
 
3.5%
의왕시 54
 
3.5%
시흥시 54
 
3.5%
과천시 53
 
3.4%
광명시 53
 
3.4%
광주시 53
 
3.4%
파주시 53
 
3.4%
Other values (21) 1001
65.1%

Length

2024-04-14T12:17:06.624462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안양시 54
 
3.5%
하남시 54
 
3.5%
김포시 54
 
3.5%
동두천시 54
 
3.5%
의왕시 54
 
3.5%
시흥시 54
 
3.5%
과천시 53
 
3.4%
광명시 53
 
3.4%
광주시 53
 
3.4%
파주시 53
 
3.4%
Other values (21) 1001
65.1%

연령대
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
10
168 
20
168 
30
168 
40
168 
50
168 
Other values (23)
697 

Length

Max length9
Median length2
Mean length2.2635003
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row10
2nd row10
3rd row20
4th row20
5th row30

Common Values

ValueCountFrequency (%)
10 168
10.9%
20 168
10.9%
30 168
10.9%
40 168
10.9%
50 168
10.9%
60 168
10.9%
70 168
10.9%
80 148
9.6%
기타 81
5.3%
전체 (기타포함) 34
 
2.2%
Other values (18) 98
6.4%

Length

2024-04-14T12:17:06.722355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10 168
10.6%
30 168
10.6%
40 168
10.6%
50 168
10.6%
60 168
10.6%
70 168
10.6%
20 168
10.6%
80 148
9.4%
기타 81
5.1%
전체 34
 
2.2%
Other values (19) 140
8.9%

성별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
711 
711 
-
77 
전체
 
34
기타
 
2
Other values (2)
 
2

Length

Max length4
Median length1
Mean length1.0266753
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
711
46.3%
711
46.3%
- 77
 
5.0%
전체 34
 
2.2%
기타 2
 
0.1%
<NA> 1
 
0.1%
미분류 1
 
0.1%

Length

2024-04-14T12:17:06.977681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:17:07.067461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
711
46.3%
711
46.3%
77
 
5.0%
전체 34
 
2.2%
기타 2
 
0.1%
na 1
 
0.1%
미분류 1
 
0.1%

결제건수
Real number (ℝ)

HIGH CORRELATION 

Distinct1534
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean556810.9
Minimum0
Maximum17524460
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2024-04-14T12:17:07.168570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5817
Q150158
median193383
Q3588580
95-th percentile1907573.6
Maximum17524460
Range17524460
Interquartile range (IQR)538422

Descriptive statistics

Standard deviation1319317.4
Coefficient of variation (CV)2.3694174
Kurtosis75.120712
Mean556810.9
Median Absolute Deviation (MAD)173475
Skewness7.6799392
Sum8.5581835 × 108
Variance1.7405984 × 1012
MonotonicityNot monotonic
2024-04-14T12:17:07.304742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15021 2
 
0.1%
20192 2
 
0.1%
0 2
 
0.1%
8409 1
 
0.1%
228460 1
 
0.1%
11024 1
 
0.1%
9531 1
 
0.1%
6940581 1
 
0.1%
280003 1
 
0.1%
9163 1
 
0.1%
Other values (1524) 1524
99.2%
ValueCountFrequency (%)
0 2
0.1%
1 1
0.1%
132 1
0.1%
283 1
0.1%
366 1
0.1%
500 1
0.1%
535 1
0.1%
562 1
0.1%
686 1
0.1%
1005 1
0.1%
ValueCountFrequency (%)
17524460 1
0.1%
16953052 1
0.1%
15682281 1
0.1%
14920179 1
0.1%
14105390 1
0.1%
12223755 1
0.1%
10653080 1
0.1%
10535661 1
0.1%
9989797 1
0.1%
9625485 1
0.1%

결제금액
Real number (ℝ)

HIGH CORRELATION 

Distinct1536
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3510526 × 1010
Minimum0
Maximum4.1642798 × 1011
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2024-04-14T12:17:07.428660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.3839162 × 108
Q11.2176234 × 109
median4.4065095 × 109
Q31.3272831 × 1010
95-th percentile4.8020067 × 1010
Maximum4.1642798 × 1011
Range4.1642798 × 1011
Interquartile range (IQR)1.2055208 × 1010

Descriptive statistics

Standard deviation3.2047633 × 1010
Coefficient of variation (CV)2.3720493
Kurtosis65.29163
Mean1.3510526 × 1010
Median Absolute Deviation (MAD)3.9927589 × 109
Skewness7.1272417
Sum2.0765679 × 1013
Variance1.0270508 × 1021
MonotonicityNot monotonic
2024-04-14T12:17:07.532781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
0.1%
188845090 1
 
0.1%
12710107612 1
 
0.1%
138536981 1
 
0.1%
168570561985 1
 
0.1%
7790299738 1
 
0.1%
251366112 1
 
0.1%
146012952 1
 
0.1%
1301387790 1
 
0.1%
1217623436 1
 
0.1%
Other values (1526) 1526
99.3%
ValueCountFrequency (%)
0 2
0.1%
9500 1
0.1%
2019260 1
0.1%
11506780 1
0.1%
13582215 1
0.1%
14782350 1
0.1%
17481380 1
0.1%
20427690 1
0.1%
23365283 1
0.1%
47333543 1
0.1%
ValueCountFrequency (%)
416427980547 1
0.1%
384296854566 1
0.1%
352759911040 1
0.1%
351009805687 1
0.1%
328395737209 1
0.1%
295245600391 1
0.1%
276953512031 1
0.1%
275779418464 1
0.1%
249864997298 1
0.1%
237325433293 1
0.1%

결제취소건수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1182
Distinct (%)83.5%
Missing121
Missing (%)7.9%
Infinite0
Infinite (%)0.0%
Mean4126.3446
Minimum0
Maximum128058
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2024-04-14T12:17:07.637738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile46.75
Q1347.5
median1441.5
Q34094
95-th percentile14641
Maximum128058
Range128058
Interquartile range (IQR)3746.5

Descriptive statistics

Standard deviation10345.021
Coefficient of variation (CV)2.5070668
Kurtosis73.947967
Mean4126.3446
Median Absolute Deviation (MAD)1272
Skewness7.7278788
Sum5842904
Variance1.0701947 × 108
MonotonicityNot monotonic
2024-04-14T12:17:07.749362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
62 7
 
0.5%
78 6
 
0.4%
38 5
 
0.3%
119 5
 
0.3%
191 5
 
0.3%
33 5
 
0.3%
83 4
 
0.3%
39 4
 
0.3%
220 4
 
0.3%
15 4
 
0.3%
Other values (1172) 1367
88.9%
(Missing) 121
 
7.9%
ValueCountFrequency (%)
0 2
0.1%
3 2
0.1%
5 1
 
0.1%
9 1
 
0.1%
11 2
0.1%
12 2
0.1%
14 3
0.2%
15 4
0.3%
16 1
 
0.1%
18 1
 
0.1%
ValueCountFrequency (%)
128058 1
0.1%
122292 1
0.1%
122230 1
0.1%
114080 1
0.1%
113560 1
0.1%
108124 1
0.1%
97369 1
0.1%
85449 1
0.1%
71239 1
0.1%
70314 1
0.1%

결제취소금액
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1415
Distinct (%)99.9%
Missing121
Missing (%)7.9%
Infinite0
Infinite (%)0.0%
Mean1.9278414 × 108
Minimum0
Maximum8.1148992 × 109
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2024-04-14T12:17:07.857552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1763272.8
Q114949535
median61763438
Q31.688131 × 108
95-th percentile7.2766326 × 108
Maximum8.1148992 × 109
Range8.1148992 × 109
Interquartile range (IQR)1.5386357 × 108

Descriptive statistics

Standard deviation5.1199728 × 108
Coefficient of variation (CV)2.655806
Kurtosis103.69884
Mean1.9278414 × 108
Median Absolute Deviation (MAD)55369785
Skewness8.8020408
Sum2.7298235 × 1011
Variance2.6214121 × 1017
MonotonicityNot monotonic
2024-04-14T12:17:07.969850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
0.1%
367760889 1
 
0.1%
80116648 1
 
0.1%
105293077 1
 
0.1%
119142090 1
 
0.1%
166163421 1
 
0.1%
200566004 1
 
0.1%
128092277 1
 
0.1%
188708409 1
 
0.1%
50952388 1
 
0.1%
Other values (1405) 1405
91.4%
(Missing) 121
 
7.9%
ValueCountFrequency (%)
0 2
0.1%
77500 1
0.1%
128130 1
0.1%
181750 1
0.1%
198100 1
0.1%
240800 1
0.1%
273101 1
0.1%
415270 1
0.1%
418270 1
0.1%
450686 1
0.1%
ValueCountFrequency (%)
8114899177 1
0.1%
7314959661 1
0.1%
6387112539 1
0.1%
6156731806 1
0.1%
4676144576 1
0.1%
3425844436 1
0.1%
3367664253 1
0.1%
3222364484 1
0.1%
3138046458 1
0.1%
2868409925 1
0.1%

1회평균결제금액
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1354
Distinct (%)95.6%
Missing121
Missing (%)7.9%
Infinite0
Infinite (%)0.0%
Mean23882.63
Minimum9500
Maximum96746
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2024-04-14T12:17:08.096219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9500
5-th percentile16465.5
Q119505.75
median22945
Q326593.75
95-th percentile33649.25
Maximum96746
Range87246
Interquartile range (IQR)7088

Descriptive statistics

Standard deviation6667.5204
Coefficient of variation (CV)0.27917865
Kurtosis17.333376
Mean23882.63
Median Absolute Deviation (MAD)3573
Skewness2.8039534
Sum33817804
Variance44455829
MonotonicityNot monotonic
2024-04-14T12:17:08.199031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17524 3
 
0.2%
18201 3
 
0.2%
23485 3
 
0.2%
19701 2
 
0.1%
28622 2
 
0.1%
17211 2
 
0.1%
23657 2
 
0.1%
28862 2
 
0.1%
25383 2
 
0.1%
23646 2
 
0.1%
Other values (1344) 1393
90.6%
(Missing) 121
 
7.9%
ValueCountFrequency (%)
9500 1
0.1%
11191 1
0.1%
13044 1
0.1%
13330 1
0.1%
13753 1
0.1%
14047 1
0.1%
14129 1
0.1%
14141 1
0.1%
14224 1
0.1%
14280 1
0.1%
ValueCountFrequency (%)
96746 1
0.1%
72340 1
0.1%
64367 1
0.1%
63254 1
0.1%
61772 1
0.1%
61020 1
0.1%
60826 1
0.1%
59411 1
0.1%
56904 1
0.1%
55813 1
0.1%

Interactions

2024-04-14T12:17:05.750681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.093354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.647758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.005702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.371352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.827613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.343134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.724015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.084001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.451770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.893553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.411923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.788644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.152595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.522352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.966578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.487363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.860598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.224817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.597727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:06.042441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.576760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:04.939876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.303254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:17:05.680639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-14T12:17:08.276789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도시군명연령대성별결제건수결제금액결제취소건수결제취소금액1회평균결제금액
기준연도1.0000.1770.2490.2500.2130.1300.1700.1410.249
시군명0.1771.0000.5000.0000.2650.2280.2930.2830.608
연령대0.2490.5001.0000.9430.6100.6040.5940.5960.892
성별0.2500.0000.9431.0000.6070.5660.7340.5550.069
결제건수0.2130.2650.6100.6071.0000.8930.9060.8750.000
결제금액0.1300.2280.6040.5660.8931.0000.9680.9030.030
결제취소건수0.1700.2930.5940.7340.9060.9681.0000.9080.000
결제취소금액0.1410.2830.5960.5550.8750.9030.9081.0000.000
1회평균결제금액0.2490.6080.8920.0690.0000.0300.0000.0001.000
2024-04-14T12:17:08.395535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명성별연령대기준연도
시군명1.0000.0000.1350.088
성별0.0001.0000.7610.107
연령대0.1350.7611.0000.129
기준연도0.0880.1070.1291.000
2024-04-14T12:17:08.482483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결제건수결제금액결제취소건수결제취소금액1회평균결제금액기준연도시군명연령대성별
결제건수1.0000.9910.9790.9610.0910.0950.1010.2760.354
결제금액0.9911.0000.9670.9670.2060.0780.0810.2630.341
결제취소건수0.9790.9671.0000.9810.0750.1020.1060.2520.392
결제취소금액0.9610.9670.9811.0000.1950.0890.1130.2830.383
1회평균결제금액0.0910.2060.0750.1951.0000.1120.2740.6120.042
기준연도0.0950.0780.1020.0890.1121.0000.0880.1290.107
시군명0.1010.0810.1060.1130.2740.0881.0000.1350.000
연령대0.2760.2630.2520.2830.6120.1290.1351.0000.761
성별0.3540.3410.3920.3830.0420.1070.0000.7611.000

Missing values

2024-04-14T12:17:06.131613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-14T12:17:06.235214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-14T12:17:06.321597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준연도시군명연령대성별결제건수결제금액결제취소건수결제취소금액1회평균결제금액
02023가평군10840918884509046487184022457
12023가평군10658417660543568184139026823
22023가평군207036315450182557283755999921958
32023가평군205511813325066456812799634924176
42023가평군30110968268223496513828447631724171
52023가평군3099493256340391612305420782725765
62023가평군401673994188413440162910077027725021
72023가평군40189169478243263319097743707925281
82023가평군501902054879269120179610924430725653
92023가평군50170268478030854715878334567628075
기준연도시군명연령대성별결제건수결제금액결제취소건수결제취소금액1회평균결제금액
15272021화성시4034726588864431588130127121543719425526
15282021화성시50132268128902971561936337733977521852
15292021화성시50111641623546798507965633808854821091
15302021화성시604100338296674651340713001246220234
15312021화성시603920068729262930395014035350822268
15322021화성시709392420167286218783674469621472
15332021화성시709493621131369809483108209322259
15342021화성시8014820296063119128342420919977
15352021화성시8019625441991314191620807222522
15362021화성시기타-1245634298009462701155341538145823924