Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory498.0 KiB
Average record size in memory51.0 B

Variable types

Numeric3
Categorical1
Text1

Dataset

DescriptionBC카드 제공한 데이터의 통계 자료로 충남에 거주하는 자의 충남외 소비를 파악할 수 있습니다. 월별, 시군별, 가맹점시군구(충남외), 카드이용건수와 이용금액을 확인할 수 있습니다. 본 자료는 상업적 이용금지 및 재배포할 수 없습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2157

Alerts

전체이용건수 is highly overall correlated with 전체이용금액High correlation
전체이용금액 is highly overall correlated with 전체이용건수High correlation

Reproduction

Analysis started2024-01-09 22:47:22.689796
Analysis finished2024-01-09 22:47:24.053883
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연월
Real number (ℝ)

Distinct58
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202083.89
Minimum201901
Maximum202310
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:47:24.119522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201901
5-th percentile201903
Q1201911
median202103
Q3202207
95-th percentile202307
Maximum202310
Range409
Interquartile range (IQR)296

Descriptive statistics

Standard deviation143.79572
Coefficient of variation (CV)0.00071156449
Kurtosis-1.3290389
Mean202083.89
Median Absolute Deviation (MAD)106
Skewness0.15947261
Sum2.0208389 × 109
Variance20677.21
MonotonicityNot monotonic
2024-01-10T07:47:24.257554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201904 254
 
2.5%
201903 246
 
2.5%
201906 246
 
2.5%
201902 245
 
2.5%
201911 241
 
2.4%
201907 240
 
2.4%
201908 234
 
2.3%
201905 231
 
2.3%
201909 223
 
2.2%
201901 222
 
2.2%
Other values (48) 7618
76.2%
ValueCountFrequency (%)
201901 222
2.2%
201902 245
2.5%
201903 246
2.5%
201904 254
2.5%
201905 231
2.3%
201906 246
2.5%
201907 240
2.4%
201908 234
2.3%
201909 223
2.2%
201910 221
2.2%
ValueCountFrequency (%)
202310 147
1.5%
202309 142
1.4%
202308 154
1.5%
202307 186
1.9%
202306 178
1.8%
202305 177
1.8%
202304 166
1.7%
202303 135
1.4%
202302 166
1.7%
202301 164
1.6%

시군구명
Categorical

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
천안시 서북구
677 
공주시
675 
아산시
673 
천안시 동남구
 
663
보령시
 
656
Other values (11)
6656 

Length

Max length7
Median length3
Mean length3.536
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row당진시
2nd row아산시
3rd row논산시
4th row천안시 동남구
5th row보령시

Common Values

ValueCountFrequency (%)
천안시 서북구 677
 
6.8%
공주시 675
 
6.8%
아산시 673
 
6.7%
천안시 동남구 663
 
6.6%
보령시 656
 
6.6%
당진시 652
 
6.5%
서산시 651
 
6.5%
예산군 651
 
6.5%
홍성군 632
 
6.3%
계룡시 625
 
6.2%
Other values (6) 3445
34.4%

Length

2024-01-10T07:47:24.395607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안시 1340
 
11.8%
서북구 677
 
6.0%
공주시 675
 
6.0%
아산시 673
 
5.9%
동남구 663
 
5.8%
보령시 656
 
5.8%
당진시 652
 
5.7%
예산군 651
 
5.7%
서산시 651
 
5.7%
홍성군 632
 
5.6%
Other values (7) 4070
35.9%
Distinct212
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:47:24.660044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.6178
Min length1

Characters and Unicode

Total characters36178
Distinct characters139
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통영시
2nd row창원시 마산합포구
3rd row구리시
4th row남해군
5th row김해시
ValueCountFrequency (%)
청주시 226
 
2.0%
창원시 219
 
1.9%
수원시 169
 
1.5%
성남시 156
 
1.4%
고양시 148
 
1.3%
용인시 142
 
1.2%
전주시 114
 
1.0%
안양시 110
 
1.0%
남구 101
 
0.9%
안산시 97
 
0.8%
Other values (210) 9986
87.1%
2024-01-10T07:47:25.048966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4395
 
12.1%
4090
 
11.3%
3358
 
9.3%
1468
 
4.1%
1193
 
3.3%
982
 
2.7%
931
 
2.6%
893
 
2.5%
851
 
2.4%
848
 
2.3%
Other values (129) 17169
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34654
95.8%
Space Separator 1468
 
4.1%
Dash Punctuation 56
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4395
 
12.7%
4090
 
11.8%
3358
 
9.7%
1193
 
3.4%
982
 
2.8%
931
 
2.7%
893
 
2.6%
851
 
2.5%
848
 
2.4%
703
 
2.0%
Other values (127) 16410
47.4%
Space Separator
ValueCountFrequency (%)
1468
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34654
95.8%
Common 1524
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4395
 
12.7%
4090
 
11.8%
3358
 
9.7%
1193
 
3.4%
982
 
2.8%
931
 
2.7%
893
 
2.6%
851
 
2.5%
848
 
2.4%
703
 
2.0%
Other values (127) 16410
47.4%
Common
ValueCountFrequency (%)
1468
96.3%
- 56
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34654
95.8%
ASCII 1524
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4395
 
12.7%
4090
 
11.8%
3358
 
9.7%
1193
 
3.4%
982
 
2.8%
931
 
2.7%
893
 
2.6%
851
 
2.5%
848
 
2.4%
703
 
2.0%
Other values (127) 16410
47.4%
ASCII
ValueCountFrequency (%)
1468
96.3%
- 56
 
3.7%

전체이용건수
Real number (ℝ)

HIGH CORRELATION 

Distinct2207
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1268.4743
Minimum3
Maximum222829
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:47:25.184385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile6
Q136
median133
Q3496.25
95-th percentile4008.1
Maximum222829
Range222826
Interquartile range (IQR)460.25

Descriptive statistics

Standard deviation7087.943
Coefficient of variation (CV)5.5877703
Kurtosis343.17188
Mean1268.4743
Median Absolute Deviation (MAD)118
Skewness15.921161
Sum12684743
Variance50238936
MonotonicityNot monotonic
2024-01-10T07:47:25.297526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 268
 
2.7%
4 121
 
1.2%
6 108
 
1.1%
7 104
 
1.0%
13 94
 
0.9%
8 93
 
0.9%
9 91
 
0.9%
10 88
 
0.9%
12 84
 
0.8%
16 81
 
0.8%
Other values (2197) 8868
88.7%
ValueCountFrequency (%)
3 268
2.7%
4 121
1.2%
5 67
 
0.7%
6 108
1.1%
7 104
 
1.0%
8 93
 
0.9%
9 91
 
0.9%
10 88
 
0.9%
11 70
 
0.7%
12 84
 
0.8%
ValueCountFrequency (%)
222829 1
< 0.1%
195742 1
< 0.1%
194369 1
< 0.1%
175555 1
< 0.1%
153584 1
< 0.1%
130762 1
< 0.1%
127220 1
< 0.1%
125902 1
< 0.1%
122307 1
< 0.1%
111761 1
< 0.1%

전체이용금액
Real number (ℝ)

HIGH CORRELATION 

Distinct9850
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56395512
Minimum2500
Maximum1.126971 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:47:25.426498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2500
5-th percentile111020
Q1971267.5
median3766826.5
Q315439652
95-th percentile1.7225814 × 108
Maximum1.126971 × 1010
Range1.1269707 × 1010
Interquartile range (IQR)14468384

Descriptive statistics

Standard deviation3.4883705 × 108
Coefficient of variation (CV)6.1855463
Kurtosis369.80513
Mean56395512
Median Absolute Deviation (MAD)3438016.5
Skewness16.818759
Sum5.6395512 × 1011
Variance1.2168729 × 1017
MonotonicityNot monotonic
2024-01-10T07:47:25.568830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21400 4
 
< 0.1%
210000 4
 
< 0.1%
54000 4
 
< 0.1%
141000 3
 
< 0.1%
330000 3
 
< 0.1%
32700 3
 
< 0.1%
27500 3
 
< 0.1%
537000 3
 
< 0.1%
24000 3
 
< 0.1%
16000 3
 
< 0.1%
Other values (9840) 9967
99.7%
ValueCountFrequency (%)
2500 1
< 0.1%
2550 1
< 0.1%
4000 1
< 0.1%
4200 1
< 0.1%
6000 2
< 0.1%
6200 1
< 0.1%
6300 1
< 0.1%
6500 1
< 0.1%
7000 1
< 0.1%
7500 1
< 0.1%
ValueCountFrequency (%)
11269709645 1
< 0.1%
9659509551 1
< 0.1%
8411366314 1
< 0.1%
8381080940 1
< 0.1%
7223644956 1
< 0.1%
7214987018 1
< 0.1%
7183280089 1
< 0.1%
7144766337 1
< 0.1%
6996961188 1
< 0.1%
6793288488 1
< 0.1%

Interactions

2024-01-10T07:47:23.619129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.058747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.362254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.728640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.156992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.461415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.815075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.247002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:47:23.533312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:47:25.650821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월시군구명전체이용건수전체이용금액
기준연월1.0000.0150.0090.000
시군구명0.0151.0000.1110.101
전체이용건수0.0090.1111.0000.943
전체이용금액0.0000.1010.9431.000
2024-01-10T07:47:26.017964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월전체이용건수전체이용금액시군구명
기준연월1.000-0.039-0.0080.010
전체이용건수-0.0391.0000.9520.044
전체이용금액-0.0080.9521.0000.039
시군구명0.0100.0440.0391.000

Missing values

2024-01-10T07:47:23.930434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:47:24.015227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월시군구명가맹점시군구명전체이용건수전체이용금액
76635202204당진시통영시1679573450
75320202202아산시창원시 마산합포구1477214360
4155201906논산시구리시6054853080
3759201905천안시 동남구남해군1694849300
31760202205보령시김해시1524638691
71323202109서산시창원시 마산회원구27345210
49695201906계룡시인제군1191997900
8862201912계룡시홍천군1673348004
99003201910예산군강서구1295164954544
14418202007금산군강북구50323190
기준연월시군구명가맹점시군구명전체이용건수전체이용금액
52898201909홍성군창원시 진해구1673257220
99578201911보령시밀양시32277560
41788202306공주시무주군744029540
48079201904계룡시노원구1462805115
43466202308논산시남해군251029170
48812201904홍성군강서구1627249995644
37209202212당진시강북구2594273860
77597202205서천군여주시14352890
82949202212부여군강남구5372524070010
84146202301태안군청주시 상당구27899710