Overview

Dataset statistics

Number of variables6
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.0 KiB
Average record size in memory53.3 B

Variable types

Text1
Numeric5

Dataset

Description샘플 데이터
Author신한카드
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=318

Alerts

소액결제건수(MICRO_PYM) has 115 (23.0%) zerosZeros

Reproduction

Analysis started2024-04-20 14:17:35.368234
Analysis finished2024-04-20 14:17:41.546379
Duration6.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct63
Distinct (%)12.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2024-04-20T23:17:42.232199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters2500
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)1.4%

Sample

1st rowSS017
2nd rowSS013
3rd rowSS016
4th rowSS055
5th rowSS038
ValueCountFrequency (%)
ss016 23
 
4.6%
ss068 22
 
4.4%
ss066 19
 
3.8%
ss048 18
 
3.6%
ss055 17
 
3.4%
ss069 17
 
3.4%
ss030 16
 
3.2%
ss006 16
 
3.2%
ss008 15
 
3.0%
ss013 15
 
3.0%
Other values (53) 322
64.4%
2024-04-20T23:17:43.262180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
S 1000
40.0%
0 646
25.8%
6 162
 
6.5%
1 126
 
5.0%
4 126
 
5.0%
5 113
 
4.5%
3 95
 
3.8%
8 85
 
3.4%
2 61
 
2.4%
9 44
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1500
60.0%
Uppercase Letter 1000
40.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 646
43.1%
6 162
 
10.8%
1 126
 
8.4%
4 126
 
8.4%
5 113
 
7.5%
3 95
 
6.3%
8 85
 
5.7%
2 61
 
4.1%
9 44
 
2.9%
7 42
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
S 1000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1500
60.0%
Latin 1000
40.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 646
43.1%
6 162
 
10.8%
1 126
 
8.4%
4 126
 
8.4%
5 113
 
7.5%
3 95
 
6.3%
8 85
 
5.7%
2 61
 
4.1%
9 44
 
2.9%
7 42
 
2.8%
Latin
ValueCountFrequency (%)
S 1000
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2500
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 1000
40.0%
0 646
25.8%
6 162
 
6.5%
1 126
 
5.0%
4 126
 
5.0%
5 113
 
4.5%
3 95
 
3.8%
8 85
 
3.4%
2 61
 
2.4%
9 44
 
1.8%

기준년월(YM)
Real number (ℝ)

Distinct67
Distinct (%)13.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201840.73
Minimum201601
Maximum202107
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-04-20T23:17:43.647063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201601
5-th percentile201604
Q1201707
median201811.5
Q3202002
95-th percentile202104
Maximum202107
Range506
Interquartile range (IQR)295

Descriptive statistics

Standard deviation159.91152
Coefficient of variation (CV)0.00079226586
Kurtosis-1.113749
Mean201840.73
Median Absolute Deviation (MAD)106
Skewness0.039627492
Sum1.0092036 × 108
Variance25571.693
MonotonicityNot monotonic
2024-04-20T23:17:43.995990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202101 15
 
3.0%
201812 13
 
2.6%
201902 13
 
2.6%
201708 12
 
2.4%
201903 12
 
2.4%
201805 12
 
2.4%
201707 12
 
2.4%
202009 12
 
2.4%
201603 11
 
2.2%
201901 11
 
2.2%
Other values (57) 377
75.4%
ValueCountFrequency (%)
201601 7
1.4%
201602 5
1.0%
201603 11
2.2%
201604 6
1.2%
201605 6
1.2%
201606 8
1.6%
201607 5
1.0%
201608 10
2.0%
201609 5
1.0%
201610 8
1.6%
ValueCountFrequency (%)
202107 10
2.0%
202106 4
 
0.8%
202105 9
1.8%
202104 6
 
1.2%
202103 8
1.6%
202102 3
 
0.6%
202101 15
3.0%
202012 9
1.8%
202011 6
 
1.2%
202010 8
1.6%

시간대구간(TIME)
Real number (ℝ)

Distinct6
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.704
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-04-20T23:17:44.401650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median4
Q35
95-th percentile6
Maximum6
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4284484
Coefficient of variation (CV)0.38565023
Kurtosis-0.8720676
Mean3.704
Median Absolute Deviation (MAD)1
Skewness-0.11519838
Sum1852
Variance2.0404649
MonotonicityNot monotonic
2024-04-20T23:17:44.721297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
4 118
23.6%
3 109
21.8%
5 103
20.6%
2 78
15.6%
6 58
11.6%
1 34
 
6.8%
ValueCountFrequency (%)
1 34
 
6.8%
2 78
15.6%
3 109
21.8%
4 118
23.6%
5 103
20.6%
6 58
11.6%
ValueCountFrequency (%)
6 58
11.6%
5 103
20.6%
4 118
23.6%
3 109
21.8%
2 78
15.6%
1 34
 
6.8%
Distinct496
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean205688.84
Minimum8529
Maximum502813
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-04-20T23:17:45.116961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8529
5-th percentile12907.1
Q1148305
median217353
Q3278712.75
95-th percentile416782.75
Maximum502813
Range494284
Interquartile range (IQR)130407.75

Descriptive statistics

Standard deviation129936.98
Coefficient of variation (CV)0.63171623
Kurtosis-0.78792106
Mean205688.84
Median Absolute Deviation (MAD)64891
Skewness0.035625156
Sum1.0284442 × 108
Variance1.6883619 × 1010
MonotonicityNot monotonic
2024-04-20T23:17:45.537874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
210343 2
 
0.4%
219289 2
 
0.4%
210784 2
 
0.4%
413323 2
 
0.4%
230873 1
 
0.2%
224245 1
 
0.2%
24568 1
 
0.2%
27268 1
 
0.2%
22652 1
 
0.2%
11063 1
 
0.2%
Other values (486) 486
97.2%
ValueCountFrequency (%)
8529 1
0.2%
8573 1
0.2%
8651 1
0.2%
9105 1
0.2%
9152 1
0.2%
9336 1
0.2%
9467 1
0.2%
9556 1
0.2%
9623 1
0.2%
9760 1
0.2%
ValueCountFrequency (%)
502813 1
0.2%
502754 1
0.2%
500680 1
0.2%
499436 1
0.2%
499426 1
0.2%
422985 1
0.2%
422018 1
0.2%
421912 1
0.2%
421697 1
0.2%
421579 1
0.2%
Distinct55
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.166
Minimum5
Maximum573
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-04-20T23:17:45.938978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile5
Q15
median15
Q335
95-th percentile191.25
Maximum573
Range568
Interquartile range (IQR)30

Descriptive statistics

Standard deviation72.566896
Coefficient of variation (CV)1.7627871
Kurtosis15.655285
Mean41.166
Median Absolute Deviation (MAD)10
Skewness3.561874
Sum20583
Variance5265.9544
MonotonicityNot monotonic
2024-04-20T23:17:46.413516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 171
34.2%
10 77
15.4%
15 45
 
9.0%
30 28
 
5.6%
25 24
 
4.8%
20 18
 
3.6%
35 13
 
2.6%
40 10
 
2.0%
55 10
 
2.0%
45 8
 
1.6%
Other values (45) 96
19.2%
ValueCountFrequency (%)
5 171
34.2%
10 77
15.4%
15 45
 
9.0%
20 18
 
3.6%
25 24
 
4.8%
30 28
 
5.6%
35 13
 
2.6%
40 10
 
2.0%
45 8
 
1.6%
50 4
 
0.8%
ValueCountFrequency (%)
573 1
0.2%
538 1
0.2%
402 1
0.2%
397 1
0.2%
392 1
0.2%
372 1
0.2%
362 1
0.2%
322 1
0.2%
302 2
0.4%
287 1
0.2%

소액결제건수(MICRO_PYM)
Real number (ℝ)

ZEROS 

Distinct40
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.34
Minimum0
Maximum483
Zeros115
Zeros (%)23.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-04-20T23:17:46.725631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median10
Q325
95-th percentile96
Maximum483
Range483
Interquartile range (IQR)20

Descriptive statistics

Standard deviation52.024162
Coefficient of variation (CV)2.0530451
Kurtosis27.991625
Mean25.34
Median Absolute Deviation (MAD)10
Skewness4.7337291
Sum12670
Variance2706.5134
MonotonicityNot monotonic
2024-04-20T23:17:47.024033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
5 125
25.0%
0 115
23.0%
10 57
11.4%
15 33
 
6.6%
20 32
 
6.4%
25 25
 
5.0%
30 18
 
3.6%
35 15
 
3.0%
70 9
 
1.8%
55 9
 
1.8%
Other values (30) 62
12.4%
ValueCountFrequency (%)
0 115
23.0%
5 125
25.0%
10 57
11.4%
15 33
 
6.6%
20 32
 
6.4%
25 25
 
5.0%
30 18
 
3.6%
35 15
 
3.0%
40 8
 
1.6%
45 7
 
1.4%
ValueCountFrequency (%)
483 1
0.2%
402 1
0.2%
362 1
0.2%
357 1
0.2%
307 1
0.2%
292 1
0.2%
257 1
0.2%
226 1
0.2%
221 1
0.2%
211 1
0.2%

Interactions

2024-04-20T23:17:40.359745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:35.708703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.696836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:37.822137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:39.084224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:40.540098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.007993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.869820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:38.079010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:39.343770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:40.693921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.208394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:37.089411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:38.332055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:39.606555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:40.875272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.375412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:37.342744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:38.584566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:39.872072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:41.037894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:36.536863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:37.574081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:38.825620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T23:17:40.202304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-20T23:17:47.195275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울시민업종코드(UPJONG_CD)기준년월(YM)시간대구간(TIME)고객주소블록코드(BLOCK_CD)카드이용금액계(AMT_CORR)소액결제건수(MICRO_PYM)
서울시민업종코드(UPJONG_CD)1.0000.0000.0000.2110.2960.478
기준년월(YM)0.0001.0000.0280.0320.0000.248
시간대구간(TIME)0.0000.0281.0000.0000.1360.000
고객주소블록코드(BLOCK_CD)0.2110.0320.0001.0000.1610.137
카드이용금액계(AMT_CORR)0.2960.0000.1360.1611.0000.000
소액결제건수(MICRO_PYM)0.4780.2480.0000.1370.0001.000
2024-04-20T23:17:47.387464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년월(YM)시간대구간(TIME)고객주소블록코드(BLOCK_CD)카드이용금액계(AMT_CORR)소액결제건수(MICRO_PYM)
기준년월(YM)1.0000.008-0.0440.018-0.035
시간대구간(TIME)0.0081.0000.0350.003-0.052
고객주소블록코드(BLOCK_CD)-0.0440.0351.000-0.040-0.046
카드이용금액계(AMT_CORR)0.0180.003-0.0401.000-0.078
소액결제건수(MICRO_PYM)-0.035-0.052-0.046-0.0781.000

Missing values

2024-04-20T23:17:41.240131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-20T23:17:41.438745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

서울시민업종코드(UPJONG_CD)기준년월(YM)시간대구간(TIME)고객주소블록코드(BLOCK_CD)카드이용금액계(AMT_CORR)소액결제건수(MICRO_PYM)
0SS01720200661106325725
1SS01320160862161378610
2SS01620180232248193515
3SS055201906219869100
4SS038201612528382105
5SS04320160613629032035
6SS0542021016175211025
7SS0542020112152539755
8SS00620160821572281010
9SS008202102616708155
서울시민업종코드(UPJONG_CD)기준년월(YM)시간대구간(TIME)고객주소블록코드(BLOCK_CD)카드이용금액계(AMT_CORR)소액결제건수(MICRO_PYM)
490SS06820180613647745565
491SS00320170932204455
492SS015201701115279510
493SS0442020095203781045
494SS0032016024210795510
495SS017201911536673555
496SS01620180321557501030
497SS081202103336642655
498SS06920170732149357010
499SS0692019094225316305