Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory54.3 B

Variable types

Numeric4
Categorical2

Alerts

tco_btc_u_ct is highly overall correlated with tco_btc_u_amHigh correlation
tco_btc_u_am is highly overall correlated with tco_btc_u_ctHigh correlation
tco_btc_u_am has unique valuesUnique
agegrp_dc has 7 (7.0%) zerosZeros

Reproduction

Analysis started2023-12-11 22:42:57.559137
Analysis finished2023-12-11 22:43:00.670891
Duration3.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

crym
Real number (ℝ)

Distinct30
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202007.24
Minimum201901
Maximum202109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:43:00.754676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201901
5-th percentile201904
Q1201911
median202007
Q3202103
95-th percentile202107.05
Maximum202109
Range208
Interquartile range (IQR)192

Descriptive statistics

Standard deviation81.358059
Coefficient of variation (CV)0.00040274823
Kurtosis-1.5188779
Mean202007.24
Median Absolute Deviation (MAD)96
Skewness-0.018637234
Sum20200724
Variance6619.1337
MonotonicityNot monotonic
2023-12-12T07:43:00.885152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
202105 8
 
8.0%
202002 6
 
6.0%
202103 6
 
6.0%
202107 6
 
6.0%
202003 6
 
6.0%
201912 6
 
6.0%
202008 5
 
5.0%
201905 5
 
5.0%
201911 4
 
4.0%
202106 4
 
4.0%
Other values (20) 44
44.0%
ValueCountFrequency (%)
201901 1
 
1.0%
201902 2
 
2.0%
201903 1
 
1.0%
201904 4
4.0%
201905 5
5.0%
201906 2
 
2.0%
201907 1
 
1.0%
201908 4
4.0%
201910 3
3.0%
201911 4
4.0%
ValueCountFrequency (%)
202109 3
 
3.0%
202108 2
 
2.0%
202107 6
6.0%
202106 4
4.0%
202105 8
8.0%
202103 6
6.0%
202102 2
 
2.0%
202101 3
 
3.0%
202012 2
 
2.0%
202011 2
 
2.0%

tco_btc_nm
Categorical

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
백화점
28 
편의점
25 
할인점
24 
슈퍼마켓
23 

Length

Max length4
Median length3
Mean length3.23
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row백화점
2nd row슈퍼마켓
3rd row슈퍼마켓
4th row슈퍼마켓
5th row편의점

Common Values

ValueCountFrequency (%)
백화점 28
28.0%
편의점 25
25.0%
할인점 24
24.0%
슈퍼마켓 23
23.0%

Length

2023-12-12T07:43:01.021095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:43:01.141356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
백화점 28
28.0%
편의점 25
25.0%
할인점 24
24.0%
슈퍼마켓 23
23.0%

ma_fem_dc
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2
53 
1
47 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
2 53
53.0%
1 47
47.0%

Length

2023-12-12T07:43:01.266150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:43:01.412999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 53
53.0%
1 47
47.0%

agegrp_dc
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.5
Minimum0
Maximum90
Zeros7
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:43:01.504957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q120
median40
Q370
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)50

Descriptive statistics

Standard deviation29.176405
Coefficient of variation (CV)0.62744958
Kurtosis-1.2761346
Mean46.5
Median Absolute Deviation (MAD)30
Skewness0.045722128
Sum4650
Variance851.26263
MonotonicityNot monotonic
2023-12-12T07:43:01.625540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
40 13
13.0%
90 13
13.0%
10 12
12.0%
30 11
11.0%
80 11
11.0%
20 9
9.0%
70 9
9.0%
50 8
8.0%
0 7
7.0%
60 7
7.0%
ValueCountFrequency (%)
0 7
7.0%
10 12
12.0%
20 9
9.0%
30 11
11.0%
40 13
13.0%
50 8
8.0%
60 7
7.0%
70 9
9.0%
80 11
11.0%
90 13
13.0%
ValueCountFrequency (%)
90 13
13.0%
80 11
11.0%
70 9
9.0%
60 7
7.0%
50 8
8.0%
40 13
13.0%
30 11
11.0%
20 9
9.0%
10 12
12.0%
0 7
7.0%

tco_btc_u_ct
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean360648.88
Minimum2
Maximum1884227
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:43:01.774423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile15.6
Q1851
median75142
Q3722234.5
95-th percentile1342931
Maximum1884227
Range1884225
Interquartile range (IQR)721383.5

Descriptive statistics

Standard deviation486558.64
Coefficient of variation (CV)1.3491201
Kurtosis0.68031441
Mean360648.88
Median Absolute Deviation (MAD)75057
Skewness1.2821826
Sum36064888
Variance2.3673931 × 1011
MonotonicityNot monotonic
2023-12-12T07:43:01.933819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7 2
 
2.0%
36341 1
 
1.0%
93700 1
 
1.0%
819043 1
 
1.0%
262888 1
 
1.0%
172 1
 
1.0%
126211 1
 
1.0%
892884 1
 
1.0%
304 1
 
1.0%
351 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
2 1
1.0%
3 1
1.0%
7 2
2.0%
8 1
1.0%
16 1
1.0%
20 1
1.0%
150 1
1.0%
172 1
1.0%
186 1
1.0%
227 1
1.0%
ValueCountFrequency (%)
1884227 1
1.0%
1830950 1
1.0%
1509703 1
1.0%
1439825 1
1.0%
1425087 1
1.0%
1338607 1
1.0%
1233206 1
1.0%
1195632 1
1.0%
1172701 1
1.0%
1168581 1
1.0%

tco_btc_u_am
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2924248 × 1010
Minimum64440
Maximum1.1167907 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:43:02.070662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum64440
5-th percentile316966.5
Q129791869
median3.0131374 × 109
Q31.4503365 × 1010
95-th percentile5.1153544 × 1010
Maximum1.1167907 × 1011
Range1.11679 × 1011
Interquartile range (IQR)1.4473573 × 1010

Descriptive statistics

Standard deviation2.2287654 × 1010
Coefficient of variation (CV)1.7244837
Kurtosis7.0715301
Mean1.2924248 × 1010
Median Absolute Deviation (MAD)3.0108511 × 109
Skewness2.5870553
Sum1.2924248 × 1012
Variance4.9673952 × 1020
MonotonicityNot monotonic
2023-12-12T07:43:02.196249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6281147892 1
 
1.0%
317680 1
 
1.0%
69380789624 1
 
1.0%
33293852352 1
 
1.0%
13416930 1
 
1.0%
13379343512 1
 
1.0%
6675718140 1
 
1.0%
17241830 1
 
1.0%
3281340 1
 
1.0%
1607607360 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
64440 1
1.0%
133840 1
1.0%
167350 1
1.0%
176980 1
1.0%
303410 1
1.0%
317680 1
1.0%
399360 1
1.0%
1080430 1
1.0%
2190940 1
1.0%
2381640 1
1.0%
ValueCountFrequency (%)
111679066260 1
1.0%
96783919915 1
1.0%
94230612454 1
1.0%
84731099846 1
1.0%
69380789624 1
1.0%
50194215215 1
1.0%
46650621371 1
1.0%
45116007528 1
1.0%
43280162754 1
1.0%
42741951411 1
1.0%

Interactions

2023-12-12T07:43:00.084219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:58.962502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.372803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.691678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:43:00.164026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.105449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.441447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.791606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:43:00.238370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.204299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.505375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.886476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:43:00.365022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.301617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.599713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:42:59.990564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:43:02.443658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
crym1.0000.0000.0000.2650.3040.000
tco_btc_nm0.0001.0000.0000.1250.0000.319
ma_fem_dc0.0000.0001.0000.0000.2990.244
agegrp_dc0.2650.1250.0001.0000.7580.540
tco_btc_u_ct0.3040.0000.2990.7581.0000.727
tco_btc_u_am0.0000.3190.2440.5400.7271.000
2023-12-12T07:43:02.532711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
tco_btc_nmma_fem_dc
tco_btc_nm1.0000.000
ma_fem_dc0.0001.000
2023-12-12T07:43:02.605591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymagegrp_dctco_btc_u_cttco_btc_u_amtco_btc_nmma_fem_dc
crym1.0000.134-0.076-0.0540.0000.000
agegrp_dc0.1341.000-0.0130.0390.0660.000
tco_btc_u_ct-0.076-0.0131.0000.9160.0000.218
tco_btc_u_am-0.0540.0390.9161.0000.2010.236
tco_btc_nm0.0000.0660.0000.2011.0000.000
ma_fem_dc0.0000.0000.2180.2360.0001.000

Missing values

2023-12-12T07:43:00.468786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:43:00.579821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
0202109백화점120363416281147892
1201910슈퍼마켓2020399360
2201911슈퍼마켓230109189417577063590
3202109슈퍼마켓23071896413244677283
4202011편의점140150970312172435983
5202106백화점16012840814676678107
6202106할인점25086033232532709185
7202103편의점180477139100330
8202007편의점27040799328412874
9201911편의점2508819145524067099
crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
90202105슈퍼마켓240183095036695974361
91202105할인점21030113650710
92202106편의점2902272190940
93202011편의점1205240633413521507
94202103백화점26042042450194215215
95202107할인점28015698596912799
96202102슈퍼마켓26078701217882074170
97201904할인점15039554116461173760
98202012편의점280532146377120
99201906할인점240116858142741951411