Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory54.3 B

Variable types

Numeric4
Categorical2

Alerts

tco_btc_u_ct is highly overall correlated with tco_btc_u_amHigh correlation
tco_btc_u_am is highly overall correlated with tco_btc_u_ctHigh correlation
tco_btc_u_am has unique valuesUnique
agegrp_dc has 6 (6.0%) zerosZeros

Reproduction

Analysis started2023-12-11 22:34:06.298829
Analysis finished2023-12-11 22:34:08.207300
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

crym
Real number (ℝ)

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202007.68
Minimum201901
Maximum202109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:08.257126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201901
5-th percentile201901.95
Q1201908.75
median202008
Q3202103
95-th percentile202108.05
Maximum202109
Range208
Interquartile range (IQR)194.25

Descriptive statistics

Standard deviation81.430751
Coefficient of variation (CV)0.00040310721
Kurtosis-1.4930435
Mean202007.68
Median Absolute Deviation (MAD)96
Skewness-0.051918613
Sum20200768
Variance6630.9673
MonotonicityNot monotonic
2023-12-12T07:34:08.357738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
202104 6
 
6.0%
202001 6
 
6.0%
202004 6
 
6.0%
202109 5
 
5.0%
201901 5
 
5.0%
201908 5
 
5.0%
202102 5
 
5.0%
202009 5
 
5.0%
202103 4
 
4.0%
201903 4
 
4.0%
Other values (21) 49
49.0%
ValueCountFrequency (%)
201901 5
5.0%
201902 2
 
2.0%
201903 4
4.0%
201904 3
3.0%
201905 1
 
1.0%
201906 4
4.0%
201907 1
 
1.0%
201908 5
5.0%
201909 2
 
2.0%
201910 3
3.0%
ValueCountFrequency (%)
202109 5
5.0%
202108 3
3.0%
202107 2
 
2.0%
202106 4
4.0%
202105 3
3.0%
202104 6
6.0%
202103 4
4.0%
202102 5
5.0%
202101 2
 
2.0%
202012 4
4.0%

tco_btc_nm
Categorical

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
양식
19 
한식
18 
패스트푸드점
16 
중식
16 
일식
16 

Length

Max length7
Median length2
Mean length3.39
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한식
2nd row한식
3rd row패스트푸드점
4th row중식
5th row패밀리레스토랑

Common Values

ValueCountFrequency (%)
양식 19
19.0%
한식 18
18.0%
패스트푸드점 16
16.0%
중식 16
16.0%
일식 16
16.0%
패밀리레스토랑 15
15.0%

Length

2023-12-12T07:34:08.480609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:34:08.563172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양식 19
19.0%
한식 18
18.0%
패스트푸드점 16
16.0%
중식 16
16.0%
일식 16
16.0%
패밀리레스토랑 15
15.0%

ma_fem_dc
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2
55 
1
45 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 55
55.0%
1 45
45.0%

Length

2023-12-12T07:34:08.653184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:34:08.719934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 55
55.0%
1 45
45.0%

agegrp_dc
Real number (ℝ)

ZEROS 

Distinct11
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.01
Minimum0
Maximum90
Zeros6
Zeros (%)6.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:08.783478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q120
median50
Q370
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)50

Descriptive statistics

Standard deviation28.833375
Coefficient of variation (CV)0.6005702
Kurtosis-1.2077581
Mean48.01
Median Absolute Deviation (MAD)25
Skewness-0.0053748641
Sum4801
Variance831.36354
MonotonicityNot monotonic
2023-12-12T07:34:08.860091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
90 16
16.0%
20 12
12.0%
50 12
12.0%
30 11
11.0%
60 10
10.0%
70 9
9.0%
40 8
8.0%
10 8
8.0%
80 7
7.0%
0 6
 
6.0%
ValueCountFrequency (%)
0 6
6.0%
1 1
 
1.0%
10 8
8.0%
20 12
12.0%
30 11
11.0%
40 8
8.0%
50 12
12.0%
60 10
10.0%
70 9
9.0%
80 7
7.0%
ValueCountFrequency (%)
90 16
16.0%
80 7
7.0%
70 9
9.0%
60 10
10.0%
50 12
12.0%
40 8
8.0%
30 11
11.0%
20 12
12.0%
10 8
8.0%
1 1
 
1.0%

tco_btc_u_ct
Real number (ℝ)

HIGH CORRELATION 

Distinct94
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67453.52
Minimum1
Maximum1060537
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:08.956157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.95
Q1133
median10890.5
Q351715.5
95-th percentile291021.55
Maximum1060537
Range1060536
Interquartile range (IQR)51582.5

Descriptive statistics

Standard deviation179356.67
Coefficient of variation (CV)2.6589669
Kurtosis17.395881
Mean67453.52
Median Absolute Deviation (MAD)10871
Skewness4.1319748
Sum6745352
Variance3.2168817 × 1010
MonotonicityNot monotonic
2023-12-12T07:34:09.066788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 3
 
3.0%
8 2
 
2.0%
4 2
 
2.0%
1 2
 
2.0%
79 2
 
2.0%
246 1
 
1.0%
25905 1
 
1.0%
990 1
 
1.0%
63 1
 
1.0%
75928 1
 
1.0%
Other values (84) 84
84.0%
ValueCountFrequency (%)
1 2
2.0%
3 3
3.0%
4 2
2.0%
5 1
 
1.0%
6 1
 
1.0%
8 2
2.0%
9 1
 
1.0%
18 1
 
1.0%
21 1
 
1.0%
28 1
 
1.0%
ValueCountFrequency (%)
1060537 1
1.0%
941663 1
1.0%
750012 1
1.0%
652516 1
1.0%
601929 1
1.0%
274658 1
1.0%
166331 1
1.0%
155229 1
1.0%
127654 1
1.0%
123638 1
1.0%

tco_btc_u_am
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0653704 × 109
Minimum11300
Maximum3.5095694 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:09.176611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11300
5-th percentile87101.5
Q14422896.2
median2.8802816 × 108
Q31.1443574 × 109
95-th percentile1.0477941 × 1010
Maximum3.5095694 × 1010
Range3.5095683 × 1010
Interquartile range (IQR)1.1399345 × 109

Descriptive statistics

Standard deviation6.163912 × 109
Coefficient of variation (CV)2.9844099
Kurtosis18.152536
Mean2.0653704 × 109
Median Absolute Deviation (MAD)2.8766618 × 108
Skewness4.2640165
Sum2.0653704 × 1011
Variance3.7993811 × 1019
MonotonicityNot monotonic
2023-12-12T07:34:09.284303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10569700 1
 
1.0%
461281060 1
 
1.0%
32431865 1
 
1.0%
1710940 1
 
1.0%
2389736689 1
 
1.0%
1757686619 1
 
1.0%
388670 1
 
1.0%
2814924708 1
 
1.0%
62649448 1
 
1.0%
346393052 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
11300 1
1.0%
11900 1
1.0%
18300 1
1.0%
54700 1
1.0%
57300 1
1.0%
88670 1
1.0%
121000 1
1.0%
136000 1
1.0%
267510 1
1.0%
322000 1
1.0%
ValueCountFrequency (%)
35095694329 1
1.0%
34290593671 1
1.0%
25710094140 1
1.0%
21547254530 1
1.0%
20958438518 1
1.0%
9926335563 1
1.0%
5337501031 1
1.0%
3540284474 1
1.0%
2814924708 1
1.0%
2468140521 1
1.0%

Interactions

2023-12-12T07:34:07.681107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:06.924223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.187820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.437633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.877035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:06.986652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.249321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.496204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.936893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.044679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.308422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.553361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:08.001981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.111701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.368106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:07.614118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:34:09.371260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
crym1.0000.2250.1730.0000.0000.212
tco_btc_nm0.2251.0000.2960.1680.2270.139
ma_fem_dc0.1730.2961.0000.0000.1360.000
agegrp_dc0.0000.1680.0001.0000.3630.312
tco_btc_u_ct0.0000.2270.1360.3631.0000.973
tco_btc_u_am0.2120.1390.0000.3120.9731.000
2023-12-12T07:34:09.469271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ma_fem_dctco_btc_nm
ma_fem_dc1.0000.207
tco_btc_nm0.2071.000
2023-12-12T07:34:09.534056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymagegrp_dctco_btc_u_cttco_btc_u_amtco_btc_nmma_fem_dc
crym1.000-0.2260.0920.0880.1230.109
agegrp_dc-0.2261.000-0.199-0.1700.0810.000
tco_btc_u_ct0.092-0.1991.0000.9730.1240.096
tco_btc_u_am0.088-0.1700.9731.0000.0830.000
tco_btc_nm0.1230.0810.1240.0831.0000.207
ma_fem_dc0.1090.0000.0960.0000.2071.000

Missing values

2023-12-12T07:34:08.101270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:34:08.176184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
0202011한식29024610569700
1202107한식14075001225710094140
2201909패스트푸드점20318300
3202012중식22017873406857103
4201904패밀리레스토랑24021435616699189
5201906한식270639152333739304
6201906패밀리레스토랑2906267510
7202106패밀리레스토랑17058816440541
8202105패스트푸드점1301276541129891501
9202004패스트푸드점15079002733793500
crymtco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
90201903한식110166047893260
91202102패스트푸드점230105852977222517
92201907일식290381476800
93201901한식210196052190270
94202009중식2706534159002901
95202003일식250418741613375817
96202105일식16017227910181493
97201901패밀리레스토랑2801013313215
98202102양식22018377608221948
99201906양식250506781500078742