Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory55.3 B

Variable types

Numeric6

Alerts

book_mastr_seq_no is highly overall correlated with mxmm_lon_halflife_co and 2 other fieldsHigh correlation
mumm_lon_halflife_co is highly overall correlated with mxmm_lon_halflife_co and 1 other fieldsHigh correlation
mxmm_lon_halflife_co is highly overall correlated with book_mastr_seq_no and 2 other fieldsHigh correlation
avrg_lon_halflife_co is highly overall correlated with book_mastr_seq_no and 2 other fieldsHigh correlation
book_co is highly overall correlated with book_mastr_seq_noHigh correlation
book_mastr_seq_no has unique valuesUnique
isbn_no has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:06:09.726743
Analysis finished2023-12-10 10:06:17.312847
Duration7.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

book_mastr_seq_no
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4363870.7
Minimum116165
Maximum5606556
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:17.412889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum116165
5-th percentile116181.75
Q15577573.8
median5577868.5
Q35578026.8
95-th percentile5606524
Maximum5606556
Range5490391
Interquartile range (IQR)453

Descriptive statistics

Standard deviation2262191.4
Coefficient of variation (CV)0.51839102
Kurtosis-0.18600837
Mean4363870.7
Median Absolute Deviation (MAD)292.5
Skewness-1.3363938
Sum4.3638707 × 108
Variance5.1175099 × 1012
MonotonicityNot monotonic
2023-12-10T19:06:17.603599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5577578 1
 
1.0%
5577958 1
 
1.0%
5577998 1
 
1.0%
5606521 1
 
1.0%
5577997 1
 
1.0%
116198 1
 
1.0%
5577978 1
 
1.0%
5606518 1
 
1.0%
5577965 1
 
1.0%
116197 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
116165 1
1.0%
116168 1
1.0%
116171 1
1.0%
116175 1
1.0%
116177 1
1.0%
116182 1
1.0%
116186 1
1.0%
116188 1
1.0%
116190 1
1.0%
116192 1
1.0%
ValueCountFrequency (%)
5606556 1
1.0%
5606530 1
1.0%
5606529 1
1.0%
5606528 1
1.0%
5606525 1
1.0%
5606524 1
1.0%
5606521 1
1.0%
5606518 1
1.0%
5606509 1
1.0%
5606505 1
1.0%

isbn_no
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.2915642 × 1012
Minimum6.1953386 × 1010
Maximum9.7911962 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:17.849623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6.1953386 × 1010
5-th percentile8.5730973 × 1012
Q19.7804396 × 1012
median9.7883868 × 1012
Q39.7889755 × 1012
95-th percentile9.7897664 × 1012
Maximum9.7911962 × 1012
Range9.7292428 × 1012
Interquartile range (IQR)8.5358407 × 109

Descriptive statistics

Standard deviation1.9525914 × 1012
Coefficient of variation (CV)0.21014668
Kurtosis15.740984
Mean9.2915642 × 1012
Median Absolute Deviation (MAD)2.7767265 × 109
Skewness-4.1463503
Sum9.2915642 × 1014
Variance3.8126131 × 1024
MonotonicityNot monotonic
2023-12-10T19:06:18.117544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9788452147047 1
 
1.0%
9780876283059 1
 
1.0%
9780439645645 1
 
1.0%
9791130803315 1
 
1.0%
9780439645621 1
 
1.0%
9788975486074 1
 
1.0%
8992372078933 1
 
1.0%
9781195054572 1
 
1.0%
9781570827792 1
 
1.0%
9788975486265 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
61953385978 1
1.0%
805086838978 1
1.0%
1157522181979 1
1.0%
1159960712979 1
1.0%
1188660004979 1
1.0%
8961751891978 1
1.0%
8963642410978 1
1.0%
8975550753040 1
1.0%
8990982545978 1
1.0%
8990982715978 1
1.0%
ValueCountFrequency (%)
9791196214773 1
1.0%
9791130803315 1
1.0%
9789766439705 1
1.0%
9789766430382 1
1.0%
9789766430375 1
1.0%
9789766430368 1
1.0%
9789766422714 1
1.0%
9788997688858 1
1.0%
9788995763643 1
1.0%
9788991623344 1
1.0%

mumm_lon_halflife_co
Real number (ℝ)

HIGH CORRELATION 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3393.26
Minimum469
Maximum7117
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:18.340219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum469
5-th percentile565.8
Q11528
median3607.5
Q35142.5
95-th percentile5946.35
Maximum7117
Range6648
Interquartile range (IQR)3614.5

Descriptive statistics

Standard deviation1960.2068
Coefficient of variation (CV)0.57767657
Kurtosis-1.3816659
Mean3393.26
Median Absolute Deviation (MAD)1757.5
Skewness-0.033861372
Sum339326
Variance3842410.7
MonotonicityNot monotonic
2023-12-10T19:06:18.585530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2230 3
 
3.0%
562 2
 
2.0%
5114 2
 
2.0%
4872 1
 
1.0%
3736 1
 
1.0%
1661 1
 
1.0%
2627 1
 
1.0%
1051 1
 
1.0%
3853 1
 
1.0%
1225 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
469 1
1.0%
534 1
1.0%
558 1
1.0%
562 2
2.0%
566 1
1.0%
587 1
1.0%
641 1
1.0%
644 1
1.0%
677 1
1.0%
684 1
1.0%
ValueCountFrequency (%)
7117 1
1.0%
6995 1
1.0%
6383 1
1.0%
6219 1
1.0%
6086 1
1.0%
5939 1
1.0%
5938 1
1.0%
5930 1
1.0%
5915 1
1.0%
5866 1
1.0%

mxmm_lon_halflife_co
Real number (ℝ)

HIGH CORRELATION 

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4007.76
Minimum469
Maximum7117
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:18.861644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum469
5-th percentile638.3
Q12151.75
median4564
Q35672
95-th percentile6598.55
Maximum7117
Range6648
Interquartile range (IQR)3520.25

Descriptive statistics

Standard deviation2029.3767
Coefficient of variation (CV)0.50636183
Kurtosis-1.1872022
Mean4007.76
Median Absolute Deviation (MAD)1382
Skewness-0.45840965
Sum400776
Variance4118369.8
MonotonicityNot monotonic
2023-12-10T19:06:19.126614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5114 2
 
2.0%
2627 2
 
2.0%
5176 1
 
1.0%
1661 1
 
1.0%
5681 1
 
1.0%
3853 1
 
1.0%
1225 1
 
1.0%
5930 1
 
1.0%
2830 1
 
1.0%
4468 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
469 1
1.0%
558 1
1.0%
562 1
1.0%
566 1
1.0%
587 1
1.0%
641 1
1.0%
644 1
1.0%
677 1
1.0%
684 1
1.0%
693 1
1.0%
ValueCountFrequency (%)
7117 1
1.0%
6995 1
1.0%
6872 1
1.0%
6657 1
1.0%
6609 1
1.0%
6598 1
1.0%
6483 1
1.0%
6326 1
1.0%
6296 1
1.0%
6219 1
1.0%

avrg_lon_halflife_co
Real number (ℝ)

HIGH CORRELATION 

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3749.96
Minimum469
Maximum7117
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:19.461394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum469
5-th percentile638.3
Q12151.75
median4197.5
Q35175.5
95-th percentile6087
Maximum7117
Range6648
Interquartile range (IQR)3023.75

Descriptive statistics

Standard deviation1872.7373
Coefficient of variation (CV)0.49940195
Kurtosis-1.1011363
Mean3749.96
Median Absolute Deviation (MAD)1369
Skewness-0.39988905
Sum374996
Variance3507145.1
MonotonicityNot monotonic
2023-12-10T19:06:20.053778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5114 2
 
2.0%
5059 2
 
2.0%
4528 1
 
1.0%
2230 1
 
1.0%
1661 1
 
1.0%
2627 1
 
1.0%
2803 1
 
1.0%
3853 1
 
1.0%
1225 1
 
1.0%
5930 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
469 1
1.0%
558 1
1.0%
562 1
1.0%
566 1
1.0%
587 1
1.0%
641 1
1.0%
644 1
1.0%
677 1
1.0%
684 1
1.0%
693 1
1.0%
ValueCountFrequency (%)
7117 1
1.0%
6995 1
1.0%
6496 1
1.0%
6219 1
1.0%
6106 1
1.0%
6086 1
1.0%
5960 1
1.0%
5939 1
1.0%
5938 1
1.0%
5930 1
1.0%

book_co
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)17.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.66
Minimum1
Maximum69
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:06:20.252800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile25.3
Maximum69
Range68
Interquartile range (IQR)1

Descriptive statistics

Standard deviation10.912156
Coefficient of variation (CV)2.3416644
Kurtosis17.121776
Mean4.66
Median Absolute Deviation (MAD)0
Skewness3.9970962
Sum466
Variance119.07515
MonotonicityNot monotonic
2023-12-10T19:06:20.458927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
1 66
66.0%
2 15
 
15.0%
3 3
 
3.0%
4 2
 
2.0%
11 2
 
2.0%
52 1
 
1.0%
5 1
 
1.0%
17 1
 
1.0%
8 1
 
1.0%
37 1
 
1.0%
Other values (7) 7
 
7.0%
ValueCountFrequency (%)
1 66
66.0%
2 15
 
15.0%
3 3
 
3.0%
4 2
 
2.0%
5 1
 
1.0%
8 1
 
1.0%
9 1
 
1.0%
10 1
 
1.0%
11 2
 
2.0%
17 1
 
1.0%
ValueCountFrequency (%)
69 1
1.0%
52 1
1.0%
45 1
1.0%
37 1
1.0%
31 1
1.0%
25 1
1.0%
23 1
1.0%
17 1
1.0%
11 2
2.0%
10 1
1.0%

Interactions

2023-12-10T19:06:16.100238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:10.127398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:11.751337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:12.790762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.790634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.702971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:16.282702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:10.446672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:11.910576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.014363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.930094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.886348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:16.433568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:10.608273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:12.063447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.151759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.068734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:15.038860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:16.595889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:10.824264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:12.256691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.307233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.235416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:15.252341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:16.738261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:11.370878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:12.451097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.456344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.372542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:15.618911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:16.889879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:11.573080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:12.610893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:13.636273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:14.538406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:06:15.864972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:06:20.618183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
book_mastr_seq_noisbn_nomumm_lon_halflife_comxmm_lon_halflife_coavrg_lon_halflife_cobook_co
book_mastr_seq_no1.0000.0000.6060.4850.5660.808
isbn_no0.0001.0000.0000.0750.0750.000
mumm_lon_halflife_co0.6060.0001.0000.9790.9860.000
mxmm_lon_halflife_co0.4850.0750.9791.0000.9870.284
avrg_lon_halflife_co0.5660.0750.9860.9871.0000.000
book_co0.8080.0000.0000.2840.0001.000
2023-12-10T19:06:20.804756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
book_mastr_seq_noisbn_nomumm_lon_halflife_comxmm_lon_halflife_coavrg_lon_halflife_cobook_co
book_mastr_seq_no1.000-0.404-0.368-0.746-0.588-0.501
isbn_no-0.4041.0000.0650.3340.2060.265
mumm_lon_halflife_co-0.3680.0651.0000.6680.912-0.215
mxmm_lon_halflife_co-0.7460.3340.6681.0000.8970.367
avrg_lon_halflife_co-0.5880.2060.9120.8971.0000.040
book_co-0.5010.265-0.2150.3670.0401.000

Missing values

2023-12-10T19:06:17.076707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:06:17.259612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

book_mastr_seq_noisbn_nomumm_lon_halflife_comxmm_lon_halflife_coavrg_lon_halflife_cobook_co
0557757897884521470475854606659602
1557756197883214180635938593859381
2557762297804351406705333533353331
311616597889754864562492500841839
4557765197818881824915603560356031
5560643997889012179071012101210121
6557768097805217754894700577351583
7557757397889217132095939593959391
8557769997807513724415554555455541
9560644997889349180204897489748971
book_mastr_seq_noisbn_nomumm_lon_halflife_comxmm_lon_halflife_coavrg_lon_halflife_cobook_co
90557801097807136522533313331333132
91116203978897278812622916326504069
92557801397889565506886086608660861
93560653097889976888587077077071
94557802097889820104053768376837681
9511620797889803668283396339633961
96557802597889883774514031403140311
97560655611575221819791683168316831
98557803297889593299923479349034852
9911620897889727834351522152215221