Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.3 KiB |
Average record size in memory | 44.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
DateTime | 2 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=0b6a0c5f-7fb8-4533-ab98-5a4acce6bb87 |
anl_base_dt has constant value "" | Constant |
seq is highly overall correlated with species_master_seq and 1 other fields | High correlation |
species_master_seq is highly overall correlated with seq and 1 other fields | High correlation |
loan_count is highly overall correlated with seq and 1 other fields | High correlation |
loan_count is highly imbalanced (80.6%) | Imbalance |
seq has unique values | Unique |
species_master_seq has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:20:41.421744 |
---|---|
Analysis finished | 2023-12-10 10:20:42.559608 |
Duration | 1.14 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
seq
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.9579647 × 108 |
Minimum | 5.9558642 × 108 |
---|---|
Maximum | 5.9580302 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 5.9558642 × 108 |
---|---|
5-th percentile | 5.9580292 × 108 |
Q1 | 5.9580294 × 108 |
median | 5.9580297 × 108 |
Q3 | 5.9580299 × 108 |
95-th percentile | 5.9580301 × 108 |
Maximum | 5.9580302 × 108 |
Range | 216598 |
Interquartile range (IQR) | 50.5 |
Descriptive statistics
Standard deviation | 37126.539 |
---|---|
Coefficient of variation (CV) | 6.231413 × 10-5 |
Kurtosis | 29.897737 |
Mean | 5.9579647 × 108 |
Median Absolute Deviation (MAD) | 25.5 |
Skewness | -5.5946443 |
Sum | 5.9579647 × 1010 |
Variance | 1.3783799 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
595802919 | 1 | 1.0% |
595802983 | 1 | 1.0% |
595802993 | 1 | 1.0% |
595802992 | 1 | 1.0% |
595802991 | 1 | 1.0% |
595802990 | 1 | 1.0% |
595802989 | 1 | 1.0% |
595802988 | 1 | 1.0% |
595802987 | 1 | 1.0% |
595802986 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
595586420 | 1 | |
595586421 | 1 | |
595586422 | 1 | |
595802919 | 1 | |
595802921 | 1 | |
595802922 | 1 | |
595802923 | 1 | |
595802924 | 1 | |
595802925 | 1 | |
595802927 | 1 |
Value | Count | Frequency (%) |
595803018 | 1 | |
595803017 | 1 | |
595803016 | 1 | |
595803015 | 1 | |
595803014 | 1 | |
595803013 | 1 | |
595803012 | 1 | |
595803011 | 1 | |
595803010 | 1 | |
595803009 | 1 |
species_master_seq
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2056507.1 |
Minimum | 1916668 |
---|---|
Maximum | 6351576 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1916668 |
---|---|
5-th percentile | 1917151.1 |
Q1 | 1920863.2 |
median | 1924120.5 |
Q3 | 1926871.2 |
95-th percentile | 1929703.2 |
Maximum | 6351576 |
Range | 4434908 |
Interquartile range (IQR) | 6008 |
Descriptive statistics
Standard deviation | 759154.12 |
---|---|
Coefficient of variation (CV) | 0.36914734 |
Kurtosis | 29.89597 |
Mean | 2056507.1 |
Median Absolute Deviation (MAD) | 3223 |
Skewness | 5.5944045 |
Sum | 2.0565071 × 108 |
Variance | 5.7631497 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1916668 | 1 | 1.0% |
1926222 | 1 | 1.0% |
1926568 | 1 | 1.0% |
1926564 | 1 | 1.0% |
1926560 | 1 | 1.0% |
1926558 | 1 | 1.0% |
1926555 | 1 | 1.0% |
1926553 | 1 | 1.0% |
1926549 | 1 | 1.0% |
1926388 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1916668 | 1 | |
1916676 | 1 | |
1916794 | 1 | |
1916820 | 1 | |
1917134 | 1 | |
1917152 | 1 | |
1917410 | 1 | |
1917760 | 1 | |
1917765 | 1 | |
1917913 | 1 |
Value | Count | Frequency (%) |
6351576 | 1 | |
6351548 | 1 | |
6351506 | 1 | |
1929714 | 1 | |
1929707 | 1 | |
1929703 | 1 | |
1929699 | 1 | |
1929687 | 1 | |
1929665 | 1 | |
1929409 | 1 |
loan_count
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4 | |
---|---|
1 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 1 |
3rd row | 4 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
4 | 97 | |
1 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 97 | |
1 | 3 | 3.0% |
anl_dt
Date
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2021-02-01 00:00:00 |
---|---|
Maximum | 2021-03-01 00:00:00 |
anl_base_dt
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2021-12-03 00:00:00 |
---|---|
Maximum | 2021-12-03 00:00:00 |
seq | species_master_seq | loan_count | anl_dt | |
---|---|---|---|---|
seq | 1.000 | 0.919 | 0.919 | 0.919 |
species_master_seq | 0.919 | 1.000 | 0.963 | 0.963 |
loan_count | 0.919 | 0.963 | 1.000 | 0.963 |
anl_dt | 0.919 | 0.963 | 0.963 | 1.000 |
seq | species_master_seq | loan_count | |
---|---|---|---|
seq | 1.000 | 0.825 | 0.826 |
species_master_seq | 0.825 | 1.000 | 0.826 |
loan_count | 0.826 | 0.826 | 1.000 |
seq | species_master_seq | loan_count | anl_dt | anl_base_dt | |
---|---|---|---|---|---|
0 | 595802919 | 1916668 | 4 | 2021-02 | 2021-12-03 |
1 | 595586420 | 6351506 | 1 | 2021-03 | 2021-12-03 |
2 | 595802921 | 1916676 | 4 | 2021-02 | 2021-12-03 |
3 | 595802922 | 1916794 | 4 | 2021-02 | 2021-12-03 |
4 | 595802923 | 1916820 | 4 | 2021-02 | 2021-12-03 |
5 | 595802924 | 1917134 | 4 | 2021-02 | 2021-12-03 |
6 | 595802925 | 1917152 | 4 | 2021-02 | 2021-12-03 |
7 | 595586421 | 6351548 | 1 | 2021-03 | 2021-12-03 |
8 | 595802927 | 1917410 | 4 | 2021-02 | 2021-12-03 |
9 | 595802928 | 1917760 | 4 | 2021-02 | 2021-12-03 |
seq | species_master_seq | loan_count | anl_dt | anl_base_dt | |
---|---|---|---|---|---|
90 | 595803009 | 1928958 | 4 | 2021-02 | 2021-12-03 |
91 | 595803010 | 1929037 | 4 | 2021-02 | 2021-12-03 |
92 | 595803011 | 1929358 | 4 | 2021-02 | 2021-12-03 |
93 | 595803012 | 1929409 | 4 | 2021-02 | 2021-12-03 |
94 | 595803013 | 1929665 | 4 | 2021-02 | 2021-12-03 |
95 | 595803014 | 1929687 | 4 | 2021-02 | 2021-12-03 |
96 | 595803015 | 1929699 | 4 | 2021-02 | 2021-12-03 |
97 | 595803016 | 1929703 | 4 | 2021-02 | 2021-12-03 |
98 | 595803017 | 1929707 | 4 | 2021-02 | 2021-12-03 |
99 | 595803018 | 1929714 | 4 | 2021-02 | 2021-12-03 |