Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 200 |
Missing cells | 200 |
Missing cells (%) | 11.1% |
Duplicate rows | 10 |
Duplicate rows (%) | 5.0% |
Total size in memory | 15.6 KiB |
Average record size in memory | 79.7 B |
Variable types
Categorical | 6 |
---|---|
Unsupported | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | (주)네스 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=NES00000000000000004 |
YEAR has constant value "" | Constant |
MONTH has constant value "" | Constant |
DATE has constant value "" | Constant |
DAYS has constant value "" | Constant |
TEL_NO has constant value "" | Constant |
Dataset has 10 (5.0%) duplicate rows | Duplicates |
SHARE_CNT is highly overall correlated with SAFE_CNT | High correlation |
SAFE_CNT is highly overall correlated with SHARE_CNT | High correlation |
SHARE_INFO has 200 (100.0%) missing values | Missing |
SHARE_INFO is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
SHARE_CNT has 140 (70.0%) zeros | Zeros |
SAFE_CNT has 139 (69.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:42:18.107327 |
---|---|
Analysis finished | 2023-12-10 06:42:19.170699 |
Duration | 1.06 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
YEAR
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2020 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 200 |
MONTH
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 200 |
DATE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
4 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 4 |
3rd row | 4 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
4 | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 200 |
TIMES
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2323 | |
---|---|
2324 | |
2322 | 5 |
2325 | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2322 |
---|---|
2nd row | 2322 |
3rd row | 2322 |
4th row | 2322 |
5th row | 2322 |
Common Values
Value | Count | Frequency (%) |
2323 | 101 | |
2324 | 92 | |
2322 | 5 | 2.5% |
2325 | 2 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2323 | 101 | |
2324 | 92 | |
2322 | 5 | 2.5% |
2325 | 2 | 1.0% |
DAYS
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
TUE |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | TUE |
---|---|
2nd row | TUE |
3rd row | TUE |
4th row | TUE |
5th row | TUE |
Common Values
Value | Count | Frequency (%) |
TUE | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
tue | 200 |
TEL_NO
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
********** |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ********** |
---|---|
2nd row | ********** |
3rd row | ********** |
4th row | ********** |
5th row | ********** |
Common Values
Value | Count | Frequency (%) |
********** | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
200 |
SHARE_INFO
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 200 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.9 KiB |
SHARE_CNT
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 20 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25.05 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 140 |
Zeros (%) | 70.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 68 |
95-th percentile | 100 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 68 |
Descriptive statistics
Standard deviation | 41.13915 |
---|---|
Coefficient of variation (CV) | 1.6422814 |
Kurtosis | -0.64917467 |
Mean | 25.05 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.122706 |
Sum | 5010 |
Variance | 1692.4296 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 140 | |
100 | 21 | 10.5% |
99 | 9 | 4.5% |
68 | 6 | 3.0% |
97 | 4 | 2.0% |
98 | 4 | 2.0% |
71 | 2 | 1.0% |
93 | 2 | 1.0% |
90 | 1 | 0.5% |
4 | 1 | 0.5% |
Other values (10) | 10 | 5.0% |
Value | Count | Frequency (%) |
0 | 140 | |
4 | 1 | 0.5% |
5 | 1 | 0.5% |
16 | 1 | 0.5% |
20 | 1 | 0.5% |
22 | 1 | 0.5% |
23 | 1 | 0.5% |
25 | 1 | 0.5% |
64 | 1 | 0.5% |
68 | 6 | 3.0% |
Value | Count | Frequency (%) |
100 | 21 | |
99 | 9 | |
98 | 4 | 2.0% |
97 | 4 | 2.0% |
93 | 2 | 1.0% |
90 | 1 | 0.5% |
84 | 1 | 0.5% |
81 | 1 | 0.5% |
71 | 2 | 1.0% |
69 | 1 | 0.5% |
SAFE_CNT
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 41 |
---|---|
Distinct (%) | 20.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8044.305 |
Minimum | 0 |
---|---|
Maximum | 93269 |
Zeros | 139 |
Zeros (%) | 69.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 623.75 |
95-th percentile | 53933.9 |
Maximum | 93269 |
Range | 93269 |
Interquartile range (IQR) | 623.75 |
Descriptive statistics
Standard deviation | 20044.355 |
---|---|
Coefficient of variation (CV) | 2.4917448 |
Kurtosis | 7.8626277 |
Mean | 8044.305 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.8697331 |
Sum | 1608861 |
Variance | 4.0177617 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 139 | |
93269 | 5 | 2.5% |
28882 | 4 | 2.0% |
51054 | 4 | 2.0% |
10295 | 3 | 1.5% |
1236 | 3 | 1.5% |
25641 | 2 | 1.0% |
2124 | 2 | 1.0% |
23257 | 2 | 1.0% |
965 | 2 | 1.0% |
Other values (31) | 34 | 17.0% |
Value | Count | Frequency (%) |
0 | 139 | |
1 | 1 | 0.5% |
5 | 1 | 0.5% |
9 | 1 | 0.5% |
57 | 1 | 0.5% |
145 | 1 | 0.5% |
164 | 1 | 0.5% |
176 | 1 | 0.5% |
263 | 1 | 0.5% |
459 | 1 | 0.5% |
Value | Count | Frequency (%) |
93269 | 5 | |
65599 | 2 | 1.0% |
65359 | 2 | 1.0% |
54255 | 1 | 0.5% |
53917 | 1 | 0.5% |
52265 | 1 | 0.5% |
51054 | 4 | |
49464 | 1 | 0.5% |
41120 | 1 | 0.5% |
30077 | 1 | 0.5% |
TIMES | SHARE_CNT | SAFE_CNT | |
---|---|---|---|
TIMES | 1.000 | 0.062 | 0.545 |
SHARE_CNT | 0.062 | 1.000 | 0.740 |
SAFE_CNT | 0.545 | 0.740 | 1.000 |
SHARE_CNT | SAFE_CNT | TIMES | |
---|---|---|---|
SHARE_CNT | 1.000 | 0.962 | 0.083 |
SAFE_CNT | 0.962 | 1.000 | 0.265 |
TIMES | 0.083 | 0.265 | 1.000 |
YEAR | MONTH | DATE | TIMES | DAYS | TEL_NO | SHARE_INFO | SHARE_CNT | SAFE_CNT | |
---|---|---|---|---|---|---|---|---|---|
0 | 2020 | 2 | 4 | 2322 | TUE | ********** | <NA> | 100 | 25641 |
1 | 2020 | 2 | 4 | 2322 | TUE | ********** | <NA> | 100 | 23257 |
2 | 2020 | 2 | 4 | 2322 | TUE | ********** | <NA> | 99 | 65359 |
3 | 2020 | 2 | 4 | 2322 | TUE | ********** | <NA> | 0 | 0 |
4 | 2020 | 2 | 4 | 2322 | TUE | ********** | <NA> | 0 | 0 |
5 | 2020 | 2 | 4 | 2323 | TUE | ********** | <NA> | 0 | 0 |
6 | 2020 | 2 | 4 | 2323 | TUE | ********** | <NA> | 25 | 164 |
7 | 2020 | 2 | 4 | 2323 | TUE | ********** | <NA> | 0 | 0 |
8 | 2020 | 2 | 4 | 2323 | TUE | ********** | <NA> | 0 | 0 |
9 | 2020 | 2 | 4 | 2323 | TUE | ********** | <NA> | 68 | 93269 |
YEAR | MONTH | DATE | TIMES | DAYS | TEL_NO | SHARE_INFO | SHARE_CNT | SAFE_CNT | |
---|---|---|---|---|---|---|---|---|---|
190 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
191 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
192 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
193 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 20 | 176 |
194 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 98 | 965 |
195 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
196 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
197 | 2020 | 2 | 4 | 2324 | TUE | ********** | <NA> | 0 | 0 |
198 | 2020 | 2 | 4 | 2325 | TUE | ********** | <NA> | 68 | 93269 |
199 | 2020 | 2 | 4 | 2325 | TUE | ********** | <NA> | 0 | 0 |
Most frequently occurring
YEAR | MONTH | DATE | TIMES | DAYS | TEL_NO | SHARE_CNT | SAFE_CNT | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
1 | 2020 | 2 | 4 | 2323 | TUE | ********** | 0 | 0 | 70 |
7 | 2020 | 2 | 4 | 2324 | TUE | ********** | 0 | 0 | 66 |
2 | 2020 | 2 | 4 | 2323 | TUE | ********** | 68 | 93269 | 3 |
4 | 2020 | 2 | 4 | 2323 | TUE | ********** | 97 | 51054 | 3 |
5 | 2020 | 2 | 4 | 2323 | TUE | ********** | 99 | 10295 | 3 |
6 | 2020 | 2 | 4 | 2323 | TUE | ********** | 100 | 28882 | 3 |
0 | 2020 | 2 | 4 | 2322 | TUE | ********** | 0 | 0 | 2 |
3 | 2020 | 2 | 4 | 2323 | TUE | ********** | 93 | 6654 | 2 |
8 | 2020 | 2 | 4 | 2324 | TUE | ********** | 100 | 1236 | 2 |
9 | 2020 | 2 | 4 | 2324 | TUE | ********** | 100 | 2124 | 2 |