Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 199 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 17 |
Duplicate rows (%) | 8.5% |
Total size in memory | 8.5 KiB |
Average record size in memory | 43.7 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 오픈메이트 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=OPMAPHUSDLPCFREE |
Dataset has 17 (8.5%) duplicate rows | Duplicates |
4212 is highly overall correlated with 5 | High correlation |
5 is highly overall correlated with 4212 | High correlation |
Reproduction
Analysis started | 2023-12-10 06:48:27.833759 |
---|---|
Analysis finished | 2023-12-10 06:48:28.612371 |
Duration | 0.78 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
5
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
6 | |
---|---|
5 | |
2 | |
1 | |
3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5 |
---|---|
2nd row | 2 |
3rd row | 3 |
4th row | 3 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
6 | 57 | |
5 | 49 | |
2 | 37 | |
1 | 32 | |
3 | 24 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
6 | 57 | |
5 | 49 | |
2 | 37 | |
1 | 32 | |
3 | 24 |
2021
Categorical
Distinct | 8 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2021 | |
---|---|
2021H1 | 12 |
2019H1 | 10 |
2020H2 | 9 |
2018H1 | 8 |
Other values (3) |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.5728643 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021 |
---|---|
2nd row | 2021 |
3rd row | 2021 |
4th row | 2021 |
5th row | 2021 |
Common Values
Value | Count | Frequency (%) |
2021 | 142 | |
2021H1 | 12 | 6.0% |
2019H1 | 10 | 5.0% |
2020H2 | 9 | 4.5% |
2018H1 | 8 | 4.0% |
2020H1 | 8 | 4.0% |
2018H2 | 5 | 2.5% |
2019H2 | 5 | 2.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021 | 142 | |
2021h1 | 12 | 6.0% |
2019h1 | 10 | 5.0% |
2020h2 | 9 | 4.5% |
2018h1 | 8 | 4.0% |
2020h1 | 8 | 4.0% |
2018h2 | 5 | 2.5% |
2019h2 | 5 | 2.5% |
11305545
Text
Distinct | 126 |
---|---|
Distinct (%) | 63.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 13 |
---|---|
Median length | 8 |
Mean length | 6.5728643 |
Min length | 1 |
Characters and Unicode
Total characters | 1308 |
---|---|
Distinct characters | 15 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 106 ? |
---|---|
Unique (%) | 53.3% |
Sample
1st row | 11680720 |
---|---|
2nd row | 17032 |
3rd row | 1109075010013 |
4th row | 1116073030021 |
5th row | 11500590 |
Value | Count | Frequency (%) |
a | 24 | 12.1% |
dd | 21 | 10.6% |
o | 12 | 6.0% |
11680531 | 3 | 1.5% |
11305660 | 3 | 1.5% |
11500520 | 2 | 1.0% |
11680750 | 2 | 1.0% |
11680640 | 2 | 1.0% |
11680510 | 2 | 1.0% |
11500593 | 2 | 1.0% |
Other values (116) | 126 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 308 | |
1 | 236 | |
5 | 125 | |
6 | 110 | 8.4% |
3 | 86 | 6.6% |
2 | 78 | 6.0% |
4 | 68 | 5.2% |
8 | 61 | 4.7% |
7 | 54 | 4.1% |
D | 42 | 3.2% |
Other values (5) | 140 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1166 | |
Uppercase Letter | 78 | 6.0% |
Other Letter | 64 | 4.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 308 | |
1 | 236 | |
5 | 125 | |
6 | 110 | 9.4% |
3 | 86 | 7.4% |
2 | 78 | 6.7% |
4 | 68 | 5.8% |
8 | 61 | 5.2% |
7 | 54 | 4.6% |
9 | 40 | 3.4% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 42 | |
A | 24 | |
O | 12 | 15.4% |
Other Letter
Value | Count | Frequency (%) |
다 | 32 | |
사 | 32 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1166 | |
Latin | 78 | 6.0% |
Hangul | 64 | 4.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 308 | |
1 | 236 | |
5 | 125 | |
6 | 110 | 9.4% |
3 | 86 | 7.4% |
2 | 78 | 6.7% |
4 | 68 | 5.8% |
8 | 61 | 5.2% |
7 | 54 | 4.6% |
9 | 40 | 3.4% |
Latin
Value | Count | Frequency (%) |
D | 42 | |
A | 24 | |
O | 12 | 15.4% |
Hangul
Value | Count | Frequency (%) |
다 | 32 | |
사 | 32 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1244 | |
Hangul | 64 | 4.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 308 | |
1 | 236 | |
5 | 125 | |
6 | 110 | 8.8% |
3 | 86 | 6.9% |
2 | 78 | 6.3% |
4 | 68 | 5.5% |
8 | 61 | 4.9% |
7 | 54 | 4.3% |
D | 42 | 3.4% |
Other values (3) | 76 | 6.1% |
Hangul
Value | Count | Frequency (%) |
다 | 32 | |
사 | 32 |
4212
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 130 |
---|---|
Distinct (%) | 65.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1601.1809 |
Minimum | 1 |
---|---|
Maximum | 5011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 16.5 |
median | 597 |
Q3 | 3033.5 |
95-th percentile | 4731.4 |
Maximum | 5011 |
Range | 5010 |
Interquartile range (IQR) | 3017 |
Descriptive statistics
Standard deviation | 1810.7965 |
---|---|
Coefficient of variation (CV) | 1.1309131 |
Kurtosis | -1.2241557 |
Mean | 1601.1809 |
Median Absolute Deviation (MAD) | 595 |
Skewness | 0.63991714 |
Sum | 318635 |
Variance | 3278984 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 12 | 6.0% |
2 | 7 | 3.5% |
8 | 5 | 2.5% |
9 | 4 | 2.0% |
27 | 4 | 2.0% |
3 | 4 | 2.0% |
13 | 3 | 1.5% |
2716 | 3 | 1.5% |
6 | 3 | 1.5% |
1114 | 3 | 1.5% |
Other values (120) | 151 |
Value | Count | Frequency (%) |
1 | 12 | |
2 | 7 | |
3 | 4 | 2.0% |
4 | 2 | 1.0% |
5 | 1 | 0.5% |
6 | 3 | 1.5% |
7 | 1 | 0.5% |
8 | 5 | |
9 | 4 | 2.0% |
10 | 2 | 1.0% |
Value | Count | Frequency (%) |
5011 | 1 | 0.5% |
4872 | 1 | 0.5% |
4836 | 3 | |
4812 | 1 | 0.5% |
4792 | 1 | 0.5% |
4776 | 1 | 0.5% |
4753 | 2 | |
4729 | 3 | |
4723 | 1 | 0.5% |
4615 | 1 | 0.5% |
9.29
Real number (ℝ)
Distinct | 118 |
---|---|
Distinct (%) | 59.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.551106 |
Minimum | 0 |
---|---|
Maximum | 66.04 |
Zeros | 1 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 10.4 |
Q3 | 14.51 |
95-th percentile | 19.585 |
Maximum | 66.04 |
Range | 66.04 |
Interquartile range (IQR) | 12.51 |
Descriptive statistics
Standard deviation | 9.046495 |
---|---|
Coefficient of variation (CV) | 0.85739784 |
Kurtosis | 12.959039 |
Mean | 10.551106 |
Median Absolute Deviation (MAD) | 5.46 |
Skewness | 2.6923948 |
Sum | 2099.67 |
Variance | 81.839073 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2.0 | 33 | 16.6% |
1.0 | 24 | 12.1% |
17.04 | 3 | 1.5% |
13.17 | 3 | 1.5% |
9.47 | 3 | 1.5% |
16.36 | 3 | 1.5% |
8.93 | 2 | 1.0% |
54.89 | 2 | 1.0% |
13.34 | 2 | 1.0% |
9.9 | 2 | 1.0% |
Other values (108) | 122 |
Value | Count | Frequency (%) |
0.0 | 1 | 0.5% |
1.0 | 24 | |
2.0 | 33 | |
4.85 | 1 | 0.5% |
4.92 | 1 | 0.5% |
5.14 | 1 | 0.5% |
5.28 | 1 | 0.5% |
6.78 | 1 | 0.5% |
8.02 | 1 | 0.5% |
8.15 | 1 | 0.5% |
Value | Count | Frequency (%) |
66.04 | 1 | |
54.89 | 2 | |
51.47 | 1 | |
23.16 | 1 | |
23.04 | 1 | |
22.19 | 1 | |
21.95 | 1 | |
21.65 | 1 | |
19.81 | 1 | |
19.56 | 1 |
5 | 2021 | 4212 | 9.29 | |
---|---|---|---|---|
5 | 1.000 | 0.649 | 0.873 | 0.640 |
2021 | 0.649 | 1.000 | 0.585 | 0.559 |
4212 | 0.873 | 0.585 | 1.000 | 0.545 |
9.29 | 0.640 | 0.559 | 0.545 | 1.000 |
2021 | 5 | |
---|---|---|
2021 | 1.000 | 0.468 |
5 | 0.468 | 1.000 |
4212 | 9.29 | 5 | 2021 | |
---|---|---|---|---|
4212 | 1.000 | -0.473 | 0.539 | 0.326 |
9.29 | -0.473 | 1.000 | 0.479 | 0.341 |
5 | 0.539 | 0.479 | 1.000 | 0.468 |
2021 | 0.326 | 0.341 | 0.468 | 1.000 |
5 | 2021 | 11305545 | 4212 | 9.29 | |
---|---|---|---|---|---|
0 | 5 | 2021 | 11680720 | 445 | 8.74 |
1 | 2 | 2021 | 17032 | 9 | 18.22 |
2 | 3 | 2021 | 1109075010013 | 95 | 10.61 |
3 | 3 | 2021 | 1116073030021 | 13 | 17.26 |
4 | 5 | 2021 | 11500590 | 3044 | 12.7 |
5 | 6 | 2018H2 | A | 2811 | 2.0 |
6 | 2 | 2021 | 15286 | 12 | 12.95 |
7 | 1 | 2021 | 다사39005510 | 1 | 4.92 |
8 | 5 | 2021 | 11500615 | 1670 | 23.16 |
9 | 1 | 2021 | 다사59204670 | 35 | 22.19 |
5 | 2021 | 11305545 | 4212 | 9.29 | |
---|---|---|---|---|---|
189 | 1 | 2021 | 다사43504860 | 42 | 16.81 |
190 | 5 | 2021 | 11305595 | 2792 | 10.02 |
191 | 5 | 2021 | 11680730 | 978 | 13.17 |
192 | 3 | 2021 | 1123075010002 | 9 | 17.23 |
193 | 2 | 2021 | 264902 | 6 | 11.63 |
194 | 6 | 2021H1 | DD | 1165 | 2.0 |
195 | 1 | 2021 | 다사60604160 | 1 | 0.0 |
196 | 1 | 2021 | 다사56306120 | 1 | 5.28 |
197 | 5 | 2021 | 11305534 | 4532 | 9.89 |
198 | 6 | 2021H1 | O | 1114 | 2.0 |
Most frequently occurring
5 | 2021 | 11305545 | 4212 | 9.29 | # duplicates | |
---|---|---|---|---|---|---|
3 | 5 | 2021 | 11305660 | 4836 | 9.47 | 3 |
10 | 5 | 2021 | 11680531 | 2716 | 17.04 | 3 |
0 | 3 | 2021 | 1123076020001 | 27 | 8.88 | 2 |
1 | 5 | 2021 | 11305595 | 2792 | 10.02 | 2 |
2 | 5 | 2021 | 11305625 | 2336 | 9.9 | 2 |
4 | 5 | 2021 | 11500520 | 879 | 13.34 | 2 |
5 | 5 | 2021 | 11500540 | 4170 | 16.66 | 2 |
6 | 5 | 2021 | 11500590 | 3044 | 12.7 | 2 |
7 | 5 | 2021 | 11500593 | 2099 | 14.6 | 2 |
8 | 5 | 2021 | 11500611 | 1616 | 11.9 | 2 |