Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 399 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.9 KiB |
Average record size in memory | 43.3 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 3 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | ㈜케이티 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KT1COMMERCEKEYWORD |
Reproduction
Analysis started | 2023-12-10 06:41:43.507966 |
---|---|
Analysis finished | 2023-12-10 06:41:45.630166 |
Duration | 2.12 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
2019-07-23
Date
Distinct | 20 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.2 KiB |
Minimum | 2019-07-23 00:00:00 |
---|---|
Maximum | 2019-08-17 00:00:00 |
1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 20 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.52381 |
Minimum | 1 |
---|---|
Maximum | 20 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 6 |
median | 11 |
Q3 | 15.5 |
95-th percentile | 19.1 |
Maximum | 20 |
Range | 19 |
Interquartile range (IQR) | 9.5 |
Descriptive statistics
Standard deviation | 5.7610553 |
---|---|
Coefficient of variation (CV) | 0.5474306 |
Kurtosis | -1.2046542 |
Mean | 10.52381 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.0011379893 |
Sum | 4199 |
Variance | 33.189758 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 20 | 5.0% |
3 | 20 | 5.0% |
20 | 20 | 5.0% |
19 | 20 | 5.0% |
18 | 20 | 5.0% |
17 | 20 | 5.0% |
16 | 20 | 5.0% |
15 | 20 | 5.0% |
14 | 20 | 5.0% |
13 | 20 | 5.0% |
Other values (10) | 199 |
Value | Count | Frequency (%) |
1 | 19 | |
2 | 20 | |
3 | 20 | |
4 | 20 | |
5 | 20 | |
6 | 20 | |
7 | 20 | |
8 | 20 | |
9 | 20 | |
10 | 20 |
Value | Count | Frequency (%) |
20 | 20 | |
19 | 20 | |
18 | 20 | |
17 | 20 | |
16 | 20 | |
15 | 20 | |
14 | 20 | |
13 | 20 | |
12 | 20 | |
11 | 20 |
원피스
Text
Distinct | 147 |
---|---|
Distinct (%) | 36.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.2 KiB |
Value | Count | Frequency (%) |
원피스 | 19 | 4.5% |
나이키 | 19 | 4.5% |
선풍기 | 18 | 4.3% |
에어프라이어 | 18 | 4.3% |
블라우스 | 17 | 4.0% |
에어팟 | 17 | 4.0% |
데싱디바 | 14 | 3.3% |
가방 | 13 | 3.1% |
x1 | 13 | 3.1% |
여성샌들 | 9 | 2.1% |
Other values (149) | 264 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 103 | 6.3% |
어 | 73 | 4.4% |
이 | 69 | 4.2% |
에 | 56 | 3.4% |
라 | 52 | 3.2% |
원 | 50 | 3.0% |
기 | 41 | 2.5% |
피 | 41 | 2.5% |
가 | 31 | 1.9% |
프 | 26 | 1.6% |
Other values (258) | 1103 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1500 | |
Lowercase Letter | 35 | 2.1% |
Decimal Number | 34 | 2.1% |
Other Punctuation | 26 | 1.6% |
Space Separator | 24 | 1.5% |
Uppercase Letter | 24 | 1.5% |
Dash Punctuation | 1 | 0.1% |
Math Symbol | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 103 | 6.9% |
어 | 73 | 4.9% |
이 | 69 | 4.6% |
에 | 56 | 3.7% |
라 | 52 | 3.5% |
원 | 50 | 3.3% |
기 | 41 | 2.7% |
피 | 41 | 2.7% |
가 | 31 | 2.1% |
프 | 26 | 1.7% |
Other values (226) | 958 |
Lowercase Letter
Value | Count | Frequency (%) |
v | 6 | |
i | 4 | |
t | 4 | |
e | 4 | |
p | 3 | |
c | 2 | 5.7% |
f | 2 | 5.7% |
a | 2 | 5.7% |
h | 2 | 5.7% |
g | 2 | 5.7% |
Other values (4) | 4 |
Uppercase Letter
Value | Count | Frequency (%) |
X | 13 | |
M | 2 | 8.3% |
G | 2 | 8.3% |
A | 2 | 8.3% |
S | 1 | 4.2% |
T | 1 | 4.2% |
C | 1 | 4.2% |
H | 1 | 4.2% |
D | 1 | 4.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 21 | |
0 | 6 | 17.6% |
2 | 5 | 14.7% |
9 | 1 | 2.9% |
4 | 1 | 2.9% |
Other Punctuation
Value | Count | Frequency (%) |
" | 26 |
Space Separator
Value | Count | Frequency (%) |
24 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1500 | |
Common | 86 | 5.2% |
Latin | 59 | 3.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 103 | 6.9% |
어 | 73 | 4.9% |
이 | 69 | 4.6% |
에 | 56 | 3.7% |
라 | 52 | 3.5% |
원 | 50 | 3.3% |
기 | 41 | 2.7% |
피 | 41 | 2.7% |
가 | 31 | 2.1% |
프 | 26 | 1.7% |
Other values (226) | 958 |
Latin
Value | Count | Frequency (%) |
X | 13 | |
v | 6 | 10.2% |
i | 4 | 6.8% |
t | 4 | 6.8% |
e | 4 | 6.8% |
p | 3 | 5.1% |
c | 2 | 3.4% |
f | 2 | 3.4% |
a | 2 | 3.4% |
h | 2 | 3.4% |
Other values (13) | 17 |
Common
Value | Count | Frequency (%) |
" | 26 | |
24 | ||
1 | 21 | |
0 | 6 | 7.0% |
2 | 5 | 5.8% |
9 | 1 | 1.2% |
- | 1 | 1.2% |
+ | 1 | 1.2% |
4 | 1 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1500 | |
ASCII | 145 | 8.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 103 | 6.9% |
어 | 73 | 4.9% |
이 | 69 | 4.6% |
에 | 56 | 3.7% |
라 | 52 | 3.5% |
원 | 50 | 3.3% |
기 | 41 | 2.7% |
피 | 41 | 2.7% |
가 | 31 | 2.1% |
프 | 26 | 1.7% |
Other values (226) | 958 |
ASCII
Value | Count | Frequency (%) |
" | 26 | |
24 | ||
1 | 21 | |
X | 13 | |
0 | 6 | 4.1% |
v | 6 | 4.1% |
2 | 5 | 3.4% |
i | 4 | 2.8% |
t | 4 | 2.8% |
e | 4 | 2.8% |
Other values (22) | 32 |
4
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0902256 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 2 |
Q3 | 2 |
95-th percentile | 4 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.98321578 |
---|---|
Coefficient of variation (CV) | 0.4703874 |
Kurtosis | 3.113008 |
Mean | 2.0902256 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.539763 |
Sum | 834 |
Variance | 0.96671327 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 223 | |
1 | 97 | |
3 | 46 | 11.5% |
4 | 16 | 4.0% |
5 | 13 | 3.3% |
6 | 4 | 1.0% |
Value | Count | Frequency (%) |
1 | 97 | |
2 | 223 | |
3 | 46 | 11.5% |
4 | 16 | 4.0% |
5 | 13 | 3.3% |
6 | 4 | 1.0% |
Value | Count | Frequency (%) |
6 | 4 | 1.0% |
5 | 13 | 3.3% |
4 | 16 | 4.0% |
3 | 46 | 11.5% |
2 | 223 | |
1 | 97 |
3.75
Real number (ℝ)
Distinct | 134 |
---|---|
Distinct (%) | 33.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.53162 |
Minimum | 1 |
---|---|
Maximum | 62.5 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 10 |
Q3 | 28 |
95-th percentile | 46 |
Maximum | 62.5 |
Range | 61.5 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 14.965298 |
---|---|
Coefficient of variation (CV) | 0.96353744 |
Kurtosis | -0.47559235 |
Mean | 15.53162 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 0.80908282 |
Sum | 6197.1165 |
Variance | 223.96014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.0 | 92 | 23.1% |
4.5 | 10 | 2.5% |
5.5 | 10 | 2.5% |
2.0 | 10 | 2.5% |
4.0 | 10 | 2.5% |
8.0 | 7 | 1.8% |
6.5 | 6 | 1.5% |
30.0 | 6 | 1.5% |
8.5 | 5 | 1.3% |
18.0 | 5 | 1.3% |
Other values (124) | 238 |
Value | Count | Frequency (%) |
1.0 | 92 | |
1.2 | 1 | 0.3% |
1.25 | 1 | 0.3% |
1.6 | 1 | 0.3% |
1.6667 | 1 | 0.3% |
1.75 | 2 | 0.5% |
1.8 | 1 | 0.3% |
2.0 | 10 | 2.5% |
2.2 | 1 | 0.3% |
2.3333 | 2 | 0.5% |
Value | Count | Frequency (%) |
62.5 | 1 | 0.3% |
53.5 | 2 | 0.5% |
50.5 | 1 | 0.3% |
50.0 | 3 | |
49.5 | 1 | 0.3% |
49.0 | 2 | 0.5% |
48.5 | 1 | 0.3% |
47.5 | 3 | |
46.5 | 5 | |
46.0 | 3 |
2019-07-23 | 1 | 4 | 3.75 | |
---|---|---|---|---|
2019-07-23 | 1.000 | 0.000 | 0.238 | 0.000 |
1 | 0.000 | 1.000 | 0.759 | 0.574 |
4 | 0.238 | 0.759 | 1.000 | 0.514 |
3.75 | 0.000 | 0.574 | 0.514 | 1.000 |
1 | 4 | 3.75 | |
---|---|---|---|
1 | 1.000 | -0.806 | -0.104 |
4 | -0.806 | 1.000 | 0.496 |
3.75 | -0.104 | 0.496 | 1.000 |
2019-07-23 | 1 | 원피스 | 4 | 3.75 | |
---|---|---|---|---|---|
0 | 2019-07-23 | 2 | 선풍기 | 4 | 8.5 |
1 | 2019-07-23 | 3 | 에어팟 | 4 | 17.5 |
2 | 2019-07-23 | 4 | 시서스 | 3 | 3.6667 |
3 | 2019-07-23 | 5 | 쿨매트 | 3 | 31.0 |
4 | 2019-07-23 | 6 | 플레이도우 | 2 | 4.5 |
5 | 2019-07-23 | 7 | 나이키 | 2 | 5.5 |
6 | 2019-07-23 | 8 | 블라우스 | 2 | 12.5 |
7 | 2019-07-23 | 9 | 크록스 | 2 | 17.0 |
8 | 2019-07-23 | 10 | 아쿠아슈즈 | 2 | 28.5 |
9 | 2019-07-23 | 11 | 에어프라이어 | 2 | 41.0 |
2019-07-23 | 1 | 원피스 | 4 | 3.75 | |
---|---|---|---|---|---|
389 | 2019-08-17 | 11 | vip데이 | 1 | 1.0 |
390 | 2019-08-17 | 12 | X1 | 1 | 1.0 |
391 | 2019-08-17 | 13 | 노트10케이스 | 1 | 1.0 |
392 | 2019-08-17 | 14 | 모유 유산균 | 1 | 1.0 |
393 | 2019-08-17 | 15 | 셀럽샵 | 1 | 1.0 |
394 | 2019-08-17 | 16 | 슬릿팬츠 | 1 | 1.0 |
395 | 2019-08-17 | 17 | 아이즈원 | 1 | 1.0 |
396 | 2019-08-17 | 18 | 침대 | 1 | 1.0 |
397 | 2019-08-17 | 19 | "TS샴푸" | 1 | 2.0 |
398 | 2019-08-17 | 20 | A+G 엣지 | 1 | 2.0 |