Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 325 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 24 |
Duplicate rows (%) | 7.4% |
Total size in memory | 22.0 KiB |
Average record size in memory | 69.4 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000024 |
Reproduction
Analysis started | 2023-12-10 06:22:02.781642 |
---|---|
Analysis finished | 2023-12-10 06:22:04.199110 |
Duration | 1.42 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
수신년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
2018 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2018 |
---|---|
2nd row | 2018 |
3rd row | 2018 |
4th row | 2018 |
5th row | 2018 |
Common Values
Value | Count | Frequency (%) |
2018 | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2018 | 325 |
수신월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
4 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 4 |
3rd row | 4 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
4 | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 325 |
수신일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
24 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 24 |
---|---|
2nd row | 24 |
3rd row | 24 |
4th row | 24 |
5th row | 24 |
Common Values
Value | Count | Frequency (%) |
24 | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
24 | 325 |
수신시분초
Real number (ℝ)
Distinct | 227 |
---|---|
Distinct (%) | 69.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 137908.92 |
Minimum | 92300 |
---|---|
Maximum | 212800 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 92300 |
---|---|
5-th percentile | 100900 |
Q1 | 113700 |
median | 135500 |
Q3 | 154400 |
95-th percentile | 183520 |
Maximum | 212800 |
Range | 120500 |
Interquartile range (IQR) | 40700 |
Descriptive statistics
Standard deviation | 26358.749 |
---|---|
Coefficient of variation (CV) | 0.19113157 |
Kurtosis | -0.62524771 |
Mean | 137908.92 |
Median Absolute Deviation (MAD) | 20800 |
Skewness | 0.28871967 |
Sum | 44820400 |
Variance | 6.9478365 × 108 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
112200 | 5 | 1.5% |
111900 | 4 | 1.2% |
140700 | 4 | 1.2% |
134100 | 3 | 0.9% |
105800 | 3 | 0.9% |
133600 | 3 | 0.9% |
133800 | 3 | 0.9% |
135300 | 3 | 0.9% |
135500 | 3 | 0.9% |
131600 | 3 | 0.9% |
Other values (217) | 291 |
Value | Count | Frequency (%) |
92300 | 1 | 0.3% |
92400 | 1 | 0.3% |
92600 | 1 | 0.3% |
92800 | 1 | 0.3% |
93000 | 1 | 0.3% |
94000 | 1 | 0.3% |
94200 | 1 | 0.3% |
94500 | 1 | 0.3% |
94700 | 1 | 0.3% |
94800 | 3 |
Value | Count | Frequency (%) |
212800 | 1 | |
203500 | 1 | |
202800 | 1 | |
195600 | 1 | |
191300 | 1 | |
190600 | 1 | |
190400 | 2 | |
190300 | 1 | |
190200 | 1 | |
185900 | 1 |
연결시간
Real number (ℝ)
Distinct | 29 |
---|---|
Distinct (%) | 8.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.975385 |
Minimum | 20 |
---|---|
Maximum | 61 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 20 |
---|---|
5-th percentile | 20 |
Q1 | 21 |
median | 25 |
Q3 | 30 |
95-th percentile | 61 |
Maximum | 61 |
Range | 41 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 13.733779 |
---|---|
Coefficient of variation (CV) | 0.45816856 |
Kurtosis | 0.83522698 |
Mean | 29.975385 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 1.5391093 |
Sum | 9742 |
Variance | 188.61668 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 71 | |
25 | 53 | |
21 | 42 | |
61 | 38 | |
26 | 21 | 6.5% |
24 | 15 | 4.6% |
22 | 14 | 4.3% |
30 | 13 | 4.0% |
60 | 8 | 2.5% |
29 | 7 | 2.2% |
Other values (19) | 43 |
Value | Count | Frequency (%) |
20 | 71 | |
21 | 42 | |
22 | 14 | 4.3% |
23 | 5 | 1.5% |
24 | 15 | 4.6% |
25 | 53 | |
26 | 21 | 6.5% |
27 | 4 | 1.2% |
28 | 3 | 0.9% |
29 | 7 | 2.2% |
Value | Count | Frequency (%) |
61 | 38 | |
60 | 8 | 2.5% |
54 | 1 | 0.3% |
49 | 1 | 0.3% |
47 | 1 | 0.3% |
46 | 1 | 0.3% |
44 | 3 | 0.9% |
43 | 4 | 1.2% |
42 | 1 | 0.3% |
40 | 1 | 0.3% |
발신번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
*********** |
---|
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 11 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *********** |
---|---|
2nd row | *********** |
3rd row | *********** |
4th row | *********** |
5th row | *********** |
Common Values
Value | Count | Frequency (%) |
*********** | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
325 |
수신번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
*********** |
---|
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 11 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *********** |
---|---|
2nd row | *********** |
3rd row | *********** |
4th row | *********** |
5th row | *********** |
Common Values
Value | Count | Frequency (%) |
*********** | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
325 |
스팸내용
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
컨텐츠 비공개 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 컨텐츠 비공개 |
---|---|
2nd row | 컨텐츠 비공개 |
3rd row | 컨텐츠 비공개 |
4th row | 컨텐츠 비공개 |
5th row | 컨텐츠 비공개 |
Common Values
Value | Count | Frequency (%) |
컨텐츠 비공개 | 325 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
컨텐츠 | 325 | |
비공개 | 325 |
수신시분초 | 연결시간 | |
---|---|---|
수신시분초 | 1.000 | 0.699 |
연결시간 | 0.699 | 1.000 |
수신시분초 | 연결시간 | |
---|---|---|
수신시분초 | 1.000 | 0.356 |
연결시간 | 0.356 | 1.000 |
수신년도 | 수신월 | 수신일 | 수신시분초 | 연결시간 | 발신번호 | 수신번호 | 스팸내용 | |
---|---|---|---|---|---|---|---|---|
0 | 2018 | 4 | 24 | 92300 | 20 | *********** | *********** | 컨텐츠 비공개 |
1 | 2018 | 4 | 24 | 92400 | 32 | *********** | *********** | 컨텐츠 비공개 |
2 | 2018 | 4 | 24 | 92600 | 20 | *********** | *********** | 컨텐츠 비공개 |
3 | 2018 | 4 | 24 | 92800 | 20 | *********** | *********** | 컨텐츠 비공개 |
4 | 2018 | 4 | 24 | 93000 | 20 | *********** | *********** | 컨텐츠 비공개 |
5 | 2018 | 4 | 24 | 94000 | 20 | *********** | *********** | 컨텐츠 비공개 |
6 | 2018 | 4 | 24 | 94200 | 20 | *********** | *********** | 컨텐츠 비공개 |
7 | 2018 | 4 | 24 | 94500 | 43 | *********** | *********** | 컨텐츠 비공개 |
8 | 2018 | 4 | 24 | 94700 | 20 | *********** | *********** | 컨텐츠 비공개 |
9 | 2018 | 4 | 24 | 94800 | 20 | *********** | *********** | 컨텐츠 비공개 |
수신년도 | 수신월 | 수신일 | 수신시분초 | 연결시간 | 발신번호 | 수신번호 | 스팸내용 | |
---|---|---|---|---|---|---|---|---|
315 | 2018 | 4 | 24 | 190200 | 61 | *********** | *********** | 컨텐츠 비공개 |
316 | 2018 | 4 | 24 | 190300 | 60 | *********** | *********** | 컨텐츠 비공개 |
317 | 2018 | 4 | 24 | 190400 | 61 | *********** | *********** | 컨텐츠 비공개 |
318 | 2018 | 4 | 24 | 190400 | 61 | *********** | *********** | 컨텐츠 비공개 |
319 | 2018 | 4 | 24 | 190600 | 61 | *********** | *********** | 컨텐츠 비공개 |
320 | 2018 | 4 | 24 | 191300 | 61 | *********** | *********** | 컨텐츠 비공개 |
321 | 2018 | 4 | 24 | 195600 | 26 | *********** | *********** | 컨텐츠 비공개 |
322 | 2018 | 4 | 24 | 202800 | 33 | *********** | *********** | 컨텐츠 비공개 |
323 | 2018 | 4 | 24 | 203500 | 20 | *********** | *********** | 컨텐츠 비공개 |
324 | 2018 | 4 | 24 | 212800 | 25 | *********** | *********** | 컨텐츠 비공개 |
Most frequently occurring
수신년도 | 수신월 | 수신일 | 수신시분초 | 연결시간 | 발신번호 | 수신번호 | 스팸내용 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
0 | 2018 | 4 | 24 | 94800 | 20 | *********** | *********** | 컨텐츠 비공개 | 3 |
1 | 2018 | 4 | 24 | 105800 | 20 | *********** | *********** | 컨텐츠 비공개 | 3 |
3 | 2018 | 4 | 24 | 112200 | 25 | *********** | *********** | 컨텐츠 비공개 | 3 |
22 | 2018 | 4 | 24 | 183600 | 61 | *********** | *********** | 컨텐츠 비공개 | 3 |
2 | 2018 | 4 | 24 | 111900 | 25 | *********** | *********** | 컨텐츠 비공개 | 2 |
4 | 2018 | 4 | 24 | 112800 | 20 | *********** | *********** | 컨텐츠 비공개 | 2 |
5 | 2018 | 4 | 24 | 132800 | 25 | *********** | *********** | 컨텐츠 비공개 | 2 |
6 | 2018 | 4 | 24 | 133000 | 25 | *********** | *********** | 컨텐츠 비공개 | 2 |
7 | 2018 | 4 | 24 | 133500 | 25 | *********** | *********** | 컨텐츠 비공개 | 2 |
8 | 2018 | 4 | 24 | 133800 | 25 | *********** | *********** | 컨텐츠 비공개 | 2 |