Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 187 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 23 |
Duplicate rows (%) | 12.3% |
Total size in memory | 11.1 KiB |
Average record size in memory | 60.7 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000022 |
발신번호 has constant value "" | Constant |
수신번호 has constant value "" | Constant |
스팸내용 has constant value "" | Constant |
Dataset has 23 (12.3%) duplicate rows | Duplicates |
수신월 is highly overall correlated with 수신시분초 and 2 other fields | High correlation |
수신일 is highly overall correlated with 수신시분초 and 2 other fields | High correlation |
수신년도 is highly overall correlated with 수신시분초 and 2 other fields | High correlation |
수신시분초 is highly overall correlated with 수신년도 and 2 other fields | High correlation |
Reproduction
Analysis started | 2023-12-10 06:46:19.473153 |
---|---|
Analysis finished | 2023-12-10 06:46:20.007976 |
Duration | 0.53 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
수신년도
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
2018 | |
---|---|
2017 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2017 |
---|---|
2nd row | 2017 |
3rd row | 2017 |
4th row | 2017 |
5th row | 2017 |
Common Values
Value | Count | Frequency (%) |
2018 | 151 | |
2017 | 36 | 19.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2018 | 151 | |
2017 | 36 | 19.3% |
수신월
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
2 | |
---|---|
1 | |
11 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.1925134 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 |
---|---|
2nd row | 11 |
3rd row | 11 |
4th row | 11 |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
2 | 99 | |
1 | 52 | |
11 | 36 | 19.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 99 | |
1 | 52 | |
11 | 36 | 19.3% |
수신일
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
24 | |
---|---|
28 | |
23 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 23 |
---|---|
2nd row | 23 |
3rd row | 23 |
4th row | 23 |
5th row | 23 |
Common Values
Value | Count | Frequency (%) |
24 | 99 | |
28 | 52 | |
23 | 36 | 19.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
24 | 99 | |
28 | 52 | |
23 | 36 | 19.3% |
수신시분초
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 145 |
---|---|
Distinct (%) | 77.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 116984.49 |
Minimum | 100 |
---|---|
Maximum | 233100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.8 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 72020 |
Q1 | 95500 |
median | 113000 |
Q3 | 132400 |
95-th percentile | 182800 |
Maximum | 233100 |
Range | 233000 |
Interquartile range (IQR) | 36900 |
Descriptive statistics
Standard deviation | 38267.901 |
---|---|
Coefficient of variation (CV) | 0.32711944 |
Kurtosis | 2.0455243 |
Mean | 116984.49 |
Median Absolute Deviation (MAD) | 18500 |
Skewness | 0.035261028 |
Sum | 21876100 |
Variance | 1.4644323 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
182800 | 6 | 3.2% |
110100 | 3 | 1.6% |
102100 | 3 | 1.6% |
120100 | 3 | 1.6% |
130000 | 3 | 1.6% |
113000 | 2 | 1.1% |
103600 | 2 | 1.1% |
100 | 2 | 1.1% |
150600 | 2 | 1.1% |
121500 | 2 | 1.1% |
Other values (135) | 159 |
Value | Count | Frequency (%) |
100 | 2 | |
1200 | 1 | |
2200 | 1 | |
2400 | 1 | |
32100 | 1 | |
35700 | 1 | |
52200 | 1 | |
70300 | 1 | |
70400 | 1 | |
75800 | 1 |
Value | Count | Frequency (%) |
233100 | 1 | 0.5% |
230400 | 1 | 0.5% |
225500 | 1 | 0.5% |
205000 | 1 | 0.5% |
201500 | 1 | 0.5% |
195600 | 1 | 0.5% |
182800 | 6 | |
182700 | 1 | 0.5% |
182500 | 2 | 1.1% |
181800 | 1 | 0.5% |
발신번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
*********** |
---|
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 11 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *********** |
---|---|
2nd row | *********** |
3rd row | *********** |
4th row | *********** |
5th row | *********** |
Common Values
Value | Count | Frequency (%) |
*********** | 187 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
187 |
수신번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
*********** |
---|
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 11 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | *********** |
---|---|
2nd row | *********** |
3rd row | *********** |
4th row | *********** |
5th row | *********** |
Common Values
Value | Count | Frequency (%) |
*********** | 187 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
187 |
스팸내용
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
컨텐츠 비공개 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 컨텐츠 비공개 |
---|---|
2nd row | 컨텐츠 비공개 |
3rd row | 컨텐츠 비공개 |
4th row | 컨텐츠 비공개 |
5th row | 컨텐츠 비공개 |
Common Values
Value | Count | Frequency (%) |
컨텐츠 비공개 | 187 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
컨텐츠 | 187 | |
비공개 | 187 |
수신년도 | 수신월 | 수신일 | 수신시분초 | |
---|---|---|---|---|
수신년도 | 1.000 | 1.000 | 1.000 | 0.844 |
수신월 | 1.000 | 1.000 | 1.000 | 0.756 |
수신일 | 1.000 | 1.000 | 1.000 | 0.756 |
수신시분초 | 0.844 | 0.756 | 0.756 | 1.000 |
수신월 | 수신일 | 수신년도 | |
---|---|---|---|
수신월 | 1.000 | 1.000 | 0.997 |
수신일 | 1.000 | 1.000 | 0.997 |
수신년도 | 0.997 | 0.997 | 1.000 |
수신시분초 | 수신년도 | 수신월 | 수신일 | |
---|---|---|---|---|
수신시분초 | 1.000 | 0.661 | 0.617 | 0.617 |
수신년도 | 0.661 | 1.000 | 0.997 | 0.997 |
수신월 | 0.617 | 0.997 | 1.000 | 1.000 |
수신일 | 0.617 | 0.997 | 1.000 | 1.000 |
수신년도 | 수신월 | 수신일 | 수신시분초 | 발신번호 | 수신번호 | 스팸내용 | |
---|---|---|---|---|---|---|---|
0 | 2017 | 11 | 23 | 32100 | *********** | *********** | 컨텐츠 비공개 |
1 | 2017 | 11 | 23 | 82400 | *********** | *********** | 컨텐츠 비공개 |
2 | 2017 | 11 | 23 | 91400 | *********** | *********** | 컨텐츠 비공개 |
3 | 2017 | 11 | 23 | 92200 | *********** | *********** | 컨텐츠 비공개 |
4 | 2017 | 11 | 23 | 92300 | *********** | *********** | 컨텐츠 비공개 |
5 | 2017 | 11 | 23 | 92500 | *********** | *********** | 컨텐츠 비공개 |
6 | 2017 | 11 | 23 | 92500 | *********** | *********** | 컨텐츠 비공개 |
7 | 2017 | 11 | 23 | 93100 | *********** | *********** | 컨텐츠 비공개 |
8 | 2017 | 11 | 23 | 93200 | *********** | *********** | 컨텐츠 비공개 |
9 | 2017 | 11 | 23 | 93800 | *********** | *********** | 컨텐츠 비공개 |
수신년도 | 수신월 | 수신일 | 수신시분초 | 발신번호 | 수신번호 | 스팸내용 | |
---|---|---|---|---|---|---|---|
177 | 2018 | 2 | 24 | 131300 | *********** | *********** | 컨텐츠 비공개 |
178 | 2018 | 2 | 24 | 131700 | *********** | *********** | 컨텐츠 비공개 |
179 | 2018 | 2 | 24 | 131900 | *********** | *********** | 컨텐츠 비공개 |
180 | 2018 | 2 | 24 | 132000 | *********** | *********** | 컨텐츠 비공개 |
181 | 2018 | 2 | 24 | 132200 | *********** | *********** | 컨텐츠 비공개 |
182 | 2018 | 2 | 24 | 133000 | *********** | *********** | 컨텐츠 비공개 |
183 | 2018 | 2 | 24 | 133400 | *********** | *********** | 컨텐츠 비공개 |
184 | 2018 | 2 | 24 | 133600 | *********** | *********** | 컨텐츠 비공개 |
185 | 2018 | 2 | 24 | 134000 | *********** | *********** | 컨텐츠 비공개 |
186 | 2018 | 2 | 24 | 135200 | *********** | *********** | 컨텐츠 비공개 |
Most frequently occurring
수신년도 | 수신월 | 수신일 | 수신시분초 | 발신번호 | 수신번호 | 스팸내용 | # duplicates | |
---|---|---|---|---|---|---|---|---|
3 | 2017 | 11 | 23 | 182800 | *********** | *********** | 컨텐츠 비공개 | 6 |
17 | 2018 | 2 | 24 | 110100 | *********** | *********** | 컨텐츠 비공개 | 3 |
19 | 2018 | 2 | 24 | 120100 | *********** | *********** | 컨텐츠 비공개 | 3 |
21 | 2018 | 2 | 24 | 130000 | *********** | *********** | 컨텐츠 비공개 | 3 |
0 | 2017 | 11 | 23 | 92500 | *********** | *********** | 컨텐츠 비공개 | 2 |
1 | 2017 | 11 | 23 | 171900 | *********** | *********** | 컨텐츠 비공개 | 2 |
2 | 2017 | 11 | 23 | 182500 | *********** | *********** | 컨텐츠 비공개 | 2 |
4 | 2018 | 1 | 28 | 115000 | *********** | *********** | 컨텐츠 비공개 | 2 |
5 | 2018 | 1 | 28 | 122000 | *********** | *********** | 컨텐츠 비공개 | 2 |
6 | 2018 | 1 | 28 | 140000 | *********** | *********** | 컨텐츠 비공개 | 2 |