Overview

Dataset statistics

Number of variables8
Number of observations325
Missing cells0
Missing cells (%)0.0%
Duplicate rows24
Duplicate rows (%)7.4%
Total size in memory22.0 KiB
Average record size in memory69.4 B

Variable types

Categorical6
Numeric2

Dataset

DescriptionSample
Author한국인터넷진흥원
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000024

Alerts

수신년도 has constant value ""Constant
수신월 has constant value ""Constant
수신일 has constant value ""Constant
발신번호 has constant value ""Constant
수신번호 has constant value ""Constant
스팸내용 has constant value ""Constant
Dataset has 24 (7.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 06:22:02.781642
Analysis finished2023-12-10 06:22:04.199110
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수신년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2018
325 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 325
100.0%

Length

2023-12-10T15:22:04.323005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:04.589418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 325
100.0%

수신월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
4
325 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 325
100.0%

Length

2023-12-10T15:22:04.776924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:04.951144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 325
100.0%

수신일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
24
325 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row24
2nd row24
3rd row24
4th row24
5th row24

Common Values

ValueCountFrequency (%)
24 325
100.0%

Length

2023-12-10T15:22:05.146950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:05.327102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
24 325
100.0%

수신시분초
Real number (ℝ)

Distinct227
Distinct (%)69.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean137908.92
Minimum92300
Maximum212800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-10T15:22:05.510753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum92300
5-th percentile100900
Q1113700
median135500
Q3154400
95-th percentile183520
Maximum212800
Range120500
Interquartile range (IQR)40700

Descriptive statistics

Standard deviation26358.749
Coefficient of variation (CV)0.19113157
Kurtosis-0.62524771
Mean137908.92
Median Absolute Deviation (MAD)20800
Skewness0.28871967
Sum44820400
Variance6.9478365 × 108
MonotonicityIncreasing
2023-12-10T15:22:05.833683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
112200 5
 
1.5%
111900 4
 
1.2%
140700 4
 
1.2%
134100 3
 
0.9%
105800 3
 
0.9%
133600 3
 
0.9%
133800 3
 
0.9%
135300 3
 
0.9%
135500 3
 
0.9%
131600 3
 
0.9%
Other values (217) 291
89.5%
ValueCountFrequency (%)
92300 1
 
0.3%
92400 1
 
0.3%
92600 1
 
0.3%
92800 1
 
0.3%
93000 1
 
0.3%
94000 1
 
0.3%
94200 1
 
0.3%
94500 1
 
0.3%
94700 1
 
0.3%
94800 3
0.9%
ValueCountFrequency (%)
212800 1
0.3%
203500 1
0.3%
202800 1
0.3%
195600 1
0.3%
191300 1
0.3%
190600 1
0.3%
190400 2
0.6%
190300 1
0.3%
190200 1
0.3%
185900 1
0.3%

연결시간
Real number (ℝ)

Distinct29
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.975385
Minimum20
Maximum61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-10T15:22:06.048025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q121
median25
Q330
95-th percentile61
Maximum61
Range41
Interquartile range (IQR)9

Descriptive statistics

Standard deviation13.733779
Coefficient of variation (CV)0.45816856
Kurtosis0.83522698
Mean29.975385
Median Absolute Deviation (MAD)4
Skewness1.5391093
Sum9742
Variance188.61668
MonotonicityNot monotonic
2023-12-10T15:22:06.291657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
20 71
21.8%
25 53
16.3%
21 42
12.9%
61 38
11.7%
26 21
 
6.5%
24 15
 
4.6%
22 14
 
4.3%
30 13
 
4.0%
60 8
 
2.5%
29 7
 
2.2%
Other values (19) 43
13.2%
ValueCountFrequency (%)
20 71
21.8%
21 42
12.9%
22 14
 
4.3%
23 5
 
1.5%
24 15
 
4.6%
25 53
16.3%
26 21
 
6.5%
27 4
 
1.2%
28 3
 
0.9%
29 7
 
2.2%
ValueCountFrequency (%)
61 38
11.7%
60 8
 
2.5%
54 1
 
0.3%
49 1
 
0.3%
47 1
 
0.3%
46 1
 
0.3%
44 3
 
0.9%
43 4
 
1.2%
42 1
 
0.3%
40 1
 
0.3%

발신번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
***********
325 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***********
2nd row***********
3rd row***********
4th row***********
5th row***********

Common Values

ValueCountFrequency (%)
*********** 325
100.0%

Length

2023-12-10T15:22:06.507936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:06.666188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
325
100.0%

수신번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
***********
325 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***********
2nd row***********
3rd row***********
4th row***********
5th row***********

Common Values

ValueCountFrequency (%)
*********** 325
100.0%

Length

2023-12-10T15:22:06.855587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:07.035740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
325
100.0%

스팸내용
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
컨텐츠 비공개
325 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row컨텐츠 비공개
2nd row컨텐츠 비공개
3rd row컨텐츠 비공개
4th row컨텐츠 비공개
5th row컨텐츠 비공개

Common Values

ValueCountFrequency (%)
컨텐츠 비공개 325
100.0%

Length

2023-12-10T15:22:07.247185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:07.395994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
컨텐츠 325
50.0%
비공개 325
50.0%

Interactions

2023-12-10T15:22:03.522496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:03.090980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:03.666355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:03.366910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:22:07.489477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수신시분초연결시간
수신시분초1.0000.699
연결시간0.6991.000
2023-12-10T15:22:07.629961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수신시분초연결시간
수신시분초1.0000.356
연결시간0.3561.000

Missing values

2023-12-10T15:22:03.857119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:22:04.103455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수신년도수신월수신일수신시분초연결시간발신번호수신번호스팸내용
020184249230020**********************컨텐츠 비공개
120184249240032**********************컨텐츠 비공개
220184249260020**********************컨텐츠 비공개
320184249280020**********************컨텐츠 비공개
420184249300020**********************컨텐츠 비공개
520184249400020**********************컨텐츠 비공개
620184249420020**********************컨텐츠 비공개
720184249450043**********************컨텐츠 비공개
820184249470020**********************컨텐츠 비공개
920184249480020**********************컨텐츠 비공개
수신년도수신월수신일수신시분초연결시간발신번호수신번호스팸내용
315201842419020061**********************컨텐츠 비공개
316201842419030060**********************컨텐츠 비공개
317201842419040061**********************컨텐츠 비공개
318201842419040061**********************컨텐츠 비공개
319201842419060061**********************컨텐츠 비공개
320201842419130061**********************컨텐츠 비공개
321201842419560026**********************컨텐츠 비공개
322201842420280033**********************컨텐츠 비공개
323201842420350020**********************컨텐츠 비공개
324201842421280025**********************컨텐츠 비공개

Duplicate rows

Most frequently occurring

수신년도수신월수신일수신시분초연결시간발신번호수신번호스팸내용# duplicates
020184249480020**********************컨텐츠 비공개3
1201842410580020**********************컨텐츠 비공개3
3201842411220025**********************컨텐츠 비공개3
22201842418360061**********************컨텐츠 비공개3
2201842411190025**********************컨텐츠 비공개2
4201842411280020**********************컨텐츠 비공개2
5201842413280025**********************컨텐츠 비공개2
6201842413300025**********************컨텐츠 비공개2
7201842413350025**********************컨텐츠 비공개2
8201842413380025**********************컨텐츠 비공개2