Overview

Dataset statistics

Number of variables13
Number of observations187
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.6 KiB
Average record size in memory112.7 B

Variable types

Categorical9
Numeric4

Dataset

DescriptionSample
Author한국인터넷진흥원
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000023

Alerts

수신년도 has constant value ""Constant
스팸내용 has constant value ""Constant
통신사업자코드 has constant value ""Constant
기기명 has constant value ""Constant
신고년도 has constant value ""Constant
발신번호 has constant value ""Constant
수신번호 has constant value ""Constant
수신일 is highly overall correlated with 수신시분초 and 3 other fieldsHigh correlation
수신월 is highly overall correlated with 수신시분초 and 3 other fieldsHigh correlation
수신시분초 is highly overall correlated with 신고시분초 and 2 other fieldsHigh correlation
신고월 is highly overall correlated with 신고일 and 2 other fieldsHigh correlation
신고일 is highly overall correlated with 신고월 and 2 other fieldsHigh correlation
신고시분초 is highly overall correlated with 수신시분초High correlation

Reproduction

Analysis started2023-12-10 06:36:09.947045
Analysis finished2023-12-10 06:36:13.687503
Duration3.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수신년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2019
187 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 187
100.0%

Length

2023-12-10T15:36:13.785290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:13.953700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 187
100.0%

수신월
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
9
92 
7
54 
8
22 
6
19 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6
2nd row6
3rd row6
4th row6
5th row6

Common Values

ValueCountFrequency (%)
9 92
49.2%
7 54
28.9%
8 22
 
11.8%
6 19
 
10.2%

Length

2023-12-10T15:36:14.114237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:14.637655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9 92
49.2%
7 54
28.9%
8 22
 
11.8%
6 19
 
10.2%

스팸내용
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
컨텐츠 비공개
187 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row컨텐츠 비공개
2nd row컨텐츠 비공개
3rd row컨텐츠 비공개
4th row컨텐츠 비공개
5th row컨텐츠 비공개

Common Values

ValueCountFrequency (%)
컨텐츠 비공개 187
100.0%

Length

2023-12-10T15:36:15.062975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:15.436597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
컨텐츠 187
50.0%
비공개 187
50.0%

통신사업자코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
-
187 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 187
100.0%

Length

2023-12-10T15:36:15.668985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:15.861321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
187
100.0%

기기명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
-
187 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 187
100.0%

Length

2023-12-10T15:36:16.035326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:16.264342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
187
100.0%

수신일
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
24
92 
25
53 
19
22 
28
19 
13
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row28
2nd row28
3rd row28
4th row28
5th row28

Common Values

ValueCountFrequency (%)
24 92
49.2%
25 53
28.3%
19 22
 
11.8%
28 19
 
10.2%
13 1
 
0.5%

Length

2023-12-10T15:36:16.460272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:16.669209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
24 92
49.2%
25 53
28.3%
19 22
 
11.8%
28 19
 
10.2%
13 1
 
0.5%

수신시분초
Real number (ℝ)

HIGH CORRELATION 

Distinct154
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129964.06
Minimum80419
Maximum213845
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-10T15:36:16.889881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80419
5-th percentile82005
Q195645.5
median123802
Q3163158
95-th percentile182929.3
Maximum213845
Range133426
Interquartile range (IQR)67512.5

Descriptive statistics

Standard deviation35048.033
Coefficient of variation (CV)0.26967481
Kurtosis-1.2935595
Mean129964.06
Median Absolute Deviation (MAD)29920
Skewness0.097925665
Sum24303279
Variance1.2283646 × 109
MonotonicityNot monotonic
2023-12-10T15:36:17.518795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
113726 4
 
2.1%
83453 3
 
1.6%
170705 2
 
1.1%
153716 2
 
1.1%
152954 2
 
1.1%
153002 2
 
1.1%
153004 2
 
1.1%
163732 2
 
1.1%
123806 2
 
1.1%
123803 2
 
1.1%
Other values (144) 164
87.7%
ValueCountFrequency (%)
80419 1
0.5%
80425 1
0.5%
81412 1
0.5%
81449 1
0.5%
81503 2
1.1%
81535 1
0.5%
81946 1
0.5%
81950 1
0.5%
82005 2
1.1%
82718 1
0.5%
ValueCountFrequency (%)
213845 1
0.5%
192443 1
0.5%
191523 2
1.1%
191522 1
0.5%
191520 1
0.5%
182943 1
0.5%
182942 1
0.5%
182935 1
0.5%
182932 1
0.5%
182923 1
0.5%

신고년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2019
187 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 187
100.0%

Length

2023-12-10T15:36:17.796483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:17.942018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 187
100.0%

신고월
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.0748663
Minimum6
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-10T15:36:18.083980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile6
Q17
median9
Q39
95-th percentile9
Maximum11
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.0899143
Coefficient of variation (CV)0.13497615
Kurtosis-1.0130076
Mean8.0748663
Median Absolute Deviation (MAD)1
Skewness-0.3765232
Sum1510
Variance1.1879133
MonotonicityNot monotonic
2023-12-10T15:36:18.300206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
9 91
48.7%
7 54
28.9%
8 24
 
12.8%
6 15
 
8.0%
10 2
 
1.1%
11 1
 
0.5%
ValueCountFrequency (%)
6 15
 
8.0%
7 54
28.9%
8 24
 
12.8%
9 91
48.7%
10 2
 
1.1%
11 1
 
0.5%
ValueCountFrequency (%)
11 1
 
0.5%
10 2
 
1.1%
9 91
48.7%
8 24
 
12.8%
7 54
28.9%
6 15
 
8.0%

신고일
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.57754
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-10T15:36:18.505058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19
Q124
median24
Q325
95-th percentile28
Maximum29
Range28
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3.9237529
Coefficient of variation (CV)0.1664191
Kurtosis15.657335
Mean23.57754
Median Absolute Deviation (MAD)1
Skewness-3.354592
Sum4409
Variance15.395837
MonotonicityNot monotonic
2023-12-10T15:36:18.682965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
24 86
46.0%
25 39
20.9%
19 21
 
11.2%
28 16
 
8.6%
26 13
 
7.0%
27 2
 
1.1%
1 2
 
1.1%
11 1
 
0.5%
15 1
 
0.5%
29 1
 
0.5%
Other values (5) 5
 
2.7%
ValueCountFrequency (%)
1 2
 
1.1%
2 1
 
0.5%
11 1
 
0.5%
13 1
 
0.5%
15 1
 
0.5%
17 1
 
0.5%
19 21
 
11.2%
20 1
 
0.5%
22 1
 
0.5%
24 86
46.0%
ValueCountFrequency (%)
29 1
 
0.5%
28 16
 
8.6%
27 2
 
1.1%
26 13
 
7.0%
25 39
20.9%
24 86
46.0%
22 1
 
0.5%
20 1
 
0.5%
19 21
 
11.2%
17 1
 
0.5%

신고시분초
Real number (ℝ)

HIGH CORRELATION 

Distinct185
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136510.32
Minimum65314
Maximum235138
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-10T15:36:18.952579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum65314
5-th percentile82771.7
Q1103837.5
median132041
Q3165309
95-th percentile193302.9
Maximum235138
Range169824
Interquartile range (IQR)61471.5

Descriptive statistics

Standard deviation38066.73
Coefficient of variation (CV)0.27885607
Kurtosis-0.89398524
Mean136510.32
Median Absolute Deviation (MAD)31900
Skewness0.17043985
Sum25527429
Variance1.4490759 × 109
MonotonicityNot monotonic
2023-12-10T15:36:19.203147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
92918 2
 
1.1%
113744 2
 
1.1%
123244 1
 
0.5%
175706 1
 
0.5%
113822 1
 
0.5%
114500 1
 
0.5%
115533 1
 
0.5%
185317 1
 
0.5%
114544 1
 
0.5%
181908 1
 
0.5%
Other values (175) 175
93.6%
ValueCountFrequency (%)
65314 1
0.5%
71042 1
0.5%
71824 1
0.5%
72333 1
0.5%
75401 1
0.5%
80918 1
0.5%
82000 1
0.5%
82101 1
0.5%
82632 1
0.5%
82751 1
0.5%
ValueCountFrequency (%)
235138 1
0.5%
231548 1
0.5%
225617 1
0.5%
211412 1
0.5%
201136 1
0.5%
201134 1
0.5%
194023 1
0.5%
193759 1
0.5%
193606 1
0.5%
193344 1
0.5%

발신번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
***********
187 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***********
2nd row***********
3rd row***********
4th row***********
5th row***********

Common Values

ValueCountFrequency (%)
*********** 187
100.0%

Length

2023-12-10T15:36:19.444016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:19.598086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
187
100.0%

수신번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
***********
187 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row***********
2nd row***********
3rd row***********
4th row***********
5th row***********

Common Values

ValueCountFrequency (%)
*********** 187
100.0%

Length

2023-12-10T15:36:19.763715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:36:19.941021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
187
100.0%

Interactions

2023-12-10T15:36:12.474511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:10.448080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.136991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.828752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:12.666242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:10.616869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.341417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.978917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:12.875382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:10.811800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.553268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:12.122806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:13.041921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:10.974751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:11.686043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:36:12.285482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:36:20.045683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수신월수신일수신시분초신고월신고일신고시분초
수신월1.0001.0000.7870.9910.9580.544
수신일1.0001.0000.8630.8800.8870.604
수신시분초0.7870.8631.0000.6580.4600.870
신고월0.9910.8800.6581.0000.8770.456
신고일0.9580.8870.4600.8771.0000.360
신고시분초0.5440.6040.8700.4560.3601.000
2023-12-10T15:36:20.209984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수신일수신월
수신일1.0000.997
수신월0.9971.000
2023-12-10T15:36:20.350980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수신시분초신고월신고일신고시분초수신월수신일
수신시분초1.0000.0040.2570.6780.5940.523
신고월0.0041.000-0.6180.0120.9330.806
신고일0.257-0.6181.0000.0650.7210.714
신고시분초0.6780.0120.0651.0000.3530.291
수신월0.5940.9330.7210.3531.0000.997
수신일0.5230.8060.7140.2910.9971.000

Missing values

2023-12-10T15:36:13.281483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:36:13.589139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수신년도수신월스팸내용통신사업자코드기기명수신일수신시분초신고년도신고월신고일신고시분초발신번호수신번호
020196컨텐츠 비공개--28834532019628143326**********************
120196컨텐츠 비공개--2883453201962885015**********************
220196컨텐츠 비공개--2883453201962891600**********************
320196컨텐츠 비공개--2883454201962890252**********************
420196컨텐츠 비공개--28834542019628165618**********************
520196컨텐츠 비공개--2883457201962885527**********************
620196컨텐츠 비공개--28835002019628103342**********************
720196컨텐츠 비공개--2883502201962883957**********************
820196컨텐츠 비공개--2883502201962884552**********************
920196컨텐츠 비공개--2883504201962895324**********************
수신년도수신월스팸내용통신사업자코드기기명수신일수신시분초신고년도신고월신고일신고시분초발신번호수신번호
17720199컨텐츠 비공개--241829202019924183246**********************
17820199컨텐츠 비공개--241829232019924182936**********************
17920199컨텐츠 비공개--241829322019924191942**********************
18020199컨텐츠 비공개--241829352019924190114**********************
18120199컨텐츠 비공개--241829422019924191307**********************
18220199컨텐츠 비공개--241829432019924183534**********************
18320199컨텐츠 비공개--241915202019924191946**********************
18420199컨텐츠 비공개--241915222019924191540**********************
18520199컨텐츠 비공개--241915232019928201136**********************
18620199컨텐츠 비공개--24191523201992792918**********************