Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)4.0%
Total size in memory7.2 KiB
Average record size in memory73.3 B

Variable types

Categorical7
DateTime2

Dataset

Description샘플 데이터
Author지란지교시큐리티
URLhttps://www.findatamall.or.kr/market/dataProdDetail?gdsSn=28&gdsSeCd=GENERAL&gdsVer=1

Alerts

등록일 has constant value ""Constant
Dataset has 4 (4.0%) duplicate rowsDuplicates
제목 is highly overall correlated with 구분 and 5 other fieldsHigh correlation
발신자주소 is highly overall correlated with 구분 and 5 other fieldsHigh correlation
크기 is highly overall correlated with 구분 and 5 other fieldsHigh correlation
발신자IP is highly overall correlated with 구분 and 5 other fieldsHigh correlation
구분 is highly overall correlated with 제목 and 5 other fieldsHigh correlation
발신국적 is highly overall correlated with 구분 and 5 other fieldsHigh correlation
첨부유무 is highly overall correlated with 구분 and 5 other fieldsHigh correlation

Reproduction

Analysis started2024-03-03 20:04:21.001109
Analysis finished2024-03-03 20:04:21.821427
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
스팸
75 
바이러스
25 

Length

Max length4
Median length2
Mean length2.5
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row스팸
2nd row스팸
3rd row스팸
4th row바이러스
5th row스팸

Common Values

ValueCountFrequency (%)
스팸 75
75.0%
바이러스 25
 
25.0%

Length

2024-03-04T05:04:21.946723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:22.095245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
스팸 75
75.0%
바이러스 25
 
25.0%

등록일
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2019-07-15 00:00:00
Maximum2019-07-15 00:00:00
2024-03-04T05:04:22.223658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-04T05:04:22.337116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2019-07-15 13:26:00
Maximum2019-07-15 17:40:00
2024-03-04T05:04:22.456107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-04T05:04:22.572820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)

제목
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다
25 
dreaming of f#cking tonight?
25 
thiszero@jiran.com 服务器升级
25 
Export Invoice 04
25 

Length

Max length29
Median length26
Mean length24.5
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다
2nd rowdreaming of f#cking tonight?
3rd rowthiszero@jiran.com 服务器升级
4th rowExport Invoice 04
5th row우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다

Common Values

ValueCountFrequency (%)
우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다 25
25.0%
dreaming of f#cking tonight? 25
25.0%
thiszero@jiran.com 服务器升级 25
25.0%
Export Invoice 04 25
25.0%

Length

2024-03-04T05:04:22.709398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:22.850573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
우리는 25
 
5.9%
of 25
 
5.9%
invoice 25
 
5.9%
export 25
 
5.9%
服务器升级 25
 
5.9%
thiszero@jiran.com 25
 
5.9%
tonight 25
 
5.9%
f#cking 25
 
5.9%
dreaming 25
 
5.9%
귀하의 25
 
5.9%
Other values (7) 175
41.2%

발신자주소
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
kir458688@kagoya.net
25 
deagmund@mediterranean-real-estate.com
25 
info@accountupdate.com
25 
cerda@kiopo.gq
25 

Length

Max length38
Median length21
Mean length23.5
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkir458688@kagoya.net
2nd rowdeagmund@mediterranean-real-estate.com
3rd rowinfo@accountupdate.com
4th rowcerda@kiopo.gq
5th rowkir458688@kagoya.net

Common Values

ValueCountFrequency (%)
kir458688@kagoya.net 25
25.0%
deagmund@mediterranean-real-estate.com 25
25.0%
info@accountupdate.com 25
25.0%
cerda@kiopo.gq 25
25.0%

Length

2024-03-04T05:04:23.033562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:23.134297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kir458688@kagoya.net 25
25.0%
deagmund@mediterranean-real-estate.com 25
25.0%
info@accountupdate.com 25
25.0%
cerda@kiopo.gq 25
25.0%

발신자IP
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
203.142.200.81
25 
46.225.240.122
25 
74.118.138.168
25 
89.36.213.148
25 

Length

Max length14
Median length14
Mean length13.75
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row203.142.200.81
2nd row46.225.240.122
3rd row74.118.138.168
4th row89.36.213.148
5th row203.142.200.81

Common Values

ValueCountFrequency (%)
203.142.200.81 25
25.0%
46.225.240.122 25
25.0%
74.118.138.168 25
25.0%
89.36.213.148 25
25.0%

Length

2024-03-04T05:04:23.271839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:23.386264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
203.142.200.81 25
25.0%
46.225.240.122 25
25.0%
74.118.138.168 25
25.0%
89.36.213.148 25
25.0%

발신국적
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
JP
25 
IR
25 
us
25 
US
25 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJP
2nd rowIR
3rd rowus
4th rowUS
5th rowJP

Common Values

ValueCountFrequency (%)
JP 25
25.0%
IR 25
25.0%
us 25
25.0%
US 25
25.0%

Length

2024-03-04T05:04:23.514852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:23.636435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
us 50
50.0%
jp 25
25.0%
ir 25
25.0%

크기
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
5.3 KB (5,425 Byte(s))
25 
244.9 KB (250,806 Byte(s))
25 
6.7 KB (6,839 Byte(s))
25 
439.8 KB (450,335 Byte(s))
25 

Length

Max length26
Median length24
Mean length24
Min length22

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5.3 KB (5,425 Byte(s))
2nd row244.9 KB (250,806 Byte(s))
3rd row6.7 KB (6,839 Byte(s))
4th row439.8 KB (450,335 Byte(s))
5th row5.3 KB (5,425 Byte(s))

Common Values

ValueCountFrequency (%)
5.3 KB (5,425 Byte(s)) 25
25.0%
244.9 KB (250,806 Byte(s)) 25
25.0%
6.7 KB (6,839 Byte(s)) 25
25.0%
439.8 KB (450,335 Byte(s)) 25
25.0%

Length

2024-03-04T05:04:23.769116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:23.886565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kb 100
25.0%
byte(s 100
25.0%
5.3 25
 
6.2%
5,425 25
 
6.2%
244.9 25
 
6.2%
250,806 25
 
6.2%
6.7 25
 
6.2%
6,839 25
 
6.2%
439.8 25
 
6.2%
450,335 25
 
6.2%

첨부유무
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
75 
Payment-Exp004.xlsx.z (321.4 K)
25 

Length

Max length31
Median length1
Mean length8.5
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th rowPayment-Exp004.xlsx.z (321.4 K)
5th row

Common Values

ValueCountFrequency (%)
75
75.0%
Payment-Exp004.xlsx.z (321.4 K) 25
 
25.0%

Length

2024-03-04T05:04:24.078249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-04T05:04:24.182584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
75
50.0%
payment-exp004.xlsx.z 25
 
16.7%
321.4 25
 
16.7%
k 25
 
16.7%

Correlations

2024-03-04T05:04:24.264864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수신 일시제목발신자주소발신자IP발신국적크기첨부유무
구분1.0001.0001.0001.0001.0001.0001.0000.999
수신 일시1.0001.0001.0001.0001.0001.0001.0001.000
제목1.0001.0001.0001.0001.0001.0001.0001.000
발신자주소1.0001.0001.0001.0001.0001.0001.0001.000
발신자IP1.0001.0001.0001.0001.0001.0001.0001.000
발신국적1.0001.0001.0001.0001.0001.0001.0001.000
크기1.0001.0001.0001.0001.0001.0001.0001.000
첨부유무0.9991.0001.0001.0001.0001.0001.0001.000
2024-03-04T05:04:24.394583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제목발신자주소크기발신자IP구분발신국적첨부유무
제목1.0001.0001.0001.0000.9901.0000.990
발신자주소1.0001.0001.0001.0000.9901.0000.990
크기1.0001.0001.0001.0000.9901.0000.990
발신자IP1.0001.0001.0001.0000.9901.0000.990
구분0.9900.9900.9900.9901.0000.9900.973
발신국적1.0001.0001.0001.0000.9901.0000.990
첨부유무0.9900.9900.9900.9900.9730.9901.000
2024-03-04T05:04:24.523640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분제목발신자주소발신자IP발신국적크기첨부유무
구분1.0000.9900.9900.9900.9900.9900.973
제목0.9901.0001.0001.0001.0001.0000.990
발신자주소0.9901.0001.0001.0001.0001.0000.990
발신자IP0.9901.0001.0001.0001.0001.0000.990
발신국적0.9901.0001.0001.0001.0001.0000.990
크기0.9901.0001.0001.0001.0001.0000.990
첨부유무0.9730.9900.9900.9900.9900.9901.000

Missing values

2024-03-04T05:04:21.566995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-04T05:04:21.743915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록일수신 일시제목발신자주소발신자IP발신국적크기첨부유무
0스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))
1스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))
2스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))
3바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)
4스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))
5스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))
6스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))
7바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)
8스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))
9스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))
구분등록일수신 일시제목발신자주소발신자IP발신국적크기첨부유무
90스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))
91바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)
92스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))
93스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))
94스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))
95바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)
96스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))
97스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))
98스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))
99바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)

Duplicate rows

Most frequently occurring

구분등록일수신 일시제목발신자주소발신자IP발신국적크기첨부유무# duplicates
0바이러스2019.07.152019-07-15 13:26Export Invoice 04cerda@kiopo.gq89.36.213.148US439.8 KB (450,335 Byte(s))Payment-Exp004.xlsx.z (321.4 K)25
1스팸2019.07.152019-07-15 17:27우리는 귀하의 소포를 수취인에게 배달 할 수 없습니다kir458688@kagoya.net203.142.200.81JP5.3 KB (5,425 Byte(s))25
2스팸2019.07.152019-07-15 17:29dreaming of f#cking tonight?deagmund@mediterranean-real-estate.com46.225.240.122IR244.9 KB (250,806 Byte(s))25
3스팸2019.07.152019-07-15 17:40thiszero@jiran.com 服务器升级info@accountupdate.com74.118.138.168us6.7 KB (6,839 Byte(s))25