Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 128 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.2 KiB |
Average record size in memory | 50.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 1 |
Categorical | 3 |
DateTime | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 지란지교시큐리티 |
URL | https://www.findatamall.or.kr/market/dataProdDetail?gdsSn=27&gdsSeCd=GENERAL&gdsVer=1 |
Reproduction
Analysis started | 2024-03-03 10:03:54.668427 |
---|---|
Analysis finished | 2024-03-03 10:03:55.833672 |
Duration | 1.17 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
필터ID
Real number (ℝ)
UNIQUE
 
Distinct | 128 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9446924 |
Minimum | 9428137 |
---|---|
Maximum | 9447169 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 9428137 |
---|---|
5-th percentile | 9447048.3 |
Q1 | 9447073.8 |
median | 9447105.5 |
Q3 | 9447137.2 |
95-th percentile | 9447162.7 |
Maximum | 9447169 |
Range | 19032 |
Interquartile range (IQR) | 63.5 |
Descriptive statistics
Standard deviation | 1716.8674 |
---|---|
Coefficient of variation (CV) | 0.00018173825 |
Kurtosis | 115.56187 |
Mean | 9446924 |
Median Absolute Deviation (MAD) | 32 |
Skewness | -10.590766 |
Sum | 1.2092063 × 109 |
Variance | 2947633.6 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9447169 | 1 | 0.8% |
9447108 | 1 | 0.8% |
9447069 | 1 | 0.8% |
9447076 | 1 | 0.8% |
9447077 | 1 | 0.8% |
9447078 | 1 | 0.8% |
9447079 | 1 | 0.8% |
9447084 | 1 | 0.8% |
9447083 | 1 | 0.8% |
9447082 | 1 | 0.8% |
Other values (118) | 118 |
Value | Count | Frequency (%) |
9428137 | 1 | |
9442795 | 1 | |
9446969 | 1 | |
9447042 | 1 | |
9447043 | 1 | |
9447047 | 1 | |
9447048 | 1 | |
9447049 | 1 | |
9447050 | 1 | |
9447051 | 1 |
Value | Count | Frequency (%) |
9447169 | 1 | |
9447168 | 1 | |
9447167 | 1 | |
9447166 | 1 | |
9447165 | 1 | |
9447164 | 1 | |
9447163 | 1 | |
9447162 | 1 | |
9447161 | 1 | |
9447160 | 1 |
필터링 값
Text
UNIQUE
 
Distinct | 128 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Length
Max length | 50 |
---|---|
Median length | 34 |
Mean length | 18.421875 |
Min length | 10 |
Characters and Unicode
Total characters | 2358 |
---|---|
Distinct characters | 73 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 128 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 1GtdpkxN7izueE1696LQUSRB72Mh71BeNh |
---|---|
2nd row | mrallison60@gmail.com |
3rd row | 2748482748&extra=&&&484827&&&vxc.azurewebsites.net |
4th row | hronicealt1d8f.top |
5th row | esliston.xyz |
Value | Count | Frequency (%) |
1gtdpkxn7izuee1696lqusrb72mh71benh | 1 | 0.7% |
wqk4gsm74w.biz | 1 | 0.7% |
pasteascript.com/home | 1 | 0.7% |
y6hitbsfc3.biz | 1 | 0.7% |
gruyerec7nsgvday.onion.pet | 1 | 0.7% |
glasees.duckdns.org | 1 | 0.7% |
kidocx.xyz | 1 | 0.7% |
newyork-defense-lawyer.com/wp-http | 1 | 0.7% |
0jl5l1rntu.biz | 1 | 0.7% |
8b0p0zldx4.biz | 1 | 0.7% |
Other values (127) | 127 |
Most occurring characters
Value | Count | Frequency (%) |
. | 214 | 9.1% |
o | 142 | 6.0% |
i | 139 | 5.9% |
a | 122 | 5.2% |
c | 121 | 5.1% |
e | 121 | 5.1% |
l | 100 | 4.2% |
n | 86 | 3.6% |
m | 84 | 3.6% |
t | 82 | 3.5% |
Other values (63) | 1147 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1787 | |
Other Punctuation | 263 | 11.2% |
Decimal Number | 244 | 10.3% |
Uppercase Letter | 26 | 1.1% |
Dash Punctuation | 16 | 0.7% |
Space Separator | 9 | 0.4% |
Other Letter | 7 | 0.3% |
Connector Punctuation | 3 | 0.1% |
Open Punctuation | 1 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 142 | 7.9% |
i | 139 | 7.8% |
a | 122 | 6.8% |
c | 121 | 6.8% |
e | 121 | 6.8% |
l | 100 | 5.6% |
n | 86 | 4.8% |
m | 84 | 4.7% |
t | 82 | 4.6% |
r | 81 | 4.5% |
Other values (16) | 709 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 3 | 11.5% |
A | 2 | 7.7% |
N | 2 | 7.7% |
C | 2 | 7.7% |
G | 2 | 7.7% |
U | 2 | 7.7% |
S | 2 | 7.7% |
K | 1 | 3.8% |
I | 1 | 3.8% |
H | 1 | 3.8% |
Other values (8) | 8 |
Decimal Number
Value | Count | Frequency (%) |
7 | 31 | |
8 | 28 | |
1 | 28 | |
5 | 26 | |
4 | 25 | |
0 | 24 | |
2 | 24 | |
6 | 23 | |
9 | 21 | |
3 | 14 |
Other Letter
Value | Count | Frequency (%) |
이 | 1 | |
해 | 1 | |
요 | 1 | |
필 | 1 | |
움 | 1 | |
도 | 1 | |
네 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 214 | |
/ | 21 | 8.0% |
@ | 17 | 6.5% |
& | 7 | 2.7% |
: | 3 | 1.1% |
! | 1 | 0.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16 |
Space Separator
Value | Count | Frequency (%) |
9 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1813 | |
Common | 538 | 22.8% |
Hangul | 7 | 0.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 142 | 7.8% |
i | 139 | 7.7% |
a | 122 | 6.7% |
c | 121 | 6.7% |
e | 121 | 6.7% |
l | 100 | 5.5% |
n | 86 | 4.7% |
m | 84 | 4.6% |
t | 82 | 4.5% |
r | 81 | 4.5% |
Other values (34) | 735 |
Common
Value | Count | Frequency (%) |
. | 214 | |
7 | 31 | 5.8% |
8 | 28 | 5.2% |
1 | 28 | 5.2% |
5 | 26 | 4.8% |
4 | 25 | 4.6% |
0 | 24 | 4.5% |
2 | 24 | 4.5% |
6 | 23 | 4.3% |
9 | 21 | 3.9% |
Other values (12) | 94 |
Hangul
Value | Count | Frequency (%) |
이 | 1 | |
해 | 1 | |
요 | 1 | |
필 | 1 | |
움 | 1 | |
도 | 1 | |
네 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2351 | |
Hangul | 7 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 214 | 9.1% |
o | 142 | 6.0% |
i | 139 | 5.9% |
a | 122 | 5.2% |
c | 121 | 5.1% |
e | 121 | 5.1% |
l | 100 | 4.3% |
n | 86 | 3.7% |
m | 84 | 3.6% |
t | 82 | 3.5% |
Other values (56) | 1140 |
Hangul
Value | Count | Frequency (%) |
이 | 1 | |
해 | 1 | |
요 | 1 | |
필 | 1 | |
움 | 1 | |
도 | 1 | |
네 | 1 |
필터링 대상
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
본문의 URL | |
---|---|
본문 | |
보내는 사람 전체 | 5 |
보내는 메일 서버 Reply to | 4 |
제목 | 1 |
Length
Max length | 18 |
---|---|
Median length | 7 |
Mean length | 6.40625 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | 본문 |
---|---|
2nd row | 본문 |
3rd row | 본문 |
4th row | 본문의 URL |
5th row | 본문의 URL |
Common Values
Value | Count | Frequency (%) |
본문의 URL | 92 | |
본문 | 25 | 19.5% |
보내는 사람 전체 | 5 | 3.9% |
보내는 메일 서버 Reply to | 4 | 3.1% |
제목 | 1 | 0.8% |
첨부파일 이름 | 1 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
본문의 | 92 | |
url | 92 | |
본문 | 25 | 10.1% |
보내는 | 9 | 3.6% |
사람 | 5 | 2.0% |
전체 | 5 | 2.0% |
메일 | 4 | 1.6% |
서버 | 4 | 1.6% |
reply | 4 | 1.6% |
to | 4 | 1.6% |
Other values (3) | 3 | 1.2% |
필터링 조건
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
포함하면 | |
---|---|
일치하면 | 4 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 포함하면 |
---|---|
2nd row | 포함하면 |
3rd row | 포함하면 |
4th row | 포함하면 |
5th row | 포함하면 |
Common Values
Value | Count | Frequency (%) |
포함하면 | 124 | |
일치하면 | 4 | 3.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
포함하면 | 124 | |
일치하면 | 4 | 3.1% |
분류
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
성인(A) | |
---|---|
피싱(X) | |
홍보(P) | |
홍보(P), 피싱(X) | 1 |
Length
Max length | 12 |
---|---|
Median length | 5 |
Mean length | 5.0546875 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | 피싱(X) |
---|---|
2nd row | 홍보(P) |
3rd row | 피싱(X) |
4th row | 홍보(P) |
5th row | 홍보(P) |
Common Values
Value | Count | Frequency (%) |
성인(A) | 65 | |
피싱(X) | 42 | |
홍보(P) | 20 | 15.6% |
홍보(P), 피싱(X) | 1 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
성인(a | 65 | |
피싱(x | 43 | |
홍보(p | 21 | 16.3% |
수정시간
Date
Distinct | 75 |
---|---|
Distinct (%) | 58.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Minimum | 2019-07-15 15:44:39 |
---|---|
Maximum | 2019-07-15 18:04:42 |
필터ID | 필터링 대상 | 필터링 조건 | 분류 | 수정시간 | |
---|---|---|---|---|---|
필터ID | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 |
필터링 대상 | 0.000 | 1.000 | 0.930 | 0.605 | 1.000 |
필터링 조건 | 0.000 | 0.930 | 1.000 | 0.390 | 1.000 |
분류 | 0.000 | 0.605 | 0.390 | 1.000 | 0.996 |
수정시간 | 1.000 | 1.000 | 1.000 | 0.996 | 1.000 |
필터링 조건 | 분류 | 필터링 대상 | |
---|---|---|---|
필터링 조건 | 1.000 | 0.259 | 0.752 |
분류 | 0.259 | 1.000 | 0.432 |
필터링 대상 | 0.752 | 0.432 | 1.000 |
필터ID | 필터링 대상 | 필터링 조건 | 분류 | |
---|---|---|---|---|
필터ID | 1.000 | 0.000 | 0.000 | 0.000 |
필터링 대상 | 0.000 | 1.000 | 0.752 | 0.432 |
필터링 조건 | 0.000 | 0.752 | 1.000 | 0.259 |
분류 | 0.000 | 0.432 | 0.259 | 1.000 |
필터ID | 필터링 값 | 필터링 대상 | 필터링 조건 | 분류 | 수정시간 | |
---|---|---|---|---|---|---|
0 | 9447169 | 1GtdpkxN7izueE1696LQUSRB72Mh71BeNh | 본문 | 포함하면 | 피싱(X) | 2019-07-15 18:04:42 |
1 | 9447168 | mrallison60@gmail.com | 본문 | 포함하면 | 홍보(P) | 2019-07-15 18:04:02 |
2 | 9447167 | 2748482748&extra=&&&484827&&&vxc.azurewebsites.net | 본문 | 포함하면 | 피싱(X) | 2019-07-15 18:03:43 |
3 | 9447166 | hronicealt1d8f.top | 본문의 URL | 포함하면 | 홍보(P) | 2019-07-15 18:02:53 |
4 | 9447165 | esliston.xyz | 본문의 URL | 포함하면 | 홍보(P) | 2019-07-15 18:02:12 |
5 | 9447164 | safra.nationalbank.enquiaries@gmail.com | 본문 | 포함하면 | 홍보(P) | 2019-07-15 17:56:44 |
6 | 9447163 | wo7783@gmail.com | 보내는 메일 서버 Reply to | 일치하면 | 홍보(P) | 2019-07-15 17:55:21 |
7 | 9447162 | chonghinatakaitu56@gmail.com | 본문 | 포함하면 | 홍보(P) | 2019-07-15 17:53:38 |
8 | 9447161 | contabilidadeatual.com | 본문의 URL | 포함하면 | 피싱(X) | 2019-07-15 17:50:18 |
9 | 9447160 | bannerman_jp@yahoo.com | 보내는 사람 전체 | 포함하면 | 홍보(P) | 2019-07-15 17:48:57 |
필터ID | 필터링 값 | 필터링 대상 | 필터링 조건 | 분류 | 수정시간 | |
---|---|---|---|---|---|---|
118 | 9447050 | .swing-particular.com | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
119 | 9447051 | .fza0yslzpz7.cloud | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
120 | 9447052 | .ggut7reuar.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
121 | 9447053 | .4lrj9zaex1.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
122 | 9447054 | .2qqo9eic76.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
123 | 9447055 | .bcgkwtnauigv.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:54:11 |
124 | 9447048 | HGCK181160287-CIFA (1).lzh | 첨부파일 이름 | 일치하면 | 피싱(X) | 2019-07-15 15:48:02 |
125 | 9447047 | /secnote24.club | 본문 | 포함하면 | 피싱(X) | 2019-07-15 15:45:52 |
126 | 9447042 | .ijv4l4yjv7.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:44:39 |
127 | 9447043 | .i1pr5jctqy.biz | 본문의 URL | 포함하면 | 성인(A) | 2019-07-15 15:44:39 |