Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 155 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.6 KiB |
Average record size in memory | 76.9 B |
Variable types
Categorical | 7 |
---|---|
Numeric | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000021 |
수신년도 has constant value "" | Constant |
수신월 has constant value "" | Constant |
이메일제목명 has constant value "" | Constant |
이메일내용 has constant value "" | Constant |
발송국가명 is highly overall correlated with 수신일 and 1 other fields | High correlation |
인터넷회사명 is highly overall correlated with 수신일 and 1 other fields | High correlation |
수신일 is highly overall correlated with 발송국가명 and 1 other fields | High correlation |
수신일 is highly imbalanced (61.8%) | Imbalance |
발송IP주소 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 06:30:07.805480 |
---|---|
Analysis finished | 2023-12-10 06:30:08.651211 |
Duration | 0.85 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
수신년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
2017 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2017 |
---|---|
2nd row | 2017 |
3rd row | 2017 |
4th row | 2017 |
5th row | 2017 |
Common Values
Value | Count | Frequency (%) |
2017 | 155 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2017 | 155 |
수신월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 155 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 155 |
수신일
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
1 | |
---|---|
10 | 10 |
5 | 7 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.0645161 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 138 | |
10 | 10 | 6.5% |
5 | 7 | 4.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 138 | |
10 | 10 | 6.5% |
5 | 7 | 4.5% |
수신시분초
Real number (ℝ)
Distinct | 151 |
---|---|
Distinct (%) | 97.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 145250.89 |
Minimum | 41147 |
---|---|
Maximum | 233645 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 41147 |
---|---|
5-th percentile | 61169.3 |
Q1 | 97872 |
median | 150148 |
Q3 | 192502.5 |
95-th percentile | 224413 |
Maximum | 233645 |
Range | 192498 |
Interquartile range (IQR) | 94630.5 |
Descriptive statistics
Standard deviation | 53458.191 |
---|---|
Coefficient of variation (CV) | 0.36804037 |
Kurtosis | -1.1476444 |
Mean | 145250.89 |
Median Absolute Deviation (MAD) | 43673 |
Skewness | -0.057833688 |
Sum | 22513888 |
Variance | 2.8577782 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
74522 | 2 | 1.3% |
74843 | 2 | 1.3% |
94119 | 2 | 1.3% |
74520 | 2 | 1.3% |
205933 | 1 | 0.6% |
214553 | 1 | 0.6% |
213814 | 1 | 0.6% |
202819 | 1 | 0.6% |
123347 | 1 | 0.6% |
123415 | 1 | 0.6% |
Other values (141) | 141 |
Value | Count | Frequency (%) |
41147 | 1 | |
50327 | 1 | |
53245 | 1 | |
53317 | 1 | |
53713 | 1 | |
60520 | 1 | |
60830 | 1 | |
60834 | 1 | |
61313 | 1 | |
62454 | 1 |
Value | Count | Frequency (%) |
233645 | 1 | |
232431 | 1 | |
231330 | 1 | |
231321 | 1 | |
231131 | 1 | |
230724 | 1 | |
230659 | 1 | |
225540 | 1 | |
223930 | 1 | |
223901 | 1 |
발송국가명
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
CN | |
---|---|
- | |
US | 4 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.6 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
CN | 89 | |
- | 62 | |
US | 4 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
cn | 89 | |
62 | ||
us | 4 | 2.6% |
발송IP주소
Text
UNIQUE
 
Distinct | 155 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 11.529032 |
Min length | 10 |
Characters and Unicode
Total characters | 1787 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 155 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 103.*.21.180 |
---|---|
2nd row | 103.*.21.182 |
3rd row | 103.*.21.183 |
4th row | 103.*.21.184 |
5th row | 103.*.21.189 |
Value | Count | Frequency (%) |
103.*.21.180 | 1 | 0.6% |
119.*.78.62 | 1 | 0.6% |
119.*.79.223 | 1 | 0.6% |
119.*.78.233 | 1 | 0.6% |
119.*.78.24 | 1 | 0.6% |
119.*.78.245 | 1 | 0.6% |
119.*.78.31 | 1 | 0.6% |
119.*.78.54 | 1 | 0.6% |
119.*.78.6 | 1 | 0.6% |
119.*.78.88 | 1 | 0.6% |
Other values (145) | 145 |
Most occurring characters
Value | Count | Frequency (%) |
. | 465 | |
1 | 337 | |
* | 155 | 8.7% |
2 | 139 | 7.8% |
3 | 135 | 7.6% |
7 | 131 | 7.3% |
9 | 124 | 6.9% |
0 | 99 | 5.5% |
6 | 76 | 4.3% |
8 | 57 | 3.2% |
Other values (2) | 69 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1167 | |
Other Punctuation | 620 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 337 | |
2 | 139 | |
3 | 135 | |
7 | 131 | 11.2% |
9 | 124 | 10.6% |
0 | 99 | 8.5% |
6 | 76 | 6.5% |
8 | 57 | 4.9% |
4 | 37 | 3.2% |
5 | 32 | 2.7% |
Other Punctuation
Value | Count | Frequency (%) |
. | 465 | |
* | 155 | 25.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1787 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 465 | |
1 | 337 | |
* | 155 | 8.7% |
2 | 139 | 7.8% |
3 | 135 | 7.6% |
7 | 131 | 7.3% |
9 | 124 | 6.9% |
0 | 99 | 5.5% |
6 | 76 | 4.3% |
8 | 57 | 3.2% |
Other values (2) | 69 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1787 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 465 | |
1 | 337 | |
* | 155 | 8.7% |
2 | 139 | 7.8% |
3 | 135 | 7.6% |
7 | 131 | 7.3% |
9 | 124 | 6.9% |
0 | 99 | 5.5% |
6 | 76 | 4.3% |
8 | 57 | 3.2% |
Other values (2) | 69 | 3.9% |
이메일제목명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
컨텐츠 비공개 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 컨텐츠 비공개 |
---|---|
2nd row | 컨텐츠 비공개 |
3rd row | 컨텐츠 비공개 |
4th row | 컨텐츠 비공개 |
5th row | 컨텐츠 비공개 |
Common Values
Value | Count | Frequency (%) |
컨텐츠 비공개 | 155 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
컨텐츠 | 155 | |
비공개 | 155 |
이메일내용
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
- |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 155 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
155 |
인터넷회사명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
CHINANET HUBEI PROVINCE NETWORK | |
---|---|
- | |
JIANGSU GROUP CO. NANJING JIANGSU PROVINCE | |
E.I. DU PONT DE NEMOURS AND CO. INC. | 4 |
Length
Max length | 42 |
---|---|
Median length | 31 |
Mean length | 19.83871 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
CHINANET HUBEI PROVINCE NETWORK | 79 | |
- | 62 | |
JIANGSU GROUP CO. NANJING JIANGSU PROVINCE | 10 | 6.5% |
E.I. DU PONT DE NEMOURS AND CO. INC. | 4 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
province | 89 | |
chinanet | 79 | |
hubei | 79 | |
network | 79 | |
62 | ||
jiangsu | 20 | 4.3% |
co | 14 | 3.0% |
group | 10 | 2.1% |
nanjing | 10 | 2.1% |
e.i | 4 | 0.9% |
Other values (6) | 24 | 5.1% |
수신일 | 수신시분초 | 발송국가명 | 인터넷회사명 | |
---|---|---|---|---|
수신일 | 1.000 | 0.167 | 0.860 | 0.817 |
수신시분초 | 0.167 | 1.000 | 0.487 | 0.461 |
발송국가명 | 0.860 | 0.487 | 1.000 | 1.000 |
인터넷회사명 | 0.817 | 0.461 | 1.000 | 1.000 |
발송국가명 | 인터넷회사명 | 수신일 | |
---|---|---|---|
발송국가명 | 1.000 | 0.997 | 0.548 |
인터넷회사명 | 0.997 | 1.000 | 0.880 |
수신일 | 0.548 | 0.880 | 1.000 |
수신시분초 | 수신일 | 발송국가명 | 인터넷회사명 | |
---|---|---|---|---|
수신시분초 | 1.000 | 0.127 | 0.346 | 0.301 |
수신일 | 0.127 | 1.000 | 0.548 | 0.880 |
발송국가명 | 0.346 | 0.548 | 1.000 | 0.997 |
인터넷회사명 | 0.301 | 0.880 | 0.997 | 1.000 |
수신년도 | 수신월 | 수신일 | 수신시분초 | 발송국가명 | 발송IP주소 | 이메일제목명 | 이메일내용 | 인터넷회사명 | |
---|---|---|---|---|---|---|---|---|---|
0 | 2017 | 1 | 1 | 205933 | - | 103.*.21.180 | 컨텐츠 비공개 | - | - |
1 | 2017 | 1 | 1 | 60834 | - | 103.*.21.182 | 컨텐츠 비공개 | - | - |
2 | 2017 | 1 | 1 | 153935 | - | 103.*.21.183 | 컨텐츠 비공개 | - | - |
3 | 2017 | 1 | 1 | 153740 | - | 103.*.21.184 | 컨텐츠 비공개 | - | - |
4 | 2017 | 1 | 1 | 102401 | - | 103.*.21.189 | 컨텐츠 비공개 | - | - |
5 | 2017 | 1 | 1 | 153916 | - | 103.*.21.190 | 컨텐츠 비공개 | - | - |
6 | 2017 | 1 | 1 | 60830 | - | 103.*.21.191 | 컨텐츠 비공개 | - | - |
7 | 2017 | 1 | 1 | 205923 | - | 103.*.21.192 | 컨텐츠 비공개 | - | - |
8 | 2017 | 1 | 1 | 102308 | - | 103.*.21.66 | 컨텐츠 비공개 | - | - |
9 | 2017 | 1 | 1 | 195253 | - | 103.*.21.69 | 컨텐츠 비공개 | - | - |
수신년도 | 수신월 | 수신일 | 수신시분초 | 발송국가명 | 발송IP주소 | 이메일제목명 | 이메일내용 | 인터넷회사명 | |
---|---|---|---|---|---|---|---|---|---|
145 | 2017 | 1 | 10 | 135713 | CN | 112.*.75.171 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
146 | 2017 | 1 | 10 | 222217 | CN | 112.*.75.187 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
147 | 2017 | 1 | 10 | 165315 | CN | 112.*.75.26 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
148 | 2017 | 1 | 10 | 221834 | CN | 112.*.75.46 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
149 | 2017 | 1 | 10 | 184752 | CN | 112.*.76.162 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
150 | 2017 | 1 | 10 | 193821 | CN | 112.*.76.30 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
151 | 2017 | 1 | 10 | 215324 | CN | 112.*.76.4 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
152 | 2017 | 1 | 10 | 185347 | CN | 112.*.76.62 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
153 | 2017 | 1 | 10 | 223053 | CN | 112.*.76.97 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |
154 | 2017 | 1 | 10 | 214150 | CN | 112.*.77.130 | 컨텐츠 비공개 | - | JIANGSU GROUP CO. NANJING JIANGSU PROVINCE |