Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 153 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 6 |
Duplicate rows (%) | 3.9% |
Total size in memory | 7.9 KiB |
Average record size in memory | 52.9 B |
Variable types
Categorical | 4 |
---|---|
Numeric | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000004 |
생성년도 has constant value "" | Constant |
Dataset has 6 (3.9%) duplicate rows | Duplicates |
생성월 is highly overall correlated with 생성시분초 and 1 other fields | High correlation |
생성일 is highly overall correlated with 생성시분초 and 1 other fields | High correlation |
생성시분초 is highly overall correlated with 생성월 and 2 other fields | High correlation |
URL is highly overall correlated with 생성시분초 | High correlation |
URL is highly imbalanced (91.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 06:36:31.023536 |
---|---|
Analysis finished | 2023-12-10 06:36:31.737540 |
Duration | 0.71 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
생성년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
2019 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2019 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 153 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 153 |
생성월
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
5 | |
---|---|
7 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5 |
---|---|
2nd row | 5 |
3rd row | 5 |
4th row | 5 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
5 | 107 | |
7 | 46 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5 | 107 | |
7 | 46 |
생성일
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
22 | |
---|---|
10 | |
23 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 23 |
---|---|
2nd row | 23 |
3rd row | 23 |
4th row | 23 |
5th row | 23 |
Common Values
Value | Count | Frequency (%) |
22 | 75 | |
10 | 46 | |
23 | 32 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
22 | 75 | |
10 | 46 | |
23 | 32 |
생성시분초
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 15.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 92179.085 |
Minimum | 3600 |
---|---|
Maximum | 223700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 3600 |
---|---|
5-th percentile | 31000 |
Q1 | 51800 |
median | 112500 |
Q3 | 120000 |
95-th percentile | 132580 |
Maximum | 223700 |
Range | 220100 |
Interquartile range (IQR) | 68200 |
Descriptive statistics
Standard deviation | 42063.418 |
---|---|
Coefficient of variation (CV) | 0.4563228 |
Kurtosis | -0.38277847 |
Mean | 92179.085 |
Median Absolute Deviation (MAD) | 18000 |
Skewness | 0.026893349 |
Sum | 14103400 |
Variance | 1.7693311 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
130500 | 28 | |
112500 | 20 | |
50600 | 20 | |
51800 | 19 | |
120000 | 16 | |
114000 | 16 | |
35100 | 6 | 3.9% |
53300 | 4 | 2.6% |
31000 | 3 | 2.0% |
24300 | 2 | 1.3% |
Other values (13) | 19 |
Value | Count | Frequency (%) |
3600 | 1 | 0.7% |
3800 | 1 | 0.7% |
23700 | 1 | 0.7% |
24300 | 2 | 1.3% |
25000 | 2 | 1.3% |
31000 | 3 | 2.0% |
35100 | 6 | 3.9% |
50600 | 20 | |
51800 | 19 | |
53300 | 4 | 2.6% |
Value | Count | Frequency (%) |
223700 | 2 | 1.3% |
182100 | 1 | 0.7% |
144500 | 1 | 0.7% |
140900 | 2 | 1.3% |
135700 | 2 | 1.3% |
130500 | 28 | |
120000 | 16 | |
114300 | 2 | 1.3% |
114000 | 16 | |
112500 | 20 |
IP주소
Text
Distinct | 131 |
---|---|
Distinct (%) | 85.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 11.705882 |
Min length | 9 |
Characters and Unicode
Total characters | 1791 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 113 ? |
---|---|
Unique (%) | 73.9% |
Sample
1st row | 51.*.16.36 |
---|---|
2nd row | 50.*.202.39 |
3rd row | 110.*.111.190 |
4th row | 202.*.239.17 |
5th row | 34.*.102.38 |
Value | Count | Frequency (%) |
175.*.163.169 | 4 | 2.6% |
23.*.239.12 | 3 | 2.0% |
50.*.202.39 | 3 | 2.0% |
192.*.78.25 | 2 | 1.3% |
104.*.115.34 | 2 | 1.3% |
62.*.70.146 | 2 | 1.3% |
86.*.200.105 | 2 | 1.3% |
185.*.136.222 | 2 | 1.3% |
181.*.254.21 | 2 | 1.3% |
37.*.33.242 | 2 | 1.3% |
Other values (121) | 129 |
Most occurring characters
Value | Count | Frequency (%) |
. | 459 | |
1 | 287 | |
2 | 172 | 9.6% |
* | 153 | 8.5% |
3 | 111 | 6.2% |
9 | 96 | 5.4% |
8 | 95 | 5.3% |
0 | 91 | 5.1% |
7 | 88 | 4.9% |
5 | 85 | 4.7% |
Other values (2) | 154 | 8.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1179 | |
Other Punctuation | 612 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 287 | |
2 | 172 | |
3 | 111 | 9.4% |
9 | 96 | 8.1% |
8 | 95 | 8.1% |
0 | 91 | 7.7% |
7 | 88 | 7.5% |
5 | 85 | 7.2% |
4 | 85 | 7.2% |
6 | 69 | 5.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 459 | |
* | 153 | 25.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1791 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 459 | |
1 | 287 | |
2 | 172 | 9.6% |
* | 153 | 8.5% |
3 | 111 | 6.2% |
9 | 96 | 5.4% |
8 | 95 | 5.3% |
0 | 91 | 5.1% |
7 | 88 | 4.9% |
5 | 85 | 4.7% |
Other values (2) | 154 | 8.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1791 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 459 | |
1 | 287 | |
2 | 172 | 9.6% |
* | 153 | 8.5% |
3 | 111 | 6.2% |
9 | 96 | 5.4% |
8 | 95 | 5.3% |
0 | 91 | 5.1% |
7 | 88 | 4.9% |
5 | 85 | 4.7% |
Other values (2) | 154 | 8.6% |
URL
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
- | |
---|---|
hxxp://versuvius.ru/phazzy/Panel/fre.php | 1 |
hxxp://37.49.230.231/285217/logs/fre.php | 1 |
hxxp://5.8.88.176/es2cdNybX27IOKuk.conf.php | 1 |
Length
Max length | 43 |
---|---|
Median length | 1 |
Mean length | 1.7843137 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 150 | |
hxxp://versuvius.ru/phazzy/Panel/fre.php | 1 | 0.7% |
hxxp://37.49.230.231/285217/logs/fre.php | 1 | 0.7% |
hxxp://5.8.88.176/es2cdNybX27IOKuk.conf.php | 1 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
150 | ||
hxxp://versuvius.ru/phazzy/panel/fre.php | 1 | 0.7% |
hxxp://37.49.230.231/285217/logs/fre.php | 1 | 0.7% |
hxxp://5.8.88.176/es2cdnybx27iokuk.conf.php | 1 | 0.7% |
생성월 | 생성일 | 생성시분초 | URL | |
---|---|---|---|---|
생성월 | 1.000 | 1.000 | 0.646 | 0.249 |
생성일 | 1.000 | 1.000 | 0.922 | 0.065 |
생성시분초 | 0.646 | 0.922 | 1.000 | 0.761 |
URL | 0.249 | 0.065 | 0.761 | 1.000 |
생성월 | 생성일 | URL | |
---|---|---|---|
생성월 | 1.000 | 0.997 | 0.164 |
생성일 | 0.997 | 1.000 | 0.060 |
URL | 0.164 | 0.060 | 1.000 |
생성시분초 | 생성월 | 생성일 | URL | |
---|---|---|---|---|
생성시분초 | 1.000 | 0.630 | 0.652 | 0.598 |
생성월 | 0.630 | 1.000 | 0.997 | 0.164 |
생성일 | 0.652 | 0.997 | 1.000 | 0.060 |
URL | 0.598 | 0.164 | 0.060 | 1.000 |
생성년도 | 생성월 | 생성일 | 생성시분초 | IP주소 | URL | |
---|---|---|---|---|---|---|
0 | 2019 | 5 | 23 | 120000 | 51.*.16.36 | - |
1 | 2019 | 5 | 23 | 25000 | 50.*.202.39 | - |
2 | 2019 | 5 | 23 | 223700 | 110.*.111.190 | - |
3 | 2019 | 5 | 23 | 120000 | 202.*.239.17 | - |
4 | 2019 | 5 | 23 | 35100 | 34.*.102.38 | - |
5 | 2019 | 5 | 23 | 120000 | 192.*.78.25 | - |
6 | 2019 | 5 | 23 | 31000 | 103.*.72.54 | - |
7 | 2019 | 5 | 23 | 223700 | 104.*.110.190 | - |
8 | 2019 | 5 | 23 | 120000 | 185.*.61.161 | - |
9 | 2019 | 5 | 23 | 35100 | 18.*.215.84 | - |
생성년도 | 생성월 | 생성일 | 생성시분초 | IP주소 | URL | |
---|---|---|---|---|---|---|
143 | 2019 | 7 | 10 | 112500 | 178.*.83.248 | - |
144 | 2019 | 7 | 10 | 114000 | 79.*.23.90 | - |
145 | 2019 | 7 | 10 | 112500 | 104.*.84.171 | - |
146 | 2019 | 7 | 10 | 114300 | 198.*.117.212 | - |
147 | 2019 | 7 | 10 | 114000 | 184.*.221.60 | - |
148 | 2019 | 7 | 10 | 114000 | 184.*.131.241 | - |
149 | 2019 | 7 | 10 | 112500 | 104.*.18.74 | - |
150 | 2019 | 7 | 10 | 112500 | 104.*.27.170 | - |
151 | 2019 | 7 | 10 | 114000 | 23.*.239.12 | - |
152 | 2019 | 7 | 10 | 82900 | 5.*.88.176 | hxxp://5.8.88.176/es2cdNybX27IOKuk.conf.php |
Most frequently occurring
생성년도 | 생성월 | 생성일 | 생성시분초 | IP주소 | URL | # duplicates | |
---|---|---|---|---|---|---|---|
0 | 2019 | 5 | 22 | 53300 | 104.*.114.34 | - | 2 |
1 | 2019 | 5 | 22 | 53300 | 104.*.115.34 | - | 2 |
2 | 2019 | 5 | 22 | 140900 | 175.*.163.169 | - | 2 |
3 | 2019 | 5 | 23 | 120000 | 192.*.78.25 | - | 2 |
4 | 2019 | 7 | 10 | 112500 | 148.*.235.217 | - | 2 |
5 | 2019 | 7 | 10 | 112500 | 185.*.136.222 | - | 2 |