Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 33 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 1 |
Duplicate rows (%) | 3.0% |
Total size in memory | 1.5 KiB |
Average record size in memory | 48.0 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 3 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000010 |
생성년도 has constant value "" | Constant |
Dataset has 1 (3.0%) duplicate rows | Duplicates |
생성월 is highly overall correlated with 생성시분초 | High correlation |
생성시분초 is highly overall correlated with 생성월 | High correlation |
생성시분초 has 24 (72.7%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:27:52.037694 |
---|---|
Analysis finished | 2023-12-10 06:27:53.890159 |
Duration | 1.85 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
생성년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 396.0 B |
2019 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2019 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 33 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 33 |
생성월
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 30.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.0909091 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 429.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 4 |
median | 6 |
Q3 | 7 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 3.2051096 |
---|---|
Coefficient of variation (CV) | 0.52621202 |
Kurtosis | -0.63640527 |
Mean | 6.0909091 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.50045096 |
Sum | 201 |
Variance | 10.272727 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 6 | |
7 | 5 | |
4 | 5 | |
12 | 4 | |
3 | 4 | |
2 | 3 | |
10 | 2 | 6.1% |
9 | 2 | 6.1% |
1 | 1 | 3.0% |
5 | 1 | 3.0% |
Value | Count | Frequency (%) |
1 | 1 | 3.0% |
2 | 3 | |
3 | 4 | |
4 | 5 | |
5 | 1 | 3.0% |
6 | 6 | |
7 | 5 | |
9 | 2 | 6.1% |
10 | 2 | 6.1% |
12 | 4 |
Value | Count | Frequency (%) |
12 | 4 | |
10 | 2 | 6.1% |
9 | 2 | 6.1% |
7 | 5 | |
6 | 6 | |
5 | 1 | 3.0% |
4 | 5 | |
3 | 4 | |
2 | 3 | |
1 | 1 | 3.0% |
생성일
Real number (ℝ)
Distinct | 13 |
---|---|
Distinct (%) | 39.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.727273 |
Minimum | 1 |
---|---|
Maximum | 28 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 429.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.6 |
Q1 | 17 |
median | 18 |
Q3 | 25 |
95-th percentile | 28 |
Maximum | 28 |
Range | 27 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 7.0899256 |
---|---|
Coefficient of variation (CV) | 0.35939715 |
Kurtosis | 1.599316 |
Mean | 19.727273 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -1.2450191 |
Sum | 651 |
Variance | 50.267045 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17 | 7 | |
24 | 6 | |
18 | 5 | |
28 | 4 | |
25 | 3 | |
16 | 1 | 3.0% |
1 | 1 | 3.0% |
26 | 1 | 3.0% |
3 | 1 | 3.0% |
21 | 1 | 3.0% |
Other values (3) | 3 |
Value | Count | Frequency (%) |
1 | 1 | 3.0% |
2 | 1 | 3.0% |
3 | 1 | 3.0% |
15 | 1 | 3.0% |
16 | 1 | 3.0% |
17 | 7 | |
18 | 5 | |
21 | 1 | 3.0% |
24 | 6 | |
25 | 3 |
Value | Count | Frequency (%) |
28 | 4 | |
27 | 1 | 3.0% |
26 | 1 | 3.0% |
25 | 3 | |
24 | 6 | |
21 | 1 | 3.0% |
18 | 5 | |
17 | 7 | |
16 | 1 | 3.0% |
15 | 1 | 3.0% |
생성시분초
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 7 |
---|---|
Distinct (%) | 21.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33089.061 |
Minimum | 0 |
---|---|
Maximum | 161131 |
Zeros | 24 |
Zeros (%) | 72.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 429.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 101648 |
95-th percentile | 151411 |
Maximum | 161131 |
Range | 161131 |
Interquartile range (IQR) | 101648 |
Descriptive statistics
Standard deviation | 56309.5 |
---|---|
Coefficient of variation (CV) | 1.7017558 |
Kurtosis | -0.12043012 |
Mean | 33089.061 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.2585116 |
Sum | 1091939 |
Variance | 3.1707598 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 24 | |
101648 | 2 | 6.1% |
151411 | 2 | 6.1% |
104927 | 2 | 6.1% |
161131 | 1 | 3.0% |
104214 | 1 | 3.0% |
110622 | 1 | 3.0% |
Value | Count | Frequency (%) |
0 | 24 | |
101648 | 2 | 6.1% |
104214 | 1 | 3.0% |
104927 | 2 | 6.1% |
110622 | 1 | 3.0% |
151411 | 2 | 6.1% |
161131 | 1 | 3.0% |
Value | Count | Frequency (%) |
161131 | 1 | 3.0% |
151411 | 2 | 6.1% |
110622 | 1 | 3.0% |
104927 | 2 | 6.1% |
104214 | 1 | 3.0% |
101648 | 2 | 6.1% |
0 | 24 |
분석보고서명
Text
Distinct | 32 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 396.0 B |
Length
Max length | 121 |
---|---|
Median length | 42 |
Mean length | 33.969697 |
Min length | 18 |
Characters and Unicode
Total characters | 1121 |
---|---|
Distinct characters | 187 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 31 ? |
---|---|
Unique (%) | 93.9% |
Sample
1st row | 2018년_하반기_악성코드_은닉사이트_탐지_동향_보고서.pdf |
---|---|
2nd row | _KISA__갠드크랩_분석_스폐셜_리포트.pdf |
3rd row | 20190226_Clop랜섬웨어_유포에_따른_감염_주의.pdf |
4th row | 사이버보안_빅데이터_활용사례(연구-국민대-윤명근_교수).pdf |
5th row | 사이버보안_빅데이터_활용사례(기업-두산디지털이노베이션BU-김민교_대리).pdf |
Value | Count | Frequency (%) |
공급망_공격_사례_분석_및_대응_방안.pdf | 2 | 6.1% |
2018년_하반기_악성코드_은닉사이트_탐지_동향_보고서.pdf | 1 | 3.0% |
3._머신러닝_기반의_보안데이터_분석_연구.pdf | 1 | 3.0% |
4._operation_kitty_phishing.pdf | 1 | 3.0% |
3._system_anomaly_analysis___detection.pdf | 1 | 3.0% |
2._threat_intelligence_수집방법_및_활용사례.pdf | 1 | 3.0% |
1._사이버보안빅데이터센터_소개.pdf | 1 | 3.0% |
ad(active_directory)_관리자가_피해야_할_6가지_ad_운영_사례.pdf | 1 | 3.0% |
2019년_1분기_사이버_위협_동향_보고서.pdf | 1 | 3.0% |
kisa_technical_report__analysis_on_cases_of_distribution_of_internal_network_ransomware_through_exploiting_ad_server.pdf | 1 | 3.0% |
Other values (22) | 22 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 178 | 15.9% |
. | 44 | 3.9% |
p | 39 | 3.5% |
f | 35 | 3.1% |
d | 33 | 2.9% |
e | 29 | 2.6% |
2 | 22 | 2.0% |
이 | 22 | 2.0% |
사 | 21 | 1.9% |
r | 19 | 1.7% |
Other values (177) | 679 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 413 | |
Lowercase Letter | 300 | |
Connector Punctuation | 178 | |
Uppercase Letter | 97 | 8.7% |
Decimal Number | 69 | 6.2% |
Other Punctuation | 44 | 3.9% |
Close Punctuation | 7 | 0.6% |
Open Punctuation | 7 | 0.6% |
Dash Punctuation | 6 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 22 | 5.3% |
사 | 21 | 5.1% |
보 | 14 | 3.4% |
안 | 12 | 2.9% |
기 | 11 | 2.7% |
분 | 11 | 2.7% |
터 | 9 | 2.2% |
버 | 9 | 2.2% |
석 | 8 | 1.9% |
례 | 8 | 1.9% |
Other values (121) | 288 |
Lowercase Letter
Value | Count | Frequency (%) |
p | 39 | |
f | 35 | |
d | 33 | |
e | 29 | |
r | 19 | 6.3% |
t | 19 | 6.3% |
o | 16 | 5.3% |
i | 15 | 5.0% |
n | 14 | 4.7% |
s | 13 | 4.3% |
Other values (13) | 68 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 18 | |
I | 16 | |
S | 13 | |
K | 8 | |
D | 8 | |
T | 6 | 6.2% |
R | 6 | 6.2% |
C | 5 | 5.2% |
N | 3 | 3.1% |
H | 2 | 2.1% |
Other values (8) | 12 |
Decimal Number
Value | Count | Frequency (%) |
2 | 22 | |
0 | 15 | |
1 | 11 | |
9 | 7 | 10.1% |
3 | 4 | 5.8% |
6 | 3 | 4.3% |
5 | 2 | 2.9% |
4 | 2 | 2.9% |
7 | 2 | 2.9% |
8 | 1 | 1.4% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 178 |
Other Punctuation
Value | Count | Frequency (%) |
. | 44 |
Close Punctuation
Value | Count | Frequency (%) |
) | 7 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 413 | |
Latin | 397 | |
Common | 311 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 22 | 5.3% |
사 | 21 | 5.1% |
보 | 14 | 3.4% |
안 | 12 | 2.9% |
기 | 11 | 2.7% |
분 | 11 | 2.7% |
터 | 9 | 2.2% |
버 | 9 | 2.2% |
석 | 8 | 1.9% |
례 | 8 | 1.9% |
Other values (121) | 288 |
Latin
Value | Count | Frequency (%) |
p | 39 | 9.8% |
f | 35 | 8.8% |
d | 33 | 8.3% |
e | 29 | 7.3% |
r | 19 | 4.8% |
t | 19 | 4.8% |
A | 18 | 4.5% |
o | 16 | 4.0% |
I | 16 | 4.0% |
i | 15 | 3.8% |
Other values (31) | 158 |
Common
Value | Count | Frequency (%) |
_ | 178 | |
. | 44 | 14.1% |
2 | 22 | 7.1% |
0 | 15 | 4.8% |
1 | 11 | 3.5% |
9 | 7 | 2.3% |
) | 7 | 2.3% |
( | 7 | 2.3% |
- | 6 | 1.9% |
3 | 4 | 1.3% |
Other values (5) | 10 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 708 | |
Hangul | 413 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 178 | |
. | 44 | 6.2% |
p | 39 | 5.5% |
f | 35 | 4.9% |
d | 33 | 4.7% |
e | 29 | 4.1% |
2 | 22 | 3.1% |
r | 19 | 2.7% |
t | 19 | 2.7% |
A | 18 | 2.5% |
Other values (46) | 272 |
Hangul
Value | Count | Frequency (%) |
이 | 22 | 5.3% |
사 | 21 | 5.1% |
보 | 14 | 3.4% |
안 | 12 | 2.9% |
기 | 11 | 2.7% |
분 | 11 | 2.7% |
터 | 9 | 2.2% |
버 | 9 | 2.2% |
석 | 8 | 1.9% |
례 | 8 | 1.9% |
Other values (121) | 288 |
생성월 | 생성일 | 생성시분초 | 분석보고서명 | |
---|---|---|---|---|
생성월 | 1.000 | 0.912 | 0.932 | 1.000 |
생성일 | 0.912 | 1.000 | 0.578 | 1.000 |
생성시분초 | 0.932 | 0.578 | 1.000 | 1.000 |
분석보고서명 | 1.000 | 1.000 | 1.000 | 1.000 |
생성월 | 생성일 | 생성시분초 | |
---|---|---|---|
생성월 | 1.000 | -0.155 | 0.547 |
생성일 | -0.155 | 1.000 | -0.185 |
생성시분초 | 0.547 | -0.185 | 1.000 |
생성년도 | 생성월 | 생성일 | 생성시분초 | 분석보고서명 | |
---|---|---|---|---|---|
0 | 2019 | 1 | 16 | 161131 | 2018년_하반기_악성코드_은닉사이트_탐지_동향_보고서.pdf |
1 | 2019 | 2 | 1 | 0 | _KISA__갠드크랩_분석_스폐셜_리포트.pdf |
2 | 2019 | 2 | 26 | 0 | 20190226_Clop랜섬웨어_유포에_따른_감염_주의.pdf |
3 | 2019 | 12 | 18 | 101648 | 사이버보안_빅데이터_활용사례(연구-국민대-윤명근_교수).pdf |
4 | 2019 | 12 | 18 | 101648 | 사이버보안_빅데이터_활용사례(기업-두산디지털이노베이션BU-김민교_대리).pdf |
5 | 2019 | 12 | 17 | 151411 | KISA-포스터(2020년_7대_사이버_공격_전망).pdf |
6 | 2019 | 12 | 17 | 151411 | KISA-발표자료(2020년_7대_사이버_공격_전망).pdf |
7 | 2019 | 10 | 25 | 104927 | 2019년_3분기_사이버_위협_동향_보고서.pdf |
8 | 2019 | 10 | 25 | 104927 | KISA_Cyber_Security_Issue_Report__Q3_2019.pdf |
9 | 2019 | 9 | 25 | 104214 | 2._머신러닝을_활용한_피싱_사이트_탐지_방안.pdf |
생성년도 | 생성월 | 생성일 | 생성시분초 | 분석보고서명 | |
---|---|---|---|---|---|
23 | 2019 | 4 | 17 | 0 | AD_악용_랜섬웨어_유포사례_분석.pdf |
24 | 2019 | 4 | 17 | 0 | _KISA_Technical_Report.pdf |
25 | 2019 | 4 | 17 | 0 | _KISA_Technical_Report__Analysis_on_Cases_of_Distribution_of_Internal_Network_Ransomware_through_Exploiting_AD_Server.pdf |
26 | 2019 | 4 | 15 | 0 | 2019년_1분기_사이버_위협_동향_보고서.pdf |
27 | 2019 | 4 | 2 | 0 | AD(Active_Directory)_관리자가_피해야_할_6가지_AD_운영_사례.pdf |
28 | 2019 | 3 | 28 | 0 | 1._사이버보안빅데이터센터_소개.pdf |
29 | 2019 | 3 | 28 | 0 | 2._Threat_Intelligence_수집방법_및_활용사례.pdf |
30 | 2019 | 3 | 28 | 0 | 3._System_Anomaly_Analysis___Detection.pdf |
31 | 2019 | 3 | 28 | 0 | 4._OPERATION_KITTY_PHISHING.pdf |
32 | 2019 | 2 | 27 | 0 | AD_관리자_계정_탈취를_통한_기업_내부망_장악_사례와_대응방안.pdf |
Most frequently occurring
생성년도 | 생성월 | 생성일 | 생성시분초 | 분석보고서명 | # duplicates | |
---|---|---|---|---|---|---|
0 | 2019 | 7 | 18 | 0 | 공급망_공격_사례_분석_및_대응_방안.pdf | 2 |