Overview

Dataset statistics

Number of variables3
Number of observations151
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory25.9 B

Variable types

Text2
Categorical1

Dataset

DescriptionSample
Author한국인터넷진흥원
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KIS00000000000000015

Alerts

생성년도 has constant value ""Constant
해시코드값 has unique valuesUnique
악성코드 has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:23:18.945872
Analysis finished2023-12-10 06:23:19.351053
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시코드값
Text

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-10T15:23:19.737798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters9664
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)100.0%

Sample

1st rowC5773996C67DAE123F54DBB974488C5B197647093525C28C6922175246D58041
2nd row4F0233B00365437F298EA43AFEB8497D54CA489028B72A8EAE47A5761F491E36
3rd row01B3FA1094D23123C7E5DFD22E8AA1B27D6FB9CE8D1EFCE1D950F27BC9938B38
4th rowFA0E01F919DBBF03DAED1D748A5789116CAF3004E2B9738FCC3CD3D5D1F44E0C
5th row8ACCAA0B357AC8E2185D101C34BBA4021BFB26E7E586A13F10992541DB4E5206
ValueCountFrequency (%)
c5773996c67dae123f54dbb974488c5b197647093525c28c6922175246d58041 1
 
0.7%
23ba4e72010f3755701b16560dce098eb05179687dbf322f7bffb9dbdd54df29 1
 
0.7%
c433a15a015af23cbef29dc32df5aaf3133a670c8634005728ff820e587ec5cf 1
 
0.7%
0abe121b77d574010285bda25fcd8ee54e024d371bd4fcd7fe3d4b1ac4e8437f 1
 
0.7%
9d907b7cd0096cd728472badaa51bedfe1d31846b65025d2a9f13ea96c36ef21 1
 
0.7%
d6a6bea686b52382182b5381f3f79a48335ddf544b5c89c65eccb5e03557d325 1
 
0.7%
6e64fd50cb9d38dceb3e6100794f3c11e3b8a458f6265b80101f1d976ad5564f 1
 
0.7%
bba1e0d25795b39f4c38cd1f2e928f0ab3611a1533e462263a8925ac1d22dd87 1
 
0.7%
c6f855877ef419237bf2135cd95a7f6b93db0d458c0b7103ff6926e91f8bacaf 1
 
0.7%
48bef03846767aa65331f48f4fe9c126e0756e65b8c7c60458c60a74c2abc586 1
 
0.7%
Other values (141) 141
93.4%
2023-12-10T15:23:20.498475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 640
 
6.6%
1 640
 
6.6%
D 626
 
6.5%
F 620
 
6.4%
4 617
 
6.4%
9 613
 
6.3%
B 604
 
6.2%
E 597
 
6.2%
5 595
 
6.2%
7 595
 
6.2%
Other values (6) 3517
36.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5995
62.0%
Uppercase Letter 3669
38.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 640
10.7%
4 617
10.3%
9 613
10.2%
5 595
9.9%
7 595
9.9%
6 589
9.8%
8 589
9.8%
0 589
9.8%
3 584
9.7%
2 584
9.7%
Uppercase Letter
ValueCountFrequency (%)
C 640
17.4%
D 626
17.1%
F 620
16.9%
B 604
16.5%
E 597
16.3%
A 582
15.9%

Most occurring scripts

ValueCountFrequency (%)
Common 5995
62.0%
Latin 3669
38.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 640
10.7%
4 617
10.3%
9 613
10.2%
5 595
9.9%
7 595
9.9%
6 589
9.8%
8 589
9.8%
0 589
9.8%
3 584
9.7%
2 584
9.7%
Latin
ValueCountFrequency (%)
C 640
17.4%
D 626
17.1%
F 620
16.9%
B 604
16.5%
E 597
16.3%
A 582
15.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9664
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 640
 
6.6%
1 640
 
6.6%
D 626
 
6.5%
F 620
 
6.4%
4 617
 
6.4%
9 613
 
6.3%
B 604
 
6.2%
E 597
 
6.2%
5 595
 
6.2%
7 595
 
6.2%
Other values (6) 3517
36.4%

생성년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2010
151 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2010
2nd row2010
3rd row2010
4th row2010
5th row2010

Common Values

ValueCountFrequency (%)
2010 151
100.0%

Length

2023-12-10T15:23:20.770770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:23:20.941223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2010 151
100.0%

악성코드
Text

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-10T15:23:21.323122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length27.662252
Min length17

Characters and Unicode

Total characters4177
Distinct characters56
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)100.0%

Sample

1st rowTrojan-Downloader.Win32.Small.ael
2nd rowTrojan.Win32.VB.cvr
3rd rowTrojan-PSW.Win32.OnLineGames.ajpd
4th rowBackdoor.Win32.Intruder.10.d
5th rowTrojan-Downloader.Win32.Agent.ahjg
ValueCountFrequency (%)
trojan-downloader.win32.small.ael 1
 
0.7%
trojan.win32.vapsup.ffv 1
 
0.7%
trojan.win32.startpage.ml 1
 
0.7%
trojan-proxy.win32.puma.afs 1
 
0.7%
trojan-downloader.win32.delf.mdj 1
 
0.7%
trojan-gamethief.win32.onlinegames.jtz 1
 
0.7%
trojan-downloader.win32.adload.afj 1
 
0.7%
trojan.win32.obfuscated.akx 1
 
0.7%
trojan-proxy.win32.raznew.a 1
 
0.7%
trojan.win32.small.xya 1
 
0.7%
Other values (141) 141
93.4%
2023-12-10T15:23:21.989066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 453
 
10.8%
n 386
 
9.2%
a 308
 
7.4%
o 303
 
7.3%
i 240
 
5.7%
r 231
 
5.5%
W 175
 
4.2%
e 165
 
4.0%
2 151
 
3.6%
3 150
 
3.6%
Other values (46) 1615
38.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2694
64.5%
Uppercase Letter 642
 
15.4%
Other Punctuation 453
 
10.8%
Decimal Number 311
 
7.4%
Dash Punctuation 77
 
1.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 386
14.3%
a 308
11.4%
o 303
11.2%
i 240
 
8.9%
r 231
 
8.6%
e 165
 
6.1%
j 124
 
4.6%
d 109
 
4.0%
c 76
 
2.8%
l 68
 
2.5%
Other values (16) 684
25.4%
Uppercase Letter
ValueCountFrequency (%)
W 175
27.3%
T 119
18.5%
B 64
 
10.0%
P 35
 
5.5%
S 35
 
5.5%
D 32
 
5.0%
G 31
 
4.8%
A 25
 
3.9%
L 22
 
3.4%
O 18
 
2.8%
Other values (13) 86
13.4%
Decimal Number
ValueCountFrequency (%)
2 151
48.6%
3 150
48.2%
1 5
 
1.6%
0 3
 
1.0%
5 2
 
0.6%
Other Punctuation
ValueCountFrequency (%)
. 453
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3336
79.9%
Common 841
 
20.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 386
 
11.6%
a 308
 
9.2%
o 303
 
9.1%
i 240
 
7.2%
r 231
 
6.9%
W 175
 
5.2%
e 165
 
4.9%
j 124
 
3.7%
T 119
 
3.6%
d 109
 
3.3%
Other values (39) 1176
35.3%
Common
ValueCountFrequency (%)
. 453
53.9%
2 151
 
18.0%
3 150
 
17.8%
- 77
 
9.2%
1 5
 
0.6%
0 3
 
0.4%
5 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4177
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 453
 
10.8%
n 386
 
9.2%
a 308
 
7.4%
o 303
 
7.3%
i 240
 
5.7%
r 231
 
5.5%
W 175
 
4.2%
e 165
 
4.0%
2 151
 
3.6%
3 150
 
3.6%
Other values (46) 1615
38.7%

Missing values

2023-12-10T15:23:19.112725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:23:19.263739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시코드값생성년도악성코드
0C5773996C67DAE123F54DBB974488C5B197647093525C28C6922175246D580412010Trojan-Downloader.Win32.Small.ael
14F0233B00365437F298EA43AFEB8497D54CA489028B72A8EAE47A5761F491E362010Trojan.Win32.VB.cvr
201B3FA1094D23123C7E5DFD22E8AA1B27D6FB9CE8D1EFCE1D950F27BC9938B382010Trojan-PSW.Win32.OnLineGames.ajpd
3FA0E01F919DBBF03DAED1D748A5789116CAF3004E2B9738FCC3CD3D5D1F44E0C2010Backdoor.Win32.Intruder.10.d
48ACCAA0B357AC8E2185D101C34BBA4021BFB26E7E586A13F10992541DB4E52062010Trojan-Downloader.Win32.Agent.ahjg
5874FF7949FD392B127422A328E5DC24B07FE9087065B80F8CD8D8A1B316373602010Trojan-PSW.Win32.ZRM
63429E8395F6D027E38CBDB59588C05104B5EE0AE043E166D54BB3495F14F2B742010Trojan-Spy.Win32.Banker.nim
7770E29034D03CDFA71E05A53540774D2F470AC168F94283E2C1AA7A9DB292DF42010Trojan-Downloader.Win32.Banload.qsa
88163459090EB0F014153528968EE3FC3254A52BCB3079A7EF40E9744BD15D35C2010Backdoor.Win32.PcClient.jud
9EB490C7E8513329433AF65EC53A7E42A894E6CAA6232CCE6601AF0613B9263B42010Trojan-PSW.Win32.QQPass.axc
해시코드값생성년도악성코드
141A760E4244277E5A81999C2ECC31F5DCB26934454E2E08B66193471648529F3792010Backdoor.Win32.Hupigon.ezza
142FC151A5F1C02C643C2AF5A635635F5874383EF6852204F0458DB809948815AC12010Trojan-PSW.Win32.Magania.twf
143FB41F392A6BB4A6A7F2720A6E392B1B37EE7914B70BF7B176F1D4B178697F79E2010Trojan-GameThief.Win32.OnLineGames.civ
144BD31F673023128B424B57CC2E0FA5C2F38CF579DE2B4C11BF9C9F6F0B0ADB17F2010Trojan-Downloader.Win32.Zlob.ajq
14529E2A763D241A7D4AD4AFFA4160D584CBCAA8EB2405E570AC693CCB475C439522010Trojan-PSW.Win32.OnLineGames.andi
1464CAC7D90FA708A99FEAB013EE016BC12332047DFB8F7D438C86E789FAD98ADD92010Trojan-Downloader.Win32.Agent.adwa
147E09AB16C9BD9AECCFA5762A26ACC5F233E5C0B3DD547C4E7813C0A274B0DA4742010Backdoor.Win32.Hupigon.dzhu
14860382931471A9F72CED73532F61ED3E14D9F96434C91498A7C5C7C24EE7B9BEC2010Worm.Win32.AutoRun.xak
1496C5FE116CE27D037D86F018D6372EF98C46F34DBD67A8072DF95615E6B0B1EAE2010Trojan-GameThief.Win32.Lmir.agw
150AE415613A9F89E8911D7317390D4A411473E5801AA96DA90349DC37CCD2AAB902010Trojan-Spy.Win32.Zbot.cfd