Skip to content

Collected dataset of DGA domains and Alexa domains. Used for experiments in automatic DGA domains classification.

Notifications You must be signed in to change notification settings

chrmor/DGA_domains_dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

DGA_domains_dataset

We extracted a set of domains generated by 25 different DGA families from the Netlab Opendata Project repository (data.netlab.360.com/dga) and used Alexa as an authoritative source for benign domain names. In total, we collected 675.000 domain names. 50% of the domains are DGA generated and 50% are Alexa domains. The 337.500 malicious DNs are equally distributed among the 25 DGA families, thus we have 13.500 domains for each family. Among such DGAs, 13 are time dependent algorithms and 12 are time independent ones.

You are free to download and use the data for reseaerch purposes. If you use data in scientific publications please cite the original paper:

Cucchiarelli, A., Morbidoni, C., Spalazzi, L., Baldi, M., Algorithmically generated malicious domain names detection based on n-grams features (2021) Expert Systems with Applications, 170, art. no. 114551

https://doi.org/10.1016/j.eswa.2020.114551

About

Collected dataset of DGA domains and Alexa domains. Used for experiments in automatic DGA domains classification.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages