GitHub - hh2010/malware_classify: Malware Classification Project

Malware Classification Project

Overview

This is the GitHub repository for the Malware Classification project I did for Metis. I used scikit-learn to train classification models (Random Forests, Logistic Regression, and SVM) using bi-directional packet flows. In my Jupyter notebook for the project, you will see how I trained the model and pickled them in the live folder.

The training data was gathered from the Malware Capture Facility Project (MCFP) from the Czech Technical University (CTU). Right now, the model seems to work well on classifying the Neris malware strain, but has not been tested on any other types. As I move on to other projects, I am very interested if people can test this model and generalize it to more malware types.

Packet Flows

Packet flows are created using Argus and a packet capture file which you can generate using Wireshark or tcpdump.

Check out the Argus documentation and you will find that it is easy to generate packet flow records using Argus and Radium.

Make sure you use the configuration files in the live folder when generating your flow records for testing.

Live Test Example

Once you have generated the packet flow CSV using Argus and the corresponding configuration files found in the live folder, use the command below to output the results of the classification model to your console.

./live_test.py --input_csv live/live.csv

Contact

If you have any questions or comments on this project, you can find me below:

E-mail: hasan.haq@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
live		live
ppt		ppt
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
live_test.py		live_test.py
malware_training.ipynb		malware_training.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

live

live

ppt

ppt

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

live_test.py

live_test.py

malware_training.ipynb

malware_training.ipynb

requirements.txt

requirements.txt

Repository files navigation

Malware Classification Project

Overview

Packet Flows

Live Test Example

Contact

About

Releases

Packages

Languages

hh2010/malware_classify

Folders and files

Latest commit

History

Repository files navigation

Malware Classification Project

Overview

Packet Flows

Live Test Example

Contact

About

Resources

Stars

Watchers

Forks

Languages