Skip to content

R script to classify trojan vs. benign traffic utilizing various machine learning models

License

Notifications You must be signed in to change notification settings

pburkepro/trojan-traffic-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Trojan Traffic Classification

Software used:

GitHub R package version

Data Source:

https://www.kaggle.com/datasets/subhajournal/trojan-detection

Findings:

From the three models investigated, along with a previously completed k-NN model, we can say that a decision tree with cost matrix would be the best model to classify Trojan web traffic from benign traffic. With only a few truly significant variables, the model does not need to be complicated, and as shown above complexity will actually reduce the effectiveness of a model using this data. Additionally, if we are developing this model for a hypothetical business use-case, determining Trojan attacks is much more important than reducing false positives. As such, the 100% effectiveness rate in determining true Trojan traffic is a tremendous positive with the decision tree, especially since it does not come at too high (~ 5% false positive) a cost with regards to false positives. As such, the final decision of using a decision tree with a cost matrix not only makes sense with regards to consideration of accuracy, but also of any potential real world usecases.

About

R script to classify trojan vs. benign traffic utilizing various machine learning models

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages