Skip to content
Hacker Forum Classification for Threat Intelligence
Jupyter Notebook
Branch: master
Clone or download
Latest commit 495be3d Aug 20, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
CrackingArena.rar
DataLabeling.ipynb
PreProcessing.ipynb
README.md
TextClassification.ipynb
dl_model_accuracy_score_cmp.png
dl_model_f1_score_cmp.png
dl_model_precision_score_cmp.png
dl_model_recall_score_cmp.png
label_cmp.png
ml_model_accuracy_score_cmp.png
ml_model_f1_score_cmp.png
ml_model_precision_score_cmp.png
ml_model_recall_score_cmp.png
security word vectors

README.md

Hacker Forum Exploit and Classification for Proactive Cyber Threat Intelligence

This is a research project that utilizes hacker forum data for proactive cyber threat intelligence. This research paper employs state-of-the-art machine learning and deep learning approach to automatically classify hacker forum data into predefined categories and develop interactive visualizations enabling CTI practitioners to explore collected data for proactive and timely CTI. The results from this research shows that among all the models, deep learning model RNN GRU gives the best classification results with 99.025% accuracy and 96.56% precision.

Tools & Libraries Used

  • Python 2.7
  • Sklearn
  • Pandas
  • Keras
  • Seaborn
  • Numpy
  • NLTK
  • Anaconda

Dataset Used

Data Visualizations

Label Classification

ML Models Accuracy Score

DL Models Accuracy Score

Contributors

You can’t perform that action at this time.