Skip to content

Author Obfuscation Implementation of paper "Avengers Ensemble".

Notifications You must be signed in to change notification settings

shivamsharma00/Avengers_ensemble

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Avengers Ensemble: Advanced Authorship Obfuscation

Welcome to the Avengers Ensemble repository, home to cutting-edge authorship obfuscation techniques using MUTANT-X. Elevate your text security with our implementation that surpasses the capabilities of existing methods.

Project Overview

Dive into the realm of Mutant-X authorship obfuscation, inspired by the groundbreaking research paper, "Mutant-X: Efficient Authorship Obfuscation Techniques." Our approach refines the original concept by introducing the Ensemble Vote classifier and harnessing the power of writeprint features to enhance SVM classifiers' training.

Key Features

  • Robust ensemble attribution classifier with a 10-SVM classifier pipeline.
  • Ensemble Vote classifier dynamically optimizes the ensemble by excluding low-accuracy classifiers.
  • Writeprint features, including:
  • Average character per word
  • Letter frequency
  • Key letter bigrams and trigrams
  • Digits and characters percentages
  • Frequency of digits and word lengths
  • Integration of Word2vec model (gensim) for efficient word neighbor computations.
  • MUTANT-X classifier with SVC, trained on the Amazon review dataset as an internal attribution classifier.
  • Local dictionary for storing computed neighbors, enhancing computation efficiency.
  • Achieved an outstanding average METEOR score of 0.5 on the Amazon review dataset, surpassing the implemented paper's results.

Usage Guide

Empower your projects with Mutant-X authorship obfuscation:

  1. Clone Repository: Copy this GitHub repository to your local machine.
  2. Install Dependencies: Install the necessary dependencies listed in the requirements.txt file.
  3. Prepare Input: Create the input text file you wish to obfuscate.
  4. Run Script: Execute the main.py script with the input text file as a command-line argument.
  5. Obfuscated Output: Retrieve the obfuscated text, saved in a file with the original name appended by "_obfuscated."

Conclusion

Revolutionize your text security with Mutant-X authorship obfuscation. Our project excels by combining Ensemble Vote classification, advanced writeprint features, and a meticulously trained MUTANT-X classifier. Notably, the local dictionary optimizes computation time, making the obfuscation process swift and effective. Experience superior results on the Amazon review dataset, marking a significant improvement over the original paper's implementation. Join us in shaping the future of authorship obfuscation!

About

Author Obfuscation Implementation of paper "Avengers Ensemble".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages