Polisis_Benchmark

Reproducing state-of-the-art results

This repo is our effort to reproduce Polisis results for privacy policy classification based on their paper: https://arxiv.org/abs/1802.02561

Setup instructions

Setup a virtual environment using any tool (e.g., conda) and activate it: conda -n privacy_policy python=3.6 source activate privacy_policy
Install dependecies from the requirement file: pip install -r requirement.txt
install NLTK tokenizer: python -m nltk.downloader punkt

To run the experiment: python -u cnn_multi_label_classifier.py

Parameters can be found in args.py

Important Note: By default the code will use GloVe embeddings. Due to licesing the in-domain embeddings can be provided only upon request.