STATE TWITTER TROLL DETECTION USING TRANSFORMERS

With the 2020 US election around the corner, the issue of electoral interference by state actors via social media and other online means is back in the spotlight. Can a fine tuned transformer model do a better job of detecting these state troll tweets than "classic" machine learning approaches? This is what we'll try to assess in this series of notebooks

REPO STRUCTURE:

1. DATA FOLDER

5 CSV files for notebooks in this series. Note that raw troll tweet files from Twitter are not included here.

2. NOTEBOOKS FOLDER

Notebooks 1.0 - 1.2: Data collection, cleaning and preparation. Optional if you just want to experiment with the final dataset.
Notebooks 2.0 - 2.1: Fine tuning distilbert with custom dataset and detailed testing with unseen validation dataset, as well as a fresh dataset with state troll tweets from Iran.
Notebook 3.0 - 3.1: Create and test optimised logistic regression and XGB models against datasets used to assess fine tuned Distilbert model.

3. APP FOLDER

app.py + folders for "static" and "template: simple app for use on a local machine to demonstrate how a state troll tweet detector can be used in deployment. Unfortunately free hosting accounts can't accomodate the disk size required for pytorch and the fine tuned model, so I've not deployed this online.

4. TROLL_DETECT FOLDER

Fine tuned Distilbert model from Colab notebook2.0. Too big for Github, download here from Dropbox instead.

5. PKL FOLDER

Pickled logistic regression model from notebook3.0
Pickled XGB model from notebook3.1

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
data		data
notebooks		notebooks
pkl		pkl
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STATE TWITTER TROLL DETECTION USING TRANSFORMERS

REPO STRUCTURE:

1. DATA FOLDER

2. NOTEBOOKS FOLDER

3. APP FOLDER

4. TROLL_DETECT FOLDER

5. PKL FOLDER

About

Releases

Packages

Languages

chuachinhon/transformers_state_trolls_cch

Folders and files

Latest commit

History

Repository files navigation

STATE TWITTER TROLL DETECTION USING TRANSFORMERS

REPO STRUCTURE:

1. DATA FOLDER

2. NOTEBOOKS FOLDER

3. APP FOLDER

4. TROLL_DETECT FOLDER

5. PKL FOLDER

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages