To reproduce, create anaconda environment: conda create --name test python=3.7.5
Then install all required modules from requirements.txt using command: pip install -r requirements.txt
Preprocessing class is at src.preprocessing.Preprocessor.py
Script that generates test_proc.tsv is at tests.draft_tests.py