Bag of Tricks for Efficient Text Classification

Things you can try:

Use n-grams by setting N_GRAMS > 1. Note: this slows down pre-processing.
Reduce the vocabulary size by setting VOCAB_MAX_SIZE or increasing VOCAB_MIN_FREQ
Train on truncated sequences by setting MAX_LENGTH
Change the tokenizer to a built in one, like the spaCy tokenizer, by setting TOKENIZER = 'spacy'. Note: this slows down the pre-processing considerably.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
models.py		models.py
run.py		run.py

Provide feedback