GitHub - Matthewdowney18/reddit: classifacation and generation experiments on the reddit dataset

reddit classification and generation experiments on the reddit dataset

Steps for training:

1: clean up the data with make_data.py which creates a new csv.

2: if there are pretrained embeddings, run the new csv through filter embeddings to create embedding matrix. make sure to have the same parameters on this file, and the model.

3: now, it is ready to train. you can train the auto encoder, or the classifier. it saves the model which is the same for both, so it is possible to train the auto encoder, and then train that model for classification. Of course the models can be trained multiple times for the same thing. Be weary, because the hyperparameters must be the same in order for you to do this.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.idea		.idea
classification		classification
generation		generation
README.md		README.md
metadata.json		metadata.json
train_classifier.py		train_classifier.py
train_generator.py		train_generator.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Matthewdowney18/reddit

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages