Skip to content

Cabbagito/Generating-South-Park-Episodes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Generating South Park Episodes


Summary

I've fine-tuned the GPT2 model on South Park episodes. The library that was used to train the model was the huggingface transformers library and the dataset was gathered from this Kaggle dataset.


Modules

The Repo contains 4 modules:

  • Data Preporcessing: Contains the code that preprocesses the dataset and creates the SouthPark_Data_test.pkl and SouthPark_Data_train.pkl files.
  • Train: This module contains the code that trains the model.
  • Testing: Computes the Rouge-1, Rouge-2, and Rouge-L scores for the test set.
  • Inference: This module is used to generate episodes.

Models

The models can be found here. Once downloaded, the desired model folders have to be put in the /AI folder. When calling the get_model function, the name should be specified in the checkpoint parameter.


Poster

Poster