NLU Project Update: Paper out on arXiv: https://arxiv.org/abs/2310.00892 All of the supervised learning code is on master branch. Whereas if you want to run RL training, please refer to feature/ppo branch. Code adapted from: https://github.com/facebookresearch/ParlAI/tree/main/parlai https://github.com/lvwerra/trl https://github.com/francoisstamant/lyrics-generation-with-GPT2