This is code + flask app for finetuning RedPajama 3B model on a Kaggle dataset of 200000 dadjokes from Reddit. The app runs inference on a checkpoint to generate a joke and then renders it in Joe Biden's voice.
Download the dataset here
Then run create_dataset.py
in redpajama_lora_finetune/data/ folder to create train.jsonl
and eval.jsonl
pip install -r requirements.txt
To run finetuning CUDA_SET_VISIBLE_DEVICES=0 python train.py
Access the app here
This was built at Any Thing But Wrappers Hackathon.