LLM-Fine-Tuning

Some helpers and examples for creating an LLM fine-tuning dataset

1.Decompress-Bigquery.py: Decompresses the gzip json files from the bigquery export of reddit data comments.
2.load_subreddits.py: This script builds large pickle files consisting of the comments from the subreddits you target.
3.Build_training_data.py: This script loads those dataframes from 2. and builds chains of conversations. Run as many instances of this simultaneously as you want to make this process go faster and/or fix it to be a proper multi-processed script.
4.make_train_json.py: Creates the actual json file that you can upload straight to huggingface as a dataset to then fine-tune.
5.QLoRA-Fine-Tune.ipynb: Fine-tune based on your data. Modified from: https://colab.research.google.com/drive/1PEQyJO1-f6j0S_XJ8DV50NkpzasXkrzd?usp=sharing#scrollTo=x-xPb-_qB0dz
6.Testing-fine-tune.ipynb: Test on some pre-made prompts. To-do: Add some multi-speaker prompt examples.

Provide feedback

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
1.Decompress-Bigquery.py		1.Decompress-Bigquery.py
2.load_subreddits.py		2.load_subreddits.py
3.Build_training_data.py		3.Build_training_data.py
4.make_train_json.py		4.make_train_json.py
5.QLoRA-Fine-Tune.ipynb		5.QLoRA-Fine-Tune.ipynb
6.Testing-fine-tune.ipynb		6.Testing-fine-tune.ipynb
LICENSE		LICENSE
README.md		README.md