Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions examples/sentiment/scripts/gpt-neox-20b_peft/README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@

# Fine-tuning 20B LLMs with RL on a 24GB consumer GPU
The scripts in the section detail the fine-tuning a 20b LLM in 8-bit, in order to generate positive imdb reviews. You
can find out more in our [blogpost](LINK_TODO).
can find out more in our [blogpost](https://huggingface.co/blog/trl-peft).

Overall there were three key steps and training scripts:

1. **cm_finetune_peft_imdb.py** - Fine tuning a Low Rank Adapter on a frozen 8-bit model for text generation on the imdb dataset.
2. **merge_peft_adapter.py** - Merging of the adapter layers into the base model’s weights and storing these on the hub.
3. **gpt-neo-20b_sentiment_peft.py** - Sentiment fine-tuning of a Low Rank Adapter to create positive reviews.
3. **gpt-neo-20b_sentiment_peft.py** - Sentiment fine-tuning of a Low Rank Adapter to create positive reviews.