Fine-tuning Gemma using qLora and Supervised Fine-tuning

This repository contains a comprehensive notebook and tutorial on how to fine-tune the gemma-7b-it model using qLora and Supervised Fine-tuning.

Overview

This project demonstrates the steps required to fine-tune the Gemma model for tasks like code generation. We use qLora quantization to reduce memory usage and the SFTTrainer from the trl library for supervised fine-tuning.

Notebook

The notebook is available on my GitHub: gemma-Instruct-2b-Finetuning-on-alpaca.ipynb

Prerequisites

Before running the notebook, ensure you have the following:

GPU:
- gemma-2b can be fine-tuned on a T4 GPU (free on Google Colab).
- gemma-7b requires an A100 GPU.

Python Packages: Install the necessary packages using the following commands:

!pip3 install -q -U bitsandbytes==0.42.0 peft==0.8.2 trl==0.7.10 accelerate==0.27.1 datasets==2.17.0 transformers==4.38.0

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
gemma-Instruct-Finetuning-on-alpaca.ipynb		gemma-Instruct-Finetuning-on-alpaca.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tuning Gemma using qLora and Supervised Fine-tuning

Overview

Notebook

Prerequisites

About

Releases

Packages

Languages

License

MadhanMohanReddy2301/gemma-Instruct-2b-Finetuning-on-alpaca

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Gemma using qLora and Supervised Fine-tuning

Overview

Notebook

Prerequisites

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages