Fine-tuning Llama2-Chat Model with QLoRA on OpenOrca dataset

This repository contains the code and data used to fine-tune the Llama2-Chat model using the 4-bit quantisation QLoRA (Quantization with Low Rank Approximation) PEFT technique on the OpenOrca dataset.

Dataset Preprocessing

OpenOrca-Clean

The OpenOrca-Clean dataset is a refined version derived from the original OpenOrca dataset.

Llama2-OpenOrca-Clean

The Llama2-OpenOrca-Clean dataset is tailored specifically for fine-tuning the Llama2-Chat model. It is derived from the OpenOrca-Clean dataset, further adapted to fit the llama prompt template. The dataset comprises a single column labeled "text," structured in the given format-

Model Fine-tuning

Model Details

Base Model: Llama-2-7B-Chat-hf
Fine-tuning Technique: 4-bit quantization using QLoRA PEFT
Dataset Used: Llama2-OpenOrca-Clean

The fine-tuning process involves training the Llama2-Chat model with 4-bit quantization using the QLoRA technique. This technique allows for efficient representation of model parameters while minimizing computational overhead.

Official Models

Llama-2-7B-Chat-OpenOrca

Our latest model, fine-tuned with 1000 examples using 4-bit quantization QLoRA from Llama2-OpenOrca-Clean dataset, is now available.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
fine_tuning_llama_2_on_orca_dataset.ipynb		fine_tuning_llama_2_on_orca_dataset.ipynb
preprocessing_orca_dataset.ipynb		preprocessing_orca_dataset.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tuning Llama2-Chat Model with QLoRA on OpenOrca dataset

Dataset Preprocessing

OpenOrca-Clean

Llama2-OpenOrca-Clean

Model Fine-tuning

Model Details

Official Models

About

Releases

Packages

Languages

aayushxrj/Llama-2-7B-Chat-OpenOrca

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Llama2-Chat Model with QLoRA on OpenOrca dataset

Dataset Preprocessing

OpenOrca-Clean

Llama2-OpenOrca-Clean

Model Fine-tuning

Model Details

Official Models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages