LLaMA-2 QLoRA experiment

I. Introduction

This project was made by me to refine the LLaMA-2 model based on instructions, applying some techniques to save memory when training such as QLoRA, DDP, half-precision.
You can run it on Kaggle notebook or Colab notebook.

II. Dataset

The dataset I used is Bactrian-X, which includes 54 languages. However, I only implemented it within the scope of Vietnamese.

III. Model

I use model LLaMA-2 7B to experiment. If your device has a larger configuration, you can experiment with larger versions of LLaMA-2 such as LLaMA-2 13B and LLaMA-2 70B.

IV. How to use

First, !git clone this repo, then install the environment with the command !pip install --upgrade -r requirements.txt.

Train:
- You can use the script in my notebook to train from scratch. This is the path to the checkpoint after I trained the model for more than 1 epoch: checkpoint-1
- Note: In the file run.py there are some arguments, you can change them optionally. If after stopping training, you feel that the performance is not as expected, if you want to continue training, pass the adapter model path to the model_weight_path argument and the state checkpoint path to the state_checkpoint argument in the script.

Inference:
Inference template:

from inference import Inference
infer = Inference(model_checkpoint = "{your_llama2-version}", model_weight_path = "{your_model_adapter_weight_path}")
instruction = "{your_instruction}"
input = "{your_input} or None"
print(infer(instruction = instruction, input = input)["response"])

Thank you a lot for the finding! 😊

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
notebook		notebook
README.md		README.md
config.py		config.py
inference.py		inference.py
model_inputs.py		model_inputs.py
process_analysis.py		process_analysis.py
prompt.py		prompt.py
requirements.txt		requirements.txt
run.py		run.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebook

notebook

README.md

README.md

config.py

config.py

inference.py

inference.py

model_inputs.py

model_inputs.py

process_analysis.py

process_analysis.py

prompt.py

prompt.py

requirements.txt

requirements.txt

run.py

run.py

train.py

train.py

Repository files navigation

LLaMA-2 QLoRA experiment

I. Introduction

II. Dataset

III. Model

IV. How to use

About

Releases

Packages

Languages

longday1102/VietAI-experiment-LLaMA2

Folders and files

Latest commit

History

Repository files navigation

LLaMA-2 QLoRA experiment

I. Introduction

II. Dataset

III. Model

IV. How to use

About

Topics

Resources

Stars

Watchers

Forks

Languages