Fine-tune LLaMA 2 (7B) with LoRA on meta-math/MetaMathQA Dataset

Fine-tuning and Inference codes on the MetaMath Dataset

Model Details

MetaMath-Fine-Tune-with-LoRA is trained to reason and answer mathematical problems on meta-math/MetaMathQA dataset. We used meta-llama/Llama-2-7b-hf as a base model and used LoRA to fine-tune it.

Model Description

Project GitHub Page: https://github.com/SuperBruceJia/MetaMath-Fine-Tune-with-LoRA
Developed by: Shuyue Jia in December, 2023
Funded by: Boston University SCC
Model type: fine-tuned
Language(s) (NLP): English
Finetuned from model: meta-llama/Llama-2-7b-hf

Results on GSM8K

Epoch	Accuracy on the testing set	Model Link
$1$	$0.609$	🤗 Hugging Face
$2$	$0.635$	🤗 Hugging Face
$3$	$0.641$	🤗 Hugging Face
$4$	$0.641$	🤗 Hugging Face

Deployment

# Load the Pre-trained LoRA Adapter
model.load_adapter("shuyuej/metamath_lora_llama2_7b_4_epoch")
model.enable_adapters()

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
benchmark		benchmark
data_analysis		data_analysis
data_augmentation		data_augmentation
data_processing		data_processing
dependency		dependency
lib		lib
results		results
save_folder		save_folder
semantics_evaluation		semantics_evaluation
testing_files		testing_files
utils		utils
LICENSE		LICENSE
README.md		README.md
config.yml		config.yml
main.py		main.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg

License

SuperBruceJia/MetaMath-Fine-Tune-with-LoRA

Folders and files

Latest commit

History

Repository files navigation

Fine-tune LLaMA 2 (7B) with LoRA on meta-math/MetaMathQA Dataset

Model Details

Model Description

Results on GSM8K

Deployment

About

Resources

License

Stars

Watchers

Forks

Languages