fine-tuning-large-language-models-on-limited-hardware

NYU Tandon, ECE-GY 9143: High Performance Machine Learning, End Semester Project

Project:

Optimize the process of domain adaptation in natural language processing, i.e., fine-tuning large language models for a particular domain on limited hardware . This is done by using 8-bit quantization, LoRA and other techniques.

Repository:

Fine_Tuning.ipynb: Main notebook for training and evaluation
train_v4.py: Same script as ipynb in python script form, for submitting a sbatch job
Inference.ipynb: Inference notebook, for generating text from a saved model
shell scripts/data_download.sh: Script for downloading data
shell scripts/run_train_job.sh: Script for running the train job (automatically runs train_v4.py)
gpt2_logs: Training and validation logs for the GPT-2 fine tuning run
opt_logs: Training and validation logs for the OPT fine tuning run
sbatch_job_log: Logs from the sbatch job for GPT-2 fine tuning run

How to run the code:

Create sbatch job and run run_train_job.sh (16 cores, 60 GB RAM, 1 RTX GPU)

Results:

Achieved a perplexity score of 7.24 while fine-tuning GPT-2 model (1.5B param model) on the NiH grants dataset after 17 epochs. Qualitatively, the text generated is plausibly similar to an NIH grant.

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
.idea		.idea
gpt2_logs		gpt2_logs
notebook_experiments		notebook_experiments
opt_logs		opt_logs
sbatch_job_logs		sbatch_job_logs
shell scripts		shell scripts
src		src
.gitignore		.gitignore
Fine_Tuning.ipynb		Fine_Tuning.ipynb
Inference.ipynb		Inference.ipynb
README.md		README.md
train_v4.py		train_v4.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fine-tuning-large-language-models-on-limited-hardware

Project:

Repository:

How to run the code:

Results:

About

Releases

Packages

Languages

VivekBits2210/fine-tuning-large-language-models-on-limited-hardware

Folders and files

Latest commit

History

Repository files navigation

fine-tuning-large-language-models-on-limited-hardware

Project:

Repository:

How to run the code:

Results:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages