README

Overview

This repository contains code for fine-tuning and merging deep learning models using LoRA (Low-Rank Adaptation). The repository includes:

Baseline model training and evaluation (baseline_model).
LoRA-based model configuration, training, and evaluation (lora_manual, lora_model, etc.).
Command-line usage through main.py with multiple configurable steps.

Dependencies

Python 3.11
Torch 2.1.2
Transformers
BitsAndBytes
PEFT
WANDB
And other libraries as listed in pyproject.toml or poetry.lock.

Installation

Clone the repository.
From the repository root, install dependencies (for example, using Poetry):
```
poetry install
```
Authenticate with Weights & Biases if needed:
```
wandb login
```

Usage

Most functionality is accessed through main.py with various steps:

Train a LoRA or baseline model:

python main.py --step train --type lora --epochs 5

Perform a custom sweep with Weights & Biases:

python main.py --step sweep_manual --type lora

Run the baseline model:
```
python main.py --step BM --epochs 3
```

Print trainable parameters of a LoRA model:

python main.py --step print_params --r 16 --lora_alpha 2.0 ...

Arguments like --learning_rate, --batch_size, etc., can be passed to customize training.

Makefile Usage

Below are commands defined in the makefile to help manage and run experiments:

make init
Installs dependencies, initializes Poetry, and prompts for Wandb auth.
make init_no_wandb
Same as above but skips Wandb auth.
make run_all_experiments
Submits multiple training jobs for different model variations (baseline, LoRA, etc.) on HPC.

For more details, check the makefile at the root of this repo.

File Structure (Key Parts)

src/model/lora_manual.py: Contains the primary LoRATrainer class and get_lora_config function.
src/config/model_config.py: Sets up different LoRA configurations (e.g., "lora", "Q_lora", etc.).
main.py: Orchestrates model training, evaluation, and inference steps via CLI arguments.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
data		data
figures		figures
hpc/bash		hpc/bash
models		models
profiling		profiling
src		src
.gitignore		.gitignore
__init__.py		__init__.py
main.py		main.py
makefile		makefile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

Overview

Dependencies

Installation

Usage

Makefile Usage

File Structure (Key Parts)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

MathiasDamsgaard/Deep_learning_finetuning_and_merge

Folders and files

Latest commit

History

Repository files navigation

README

Overview

Dependencies

Installation

Usage

Makefile Usage

File Structure (Key Parts)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages