Pytorch based Finetuning LLMs (PEFT) by iswaryaalex · Pull Request #78 · amd/playbooks

iswaryaalex · 2026-02-16T22:10:11Z

This PR brings in Pytorch Finetuning playbooks that focus of PEFT techniques like SFT full-finetuning, LoRA and QLoRA using pytorch with ROCm backend

Added 3 scripts in assets : train_full_finetuning, train_lora.py and train_qlora.py
Added README to walk through the different techniques, overview, comparison , details, how to finetune, how to customize and finally how to use the finetuned models/adapters
Note: For Qlora/Lora with bitsandbytes, the bnb package is not supported for Windows Only works for Linux.
Tested on stx halo windows and Linux with ROCm 7.11.0, pytorch
Added CI/CD tests for playbook README. Note: entire finetuning is not run during testing but covers module imports, initializations etc

adamlam2-amd · 2026-03-05T07:36:40Z

Nice - overall looks decent. Couple thoughts:

Why did we choose Gemma3-4B? It's also a gated model so users will need to navigate to HuggingFace to accept Google's licensing. This is not yet clear.
We should make it more clear that bitsandbytes isn't supported on Windows
The Lora and QLora explanations are a bit weak - I don't think normal users will be able to understand this. If we want to explain it, we can be more thorough.
To compensate for the extra length above, we can save some space on some of the hyperparameter tuning. Instead of showing each value higher or lower, we can just tell them to change it themselves. Essentially, more concise here.
Is the command rocm-smi? I think amd-smi might be the one.
Wandb can be optional since it's a fairly significant addition.
Lastly, if you can add any screenshots anywhere that might be good. Wherever you think is best. Maybe the result of the fine tuning?

danielholanda

Reviewed and tested on Windows! Worked great.

Please wait for Adam's review before merging.

adamlam2-amd

Please address Daniel's comments as well as the comment I left above.

Otherwise, most of it looks good. Will retest when QA sees it.

iswaryaalex · 2026-03-09T23:36:14Z

@danielholanda Adding a hidden test to do single run of lora (model loading, dataset loading and 1 iteration) with a timeout

Ran the test locally - looks good

iswaryaalex · 2026-03-09T23:40:13Z

@adamlam2-amd Thanks for the review. My key changes are

Added test for single run of LoRa
Added specific instructions to load gated models like Gemma (we based everything on this for stability.. others were iffy). Install and authenticate HF steps are also added
Added instructions for "Dataset example" what the format means, how to use it and what we accomplish from finetuning
Added more details on Lora/Qlora
Made hyperparam tuning more concise

danielholanda · 2026-03-10T00:35:42Z

@adamlam2-amd Anything else you would like to see here before approving?

…alo_playbooks into iswarya/pytorch-finetuning

adamlam2-amd

lgtm

iswaryaalex · 2026-03-11T21:02:31Z

Alright! Merging

Iswarya Alex and others added 5 commits February 3, 2026 11:44

Initial commit pytorch

8a924d7

Updated all training configs

113efe3

Update playbooks

b92c14b

Merge branch 'main' into iswarya/pytorch-finetuning

4443c6b

update comment

afc0c2e

iswaryaalex marked this pull request as draft February 16, 2026 22:10

iswaryaalex added 6 commits February 16, 2026 14:16

Add CI/CD tests for playbook

80c29f7

update tests

b85c847

update tests

84edfa1

update test for ci/cd

6eb6d71

update test for ci/cd

4269b5a

unit test for finetuning

d10f635

iswaryaalex marked this pull request as ready for review February 17, 2026 05:50

iswaryaalex requested a review from danielholanda February 17, 2026 05:55

iswaryaalex self-assigned this Feb 17, 2026

danielholanda requested review from adamlam2-amd February 17, 2026 22:41

danielholanda approved these changes Mar 5, 2026

View reviewed changes

adamlam2-amd requested changes Mar 6, 2026

View reviewed changes

Review comments addressed

93d2d59

iswaryaalex and others added 3 commits March 9, 2026 16:42

Added test, minor changes

bba6f10

Merge branch 'main' into iswarya/pytorch-finetuning

6d3ff5e

Add cover image

fb1690e

iswaryaalex added 3 commits March 9, 2026 18:17

Add cover image

5b9cd5a

Add cover image

fffd793

Merge branch 'iswarya/pytorch-finetuning' of https://github.com/amd/h…

0216cb2

…alo_playbooks into iswarya/pytorch-finetuning

iswaryaalex and others added 7 commits March 9, 2026 18:38

Add cover image

fae144e

compressed image

244952a

Update tested platforms

f1c26a4

fix test

4cbb505

Add Mixed precision check on HW

deeb49a

Remove KRK testing

07104af

Merge branch 'main' into iswarya/pytorch-finetuning

867a9ab

adamlam2-amd approved these changes Mar 11, 2026

View reviewed changes

iswaryaalex merged commit e53108c into main Mar 11, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch based Finetuning LLMs (PEFT)#78

Pytorch based Finetuning LLMs (PEFT)#78
iswaryaalex merged 25 commits intomainfrom
iswarya/pytorch-finetuning

iswaryaalex commented Feb 16, 2026 •

edited

Loading

Uh oh!

adamlam2-amd commented Mar 5, 2026 •

edited

Loading

Uh oh!

danielholanda left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adamlam2-amd left a comment

Uh oh!

iswaryaalex commented Mar 9, 2026

Uh oh!

iswaryaalex commented Mar 9, 2026

Uh oh!

danielholanda commented Mar 10, 2026

Uh oh!

adamlam2-amd left a comment

Uh oh!

iswaryaalex commented Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

iswaryaalex commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamlam2-amd commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danielholanda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adamlam2-amd left a comment

Choose a reason for hiding this comment

Uh oh!

iswaryaalex commented Mar 9, 2026

Uh oh!

iswaryaalex commented Mar 9, 2026

Uh oh!

danielholanda commented Mar 10, 2026

Uh oh!

adamlam2-amd left a comment

Choose a reason for hiding this comment

Uh oh!

iswaryaalex commented Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iswaryaalex commented Feb 16, 2026 •

edited

Loading

adamlam2-amd commented Mar 5, 2026 •

edited

Loading