Notebook showing how to fine tune llama guard with torchtune #898

agunapal · 2025-03-11T23:37:41Z

Finetune llama-guard with torchtune

This PR includes a notebook which does the following:

Shows how to finetune llama-guard with torchtune
Shows how to finetune with a custom dataset & custom prompt template with torchtune
Shows how to finetune llama-guard to return multiple PII violations

Fixes # (issue)

Feature/Issue validation/testing

Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

Test A
Logs for Test A
Test B
Logs for Test B

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

IgorKasianenko · 2025-03-18T17:34:11Z

Thanks for the PR!

Could you please add torch tune install step in the notebook? I was missing torchao and torchtune
What is the memory requirement? I am running OOM on batch_size=64, maybe it is worth setting it to 16 in the tune line

IgorKasianenko · 2025-03-20T11:45:46Z

@agunapal Thanks for the notebook, it works end to end. The only thing is fine tuning on 8 NVIDIA L4 doesn't fit batch 64, and batch 16 for 10 epochs gives slightly worse performance f1 score is 44.49% vs 45.18% in your example.

IgorKasianenko · 2025-03-31T12:44:39Z

Looks good, please clear the cell outputs for install and training and I'll merge

…review

IgorKasianenko

LGTM

Notebook showing how to fine tune llama guard with torchtune

5d87da3

facebook-github-bot added the cla signed label Mar 11, 2025

Notebook showing how to fine tune llama guard with torchtune

e5159d8

albertodepaola self-requested a review March 12, 2025 16:34

update the notebook with identifying multiple PII violations

5bca315

IgorKasianenko self-assigned this Mar 31, 2025

Removed installation and training outputs from the notebook based on …

5907e0c

…review

IgorKasianenko approved these changes Apr 1, 2025

View reviewed changes

IgorKasianenko merged commit 80149c2 into meta-llama:main Apr 1, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Notebook showing how to fine tune llama guard with torchtune #898

Notebook showing how to fine tune llama guard with torchtune #898

Uh oh!

agunapal commented Mar 11, 2025 •

edited

Loading

Uh oh!

IgorKasianenko commented Mar 18, 2025 •

edited

Loading

Uh oh!

IgorKasianenko commented Mar 20, 2025

Uh oh!

IgorKasianenko commented Mar 31, 2025

Uh oh!

IgorKasianenko left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Notebook showing how to fine tune llama guard with torchtune #898

Notebook showing how to fine tune llama guard with torchtune #898

Uh oh!

Conversation

agunapal commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Finetune llama-guard with torchtune

Feature/Issue validation/testing

Before submitting

Uh oh!

IgorKasianenko commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IgorKasianenko commented Mar 20, 2025

Uh oh!

IgorKasianenko commented Mar 31, 2025

Uh oh!

IgorKasianenko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

agunapal commented Mar 11, 2025 •

edited

Loading

IgorKasianenko commented Mar 18, 2025 •

edited

Loading