feat: implement NEFTune noisy embeddings for instruction fine-tuning by stanley1208 · Pull Request #1686 · NVIDIA-NeMo/Automodel

stanley1208 · 2026-04-06T08:58:09Z

What does this PR do?

Implement NEFTune (Noisy Embeddings Fine-Tuning) as a training component with recipe config support.

What is NEFTune?

From the paper: adding uniform random noise to token embeddings during fine-tuning improves instruction following quality, often by a large margin, with no additional compute or data overhead.

Implementation

New component: nemo_automodel/components/training/neftune.py

NEFTune class with activate(model) / deactivate(model) methods
Uses register_forward_hook on the model's embedding layer
Adds scaled uniform noise: alpha / sqrt(seq_len * hidden_dim)
Noise only applied during training (checks module.training)
_get_input_embeddings() helper finds embeddings via HF method or common attribute names

Recipe integration: nemo_automodel/recipes/llm/train_ft.py

Add optional neftune config section in setup()
Automatically activated after model build, before training loop

Usage in YAML config: Add neftune.noise_alpha: 5.0 to your training recipe.

Tests

tests/unit_tests/training/test_neftune.py

8 unit tests covering: noise during training, no noise during eval, deactivation, zero alpha, negative alpha, double activate, is_active property
3 tests for _get_input_embeddings helper

…1686) * feat: implement NEFTune noisy embeddings for instruction fine-tuning Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor * fix: correct test_noise_applied_during_training to compare clean vs noisy output Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor * feat: add example YAML config for NEFTune fine-tuning Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor --------- Signed-off-by: stanley1208 <stanley.mei08@gmail.com>

feat: implement NEFTune noisy embeddings for instruction fine-tuning

a6f221b

Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor

stanley1208 requested review from HuiyingLi, ZhiyuLi-Nvidia, adil-a, akoumpa, hemildesai and pthombre as code owners April 6, 2026 08:58

github-actions Bot added the community-request label Apr 6, 2026

fix: correct test_noise_applied_during_training to compare clean vs n…

cffa75e

…oisy output Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor

stanley1208 mentioned this pull request Apr 6, 2026

Implement NEFTune #1221

Closed

copy-pr-bot Bot temporarily deployed to test April 6, 2026 19:21 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 6, 2026 19:21 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 6, 2026 19:43 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 6, 2026 19:48 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 6, 2026 20:03 Inactive

feat: add example YAML config for NEFTune fine-tuning

e45a194

Signed-off-by: stanley1208 <stanley.mei08@gmail.com> Made-with: Cursor

akoumpa reviewed Apr 6, 2026

View reviewed changes

Comment thread examples/llm_finetune/llama3_2/llama3_2_1b_squad_neftune.yaml

akoumpa approved these changes Apr 6, 2026

View reviewed changes

akoumpa merged commit 1c3944a into NVIDIA-NeMo:main Apr 6, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement NEFTune noisy embeddings for instruction fine-tuning#1686

feat: implement NEFTune noisy embeddings for instruction fine-tuning#1686
akoumpa merged 3 commits intoNVIDIA-NeMo:mainfrom
stanley1208:feat/implement-neftune

stanley1208 commented Apr 6, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Apr 6, 2026

Uh oh!

stanley1208 commented Apr 6, 2026

Uh oh!

akoumpa commented Apr 6, 2026

Uh oh!

akoumpa commented Apr 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

akoumpa commented Apr 6, 2026

Uh oh!

akoumpa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stanley1208 commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

What is NEFTune?

Implementation

Tests

Related

Uh oh!

copy-pr-bot Bot commented Apr 6, 2026

Uh oh!

stanley1208 commented Apr 6, 2026

Uh oh!

akoumpa commented Apr 6, 2026

Uh oh!

akoumpa commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

akoumpa commented Apr 6, 2026

Uh oh!

akoumpa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stanley1208 commented Apr 6, 2026 •

edited

Loading

akoumpa commented Apr 6, 2026 •

edited

Loading