cp: `fix: Coerce plain-dict backend to BackendConfig in model init (1784)` into `r0.4.0` by svcnvidia-nemo-ci · Pull Request #1803 · NVIDIA-NeMo/Automodel

svcnvidia-nemo-ci · 2026-04-13T17:56:48Z

beep boop [🤖]: Hi @adil-a 👋,

we've cherry picked #1784 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

* fix: Coerce plain-dict backend to BackendConfig in model init When backend is specified via CLI override (e.g. --model.backend.attn sdpa) without a _target_ key in the YAML, the config system passes it as a plain dict. This causes AttributeError in model constructors that do backend.rms_norm, backend.linear, etc. Convert the dict to BackendConfig(**dict) in _init_model, which is the single gateway between the config system and all model constructors. This fixes the issue for all 17+ custom model implementations. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: adil-a <adil.asif2000@hotmail.com> * test: Add unit tests for dict-to-BackendConfig coercion Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: adil-a <adil.asif2000@hotmail.com> * style: Remove unused pytest import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: adil-a <adil.asif2000@hotmail.com> * fix(test): Use environment-aware BackendConfig defaults in assertion BackendConfig defaults for attn/linear depend on TE availability, so hardcoding "torch" fails on GPU CI where TE is present. Compare against BackendConfig() defaults instead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: adil-a <adil.asif2000@hotmail.com> * fix(test): Compare inputs_embeds generate against input_ids generate The old test compared cached generate(inputs_embeds) against manual uncached decode (use_cache=False). Mamba uses different CUDA kernels for cached vs uncached paths, causing bf16 divergence. Compare both generate() paths instead, which both use cached kernels. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: adil-a <adil.asif2000@hotmail.com> --------- Signed-off-by: adil-a <adil.asif2000@hotmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

svcnvidia-nemo-ci · 2026-04-13T17:56:52Z

/ok to test 36d1923

copy-pr-bot · 2026-04-13T17:56:52Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

akoumpa · 2026-04-19T02:01:37Z

/ok to test 44979df

svcnvidia-nemo-ci requested a review from akoumpa as a code owner April 13, 2026 17:56

svcnvidia-nemo-ci requested a review from adil-a April 13, 2026 17:56

svcnvidia-nemo-ci requested review from HuiyingLi, ZhiyuLi-Nvidia, adil-a, hemildesai and pthombre as code owners April 13, 2026 17:56

svcnvidia-nemo-ci added cherry-pick Run CICD Trigger Testing CICD labels Apr 13, 2026

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 17:57 Inactive

copy-pr-bot Bot temporarily deployed to test April 13, 2026 17:57 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 18:09 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 18:17 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci April 13, 2026 18:34 Failure

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 18:34 Inactive

Merge branch 'r0.4.0' into cherry-pick-1784-r0.4.0

44979df

copy-pr-bot Bot temporarily deployed to nemo-ci April 19, 2026 02:01 Inactive

copy-pr-bot Bot temporarily deployed to test April 19, 2026 02:02 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 19, 2026 02:47 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 19, 2026 03:11 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 19, 2026 03:34 Inactive

akoumpa approved these changes Apr 19, 2026

View reviewed changes

akoumpa merged commit 38da59e into r0.4.0 Apr 19, 2026
52 of 54 checks passed

akoumpa deleted the cherry-pick-1784-r0.4.0 branch April 19, 2026 20:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cp: `fix: Coerce plain-dict backend to BackendConfig in model init (1784)` into `r0.4.0`#1803

cp: `fix: Coerce plain-dict backend to BackendConfig in model init (1784)` into `r0.4.0`#1803
akoumpa merged 2 commits intor0.4.0from
cherry-pick-1784-r0.4.0

svcnvidia-nemo-ci commented Apr 13, 2026

Uh oh!

svcnvidia-nemo-ci commented Apr 13, 2026

Uh oh!

copy-pr-bot Bot commented Apr 13, 2026

Uh oh!

akoumpa commented Apr 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

svcnvidia-nemo-ci commented Apr 13, 2026

Uh oh!

svcnvidia-nemo-ci commented Apr 13, 2026

Uh oh!

copy-pr-bot Bot commented Apr 13, 2026

Uh oh!

akoumpa commented Apr 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants