Fix Unsloth autocast dtype for bf16 models by vivekkalyan · Pull Request #663 · OpenPipe/ART

vivekkalyan · 2026-04-27T21:24:31Z

Summary

PipelineTrainer + LocalBackend can load bf16 base models like Llama 3.1 in 16-bit mode while ACCELERATE_MIXED_PRECISION is unset. We were defaulting the Unsloth logprob path to fp16 in that case, which can hit Half vs BFloat16 matmul mismatches.

This keeps explicit mixed-precision settings authoritative, but when the env var is unset, infers the autocast dtype from the loaded model parameters. Unknown mixed-precision/model dtype states fail early instead of silently falling through.

Validation

.venv/bin/pytest tests/unit/test_unsloth_autocast_dtype.py -q
Sky 2x H200 smoke: Llama 3.1 + PipelineTrainer + dedicated LocalBackend, load_in_4bit=False, load_in_16bit=True, ACCELERATE_MIXED_PRECISION unset; reached step 2 and reloaded adapters for steps 1 and 2.

arcticfly

Thanks @vivekkalyan!

fix: Infer Unsloth autocast dtype from model

fc413e9

vivekkalyan requested a review from arcticfly April 27, 2026 21:27

arcticfly approved these changes Apr 27, 2026

View reviewed changes

fix: Tighten Unsloth autocast typing

e5730f2

vivekkalyan merged commit 5cfe180 into main Apr 27, 2026
4 checks passed

vivekkalyan deleted the fix/infer-unsloth-autocast-dtype branch April 27, 2026 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Unsloth autocast dtype for bf16 models#663

Fix Unsloth autocast dtype for bf16 models#663
vivekkalyan merged 2 commits into
mainfrom
fix/infer-unsloth-autocast-dtype

vivekkalyan commented Apr 27, 2026

Uh oh!

arcticfly left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vivekkalyan commented Apr 27, 2026

Summary

Validation

Uh oh!

arcticfly left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants