Fix GRU hidden=None handling for biased new-gate path by ssmall256 · Pull Request #3250 · ml-explore/mlx

ssmall256 · 2026-03-13T22:09:35Z

Proposed changes

Fix nn.GRU so hidden=None is treated the same as an explicit zero initial state.

Previously, the hidden=None path skipped the hidden-side new-gate contribution at the first timestep, including bhn, so gru(x) could differ from gru(x, hidden=zeros) when bias=True.

This change initializes a zero hidden state before the loop and adds a regression test covering batched and unbatched inputs.

Closes #3249

Tests

PYTHONPATH=python pytest python/tests/test_nn.py

Checklist

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

ssmall256 · 2026-03-13T22:16:14Z

I checked prior MLX issues and PRs for duplicates before opening this. I did not find an existing report or fix for this specific hidden=None / first-step bhn behavior. The closest prior GRU-related change I found was #952, but that patch addressed a different GRU bug in the hidden-state update and did not change the hidden=None first-timestep path.

angeloskath · 2026-03-16T18:09:38Z

Hi @ssmall256 , thanks for the fix! I hope it's ok but I will merge #3252 instead because it is a bit more concise and efficient. The efficiency comes from avoiding all the computations with 0s when None is passed.

Thanks again for the fix!

Fix GRU handling of hidden=None

28b75a9

angeloskath closed this Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix GRU hidden=None handling for biased new-gate path#3250

Fix GRU hidden=None handling for biased new-gate path#3250
ssmall256 wants to merge 1 commit intoml-explore:mainfrom
ssmall256:fix-gru-hidden-none-bias

ssmall256 commented Mar 13, 2026

Uh oh!

ssmall256 commented Mar 13, 2026 •

edited

Loading

Uh oh!

angeloskath commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ssmall256 commented Mar 13, 2026

Proposed changes

Tests

Checklist

Uh oh!

ssmall256 commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angeloskath commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ssmall256 commented Mar 13, 2026 •

edited

Loading