Fix batch_size=1 token_buffer truncation in sampler by Cascoopman · Pull Request #963 · google/tunix

Cascoopman · 2026-01-11T10:15:49Z

Summary

Fixes #809

When using the generic sampler with batch_size=1, the token_buffer was being truncated instead of properly populated during the while_loop. This made the sampler unusable when TRAIN_MICRO_BATCH_SIZE=1, yet it worked fine for larger batches.

Root Cause

The issue was caused by using Python integers for decoding_step indexing inside jax.lax.while_loop. When JAX traces through a while_loop, using Python integers for dynamic indexing can cause issues with how the traced computation is specialized. With batch_size=1, JAX may incorrectly optimize or trace the array update operations (.at[:, decoding_step + 1].set(...)).

The Gemma-specific sampler (tunix/tunix/models/gemma/sampler.py) already had this fix at line 440:

decoding_step = jnp.asarray(sampler_state.decoding_step, dtype=jnp.int32)

But the generic sampler was missing this conversion.

Changes

tunix/generate/sampler.py:
- Initialize decoding_step as jnp.int32(num_input_tokens - 1) in init_sample_state
- Add explicit jnp.asarray() conversion in _sample() method
- Add explicit jnp.asarray() conversion in _sample_step() method with a comment referencing the issue
tests/generate/sampler_test.py:
- Added test_batch_size_one() test case
- Added test_batch_size_one_with_echo() test case

Testing

Ran all existing sampler tests - all pass
Ran new batch_size=1 tests - all pass
Verified fix with standalone reproduction script testing while_loop behavior

Fixes google#809 When using the generic sampler with batch_size=1, the token_buffer was being truncated instead of properly populated during the while_loop. This was caused by using Python integers for decoding_step indexing, which can cause JAX tracing issues. Changes: - Initialize decoding_step as jnp.int32() in init_sample_state - Add explicit jnp.asarray() conversion in _sample() and _sample_step() to ensure consistent JAX tracing behavior - Add test cases for batch_size=1 to prevent regression The Gemma-specific sampler already had this fix at line 440, but the generic sampler was missing it. This fix aligns the generic sampler with the Gemma sampler implementation.

Cascoopman requested review from abheesht17, hgao327, jiangyangmu, lc5211, sizhit2, tianshub and wang2yn84 as code owners January 11, 2026 10:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix batch_size=1 token_buffer truncation in sampler#963

Fix batch_size=1 token_buffer truncation in sampler#963
Cascoopman wants to merge 1 commit intogoogle:mainfrom
Cascoopman:fix/batch-size-1-sampler-issue-809

Cascoopman commented Jan 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Cascoopman commented Jan 11, 2026

Summary

Root Cause

Changes

Testing

Related

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant