[tx] Support `top_k` sampling #680

agolajko · 2025-11-18T02:17:20Z

Implements the top_k part of the sampling API requested in #533

Tests

test_top_k_filtering in test_generator.py: tests the core logic of the sampling
test_sample_top_k in test_api.py: checks the API can be called with the top_k parameter

Discussed with @pcmoritz on slack

agolajko · 2025-11-18T02:19:56Z

@gemini-code-assist review

gemini-code-assist

Code Review

This pull request successfully implements top_k sampling, including the core logic and API integration. The changes are well-tested. I've provided a few suggestions to enhance performance and improve the readability and robustness of the tests.

skyrl-tx/tests/tinker/test_api.py

skyrl-tx/tests/tinker/test_engine.py

skyrl-tx/tx/utils/generator.py

gemini-code-assist

Code Review

This pull request implements top_k sampling. The changes look good and include necessary logic in the generation pipeline and new tests. I've provided a few suggestions to improve code maintainability and performance. Specifically, I've suggested refactoring duplicated test code, simplifying assertions, and optimizing the apply_top_k function for better performance.

skyrl-tx/tests/tinker/test_api.py

gemini-code-assist · 2025-11-18T02:22:07Z

skyrl-tx/tests/tinker/test_engine.py

+    # Values below threshold should be -inf
+    assert jnp.isinf(filtered[0]) and filtered[0] < 0
+    assert jnp.isinf(filtered[1]) and filtered[1] < 0
+    assert jnp.isinf(filtered[2]) and filtered[2] < 0
+    # Top 2 values should be unchanged
+    assert filtered[3] == 4.0
+    assert filtered[4] == 5.0


The assertions to check the filtered logits can be simplified by comparing against an expected array. This makes the test more concise and easier to read.

Suggested change

# Values below threshold should be -inf

assert jnp.isinf(filtered[0]) and filtered[0] < 0

assert jnp.isinf(filtered[1]) and filtered[1] < 0

assert jnp.isinf(filtered[2]) and filtered[2] < 0

# Top 2 values should be unchanged

assert filtered[3] == 4.0

assert filtered[4] == 5.0

expected = jnp.array([-jnp.inf, -jnp.inf, -jnp.inf, 4.0, 5.0])

assert jnp.array_equal(filtered, expected)

Ditto above

skyrl-tx/tx/utils/generator.py

agolajko · 2025-12-01T18:25:17Z

@tyler-griggs lmk if you have any comments for top_k

Rebased to most recent main

agolajko · 2025-12-08T23:37:34Z

@pcmoritz @tyler-griggs lmk if you have thoughts, happy to rebase again to most recent before you merge

tyler-griggs

Thanks @agolajko !!

skyrl-tx/tx/tinker/types.py

skyrl-tx/tests/tinker/test_engine.py

tyler-griggs · 2025-12-27T19:57:38Z

skyrl-tx/tests/tinker/test_engine.py

+    # Values below threshold should be -inf
+    assert jnp.isinf(filtered[0]) and filtered[0] < 0
+    assert jnp.isinf(filtered[1]) and filtered[1] < 0
+    assert jnp.isinf(filtered[2]) and filtered[2] < 0
+    # Top 2 values should be unchanged
+    assert filtered[3] == 4.0
+    assert filtered[4] == 5.0


Ditto above

skyrl-tx/tests/tinker/test_api.py

skyrl-tx/tx/utils/generator.py

Resolve merge conflicts to combine: - top_k sampling support from feature branch - stop_strings support from main branch Both features are now available in SamplingParams. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-authored-by: Tyler Griggs <131809874+tyler-griggs@users.noreply.github.com>

…nto feature/implement-top-k

pcmoritz · 2025-12-29T19:56:51Z

/gemini review

gemini-code-assist

Code Review

This pull request successfully implements top_k sampling. The changes are well-structured, touching the API, type definitions, and the core generator logic. The implementation of apply_top_k_batch is efficient and JIT-friendly. The new tests in test_api.py and test_generator.py provide good coverage for the new functionality.

I have a few suggestions to improve the assertions in the tests to make them more robust and concise. Overall, this is a solid contribution.

skyrl-tx/tests/tinker/test_api.py

skyrl-tx/tests/utils/test_generator.py

pcmoritz

Thanks a lot for implementing this @agolajko , I implemented @tyler-griggs 's suggestion of a fast path if there is no top_k filtering, and also used jax.lax.top_k so we don't need to do the sorting :)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

pcmoritz · 2025-12-29T21:08:54Z

I'll merge this now, but there are some improvements that could be made as a follow up:

Bin max_top_k to avoid JIT recompilation (and we could also restrict top_k to be smaller than some reasonable value)
Do sampling only among the top_k / max_top_k values, this will be more performant but requires reorganizing the code

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

skyrl-tx/tests/tinker/test_api.py Outdated Show resolved Hide resolved

skyrl-tx/tests/tinker/test_api.py Outdated Show resolved Hide resolved

skyrl-tx/tests/tinker/test_engine.py Outdated Show resolved Hide resolved

skyrl-tx/tx/utils/generator.py Outdated Show resolved Hide resolved

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

pcmoritz self-assigned this Nov 18, 2025

pcmoritz added the tx label Nov 18, 2025

tyler-griggs changed the title ~~[tx] top_k re #533~~ [tx] Support top_k sampling Nov 30, 2025

agolajko force-pushed the feature/implement-top-k branch from 9bf5c08 to bbc5056 Compare December 1, 2025 18:23

agolajko added 9 commits December 2, 2025 07:01

changes for top_k

ee5d226

add test

fcbe5ce

added type for top_k

804803f

fixed top_k typo

49dd8af

apply_top_k fix

59728e4

apply_top_k fix again

e8a0f1a

apply_top_k fix again 2

cbe6217

apply_top_k fix again 3

3427278

apply_top_k test_engine

1766982

agolajko force-pushed the feature/implement-top-k branch from bbc5056 to 1766982 Compare December 2, 2025 15:41

sriranganm mentioned this pull request Dec 5, 2025

[tx] Add top_p sampling feature and basic test implementations #742

Closed

tyler-griggs reviewed Dec 27, 2025

View reviewed changes

pcmoritz and others added 8 commits December 29, 2025 11:01

update

a0a9c85

Update skyrl-tx/tx/tinker/types.py

e3bc932

Co-authored-by: Tyler Griggs <131809874+tyler-griggs@users.noreply.github.com>

update test

c5136d2

Merge branch 'feature/implement-top-k' of github.com:agolajko/SkyRL i…

3725109

…nto feature/implement-top-k

move test

be7e0e8

update

aae5000

simplify

9dc1c26

gemini-code-assist bot reviewed Dec 29, 2025

View reviewed changes

skyrl-tx/tests/tinker/test_api.py Outdated Show resolved Hide resolved

skyrl-tx/tests/utils/test_generator.py Outdated Show resolved Hide resolved

skyrl-tx/tests/utils/test_generator.py Outdated Show resolved Hide resolved

black

1e1807a

pcmoritz approved these changes Dec 29, 2025

View reviewed changes

pcmoritz and others added 6 commits December 29, 2025 12:00

Update skyrl-tx/tests/tinker/test_api.py

22d102c

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-tx/tests/utils/test_generator.py

73c5be9

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-tx/tests/utils/test_generator.py

d7a4913

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

bladk

a0913ce

Fix ties

42f8612

add test for ties

8a3355f

pcmoritz merged commit 465c3b6 into NovaSky-AI:main Dec 29, 2025
4 checks passed

[tx] Support top_k sampling #680

[tx] Support top_k sampling #680

Uh oh!

Conversation

agolajko commented Nov 18, 2025 • edited by pcmoritz Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tests

Uh oh!

agolajko commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

tyler-griggs Dec 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

agolajko commented Dec 1, 2025

Uh oh!

agolajko commented Dec 8, 2025

Uh oh!

tyler-griggs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tyler-griggs Dec 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pcmoritz commented Dec 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pcmoritz left a comment

Choose a reason for hiding this comment

Uh oh!

pcmoritz commented Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[tx] Support `top_k` sampling #680

[tx] Support `top_k` sampling #680

agolajko commented Nov 18, 2025 •

edited by pcmoritz

Loading