Refactor the weight mapping config #562

wang2yn84 · 2025-10-13T05:10:41Z

The current weight mapping lives in the model file. Consider we have more backends or more modes, the file size will grow much bigger. What's more important, the weight mapping should not sit in the model file directly, it's more of a extra feature we added to the model for RL.

This PR does the following:

Separates the mapping function to separate file outside of model.py.
Use mixin to inject the API to the model, so ppl can still call llama3_model.to_hf_mapping like before.
Move lora config out of the mapping config, sitting directly in vllm config now. Mapping config is more for weight sync from actor to rollout model, but lora config is higly customizable and applies to rollout engine for initialization.
Move MappingConfig from vllm_sampler to a separate file, together with mixin setup. Consider we have more rollout engines such as SGLang, they should be able to use the same MappingConfig and Mixin class.
Provide the option for users to overwrite the mapping function from outside, e.g. MaxText.

Reference

Colab Notebook

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and all unit tests pass.
I have added all appropriate doc-strings/documentation.
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have signed the Contributor License Agreement.
I have followed Contribution Guidelines.

PiperOrigin-RevId: 818798279

Refactor the weight mapping config.

45e6401

wang2yn84 temporarily deployed to testing October 13, 2025 05:11 — with GitHub Actions Inactive

wang2yn84 mentioned this pull request Oct 13, 2025

add sglang jax sampler #553

Merged

6 tasks

copybara-service bot merged commit 4858530 into main Oct 13, 2025
7 checks passed

jxiong21029 pushed a commit to PLAN-Lab/tunix that referenced this pull request Nov 1, 2025

Merge pull request google#562 from google:lance-cleanup

48fab5c

PiperOrigin-RevId: 818798279

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor the weight mapping config #562

Refactor the weight mapping config #562

wang2yn84 commented Oct 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Refactor the weight mapping config #562

Refactor the weight mapping config #562

Conversation

wang2yn84 commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wang2yn84 commented Oct 13, 2025 •

edited

Loading