-
Notifications
You must be signed in to change notification settings - Fork 662
[Cherry-Pick][RL] Support Rollout Routing Replay #5166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: feature/experimental_feature_20250908
Are you sure you want to change the base?
[Cherry-Pick][RL] Support Rollout Routing Replay #5166
Conversation
|
Thanks for your contribution! |
| moba_attention_config = self.create_moba_attention_config() | ||
| eplb_cfg = self.create_eplb_config() | ||
| routing_replay_config = self.create_routing_repaly_config() | ||
| print("after create {routing_replay_config}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete debug code
|
|
||
| @abstractmethod | ||
| def apply_ep_decode( | ||
| self, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
为啥 ep decode 没加 hook
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
develop 分支的 xpu backend 迁移到哪里了?
Motivation
In RL training tasks, we have enabled recording of routing information during the inference process and directly utilized the experts selected during inference in the training process to alleviate the consistency issue between training and inference.

Modifications
Add
RoutingReplayManagerandRoutingStoreclass to support routing replayRoutingStore: The client used for interacting with Routing Store has two routing storage modes based on File System and RDMA(WIP).RoutingReplayManager: Implement request level routing table management within FastDeploy.Usage or Command
Add a startup switch for route replay:
Accuracy Tests
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.