-
Notifications
You must be signed in to change notification settings - Fork 699
Questions on Multimodal and Agentic RL Support in Slime Roadmap #142
Copy link
Copy link
Open
Description
Hi Slime Team (cc @zhuzilin ),
I have recently studied the Slime framework's source code and was impressed by its elegant and lightweight design. I am from the Xiaohongshu team, and we focus more on large-scale multimodal model and Agentic RL scenarios. I have several questions regarding the future roadmap of Slime and would appreciate your insights:
- Are there any plans to support RL scenarios with large-scale multimodal foundation models?
- Has there been any internal validation or experience with Agentic RL training in Slime?
- Is there a roadmap for scaling to massive-size multimodal models and Agentic RL? Specifically, have you considered design strategies for rollouts addressing long-tail issues in such scenarios?
I
f possible, could you share some information relevant to these points? From an RL framework technology perspective, we are very interested in collaborating and advancing together with the Slime community.
Looking forward to your sharing and suggestions.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels