Questions on Multimodal and Agentic RL Support in Slime Roadmap

Hi Slime Team (cc @zhuzilin ),

I have recently studied the Slime framework's source code and was impressed by its elegant and lightweight design. I am from the Xiaohongshu team, and we focus more on large-scale multimodal model and Agentic RL scenarios. I have several questions regarding the future roadmap of Slime and would appreciate your insights:

1. Are there any plans to support RL scenarios with large-scale multimodal foundation models?
2. Has there been any internal validation or experience with Agentic RL training in Slime?
3. Is there a roadmap for scaling to massive-size multimodal models and Agentic RL? Specifically, have you considered design strategies for rollouts addressing long-tail issues in such scenarios?
I
f possible, could you share some information relevant to these points? From an RL framework technology perspective, we are very interested in collaborating and advancing together with the Slime community.

Looking forward to your sharing and suggestions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on Multimodal and Agentic RL Support in Slime Roadmap #142

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions on Multimodal and Agentic RL Support in Slime Roadmap #142

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions