Skip to content

Conversation

@slin000111
Copy link
Collaborator

PR type

  • Bug Fix
  • New Feature
  • Document Updates
  • More Models or Datasets Support

PR information

Support grpo web-ui

Experiment results

Paste your experiment result here(if needed).

return gr.update(open=True), gr.update(visible=True)

@classmethod
def train(cls, *args):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里和训练能复用一部分吗

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

复用了llm_train

},
'rlhf_type': {
'label': {
'zh': '人类对齐算法类型',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

grpo这个是不是没必要了

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已删除

}
},
'train_stage': {
'label': {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个的作用是

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上面rlhf_type,已删除

},
'clear_cache': {
'value': {
'zh': '删除训练记录',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个能继承llm_train么

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

继承了llm_train.model.Model

SWIFT_TYPE_KEY = 'swift_type'
DEFAULT_ADAPTER = 'default'

DEFAULT_SYSTEM = ('A conversation between User and Assistant. The user asks a question, and the Assistant solves it. '
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DEFAULT_SYSTEM不要放在这里,放在ui里好了

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已移除

@slin000111 slin000111 merged commit 08c7596 into modelscope:main May 22, 2025
1 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants