Skip to content

Conversation

lxline
Copy link
Collaborator

@lxline lxline commented Jan 23, 2025

PR type

  • New Feature

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

if not child.terminated:
self.active_children.append(child)

def collect(self):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

文档:
sample.md
采样.md
强化微调.md
reinforce_fine_tuning.md

split_dataset(ds, device_count, dataset_dir)

ts = time.time()
client_sample(server_model, orm, dataset_dir, 0, device_count, output_dir)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

将训练过程也增加一个shell,方便开发者复现

@tastelikefeet tastelikefeet merged commit 8f0630e into modelscope:main Feb 8, 2025
2 checks passed
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Feb 10, 2025
…ple_multi_modal

* commit 'a4d751356b36917e8d0e21c9e170418d8f35bd09':
  fix windows url (modelscope#3041)
  MCTS Sampler (modelscope#2967)
  fix docs
  support mistralai/Mistral-Small-24B-Instruct-2501 (modelscope#3030)

# Conflicts:
#	swift/llm/sampling/utils.py
#	swift/plugin/orm.py
#	swift/plugin/prm.py
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Feb 10, 2025
…edding

* commit '646023dcae858f0fa388f7663a217790604339fa':
  Support sample multi modal models (modelscope#3048)
  fix windows url (modelscope#3041)
  MCTS Sampler (modelscope#2967)
  fix docs
  support mistralai/Mistral-Small-24B-Instruct-2501 (modelscope#3030)

# Conflicts:
#	swift/plugin/prm.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants