support truncation_strategy split & cached_dataset (qwen3.5) by Jintao-Huang · Pull Request #9211 · modelscope/ms-swift

Jintao-Huang · 2026-04-25T14:29:10Z

No description provided.

gemini-code-assist

Code Review

This pull request enables the use of the split truncation strategy in conjunction with cached datasets during pre-training. The changes include documentation updates in both Chinese and English, an updated example script demonstrating the correct workflow, and a modification in sft.py to relax restrictions on multimodal models for this strategy. Feedback was provided to ensure the Chinese documentation is consistent with the English version by adding a warning that the split strategy must be explicitly set during both the dataset export and training phases when using cached datasets.

Jintao-Huang · 2026-04-25T14:47:33Z

/gemini review

gemini-code-assist

Code Review

This pull request enables compatibility between the split truncation strategy and cached_dataset for pre-training. It updates the documentation to specify that when using this combination, input_ids are stored and consistent parameters must be used across export and training. The changes also include updates to the validation logic in sft.py and the example pre-training script. I have no feedback to provide.

Jintao-Huang added 3 commits April 25, 2026 22:27

support truncation_strategy split & cached_dataset (qwen3.5)

2fe25c3

fix

24edadc

fix

2ba3a37

gemini-code-assist Bot reviewed Apr 25, 2026

View reviewed changes

Comment thread docs/source/Instruction/Command-line-parameters.md Outdated

tastelikefeet approved these changes Apr 25, 2026

View reviewed changes

update

df8e839

hjh0119 approved these changes Apr 25, 2026

View reviewed changes

update

215a3c1

gemini-code-assist Bot reviewed Apr 25, 2026

View reviewed changes

Jintao-Huang merged commit 68eae8c into modelscope:main Apr 25, 2026
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support truncation_strategy split & cached_dataset (qwen3.5)#9211

support truncation_strategy split & cached_dataset (qwen3.5)#9211
Jintao-Huang merged 5 commits into
modelscope:mainfrom
Jintao-Huang:support_split_cached_dataset

Jintao-Huang commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Jintao-Huang commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Jintao-Huang commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Jintao-Huang commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants