Release v1.5.0 · modelscope/swift

English Version

New features:

Support multi-line inference
Support multi node training
Add benchmarks
Support UI training, start by swift web-ui
Support VLLM inference
Support RLHF(DPO) training

New tuners:

SCEdit, adopted by TongYi Lab, uses lesser memory but produces better performance than LoRA, and can be used to replace ControlNet in a series of scenarios like Pose control/In-Painting/Out-Paining/Label-removing, etc.

New models:

SUS series models
Mixtral-MoE series models
deepseek series models
phi2-3b
cogagent-chat/cogagent-vqa
codegeex2-6b

New datasets:

Datasets used in RLHF:

hh-rlhf
stack-exchange-paired

中文版

SWIFT本月新版本已经发布！

新特性:

支持多行输入推理
支持多卡训练
添加了模型训练的benchmarks
支持界面训练和推理，通过swift web-ui开启
支持VLLM推理
支持RLHF(DPO)训练

新tuners:

SCEdit: 通义实验室自研的优秀U-Net微调框架，显存占用远小于LoRA，效果较LoRA更好，且可以替代实现ControlNet的效果，实现In-Painting/Out-Paining/标签去除/Pose控制等能力

新模型：

SUS系列模型
Mixtral-MoE系列模型
deepseek系列模型
phi2-3b
cogagent-chat/cogagent-vqa
codegeex2-6b

新数据集:

用于RLHF的数据集：
hh-rlhf
stack-exchange-paired

What's Changed

update multi-line input (infer) by @Jintao-Huang in #196
Fix model saving in new format by @tastelikefeet in #198
Fix compatible error by @tastelikefeet in #201
Fix bug 1206 by @Jintao-Huang in #202
fix fp16 & full bug by @Jintao-Huang in #203
Fix qwen-audio inference bug by @Jintao-Huang in #204
Support multi node by @Jintao-Huang in #205
fix typo bug by @Jintao-Huang in #206
Support sus by @Jintao-Huang in #207
Support cpu by @Jintao-Huang in #208
Add Feat: Freeze Parameters, disable_tqdm by @Jintao-Huang in #210
update dataset by @Jintao-Huang in #212
Support lazy_tokenize, preprocess_num_proc by @Jintao-Huang in #211
Support Mixtral MoE by @tastelikefeet in #217
Add benchmark by @Jintao-Huang in #213
support ui training by @tastelikefeet in #219
Fix transformers 4.36 by @Jintao-Huang in #218
Update mixtral-7b-moe by @Jintao-Huang in #221
Compatible with peft>=0.7.0 by @tastelikefeet in #220
fix dtype='fp16' sft bug by @Jintao-Huang in #227
fix generation_config warning by @Jintao-Huang in #224
Fix merge_lora & model_cache_dir bug by @Jintao-Huang in #229
fix lazy_tokenize bug by @Jintao-Huang in #228
Add inference UI and refactor machenism by @tastelikefeet in #230
Support deepseek by @Jintao-Huang in #223
relax version restriction by @tastelikefeet in #232
fix bug 1218 by @Jintao-Huang in #235
support deployment by @Jintao-Huang in #231
update docs by @Jintao-Huang in #238
Refactor some code by @tastelikefeet in #237
fix typo bug by @Jintao-Huang in #239
update readme & phi2-3b by @Jintao-Huang in #241
Fix argument 1220 by @Jintao-Huang in #242
Support CogAgent by @tastelikefeet in #243
fix infer by @tastelikefeet in #244
Support more peft tuners by @tastelikefeet in #245
Fix copying additional files by @tastelikefeet in #247
Add sft for codegeex2 by @tastelikefeet in #248
fix issue #249 by @tastelikefeet in #250
Feat/scedit by @jiangzeyinzi in #253
Update 1228 by @Jintao-Huang in #254
fix unicode error by @tastelikefeet in #259
Update readme for SCEdit by @tastelikefeet in #258
DPO by @tastelikefeet in #255
update self-cognition by @Jintao-Huang in #261
Fix/1229 by @tastelikefeet in #260
fix trainer init by @tastelikefeet in #262
fix bugs by @tastelikefeet in #263
fix import by @tastelikefeet in #265
Fix import by @tastelikefeet in #266
update perf by @Jintao-Huang in #264
fix bug by @tastelikefeet in #267
Support win32 by @tastelikefeet in #268

Full Changelog: v1.4.0...v1.5.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.5.0

English Version

中文版

What's Changed

Contributors