Support agent training, etc. #352

tastelikefeet · 2024-01-30T06:07:59Z

Support sft_beta arg to DPOTrainer, which may stabilize the model generation.
Support Adalora and IA3
Support Agent training
Support dataset mixture
Support two new datasets: ms-agent and ms-bench
Support AnimateDiff merge lora
Fix some bugs in web-ui

…t_module * commit '8a284293b19bd5c1faf945f48a08215231d22d26': Support xverse-13b-256k (modelscope#332) Feat/update course 0120 (modelscope#331) fix doc (modelscope#330)

swift/llm/agent/models/xverse.py

swift/llm/utils/argument.py

swift/llm/utils/preprocess.py

swift/llm/utils/template.py

…t_module * commit '12e731a713c3f2d7f8c02038bf477e593e5f1240': Removing eos_token when doing inference. (modelscope#351) Update internlm2 math (modelscope#349) Fix template encode bug (modelscope#348) fix yi-vl finetune error (modelscope#347) Support yi vl (modelscope#345) update codefuse series (modelscope#343) Update orion 14b (modelscope#341) update default_lr; fix do_sample in vllm (modelscope#336) Fix ui (modelscope#335) # Conflicts: # swift/llm/infer.py # swift/llm/utils/argument.py # swift/llm/utils/template.py # swift/llm/utils/utils.py # swift/trainers/dpo_trainers.py

This reverts commit 54ecf9f.

yingdachen · 2024-01-31T01:49:08Z

plus also update the pr title and commit msg

…t_module * commit '82a5ae72a4da990cfa11f7df80f6536f1de9a589': Support internlm xcomposer2 (modelscope#354) Support zero3 (modelscope#353)

This reverts commit edd4c33.

This reverts commit 9b4da18.

…t_module * commit '0872f43dfe1ef8ef2312c3ee59ae705c9228e807': update openbmb sh (modelscope#361) fix test_run.py (modelscope#360) Fix issue 342 (modelscope#359) fix template_encode bug (modelscope#358) fix baichuan2 bug (modelscope#357) update qwen2 (modelscope#355) # Conflicts: # swift/llm/utils/utils.py

…t_module * commit '545f17e1d6585fbc0dd0ea45adde06b6c4ade2d0': support openbmb minicpm (modelscope#364) support dpo cli and add examples controlnet and dreambooth (modelscope#344) Fix openbmb model name (modelscope#362) # Conflicts: # README.md # README_CN.md

tastelikefeet added 19 commits January 20, 2024 17:53

support ia3 & adalora

f7958dd

fix ui bug

ffc5db2

Merge commit '8a284293b19bd5c1faf945f48a08215231d22d26' into feat/pef…

08d85ff

…t_module * commit '8a284293b19bd5c1faf945f48a08215231d22d26': Support xverse-13b-256k (modelscope#332) Feat/update course 0120 (modelscope#331) fix doc (modelscope#330)

wip

7f0be7d

update script

bfc1151

wip

88e159d

fix import

82f7fa0

support qwen

a0bc0ce

fix

ceea0b6

wip

43b45e8

fix

3484a10

revert code

24fe767

wip

586dfa6

wip

3a092af

wip

b5c522d

fix

7f66f5c

add supported models

6c64502

fix

8518eba

fix

3d9f898

Jintao-Huang reviewed Jan 30, 2024

View reviewed changes

tastelikefeet added 5 commits January 30, 2024 20:46

pre-commit passed

7ba25b4

wip

d598705

fix

a9de540

fix dpo trainer

3824dca

tastelikefeet force-pushed the feat/peft_module branch from b3bfd8a to 3824dca Compare January 30, 2024 15:46

tastelikefeet added 4 commits January 30, 2024 23:56

fix

8c37a35

fix

fb5a87f

temp dataset

54ecf9f

Revert "temp dataset"

edd4c33

This reverts commit 54ecf9f.

tastelikefeet added 12 commits January 31, 2024 10:36

Merge commit '82a5ae72a4da990cfa11f7df80f6536f1de9a589' into feat/pef…

b792755

…t_module * commit '82a5ae72a4da990cfa11f7df80f6536f1de9a589': Support internlm xcomposer2 (modelscope#354) Support zero3 (modelscope#353)

fix

8b6de8b

fix

19927f8

fix issue-342

21eb7f6

fix

5b85649

fix

2f01af9

fix

d1485eb

fix

3e13971

Revert "Revert "temp dataset""

9b4da18

This reverts commit edd4c33.

Revert "Revert "Revert "temp dataset"""

6359ece

This reverts commit 9b4da18.

pre commit passed

4ab10e1

fix doc

a39078a

Jintao-Huang approved these changes Feb 1, 2024

View reviewed changes

tastelikefeet added 12 commits February 1, 2024 12:19

fix ui

10de9bc

add agent doc

5568d19

fix doc

f86ade8

pre-commit passed

790f549

fix

8a4957d

fix loss_scale

e82be60

fix

b0e20b2

no message

4e23541

remove useless param

b943ea0

fix bug

5b9e0c1

update shell

aea3382

tastelikefeet changed the title ~~[WIP] many features~~ Support agent training, etc. Feb 1, 2024

tastelikefeet merged commit 50eb82b into modelscope:main Feb 1, 2024
2 checks passed

tastelikefeet mentioned this pull request Feb 2, 2024

animatediff 是否可以支持merge_lora_and_save #339

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support agent training, etc. #352

Support agent training, etc. #352

tastelikefeet commented Jan 30, 2024 •

edited

Loading

yingdachen commented Jan 31, 2024

Support agent training, etc. #352

Support agent training, etc. #352

Conversation

tastelikefeet commented Jan 30, 2024 • edited Loading

yingdachen commented Jan 31, 2024

tastelikefeet commented Jan 30, 2024 •

edited

Loading