[Model Support] Qwen2 update #849

wheresmyhair · 2024-06-07T03:39:14Z

Description

Support Qwen2 models.
Only docs are updated, since the pipeline requirements are the same as Qwen1.5.

Pipeline Tests

Full-Finetune

2 LoRA

Known Issue

Note: This isn't a bug in LMFlow, but we will add a logger notification ASAP to notify users when they might trigger this bug.

When do lora (or other peft tuning that uses peft library) first and saved model at a dir, say, A, and then do another finetuning work that also specifies the same output_dir A, pipeline will fail to update the model card, since Qwen2ForCausalLM doesn't have attribute .create_or_update_model_card(). But this will not affect the model saving.

Bug logi:

Finetune with PeftTrainer and save leads to a modelcard with library_name = 'peft'.
When do anyother finetune with same output_dir, since
https://github.com/huggingface/transformers/blob/bdf36dcd48106a4a0278ed7f3cc26cd65ab7b066/src/transformers/trainer.py#L4114, os.path.exists(model_card_filepath) is True, and then is_peft_library is True.
At https://github.com/huggingface/transformers/blob/bdf36dcd48106a4a0278ed7f3cc26cd65ab7b066/src/transformers/trainer.py#L4143, transformers will do .create_or_update_model_card(), which Qwen2ForCausalLM doesn't have.

We strongly recommend to use different output_dir for every finetuning work to avoid unexpected issues like the one above.

research4pan

LGTM

[Model Support] Qwen2 update

088d40a

wheresmyhair marked this pull request as ready for review June 7, 2024 04:26

research4pan approved these changes Jun 7, 2024

View reviewed changes

research4pan merged commit be54124 into main Jun 7, 2024
2 checks passed

wheresmyhair mentioned this pull request Jun 7, 2024

'Qwen2ForCausalLM' object has no attribute 'create_or_update_model_card' QwenLM/Qwen2.5#495

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model Support] Qwen2 update #849

[Model Support] Qwen2 update #849

wheresmyhair commented Jun 7, 2024 •

edited

Loading

research4pan left a comment

[Model Support] Qwen2 update #849

[Model Support] Qwen2 update #849

Conversation

wheresmyhair commented Jun 7, 2024 • edited Loading

Description

Pipeline Tests

Known Issue

research4pan left a comment

Choose a reason for hiding this comment

wheresmyhair commented Jun 7, 2024 •

edited

Loading