模型训练出错 #41

svjack · 2023-06-13T02:11:36Z

使用 textgen/examples/llama/training_llama_demo.py
微调模型：https://huggingface.co/shibing624/chinese-llama-plus-13b-hf
使用示例数据集data/zh_csc_train.tsv
有下面的错误
assertion srcindex < srcselectdimsize failed.

该用模型如：https://huggingface.co/shibing624/chinese-alpaca-plus-13b-hf
则正常。

The text was updated successfully, but these errors were encountered:

shibing624 · 2023-06-13T06:36:04Z

嗯，我留意到了此问题，llama-plus-13b 本地直接预测也会出此错误，alpaca不会。

可能是transformers升级导致的问题，还在排查。

训练13b，可以用其他模型替代，如alpaca-13b, ziya-13b

svjack · 2023-06-13T07:35:05Z

llama model predict 方法感觉对外暴露的(不由default arg指定的)GenerationConfig 参数感觉有点少
**kwargs 应该考虑重载generation_config 会不会更好一些呢？

shibing624 · 2023-06-13T07:42:08Z

有 kwargs: https://github.com/shibing624/textgen/blob/main/textgen/llama/llama_model.py#L519

svjack · 2023-06-13T07:48:03Z

有 kwargs: https://github.com/shibing624/textgen/blob/main/textgen/llama/llama_model.py#L519

像这种参数怎么改呢？

repetition_penalty=self.args.repetition_penalty,
length_penalty=self.args.length_penalty,

svjack · 2023-06-13T07:51:29Z

huggingface/transformers#24104

shibing624 · 2023-06-13T08:28:07Z

这样写：https://github.com/shibing624/textgen/blob/main/examples/llama/training_llama_demo.py#L53 写进model_args 就可以，会自动覆盖默认的参数。

svjack · 2023-06-13T08:30:54Z

这样写：https://github.com/shibing624/textgen/blob/main/examples/llama/training_llama_demo.py#L53 写进model_args 就可以，会自动覆盖默认的参数。

感觉这里面的一些参数不应该在初始化时指定而应该在生成时是动态的

shibing624 · 2023-06-13T09:05:51Z

初始化时指定的是默认的，生成时指定可以覆盖默认的，类似于max_length 参数，其他的参数也会覆盖默认的，这个我加下。

shibing624 · 2023-06-13T09:36:43Z

done

svjack added the bug Something isn't working label Jun 13, 2023

shibing624 closed this as completed in ee420bc Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

模型训练出错 #41

模型训练出错 #41

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

svjack commented Jun 13, 2023

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023 •

edited

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

shibing624 commented Jun 13, 2023

模型训练出错 #41

模型训练出错 #41

Comments

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

svjack commented Jun 13, 2023

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023 • edited

svjack commented Jun 13, 2023

shibing624 commented Jun 13, 2023

shibing624 commented Jun 13, 2023

shibing624 commented Jun 13, 2023 •

edited