Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

可以基于你发布的hf的模型再训练吗? #497

Open
hudaoling opened this issue Jun 4, 2024 · 5 comments
Open

可以基于你发布的hf的模型再训练吗? #497

hudaoling opened this issue Jun 4, 2024 · 5 comments
Labels
question Further information is requested

Comments

@hudaoling
Copy link

你好,请问下,可以基于你发布的模型,只用自己的数据再训练吗?
image

1w条样本领域内的样本句子,进行数据增强(替换谐音词,英文单词增删改字母),生成了11w增强样本你,
采用这种方式,训练下来感觉train样本纠错还勉强能看,迁移到测试集上以后就比较差,好纠结啊。

@hudaoling hudaoling added the question Further information is requested label Jun 4, 2024
@shibing624
Copy link
Owner

可以再训练;建议融合我的训练集从头训练。

@Jamie2898
Copy link

Jamie2898 commented Jun 25, 2024

如果从头微调macbert,是不是直接把train_macbert4csc.yml文件里的BERT_CKPT改为hfl/chinese-macbert-base就行?
另外MacBERT的输入长度限制是512,对吗?

@shibing624
Copy link
Owner

可以

@Jamie2898
Copy link

谢谢。另外,找了一些公开数据集,有的case没有错误,也就json中没有wrong_ids,这种数据放进训练集模型可以跑吗?会对模型效果产生负面影响吗?

@shibing624
Copy link
Owner

可以放

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants