请问finetune时应如何设置token type id? #48

AOZMH · 2020-08-09T09:19:31Z

在Bert中若处理输入为两个句子的相关任务（例如语义相似性打分等），常使用token_type_embedding对两个句子分别加上不同的embedding；这一做法只需要在transformers的API中设置token_type_id（一个句子全为0，另一个全为1）即可实现。
然而，electra的预训练好像取消了NSP任务，相应的也就没有训练这个句子embedding（抱歉我不是很确定，只是看了一下论文好像没写这一点😂），所以我想请教一下使用token_type_id这一做法是否在electra中也可以通用呢？如果不行，对于两个句子的输入，推荐的处理方法是什么呢？谢谢！

ymcui · 2020-08-09T09:31:38Z

TF版：https://github.com/ymcui/Chinese-ELECTRA/blob/master/finetune/classification/classification_tasks.py#L65
PT版：https://huggingface.co/transformers/model_doc/electra.html#electraforsequenceclassification

见TF版，实际上与BERT没有什么差别，还是要设置token_type_id，PT版详见transformers库中的API。

AOZMH · 2020-08-09T09:34:48Z

明白谢谢回复！不过有一点好奇这个token_type_id对应的embedding是在哪个阶段预训练的呢？

ymcui · 2020-08-09T09:42:08Z

阅读这里的预训练代码：https://github.com/ymcui/Chinese-ELECTRA/blob/master/build_pretraining_dataset.py#L57
事实上是有两个segment的。

AOZMH · 2020-08-10T00:08:22Z

明白，谢谢帮助！

AOZMH closed this as completed Aug 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请问finetune时应如何设置token type id? #48

请问finetune时应如何设置token type id? #48

AOZMH commented Aug 9, 2020

ymcui commented Aug 9, 2020

AOZMH commented Aug 9, 2020

ymcui commented Aug 9, 2020

AOZMH commented Aug 10, 2020

请问finetune时应如何设置token type id? #48

请问finetune时应如何设置token type id? #48

Comments

AOZMH commented Aug 9, 2020

ymcui commented Aug 9, 2020

AOZMH commented Aug 9, 2020

ymcui commented Aug 9, 2020

AOZMH commented Aug 10, 2020