Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] 论文中数据增强方法的一些疑惑? #43

Closed
wangyuxinwhy opened this issue Feb 9, 2021 · 2 comments
Closed

[Question] 论文中数据增强方法的一些疑惑? #43

wangyuxinwhy opened this issue Feb 9, 2021 · 2 comments
Labels

Comments

@wangyuxinwhy
Copy link

论文中数据增强在小样本数据集上取得了大幅的性能提升,但是数据增强具体的方案没有细谈。我根据论文中的描述,不知道按照以下理解是否正确:

论文中的数据增强是指,通过添加 训练数据集外 相似的文本数据,让教师网络和学生网络在这些样本中通过 中间层 hidden or att 的匹配 loss 进行学习,而忽略掉 logits,因此数据增强这部分其实不需要任何标签。

感谢 HFL 能一直提供如此之多的高质量开源项目,实实在在的为中文 NLP 带来了巨大的积极影响!

@airaria
Copy link
Owner

airaria commented Feb 10, 2021

差不多是这样,
不过中间层的hidden or att 以及最后的logits都是用的,
通过teacher给增强数据打logits标签,可以参见我在这里的回复
##37 (comment)

@stale
Copy link

stale bot commented Feb 15, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Feb 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants