-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(transformers): Add DeBERTa model implementation to transformers … #5414
base: develop
Are you sure you want to change the base?
feat(transformers): Add DeBERTa model implementation to transformers … #5414
Conversation
Thanks for your contribution! |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #5414 +/- ##
===========================================
+ Coverage 54.61% 63.20% +8.59%
===========================================
Files 480 512 +32
Lines 67855 72473 +4618
===========================================
+ Hits 37056 45807 +8751
+ Misses 30799 26666 -4133 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
devertav2 -> deverta_v2
需要补充下modeling和Tokenizer的实现,并且支持AutoModel及AutoTokenizer的from_pretrained调用
@@ -0,0 +1,431 @@ | |||
# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
缺少文档,需要补充执行方式及具体的实验结果
dev_ds_matched = dev_ds_matched.map(trans_func, lazy=True) | ||
dev_ds_mismatched = dev_ds_mismatched.map(trans_func, lazy=True) | ||
dev_batch_sampler_matched = paddle.io.BatchSampler( | ||
dev_ds_matched, batch_size=self.args.per_device_eval_batch_size * 2, shuffle=False |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里的batch_size为什么是self.args.per_device_eval_batch_size * 2?
@@ -0,0 +1,13 @@ | |||
# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
补充deberta tokenizer单测及deberta_v2的相关单测
|
||
|
||
PRETRAINED_VOCAB_FILES_MAP = { | ||
"vocab_file": { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个key应该是sentencepiece_model_file, 不是vocab_file
This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。 |
This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。 |
This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。 |
…library
PR types
PR changes
Description