feat(transformers): Add DeBERTa model implementation to transformers … #5414

Zhiyuan-Fan · 2023-03-25T05:59:42Z

…library

PR types

PR changes

Description

…library

paddle-bot · 2023-03-25T05:59:47Z

Thanks for your contribution!

codecov · 2023-03-25T06:31:17Z

Codecov Report

Attention: Patch coverage is 65.82880% with 503 lines in your changes are missing coverage. Please review.

Project coverage is 63.20%. Comparing base (f54e80d) to head (b6f945a).
Report is 1268 commits behind head on develop.

❗ Current head b6f945a differs from pull request most recent head 2741703. Consider uploading reports for the commit 2741703 to get more accurate results

Files	Patch %	Lines
paddlenlp/transformers/deberta_v2/modeling.py	66.50%	205 Missing ⚠️
paddlenlp/transformers/deberta/modeling.py	78.44%	125 Missing ⚠️
paddlenlp/transformers/deberta/tokenizer.py	22.53%	110 Missing ⚠️
paddlenlp/transformers/deberta/converter.py	0.00%	31 Missing ⚠️
paddlenlp/transformers/deberta_v2/converter.py	0.00%	31 Missing ⚠️
paddlenlp/transformers/deberta_v2/configuration.py	97.36%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #5414      +/-   ##
===========================================
+ Coverage    54.61%   63.20%   +8.59%     
===========================================
  Files          480      512      +32     
  Lines        67855    72473    +4618     
===========================================
+ Hits         37056    45807    +8751     
+ Misses       30799    26666    -4133

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

linjieccc

devertav2 -> deverta_v2

需要补充下modeling和Tokenizer的实现，并且支持AutoModel及AutoTokenizer的from_pretrained调用

linjieccc · 2023-05-26T05:43:27Z

examples/language_model/deberta/glue/run_glue.py

@@ -0,0 +1,431 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


缺少文档，需要补充执行方式及具体的实验结果

linjieccc · 2023-05-26T05:44:20Z

examples/language_model/deberta/glue/run_glue.py

+            dev_ds_matched = dev_ds_matched.map(trans_func, lazy=True)
+            dev_ds_mismatched = dev_ds_mismatched.map(trans_func, lazy=True)
+            dev_batch_sampler_matched = paddle.io.BatchSampler(
+                dev_ds_matched, batch_size=self.args.per_device_eval_batch_size * 2, shuffle=False


这里的batch_size为什么是self.args.per_device_eval_batch_size * 2？

linjieccc · 2023-05-26T05:47:13Z

tests/transformers/deberta/test_tokenizer.py

@@ -0,0 +1,13 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


补充deberta tokenizer单测及deberta_v2的相关单测

sijunhe · 2023-06-05T06:37:56Z

paddlenlp/transformers/deberta_v2/tokenizer.py

+
+
+PRETRAINED_VOCAB_FILES_MAP = {
+    "vocab_file": {


这个key应该是sentencepiece_model_file, 不是vocab_file

github-actions · 2023-08-05T00:16:31Z

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动，被标记为stale。

github-actions · 2023-10-29T00:16:38Z

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动，被标记为stale。

github-actions · 2024-05-08T00:15:36Z

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动，被标记为stale。

feat(transformers): Add DeBERTa model implementation to transformers …

4761b27

…library

paddle-bot bot added contributor status: proposed labels Mar 25, 2023

Zhiyuan-Fan added 9 commits April 3, 2023 18:51

feat: change package

933cd4c

feat: model full

4912206

feat: model test

503d3f7

fix: import error

ded924c

fix: config bug

5a5fe46

fix: modify list

68108b2

fix: module bug

deab7a1

fix: bugs

eb5c752

feat: add training code

799e523

Ligoml mentioned this pull request May 23, 2023

【PaddlePaddle Hackathon 第四期】任务总览 PaddlePaddle/Paddle#51281

Closed

linjieccc reviewed May 26, 2023

View reviewed changes

Zhiyuan-Fan added 13 commits June 1, 2023 13:34

feat: add config

871672a

add position embeddings

a38cb3d

feat test

fb1e194

feat embeddings

9305fce

feat embeddings

682ea30

feat embeddings

13f82bc

feat embeddings

a02d9fe

feat embeddings

255da72

fix bugs

6e7d0cb

add tokenizer test

6155e89

modify tokenizer test

7cacc92

modify tokenizer test

8204aba

modify tokenizer testing

3d70e57

Zhiyuan-Fan added 3 commits June 3, 2023 22:04

v2 complete testing

ea93234

v2 complete testing

b07467f

v2 complete testing

b6f945a

Zhiyuan-Fan requested review from sijunhe and linjieccc June 3, 2023 17:03

v2 tokenizer

48f7720

Zhiyuan-Fan marked this pull request as draft June 4, 2023 00:47

Zhiyuan-Fan marked this pull request as ready for review June 4, 2023 00:48

Zhiyuan-Fan added 8 commits June 4, 2023 10:46

v1 tokenizer

17261bb

offset mapping

5a9af9a

v2 tokenizer test

f46b41f

fix

4d489d2

fix

261783e

fix

d1c6ed5

fix

5aee411

tokenizer v2 spiece

f1c1ad7

sijunhe reviewed Jun 5, 2023

View reviewed changes

Zhiyuan-Fan added 2 commits June 5, 2023 15:34

fix

a82a992

key words

2741703

github-actions bot added the stale label Aug 5, 2023

sijunhe removed the status: proposed label Aug 28, 2023

github-actions bot removed the stale label Aug 29, 2023

github-actions bot added the stale label Oct 29, 2023

paddle-bot bot assigned wawltor Mar 8, 2024

github-actions bot removed the stale label Mar 9, 2024

w5688414 mentioned this pull request Apr 3, 2024

Add DeBERTa model #8227

Merged

github-actions bot added the stale label May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(transformers): Add DeBERTa model implementation to transformers … #5414

feat(transformers): Add DeBERTa model implementation to transformers … #5414

Zhiyuan-Fan commented Mar 25, 2023

paddle-bot bot commented Mar 25, 2023

codecov bot commented Mar 25, 2023 •

edited

linjieccc left a comment

linjieccc May 26, 2023

linjieccc May 26, 2023

linjieccc May 26, 2023

sijunhe Jun 5, 2023

github-actions bot commented Aug 5, 2023

github-actions bot commented Oct 29, 2023

github-actions bot commented May 8, 2024

		@@ -0,0 +1,431 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

		@@ -0,0 +1,13 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.



		PRETRAINED_VOCAB_FILES_MAP = {
		"vocab_file": {

feat(transformers): Add DeBERTa model implementation to transformers … #5414

Are you sure you want to change the base?

feat(transformers): Add DeBERTa model implementation to transformers … #5414

Conversation

Zhiyuan-Fan commented Mar 25, 2023

PR types

PR changes

Description

paddle-bot bot commented Mar 25, 2023

codecov bot commented Mar 25, 2023 • edited

Codecov Report

linjieccc left a comment

Choose a reason for hiding this comment

linjieccc May 26, 2023

Choose a reason for hiding this comment

linjieccc May 26, 2023

Choose a reason for hiding this comment

linjieccc May 26, 2023

Choose a reason for hiding this comment

sijunhe Jun 5, 2023

Choose a reason for hiding this comment

github-actions bot commented Aug 5, 2023

github-actions bot commented Oct 29, 2023

github-actions bot commented May 8, 2024

codecov bot commented Mar 25, 2023 •

edited