Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[utc] add small scale models #4716

Merged
merged 2 commits into from
Feb 10, 2023
Merged

Conversation

LemonNoel
Copy link
Contributor

@LemonNoel LemonNoel commented Feb 9, 2023

PR types

Others

PR changes

Others

Description

  • Add small scale utc models, includingutc-xbase, utc-base, utc-medium, utc-micro, utc-mini, utc-nano and utc-pico.
  • Update relevant documents.
  • Fix the vocab.txt of utc-large.

@paddle-bot
Copy link

paddle-bot bot commented Feb 9, 2023

Thanks for your contribution!

@LemonNoel LemonNoel self-assigned this Feb 9, 2023
"utc-pico": {
"task_class": ZeroShotTextClassificationTask,
"task_flag": "zero_shot_text_classification-utc-pico",
},
},
"default": {"model": "utc-large"},
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

为了兼容宣传中的示例,默认模型暂时还是utc-large,其他地方均改为了utc-base

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

需要用户输入吗?用户对 utc-large 有感知吗?没有的话可以切换

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

默认模型不需要用户输入。小模型在相同的case下与 utc-large 结果有出入。

TODO:根据 uie-base 构造替换当前示例。

@codecov
Copy link

codecov bot commented Feb 9, 2023

Codecov Report

Merging #4716 (fed92fe) into develop (3ab7c61) will increase coverage by 0.28%.
The diff coverage is 100.00%.

@@             Coverage Diff             @@
##           develop    #4716      +/-   ##
===========================================
+ Coverage    42.97%   43.25%   +0.28%     
===========================================
  Files          434      435       +1     
  Lines        61956    61966      +10     
===========================================
+ Hits         26626    26805     +179     
+ Misses       35330    35161     -169     
Impacted Files Coverage Δ
paddlenlp/taskflow/taskflow.py 76.74% <ø> (ø)
paddlenlp/transformers/ernie/configuration.py 100.00% <ø> (ø)
paddlenlp/transformers/ernie/tokenizer.py 41.93% <ø> (ø)
...addlenlp/taskflow/zero_shot_text_classification.py 18.25% <100.00%> (ø)
paddlenlp/utils/downloader.py 62.55% <0.00%> (-0.45%) ⬇️
paddlenlp/transformers/__init__.py 100.00% <0.00%> (ø)
paddlenlp/transformers/bert/modeling.py 90.28% <0.00%> (ø)
...addlenlp/transformers/chinesebert/configuration.py 100.00% <0.00%> (ø)
...addlenlp/experimental/autonlp/auto_trainer_base.py 89.51% <0.00%> (+0.44%) ⬆️
...dlenlp/experimental/autonlp/text_classification.py 98.12% <0.00%> (+1.97%) ⬆️
... and 2 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

Copy link
Collaborator

@ZHUI ZHUI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

"utc-pico": {
"task_class": ZeroShotTextClassificationTask,
"task_flag": "zero_shot_text_classification-utc-pico",
},
},
"default": {"model": "utc-large"},
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

需要用户输入吗?用户对 utc-large 有感知吗?没有的话可以切换

@LemonNoel LemonNoel merged commit 9505bc8 into PaddlePaddle:develop Feb 10, 2023
@LemonNoel LemonNoel deleted the utc-series branch February 10, 2023 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants