-
Notifications
You must be signed in to change notification settings - Fork 26.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BartForCausalLM analogs to ProphetNetForCausalLM
#9128
Conversation
Hy @sadakmed, let me know if you need help on the issue or if you don't find the time to tackle it. I'll then just make it open to the "public" again :-) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's great to see progress here! Good job. Note that BartForCausalLM
should have the exact same functionality in terms of computing the loss as the BartForConditionalGeneration
model => this means we can copy the way BartForConditionalGeneration
computes the loss into BartForCausalLM
.
Let me know if you need help or are stuck :-)
@patrickvonplaten The loss function it what I stuck on, thank you very much for your guidance.
for sure I will ;-) |
Hey @sadakmed, Do you have an update on the PR? It's been three weeks now and it would be great to merge this soon. Sorry, we're very fast-moving in this lib and other community contributors have started asking for this feature. By next week, I'll probably have to take a look myself or redistribute the issue. |
Hi @patrickvonplaten my apologies, Could you Please see it now, lemme know if anything is missing. |
Hi @patrickvonplaten, working on the test 'BartStandaloneCausalLM': I dont know if the |
Hey @sadakmed, Thanks a lot for you additions here :-) |
of course, I'm working on it, |
Hi @patrickvonplaten, I just pushed the test, could you please check it out! |
@patrickvonplaten could you check please! |
UPDATE: @LysandreJik @sgugger |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is fantastic work! Thanks a lot!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Impressive work, great work @sadakmed and @patrickvonplaten!
Very cool that you completed the templates as well.
There seems to be a few issues with the "copied by" scripts in the templates, but the test still passes, which is weird.
...ecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
Outdated
Show resolved
Hide resolved
...ecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
Show resolved
Hide resolved
...ecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
Outdated
Show resolved
Hide resolved
...ecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
Outdated
Show resolved
Hide resolved
Great job @sadakmed |
wouldn't happen without you, thank you very much. see u in the next PR ;) |
What does this PR do?
Implementing BartForCausalLM anologs for ProphetNetForCausalLM
Fixes #9066