add pegasus support #11

HenryDashwood · 2020-09-04T17:55:26Z

Pegasus has the same parameter groups in Bart so the splitter works for both.

When Pegasus decodes it leaves these <n> symbols in, so I added a line the take them out.

review-notebook-app · 2020-09-04T17:55:30Z

Check out this pull request on

Review Jupyter notebook visual diffs & provide feedback on notebooks.

Powered by ReviewNB

HenryDashwood · 2020-09-04T18:02:51Z

See #10

ohmeow · 2020-09-05T20:45:29Z

Thanks @HenryDashwood. Going to work thru this after the fastcore PR. Was going to start looking at Pegasus support this weekend after adding in tests for the question answering bits but I'm glad to see you beat me to it :)

ohmeow · 2020-09-08T21:28:52Z

Hey @HenryDashwood ... how are you testing this with Pegasus? That model is a beast! Training on colab right now but can only get away with a batch size = 2 and a max_length=256 (for the text to summarize).

HenryDashwood · 2020-09-08T22:10:25Z

Yeah it’s a bit of a nightmare to work with. My advice, if you haven’t already seen it and need more memory, is https://datacrunch.io/products/

I’ve moved all the GPU work I do in and outside of work over to there since it’s shockingly cheap compared to Amazon who we were using before https://aws.amazon.com/sagemaker/pricing/

Obviously not free though. Let me know if you have any tests you would like run and I’ll see what I can do!

ohmeow · 2020-09-09T17:51:43Z

Ah, never heard of it ... pricing looks nice.

You guys just using it for training I assume? Are you just running everything on the instance itself or are you training through something like Docker? Back in the old days I remember it being easiest just spinning up an EC2 instance, running a bash script that did all my installs, and training directly on it ... anyways, curious to know how you all are using it. Thanks for the info.

btw, added T5 and Pegasus support to library now. I couldn't run any of my tests for Pegasus cuz I'm running everything on my local 1080 TI. Anyhow, if you check the docs you'll see how I have the tests set up ... if you get a chance, yah, it would be great if you could test Pegasus and lmk how things go. I have tests for both the data and modeling so curious to know if the tokenization is right for the architecture and if it trains :). Lmk if you can. Thanks.

HenryDashwood · 2020-09-09T18:08:54Z

Will do!

Re DataCrunch. Yeah we only use it for training. All our models still get served on cpus.

They actually offer a one click to start up Jupyter notebook image with things like Fastai preinstalled. That's how I came across them. However I prefer to spin up a plain Ubuntu image, ssh in, run a shell script to set it up the way I like, and then do all my work through the terminal and VSCode. VSCode's notebook and ssh support is amazing now and DataCrunch is what finally gave me a reason to mould my workflow around it. I'm thinking of writing a blog or making a video about it because it's so much better than anything I've seen anywhere else.

ohmeow · 2020-09-10T18:19:50Z

Yah go for it with the blog post/video. I've been switching between using VSCode myself and pure Jupyter. I'm a .NET and Rails developer by trade, and even though I work on a Mac, I'm able to use VSCode for just about everything outside of managing/working with our SQL Servers (haven't found anything better that SQL Server Mgmt. Studio on windows for that). Anyways, thanks for the reply. Always interested to see how folks are training these models which are getting bigger and bigger it seems.

…

-wg

On Wed, Sep 9, 2020 at 11:09 AM Henry Dashwood ***@***.***> wrote: Will do! Re DataCrunch. Yeah we only use it for training. All our models still get served on cpus. They actually offer a one click to start up Jupyter notebook image with things like Fastai preinstalled. That's how I came across them. However I prefer to spin up a plain Ubuntu image, ssh in, and then do all my work through the terminal and VSCode. VSCode's notebook and ssh support is amazing now and DataCrunch is what finally gave me a reason to mould my workflow around it. I'm thinking of writing a blog or making a video about it because it's so much better than anything I've seen anywhere else. — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#11 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAADNMB42GBVLTPQQ72OAMLSE7AELANCNFSM4QZAW5BQ> .

add pegasus support

2b465fc

HenryDashwood mentioned this pull request Sep 4, 2020

Latest fastcore/fastai breaks blurr #10

Closed

Merge branch 'master' into master

77b4e5e

ohmeow merged commit 13452dc into ohmeow:master Sep 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pegasus support #11

add pegasus support #11

HenryDashwood commented Sep 4, 2020

review-notebook-app bot commented Sep 4, 2020

HenryDashwood commented Sep 4, 2020

ohmeow commented Sep 5, 2020

ohmeow commented Sep 8, 2020

HenryDashwood commented Sep 8, 2020

ohmeow commented Sep 9, 2020 •

edited

HenryDashwood commented Sep 9, 2020 •

edited

ohmeow commented Sep 10, 2020 via email

add pegasus support #11

add pegasus support #11

Conversation

HenryDashwood commented Sep 4, 2020

review-notebook-app bot commented Sep 4, 2020

HenryDashwood commented Sep 4, 2020

ohmeow commented Sep 5, 2020

ohmeow commented Sep 8, 2020

HenryDashwood commented Sep 8, 2020

ohmeow commented Sep 9, 2020 • edited

HenryDashwood commented Sep 9, 2020 • edited

ohmeow commented Sep 10, 2020 via email

ohmeow commented Sep 9, 2020 •

edited

HenryDashwood commented Sep 9, 2020 •

edited