A major refactor to sacrifice some performance for flexiblity and simplicity #304

zhuzilin · 2022-02-28T09:05:32Z

Currently, PatrickStart could train the largest pretrained model with the lowest hardware requirement, i.e. gpu and cpu memory. However, this comes with a price, as we are specializing the design to naive bert and gpt structure as well as adam optimizer. This makes our users hard to use PatrickStar for their latest research project and hard for us to tweak some edge cases to be compatible with popular NLP repos.

Therefore, we decide to refactor PatrickStar to make it simple and flexible. After all, comparing to break record, we prefer creating a handy tool to the NLP community :)

Here are some of the changes we are making now:

Stop reusing the parameter chunks for gradient: This would increase memory usage but will make PatrickStar support more network structure.
No longer managing the optimizer state with chunks: This should allow us to use pytorch native or thirdparty optimizers directly.

We will try to make the new design as performant and as efficient as the old one, however if what you need is the extreme performance mentioned as the paper, please refer to release v0.4.6.

Jack47 · 2022-02-28T14:16:46Z

sounds cool, we really need these two features !

zhuzilin pinned this issue Feb 28, 2022

zhuzilin mentioned this issue Feb 28, 2022

[WIP] Refactor #305

Closed

zhuzilin mentioned this issue Mar 9, 2022

Code-Clean and refactor #306

Merged

feifeibear closed this as completed May 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A major refactor to sacrifice some performance for flexiblity and simplicity #304

A major refactor to sacrifice some performance for flexiblity and simplicity #304

zhuzilin commented Feb 28, 2022

Jack47 commented Feb 28, 2022

A major refactor to sacrifice some performance for flexiblity and simplicity #304

A major refactor to sacrifice some performance for flexiblity and simplicity #304

Comments

zhuzilin commented Feb 28, 2022

Jack47 commented Feb 28, 2022