Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[doc] performance and parallelism updates #14391

Merged
merged 3 commits into from
Nov 15, 2021

Conversation

stas00
Copy link
Contributor

@stas00 stas00 commented Nov 14, 2021

This PR:

  • updates the performance doc to break down all the memory used by the model (+ mentioned bitsandbytes which saves 3/4 optim memory)
  • updates the parallelism doc to introduce Varuna and expand on Sagemaker model parallelism solutions - both published a paper just recently

@sgugger

Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for this update!

docs/source/parallelism.md Outdated Show resolved Hide resolved
@stas00 stas00 merged commit 29dfb2d into huggingface:master Nov 15, 2021
@stas00 stas00 deleted the parallel-doc-update branch November 15, 2021 01:19
Albertobegue pushed a commit to Albertobegue/transformers that referenced this pull request Jan 27, 2022
* [doc] performance and parallelism doc update

* improve

* improve
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants