-
Notifications
You must be signed in to change notification settings - Fork 689
Fix Cuda out of memory issue for eager runner #6866
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Cuda out of memory issue for eager runner #6866
Conversation
This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6866
Note: Links to docs will display an error until the docs builds have been completed. ❗ 2 Active SEVsThere are 2 currently active SEVs. If your PR is affected, please view them below:
✅ No FailuresAs of commit 5c9dbfb with merge base e95f171 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D65962743 |
This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D65962743 |
This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D65962743 |
This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D65962743 |
This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D65962743 |
3f0c204
into
gh/helunwencser/76/base
Pull Request resolved: #6866 This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. ghstack-source-id: 254139542 Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/) Co-authored-by: Lunwen He <lwhecser@gmail.com>
Pull Request resolved: pytorch/executorch#6866 This PR updates the eager runner to disable grad and save memory usage. It also update the prompt format to not include bos. ghstack-source-id: 253958277 Differential Revision: [D65962743](https://our.internmc.facebook.com/intern/diff/D65962743/)
Stack from ghstack (oldest at bottom):
This PR updates the eager runner to disable grad and save memory usage.
It also update the prompt format to not include bos.
Differential Revision: D65962743