[PP] Go back to export instead of _export #134299

kwen2501 · 2024-08-23T00:57:18Z

Stack from ghstack (oldest at bottom):

-> [PP] Go back to export instead of _export #134299

Reverts #130998 because FakeTensor + real device suffice to work around the autocast issue in HF.

cc @XilunWu @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o

[ghstack-poisoned]

pytorch-bot · 2024-08-23T00:57:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134299

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit c568145 with merge base e000cf0 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

linux-binary-manywheel / manywheel-py3_9-cuda12_1-test / test (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: f08adf7 Pull Request resolved: #134299

lessw2020

only issue with going back is HF may break now since they still have autocast in their rope embeddings? That said, probably better if this is fixed in export directly.
Otherwise, lgtm!

kwen2501 · 2024-08-23T01:26:07Z

It is okay if HF still have autocast in their rope embeddings.
Users just need to do this before tracing:

# Cast the model to FakeTensor with real device (from meta device) because
# there is autocast code in llama.  Autocast functions based on device of
# tensor. So we'd need to give it a real device instead of meta device.
with fake_mode:
    llama.to_empty(device="cuda")

kwen2501 · 2024-08-23T01:27:27Z

Okay, I see what you mean @lessw2020
I will hold the merge till figuring out a preference.

kwen2501 · 2024-09-04T18:51:10Z

@pytorchbot merge

pytorchmergebot · 2024-09-04T18:52:52Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

kwen2501 · 2024-09-04T19:53:19Z

@pytorchbot merge

pytorchmergebot · 2024-09-04T19:55:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-09-04T19:55:14Z

Merge failed

Reason: 5 jobs have failed, first few of them are: trunk / linux-focal-cuda12.4-py3.10-gcc9-experimental-split-build / build, trunk / linux-focal-cuda12.4-py3.10-gcc9-sm86 / build, trunk / linux-focal-cuda12.4-py3.10-gcc9-no-ops / build, trunk / linux-focal-cuda12.1-py3.10-gcc9-no-ops / build, trunk / linux-focal-cuda11.8-py3.10-gcc9-experimental-split-build / build

Details for Dev Infra team

Raised by workflow job

kwen2501 · 2024-09-04T20:11:58Z

@pytorchbot --help

pytorch-bot · 2024-09-04T20:12:00Z

PyTorchBot Help

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci,cherry-pick,close} ...

In order to invoke the bot on your PR, include a line that starts with
@pytorchbot anywhere in a comment. That line will form the command; no
multi-line commands are allowed. Some commands may be used on issues as specified below.

Example:
    Some extra context, blah blah, wow this PR looks awesome

    @pytorchbot merge

optional arguments:
  -h, --help            Show this help message and exit.

command:
  {merge,revert,rebase,label,drci,cherry-pick,close}
    merge               Merge a PR
    revert              Revert a PR
    rebase              Rebase a PR
    label               Add label to a PR
    drci                Update Dr. CI
    cherry-pick         Cherry pick a PR onto a release branch
    close               Close a PR

Merge

usage: @pytorchbot merge [-f MESSAGE | -i] [-ic] [-r [{viable/strict,main}]]

Merge an accepted PR, subject to the rules in .github/merge_rules.json.
By default, this will wait for all required checks (lint, pull) to succeed before merging.

optional arguments:
  -f MESSAGE, --force MESSAGE
                        Merge without checking anything. This requires a reason for auditting purpose, for example:
                        @pytorchbot merge -f 'Minor update to fix lint. Expecting all PR tests to pass'
                        
                        Please use `-f` as last resort, prefer `--ignore-current` to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.
  -i, --ignore-current  Merge while ignoring the currently failing jobs.  Behaves like -f if there are no pending jobs.
  -ic                   Old flag for --ignore-current. Deprecated in favor of -i.
  -r [{viable/strict,main}], --rebase [{viable/strict,main}]
                        Rebase the PR to re run checks before merging.  Accepts viable/strict or main as branch options and will default to viable/strict if not specified.

Revert

usage: @pytorchbot revert -m MESSAGE -c
                          {nosignal,ignoredsignal,landrace,weird,ghfirst}

Revert a merged PR. This requires that you are a Meta employee.

Example:
  @pytorchbot revert -m="This is breaking tests on trunk. hud.pytorch.org/" -c=nosignal

optional arguments:
  -m MESSAGE, --message MESSAGE
                        The reason you are reverting, will be put in the commit message. Must be longer than 3 words.
  -c {nosignal,ignoredsignal,landrace,weird,ghfirst}, --classification {nosignal,ignoredsignal,landrace,weird,ghfirst}
                        A machine-friendly classification of the revert reason.

Rebase

usage: @pytorchbot rebase [-s | -b BRANCH]

Rebase a PR. Rebasing defaults to the stable viable/strict branch of pytorch.
Repeat contributor may use this command to rebase their PR.

optional arguments:
  -s, --stable          [DEPRECATED] Rebase onto viable/strict
  -b BRANCH, --branch BRANCH
                        Branch you would like to rebase to

Label

usage: @pytorchbot label labels [labels ...]

Adds label to a PR or Issue [Can be used on Issues]

positional arguments:
  labels  Labels to add to given Pull Request or Issue [Can be used on Issues]

Dr CI

usage: @pytorchbot drci 

Update Dr. CI. Updates the Dr. CI comment on the PR in case it's gotten out of sync with actual CI results.

cherry-pick

usage: @pytorchbot cherry-pick --onto ONTO [--fixes FIXES] -c
                               {regression,critical,fixnewfeature,docs,release}

Cherry pick a pull request onto a release branch for inclusion in a release

optional arguments:
  --onto ONTO           Branch you would like to cherry pick onto (Example: release/2.1)
  --fixes FIXES         Link to the issue that your PR fixes (Example: https://github.com/pytorch/pytorch/issues/110666)
  -c {regression,critical,fixnewfeature,docs,release}, --classification {regression,critical,fixnewfeature,docs,release}
                        A machine-friendly classification of the cherry-pick reason.

Close

usage: @pytorchbot close

Close a PR [Can be used on issues]

kwen2501 · 2024-09-04T20:12:17Z

@pytorchbot rebase -s

pytorchmergebot · 2024-09-04T20:13:42Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

[ghstack-poisoned]

pytorchmergebot · 2024-09-04T20:13:54Z

Successfully rebased gh/kwen2501/49/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/134299)

ghstack-source-id: 408cd1b Pull Request resolved: #134299

kwen2501 · 2024-09-04T20:37:46Z

@pytorchbot merge

pytorchmergebot · 2024-09-04T20:39:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Reverts pytorch#130998 because FakeTensor + real device suffice to work around the autocast issue in HF. Pull Request resolved: pytorch#134299 Approved by: https://github.com/lessw2020

[PP] Go back to export instead of _export

9231977

[ghstack-poisoned]

pytorch-bot bot added oncall: distributed Add this issue/PR to distributed oncall triage queue labels Aug 23, 2024

kwen2501 added a commit that referenced this pull request Aug 23, 2024

[PP] Go back to export instead of _export

70f12b3

ghstack-source-id: f08adf7 Pull Request resolved: #134299

kwen2501 requested a review from lessw2020 August 23, 2024 01:03

lessw2020 approved these changes Aug 23, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 4, 2024

pytorchmergebot added the merging label Sep 4, 2024

pytorchmergebot removed the merging label Sep 4, 2024

kwen2501 added the topic: not user facing topic category label Sep 4, 2024

pytorchmergebot added the merging label Sep 4, 2024

pytorchmergebot removed the merging label Sep 4, 2024

Update

c568145

[ghstack-poisoned]

pytorchmergebot pushed a commit that referenced this pull request Sep 4, 2024

[PP] Go back to export instead of _export

0c73343

ghstack-source-id: 408cd1b Pull Request resolved: #134299

pytorchmergebot added the merging label Sep 4, 2024

pytorchmergebot added the Merged label Sep 4, 2024

pytorchmergebot closed this in 9810ce9 Sep 4, 2024

pytorchmergebot removed the merging label Sep 4, 2024

github-actions bot deleted the gh/kwen2501/49/head branch October 5, 2024 02:06

[PP] Go back to export instead of _export #134299

[PP] Go back to export instead of _export #134299

Uh oh!

Conversation

kwen2501 commented Aug 23, 2024 • edited by pytorchmergebot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134299

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

lessw2020 left a comment

Choose a reason for hiding this comment

Uh oh!

kwen2501 commented Aug 23, 2024

Uh oh!

kwen2501 commented Aug 23, 2024

Uh oh!

kwen2501 commented Sep 4, 2024

Uh oh!

pytorchmergebot commented Sep 4, 2024

Merge failed

Uh oh!

kwen2501 commented Sep 4, 2024

Uh oh!

pytorchmergebot commented Sep 4, 2024

Merge started

Uh oh!

pytorchmergebot commented Sep 4, 2024

Merge failed

Uh oh!

kwen2501 commented Sep 4, 2024

Uh oh!

pytorch-bot bot commented Sep 4, 2024

PyTorchBot Help

Merge

Revert

Rebase

Label

Dr CI

cherry-pick

Close

Uh oh!

kwen2501 commented Sep 4, 2024

Uh oh!

pytorchmergebot commented Sep 4, 2024

Uh oh!

pytorchmergebot commented Sep 4, 2024

Uh oh!

kwen2501 commented Sep 4, 2024

Uh oh!

pytorchmergebot commented Sep 4, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kwen2501 commented Aug 23, 2024 •

edited by pytorchmergebot

Loading

pytorch-bot bot commented Aug 23, 2024 •

edited

Loading