Unify llama generate utils by guangyey · Pull Request #3318 · pytorch/ao

guangyey · 2025-11-10T06:11:27Z

Motivation

Unify llama generate utils via torch.accelerator APIs.

cc @albanD

pytorch-bot · 2025-11-10T06:11:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3318

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Some B200 runners are down due to network issues

✅ You can merge normally! (1 Unrelated Failure)

As of commit b7156ec with merge base 415e0e8 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh) (trunk failure)
test/dtypes/test_nf4.py::TestComm::test_comm

This comment was automatically generated by Dr. CI and updates every 15 minutes.

albanD

Sounds good to me.
But I'll let an ao maintainer accept the PR

jerryzh168

looks good, as long as it works

jerryzh168 · 2025-11-11T21:48:48Z

torchao/_models/llama/generate.py

-    if torch.xpu.is_available()
-    else "cpu"
-)
+default_device = acc.type if (acc := torch.accelerator.current_accelerator(True)) else "cpu"


maybe spell out the arg of current_accelerator to be clearer https://docs.pytorch.org/docs/stable/generated/torch.accelerator.current_accelerator.html

@jerryzh168 Done.

guangyey · 2025-11-18T07:05:54Z

@pytorchbot label "topic: improvement"

guangyey · 2025-11-18T07:10:06Z

The CI workflow needs to be triggered.

guangyey · 2025-12-19T03:10:14Z

@jerryzh168 I rebase this PR to the latest main already. Could you help re-trigger the CI.

guangyey · 2025-12-23T09:24:11Z

@pytorchbot merge

pytorchmergebot · 2025-12-23T09:24:40Z

Merge failed

Reason: 1 mandatory check(s) are pending/not yet run. The first few are:

Facebook CLA Check

Dig deeper by viewing the pending checks on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: superuser

jerryzh168 · 2025-12-23T18:26:10Z

@guangyey we can't use pytorchbot here to merge the PR. we just click the button to merge

pytorch-bot · 2025-12-24T02:05:32Z

To add the ciflow label ciflow/xpu please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

pytorch-bot · 2025-12-24T02:06:46Z

To add the ciflow label ciflow/xpu please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

guangyey · 2025-12-24T02:26:34Z

Thanks, @jerryzh168. I’ll reach out to the Intel team to re-trigger the CI and land the PR once all checks are green.

guangyey · 2025-12-29T05:36:42Z

The failure is irrelevant to this PR. The same failure can be found in https://github.com/pytorch/ao/actions/runs/20529844203/job/58979473220 that already landed in #3548

File "/opt/conda/envs/venv/lib/python3.10/site-packages/torchao/dtypes/nf4tensor.py", line 940, in __torch_dispatch__
    raise NotImplementedError(
NotImplementedError: NF4Tensor dispatch: attempting to run _c10d_functional._wrap_tensor_autograd.default, this is not supported

To execute this test, run the following from the base repo dir:
    python test/dtypes/test_nf4.py TestComm.test_comm

liangan1 · 2025-12-29T05:39:04Z

Double confirmed, the UT fail is not related to this PR. Merged.

guangyey · 2025-12-29T05:39:46Z

Thanks for your help!

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 10, 2025

albanD reviewed Nov 11, 2025

View reviewed changes

jerryzh168 approved these changes Nov 11, 2025

View reviewed changes

jerryzh168 reviewed Nov 11, 2025

View reviewed changes

guangyey force-pushed the guangyey/1st branch from 57478ea to 09e0994 Compare November 12, 2025 01:57

pytorch-bot bot added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Nov 18, 2025

guangyey force-pushed the guangyey/1st branch from 09e0994 to 85bf902 Compare November 18, 2025 07:07

guangyey force-pushed the guangyey/1st branch from 85bf902 to 6dd4c33 Compare December 19, 2025 03:08

guangyey force-pushed the guangyey/1st branch from 6dd4c33 to 810e31a Compare December 19, 2025 04:31

pytorchmergebot added the merging label Dec 23, 2025

pytorchmergebot removed the merging label Dec 23, 2025

guangyey force-pushed the guangyey/1st branch from 810e31a to 1e180ae Compare December 24, 2025 02:01

xiaowangintel added the ciflow/xpu label used to trigger xpu CI jobs label Dec 24, 2025

pytorch-bot bot removed the ciflow/xpu label used to trigger xpu CI jobs label Dec 24, 2025

xiaowangintel added the ciflow/xpu label used to trigger xpu CI jobs label Dec 24, 2025

pytorch-bot bot removed the ciflow/xpu label used to trigger xpu CI jobs label Dec 24, 2025

guangyey force-pushed the guangyey/1st branch from 06471e4 to b7156ec Compare December 29, 2025 02:04

liangan1 merged commit f981d92 into pytorch:main Dec 29, 2025
20 of 21 checks passed

Unify llama generate utils

b7156ec

Conversation

guangyey commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

pytorch-bot bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3318

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

guangyey Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

guangyey commented Nov 18, 2025

Uh oh!

guangyey commented Nov 18, 2025

Uh oh!

guangyey commented Dec 19, 2025

Uh oh!

guangyey commented Dec 23, 2025

Uh oh!

pytorchmergebot commented Dec 23, 2025

Merge failed

Uh oh!

jerryzh168 commented Dec 23, 2025

Uh oh!

pytorch-bot bot commented Dec 24, 2025

Uh oh!

pytorch-bot bot commented Dec 24, 2025

Uh oh!

guangyey commented Dec 24, 2025

Uh oh!

guangyey commented Dec 29, 2025

Uh oh!

liangan1 commented Dec 29, 2025

Uh oh!

Uh oh!

guangyey commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

guangyey commented Nov 10, 2025 •

edited

Loading

pytorch-bot bot commented Nov 10, 2025 •

edited

Loading