Qualcomm AI Engine Direct - Fixed the order of the transforms for llama #5221

shewu-quic · 2024-09-10T08:29:58Z

We need to apply r1 r2 before converting linear to conv.

pytorch-bot · 2024-09-10T08:30:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5221

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit dfd80b0 with merge base 657789e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

shewu-quic · 2024-09-10T09:03:36Z

Hi @cccclai,
I apologize for the inconvenience. It seems I misplaced the order of the transform during the rebase process.
Could you please review it again? I would greatly appreciate your help.
Thank you very much.

shewu-quic · 2024-09-10T09:32:20Z

I have a small request. I would like to add a document that explains how to export llama with Qualcomm AI Engine Direct, including steps like downloading spin quant and setting num_sharding. However, I’m not sure where the best place to save this file would be. Could you please advise?

cccclai · 2024-09-10T13:18:08Z

examples/models/llama2/eval_llama_lib.py

+                if self._generate_full_logits:
+                    return torch.cat(result_logits, dim=1)
+                else:
+                    return torch.stack(result_logits, dim=1)


hmm what's the difference between these?

Because the shape of the function output should be (batch, seq, vocab_size).
If _generate_full_logits, the shape of each result logit in result_logits are (batch, seq, vocab_size)
We could just use cat by dim=1.
If not _generate_full_logits, the shape of each result logit in result_logits are (batch, vocab_size).
We will need use stack to get one more dimension (batch, seq, vocab_size)

cccclai

Looks good

cccclai · 2024-09-10T14:32:42Z

CI needs to be fixed.

facebook-github-bot · 2024-09-10T15:44:52Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 10, 2024

Qualcomm AI Engine Direct - Fixed the order of the transforms for llama

0e23b45

shewu-quic force-pushed the dev1/hutton/fixed_spin_quant_r1_r2 branch from 999a992 to 0e23b45 Compare September 10, 2024 11:03

cccclai reviewed Sep 10, 2024

View reviewed changes

cccclai approved these changes Sep 10, 2024

View reviewed changes

fixed ci

dfd80b0

shewu-quic force-pushed the dev1/hutton/fixed_spin_quant_r1_r2 branch from 807d763 to dfd80b0 Compare September 10, 2024 15:22

cccclai merged commit c76b22f into pytorch:main Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - Fixed the order of the transforms for llama #5221

Qualcomm AI Engine Direct - Fixed the order of the transforms for llama #5221

Uh oh!

shewu-quic commented Sep 10, 2024

Uh oh!

pytorch-bot bot commented Sep 10, 2024 •

edited

Loading

Uh oh!

shewu-quic commented Sep 10, 2024 •

edited

Loading

Uh oh!

shewu-quic commented Sep 10, 2024

Uh oh!

cccclai Sep 10, 2024

Uh oh!

shewu-quic Sep 10, 2024

Uh oh!

cccclai left a comment

Uh oh!

cccclai commented Sep 10, 2024

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Qualcomm AI Engine Direct - Fixed the order of the transforms for llama #5221

Qualcomm AI Engine Direct - Fixed the order of the transforms for llama #5221

Uh oh!

Conversation

shewu-quic commented Sep 10, 2024

Uh oh!

pytorch-bot bot commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5221

✅ No Failures

Uh oh!

shewu-quic commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shewu-quic commented Sep 10, 2024

Uh oh!

cccclai Sep 10, 2024

Choose a reason for hiding this comment

Uh oh!

shewu-quic Sep 10, 2024

Choose a reason for hiding this comment

Uh oh!

cccclai left a comment

Choose a reason for hiding this comment

Uh oh!

cccclai commented Sep 10, 2024

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Sep 10, 2024 •

edited

Loading

shewu-quic commented Sep 10, 2024 •

edited

Loading