add eval for attention sink #7150

pytorchbot · 2024-12-02T23:16:21Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #7070
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/helunwencser/80/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/helunwencser/80/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/helunwencser/78/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/helunwencser/80/orig
@diff-train-skip-merge

Pull Request resolved: #7070 This PR adds the function to evaluate the model's perplexity when AttentionSink is enabled. This is mostly copied from https://github.com/mit-han-lab/streaming-llm/blob/main/examples/eval_long_ppl.py which is used by the AttentionSink paper to evaluate the model's perplexity when AttentionSink is enabled. ghstack-source-id: 256108079 @exported-using-ghexport Differential Revision: [D66474732](https://our.internmc.facebook.com/intern/diff/D66474732/)

pytorch-bot · 2024-12-02T23:16:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7150

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b6df0a9 with merge base ddec0c7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This reverts commit 5cd90a1.

* Transform model to be able to use Attention Sink Pull Request resolved: #6700 This PR adds necessary functions for transforming the model to be able to use Attention Sink. ghstack-source-id: 256108077 @exported-using-ghexport Differential Revision: [D65571289](https://our.internmc.facebook.com/intern/diff/D65571289/) * Update eager runner to support AttentionSink Pull Request resolved: #6921 This PR updates the eager runner to support AttentionSink. It also fixes issues in the `chat_completion` function to properly handle the position id. ghstack-source-id: 256108078 Differential Revision: [D66076486](https://our.internmc.facebook.com/intern/diff/D66076486/) * add eval for attention sink (#7150) Pull Request resolved: #7070 This PR adds the function to evaluate the model's perplexity when AttentionSink is enabled. This is mostly copied from https://github.com/mit-han-lab/streaming-llm/blob/main/examples/eval_long_ppl.py which is used by the AttentionSink paper to evaluate the model's perplexity when AttentionSink is enabled. ghstack-source-id: 256108079 @exported-using-ghexport Differential Revision: [D66474732](https://our.internmc.facebook.com/intern/diff/D66474732/) Co-authored-by: Lunwen He <lwhecser@gmail.com> --------- Co-authored-by: Lunwen He <lwhecser@gmail.com>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 2, 2024

helunwencser merged commit 5cd90a1 into gh/helunwencser/78/orig Dec 3, 2024
40 checks passed

helunwencser deleted the gh/helunwencser/80/orig branch December 3, 2024 01:24

helunwencser added a commit that referenced this pull request Dec 3, 2024

Revert "add eval for attention sink (#7150)"

704cd3b

This reverts commit 5cd90a1.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add eval for attention sink #7150

add eval for attention sink #7150

Uh oh!

pytorchbot commented Dec 2, 2024

Uh oh!

pytorch-bot bot commented Dec 2, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add eval for attention sink #7150

add eval for attention sink #7150

Uh oh!

Conversation

pytorchbot commented Dec 2, 2024

Uh oh!

pytorch-bot bot commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7150

✅ No Failures

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 2, 2024 •

edited

Loading