[llava][16/N] Extract out prefill logic into a new class #4585

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 7 commits into gh/larryliu0820/45/base from gh/larryliu0820/45/head

Aug 9, 2024

Contributor

larryliu0820 commented Aug 7, 2024 •

edited

Loading

Stack from ghstack (oldest at bottom):

Depends on whether parallel or sequential prefill is chosen, prefill()
calls TextDecoderRunner.step() to prefill prompt tokens to LLM.

Differential Revision: D60927756


          [llava][16/N] Extract out prefill logic into a new class

20a6a23

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

[ghstack-poisoned]

This was referenced Aug 6, 2024

[llava][14/N] Refactor runner prefill() and run_model_step() #4556

Merged

[llava][15/N] Extract out text decoder runner #4567

Merged

pytorch-bot bot commented Aug 7, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4585

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c1d970b with merge base 92edd04 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

larryliu0820 added a commit that referenced this pull request


          [llava][16/N] Extract out prefill logic into a new class

4441eac

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

ghstack-source-id: d7f35ea
Pull Request resolved: #4585

Contributor Author

larryliu0820 commented Aug 7, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.


          Update on "[llava][16/N] Extract out prefill logic into a new class"

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request


          [llava][16/N] Extract out prefill logic into a new class

11797c8

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

ghstack-source-id: 5c4f6c8
Pull Request resolved: #4585

Contributor Author

larryliu0820 commented Aug 7, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.


          Update on "[llava][16/N] Extract out prefill logic into a new class"

7f6b71d

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request


          [llava][16/N] Extract out prefill logic into a new class

20c4a99

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

ghstack-source-id: 5e03836
Pull Request resolved: #4585


          Update on "[llava][16/N] Extract out prefill logic into a new class"

9a60fde

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

larryliu0820 mentioned this pull request

[llava][17/N] Move util.h into /e/llm/runner #4588

Merged

Contributor Author

larryliu0820 commented Aug 7, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.


          Update on "[llava][16/N] Extract out prefill logic into a new class"

7604e8d

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

Contributor Author

larryliu0820 commented Aug 7, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.


          Update on "[llava][16/N] Extract out prefill logic into a new class"

4d977e8

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

Contributor Author

larryliu0820 commented Aug 8, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.


          Update on "[llava][16/N] Extract out prefill logic into a new class"

c1d970b

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

Differential Revision: [D60927756](https://our.internmc.facebook.com/intern/diff/D60927756)

[ghstack-poisoned]

Contributor Author

larryliu0820 commented Aug 8, 2024

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

helunwencser approved these changes

View reviewed changes

facebook-github-bot merged commit 386a06c into gh/larryliu0820/45/base

kedarnath03 pushed a commit to kedarnath03/executorch that referenced this pull request


          [llava][16/N] Extract out prefill logic into a new class

Depends on whether parallel or sequential prefill is chosen, prefill()
calls `TextDecoderRunner.step()` to prefill prompt tokens to LLM.

ghstack-source-id: 1af03e2
Pull Request resolved: pytorch/executorch#4585

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels