Adding autocomplete to vllm model.py #20

oandreeva-nv · 2023-11-03T22:26:33Z

Moving default parameters from config to auto_complete function.

Also tests Python_backend's set_model_transaction_policy

src/model.py

rmccorm4

LGTM other than ask to make stream input optional

rmccorm4 · 2023-11-08T22:24:04Z

@tanmayv25 should we merge this?

tanmayv25 · 2023-11-09T00:07:19Z

I was unable to get a successful run using this change with olga's pipeline. I am in middle of investigating what is not working.

tanmayv25 · 2023-11-09T02:17:51Z

Seems like there was some setup problem. Tested successfully with job id 73883232.

Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com>

rguo123 · 2023-11-13T21:12:25Z

Does autocomplete work? It doesn't seem to be called anywhere in the code besides the method being defined?

tanmayv25 · 2023-11-13T22:54:26Z

The auto-complete feature is currently available only on triton's main branch that tracks development to a future release. The auto-complete functionality will be available with 23.11 release which is scheduled to be released by end of the month.
Thanks for your patience.

oandreeva-nv mentioned this pull request Nov 3, 2023

Follow up to autocomplete pr #317 triton-inference-server/python_backend#320

Merged

oandreeva-nv requested review from dyastremsky, tanmayv25, rmccorm4 and krishung5 November 3, 2023 22:28

rmccorm4 reviewed Nov 4, 2023

View reviewed changes

src/model.py Outdated Show resolved Hide resolved

rmccorm4 reviewed Nov 4, 2023

View reviewed changes

src/model.py Show resolved Hide resolved

rmccorm4 reviewed Nov 4, 2023

View reviewed changes

dyastremsky previously approved these changes Nov 6, 2023

View reviewed changes

rmccorm4 previously approved these changes Nov 6, 2023

View reviewed changes

krishung5 previously approved these changes Nov 6, 2023

View reviewed changes

tanmayv25 previously approved these changes Nov 8, 2023

View reviewed changes

oandreeva-nv added 11 commits November 7, 2023 20:42

Adding auto_complete to model.py

cb7c10c

Fixing input's name

2e000d0

Reversed changes

b9bdcd4

Debug print

ecbac54

Retrieving auto_complete to model.py

56ea6f0

mtp from config

5782fed

Setting mtp in config

4c9d285

Added debug msgs to assert for easier troubleshooting

358eb9d

Removing debug print

cd63100

Reversed model name in stream_enabled_test

26d8154

Making stream optional input

70b4547

tanmayv25 dismissed stale reviews from krishung5, rmccorm4, dyastremsky, and themself via 70b4547 November 8, 2023 04:45

tanmayv25 force-pushed the oandreeva_autocomplete_vllm branch from 11d5c96 to 70b4547 Compare November 8, 2023 04:45

tanmayv25 self-requested a review November 8, 2023 04:46

tanmayv25 approved these changes Nov 8, 2023

View reviewed changes

rmccorm4 approved these changes Nov 8, 2023

View reviewed changes

tanmayv25 merged commit 0e5b209 into main Nov 9, 2023
3 checks passed

tanmayv25 pushed a commit that referenced this pull request Nov 9, 2023

Adding autocomplete to vllm model.py (#20)

98634d1

tanmayv25 deleted the oandreeva_autocomplete_vllm branch November 9, 2023 02:20

tanmayv25 added a commit that referenced this pull request Nov 9, 2023

Adding autocomplete to vllm model.py (#20) (#21)

37167e9

Co-authored-by: Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com>

rguo123 mentioned this pull request Nov 13, 2023

Triton Server Example does not work vllm-project/vllm#1647

Closed

oandreeva-nv mentioned this pull request Nov 22, 2023

change the configuration of engineArgs in config.pbtxt triton-inference-server/tutorials#72

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding autocomplete to vllm model.py #20

Adding autocomplete to vllm model.py #20

oandreeva-nv commented Nov 3, 2023

rmccorm4 left a comment

rmccorm4 commented Nov 8, 2023

tanmayv25 commented Nov 9, 2023

tanmayv25 commented Nov 9, 2023

rguo123 commented Nov 13, 2023

tanmayv25 commented Nov 13, 2023

Adding autocomplete to vllm model.py #20

Adding autocomplete to vllm model.py #20

Conversation

oandreeva-nv commented Nov 3, 2023

rmccorm4 left a comment

Choose a reason for hiding this comment

rmccorm4 commented Nov 8, 2023

tanmayv25 commented Nov 9, 2023

tanmayv25 commented Nov 9, 2023

rguo123 commented Nov 13, 2023

tanmayv25 commented Nov 13, 2023