Add vllm chat completion endpoints and fix inspect_history #811

noobHappylife · 2024-04-13T04:58:56Z

Related to issue #778 #784

added vllm chat completion endpoints (support system_prompt)
fixed an issue with .copy() with vllm client module. (need to include model, url, port into kwargs)
fix inspect_histrory not respecting n/skip as mentioned in Inspect_history only shows the last history regardless of n/skip #784

arnavsinghvi11 · 2024-04-15T02:24:25Z

dsp/modules/hf_client.py

        self.kwargs |= kwargs
+        # kwargs needs to have model, port and url for the lm.copy() to work properly
+        self.kwargs.update({
+            'model': model,


model is in fact included in the kwargs already, inherited through HFModel-> LM so we can remove this.

is there a use case for including the port and URL in the LM kwargs? seems like they are only used for setting the URL(s) and not in the model payload? feel free to remove if not.

Noted on the model part. For url and port, basically is when calling lm.copy() (ex: while compiling a program with bootstrap), it is missing the url and port kwargs. As shown as below. Thus, requiring the port and url in the kwargs for the copy() to work properly.

in BootstrapFewShot._bootstrap_one_example(self, example, round_idx) 148 with dsp.settings.context(trace=[], **self.teacher_settings): 149 lm = dsp.settings.lm --> 150 lm = lm.copy(temperature=0.7 + 0.001 * round_idx) if round_idx > 0 else lm 151 new_settings = dict(lm=lm) if round_idx > 0 else {} 153 with dsp.settings.context(**new_settings): File [~/micromamba/envs/yh_hf/lib/python3.10/site-packages/dsp/modules/lm.py:108](http://192.168.77.2:8892/lab/tree/~/micromamba/envs/yh_hf/lib/python3.10/site-packages/dsp/modules/lm.py#line=107), in LM.copy(self, **kwargs) 105 kwargs = {**self.kwargs, **kwargs} 106 model = kwargs.pop("model") --> 108 return self.__class__(model=model, **kwargs) TypeError: HFClientVLLM.__init__() missing 1 required positional argument: 'port'

arnavsinghvi11 · 2024-04-15T02:27:29Z

dsp/modules/hf_client.py

-

+        # get model_type
+        model_type = kwargs.get("model_type",None)


let's actually move this to the initialization layer so the user can specify the model_type when configuring the vLLM client, not during generation time.

OK, will include into the initialization layer.

arnavsinghvi11 · 2024-04-15T02:30:17Z

dsp/modules/lm.py

-        for idx, (prompt, choices) in enumerate(reversed(printed)):
-            printing_value = ""
-
+        for idx, (prompt, choices) in enumerate(printed):


is there a reason for this change? seems to have the same effect originally?

currently failing these tests

I've included some of my testing with the inspect_history in the issue #784. Can you take a look, or I'm misunderstanding how it should works?

Ah thanks for highlighting. Just realized that this is a bug where the printing_value is getting reset everytime in the loop. That should be all the error is and the skip condition is fine as is.

Thank you. Let me know what else is needed to change. I've updated to the vllm initialization part, and fix the error, so the test should pass now.

yeah can you revert the changes made to inspect_history here and just move the printing_value outside the for loop?

ok, updated and tested it works. Thank you. Now inspect_history is able to view n-number of histories with skip.

good catch folks!

thanks for it!

arnavsinghvi11 · 2024-04-15T02:37:47Z

Thanks @noobHappylife for the additions to vLLM. left a couple comments and should be good to merge following those comments and running ruff check . --fix-only .

arnavsinghvi11 · 2024-04-16T17:50:25Z

Thanks @noobHappylife !

added vllm chat completion endpoints and fix inspect_history

0b7289f

arnavsinghvi11 requested changes Apr 15, 2024

View reviewed changes

update vllm initializtion and fix inspect history

ca4efef

noobHappylife force-pushed the vllm-chat branch from 1e30f67 to ca4efef Compare April 16, 2024 03:17

arnavsinghvi11 merged commit 715e847 into stanfordnlp:main Apr 16, 2024

Add vllm chat completion endpoints and fix inspect_history #811

Add vllm chat completion endpoints and fix inspect_history #811

Uh oh!

Conversation

noobHappylife commented Apr 13, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arnavsinghvi11 Apr 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arnavsinghvi11 commented Apr 15, 2024

Uh oh!

arnavsinghvi11 commented Apr 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arnavsinghvi11 Apr 15, 2024 •

edited

Loading