Add Text Generation Inference with JSON output #235

joaomsimoes · 2024-06-14T12:35:40Z

Added the option to use TGI from HuggingFace using JSON format. It makes the output more predictable. Also it is useful to use TGI in case we use it for other use cases.

MaartenGr

Thank you for this PR and your work on it! I left a comment and it seems that the docstrings are not yet updated to match the current implementation. Also, it would be great if you could add a small section on how this works in the docs, see https://github.com/MaartenGr/KeyBERT/blob/master/docs/guides/llms.md

MaartenGr · 2024-06-17T09:37:20Z

keybert/llm/_textgenerationinference.py

+    ```
+    """
+    def __init__(self,
+                 url: str,


I propose to pass the entire InferenceClient rather than just the URL since not all its parameters are exposed at the moment. Moreover, it would then follow the same structure as is done with OpenAI in this repo.

I updated the code. Now when constructing TextGenerationInference it accepts InferenceClient. I also added json_schema in case that someone is looking for a different output result.

…ionInference. Updated documentation

joaomsimoes · 2024-06-21T11:39:02Z

I also added the inference_kwargs to the extract_keywords function. I believe it is good to have control of temperature and max new tokens during inference, in case we have two different tasks that require a bit more control.

MaartenGr · 2024-06-23T06:56:00Z

Awesome, everything looks good to me! Thank you for the work on this it is highly appreciated 😄

joaomsimoes added 6 commits June 14, 2024 14:51

added tgi huggingface

239fc43

added tgi huggingface

02be03f

added tgi huggingface

7f525c4

added tgi huggingface

cdbc7ae

added tgi huggingface

420b4c9

added tgi huggingface

d246fe0

MaartenGr reviewed Jun 17, 2024

View reviewed changes

joaomsimoes added 2 commits June 21, 2024 14:31

passing InferenceClient and json schema when constructing TextGenerat…

c5ec185

…ionInference. Updated documentation

added inference kwargs to control temperature, max new tokens, etc...

dfb8ba9

MaartenGr merged commit 09ca938 into MaartenGr:master Jun 23, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Text Generation Inference with JSON output #235

Add Text Generation Inference with JSON output #235

joaomsimoes commented Jun 14, 2024

MaartenGr left a comment

MaartenGr Jun 17, 2024

joaomsimoes Jun 21, 2024

joaomsimoes commented Jun 21, 2024

MaartenGr commented Jun 23, 2024

Add Text Generation Inference with JSON output #235

Add Text Generation Inference with JSON output #235

Conversation

joaomsimoes commented Jun 14, 2024

MaartenGr left a comment

Choose a reason for hiding this comment

MaartenGr Jun 17, 2024

Choose a reason for hiding this comment

joaomsimoes Jun 21, 2024

Choose a reason for hiding this comment

joaomsimoes commented Jun 21, 2024

MaartenGr commented Jun 23, 2024