Harrison/add huggingface hub #23

hwchase17 · 2022-10-26T03:23:09Z

Add support for huggingface hub

I could not find a good way to enforce stop tokens over the huggingface hub api - that needs to hopefully be cleaned up in the future

langchain/llms/huggingface_hub.py

hinthornw · 2022-10-26T04:18:29Z

langchain/llms/huggingface_hub.py

+            repo_id = values.get("repo_id", DEFAULT_REPO_ID)
+            values["client"] = InferenceApi(
+                repo_id=repo_id,
+                token=os.environ["HUGGINGFACEHUB_API_TOKEN"],


These parameters seem like nice ones to include as init args with optional env args as backups.

Also the relationship feels a bit convoluted at first glance

what in particular seems convoluted?

I think the main thing is that the client is created in a validation function - but that seems out of scope of this PR given the existing structure

ah yeah i agree. is a bit weird. may look into post_init

hinthornw · 2022-10-26T04:23:01Z

langchain/llms/huggingface_hub.py

+        if "error" in response:
+            raise ValueError(f"Error raised by inference API: {response['error']}")
+        text = response[0]["generated_text"][len(prompt) :]
+        if stop is not None:


Could we share this across models?

Yeah it's a bit hacky but I don't see better support via InferenceAPI and ultimately it just ends with a bit of wasted computation

what do you mean by share across models? its already factored out and used in cohere as well (cohere is a bit different - you can pass stop words but they are included at the end of the prompt)

Yeah this is sufficient - not enough examples to merit more

hinthornw · 2022-10-26T04:26:52Z

langchain/llms/utils.py

+from typing import List
+
+
+def enforce_stop_tokens(text: str, stop: List[str]) -> str:


Double checking - the prompt isn't returned as part of the generated text, right?

no it shouldnt be

…lback_docs Update examples to use inheritable callbacks

hwchase17 added 3 commits October 25, 2022 12:53

add huggingface hub support

4f4dac3

cr

14b952b

cr

d834742

hinthornw self-requested a review October 26, 2022 04:12

hinthornw reviewed Oct 26, 2022

View reviewed changes

hwchase17 added 2 commits October 25, 2022 21:43

cr

370456d

cr

69e4062

hwchase17 merged commit 020c42d into master Oct 26, 2022

hwchase17 deleted the harrison/add_huggingface branch October 26, 2022 05:00

dashesy added a commit to dashesy/langchain that referenced this pull request May 8, 2023

Rename the agent to MM-Assistant to be more specific (langchain-ai#23)

968462b

Prince-Mendiratta pushed a commit to Prince-Mendiratta/langchain that referenced this pull request Oct 17, 2023

Merge pull request langchain-ai#23 from langchain-ai/jacob/hotfix_cal…

b578f4d

…lback_docs Update examples to use inheritable callbacks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harrison/add huggingface hub #23

Harrison/add huggingface hub #23

hwchase17 commented Oct 26, 2022

hinthornw Oct 26, 2022

hwchase17 Oct 26, 2022

hinthornw Oct 26, 2022

hwchase17 Oct 26, 2022

hinthornw Oct 26, 2022

hwchase17 Oct 26, 2022

hinthornw Oct 26, 2022

hinthornw Oct 26, 2022

hwchase17 Oct 26, 2022

		from typing import List


		def enforce_stop_tokens(text: str, stop: List[str]) -> str:

Harrison/add huggingface hub #23

Harrison/add huggingface hub #23

Conversation

hwchase17 commented Oct 26, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment