Community: Fuse HuggingFace Endpoint-related classes into one #17254

aymeric-roucher · 2024-02-08T18:44:04Z

Description

Fuse HuggingFace Endpoint-related classes into one:

Are fused into

HuggingFaceEndpoint

Issue

The deduplication of classes was creating a lack of clarity, and additional effort to develop classes leads to issues like this hack.

Dependancies

None, this removes dependancies.

Twitter handle

If you want to post about this: @AymericRoucher

vercel · 2024-02-08T18:44:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	🛑 Canceled (Inspect)			Feb 19, 2024 6:36pm

aymeric-roucher · 2024-02-09T17:09:36Z

@andrewrreed @Jofthomas if you want to take a look at the PR!

aymeric-roucher · 2024-02-12T08:54:35Z

@baskaryan @efriis what do you think: should I remove the huggingface_hub.py and
huggingface_text_gen_inference.py scripts?

baskaryan · 2024-02-12T23:14:55Z

@baskaryan @efriis what do you think: should I remove the huggingface_hub.py and huggingface_text_gen_inference.py scripts?

no we should leave them for backwards compatibility but mark them as deprecated. here's an example of how to do that for a class

langchain/libs/community/langchain_community/storage/upstash_redis.py

Line 123 in 54fa78c

@deprecated("0.0.335", alternative="UpstashRedisByteStore")

aymeric-roucher · 2024-02-13T09:50:03Z

Thank you @baskaryan !
I've emptied both classes and marked them with:
@deprecated("0.1.7", alternative="HuggingFaceEndppoint")

➡️ For the release number, I just went to the root repo and did current version+1, could you confirm it is the right process?

libs/community/langchain_community/llms/huggingface_text_gen_inference.py

baskaryan · 2024-02-13T18:39:31Z

libs/community/langchain_community/llms/huggingface_text_gen_inference.py

 logger = logging.getLogger(__name__)

-
+@deprecated("0.1.7", alternative="HuggingFaceEndppoint") 


baskaryan

one last comment then lgtm

baskaryan · 2024-02-14T18:43:25Z

libs/community/langchain_community/chat_models/huggingface.py

@@ -41,7 +38,7 @@ class ChatHuggingFace(BaseChatModel):
    Adapted from: https://python.langchain.com/docs/integrations/chat/llama2_chat
    """

-    llm: Union[HuggingFaceTextGenInference, HuggingFaceEndpoint, HuggingFaceHub]


this is also backwards incompatible, can we revert?

Only problem is when I add this line back, it seems to confuse the deprecation manager:
Now running this script

from langchain_community.llms import HuggingFaceEndpoint from langchain.chat_models import ChatHuggingFace llm = HuggingFaceEndpoint(repo_id="mistralai/Mixtral-8x7B-Instruct-v0.1") agent = ChatHuggingFace(llm=llm)

Triggers this error:

--------------------------------------------------------------------------- NotImplementedError Traceback (most recent call last) [/home/ubuntu/benchmark_agents/test_frameworks.ipynb](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/test_frameworks.ipynb) Cell 1 line 5 [2](vscode-notebook-cell://ssh-remote%2Bec2/home/ubuntu/benchmark_agents/test_frameworks.ipynb#X56sdnNjb2RlLXJlbW90ZQ%3D%3D?line=1) from langchain.chat_models import ChatHuggingFace [4](vscode-notebook-cell://ssh-remote%2Bec2/home/ubuntu/benchmark_agents/test_frameworks.ipynb#X56sdnNjb2RlLXJlbW90ZQ%3D%3D?line=3) llm = HuggingFaceEndpoint(repo_id="mistralai/Mixtral-8x7B-Instruct-v0.1") ----> [5](vscode-notebook-cell://ssh-remote%2Bec2/home/ubuntu/benchmark_agents/test_frameworks.ipynb#X56sdnNjb2RlLXJlbW90ZQ%3D%3D?line=4) agent = ChatHuggingFace(llm=llm) File [~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:50](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:50), in ChatHuggingFace.__init__(self, **kwargs) [49](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:49) def __init__(self, **kwargs: Any): ---> [50](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:50) super().__init__(**kwargs) [52](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:52) from transformers import AutoTokenizer [54](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/langchain_fuse_hf_endpoints/libs/community/langchain_community/chat_models/huggingface.py:54) self._resolve_model_id() File [~/venv/agents/lib/python3.10/site-packages/langchain_core/load/serializable.py:107](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/load/serializable.py:107), in Serializable.__init__(self, **kwargs) [106](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/load/serializable.py:106) def __init__(self, **kwargs: Any) -> None: --> [107](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/load/serializable.py:107) super().__init__(**kwargs) [108](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/load/serializable.py:108) self._lc_kwargs = kwargs File [~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:339](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:339), in BaseModel.__init__(__pydantic_self__, **data) [333](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:333) """ [334](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:334) Create a new model by parsing and validating input data from keyword arguments. [335](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:335) [336](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:336) Raises ValidationError if the input data cannot be parsed to form a valid model. [337](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:337) """ [338](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/pydantic/v1/main.py:338) # Uses something other than `self` the first arg to allow "self" as a settable attribute ... [339](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/_api/deprecation.py:339) ) [340](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/_api/deprecation.py:340) else: [341](https://vscode-remote+ssh-002dremote-002bec2.vscode-resource.vscode-cdn.net/home/ubuntu/benchmark_agents/~/venv/agents/lib/python3.10/site-packages/langchain_core/_api/deprecation.py:341) removal = f"in {removal}" NotImplementedError: Need to determine which default deprecation schedule to use. within ?? minor releases

The error seems to be triggered here and seems due to the fact that I have not setup neither pending nor removal argument.
@baskaryan : do you know the proper configuration I should use in my @deprecated decorator?

ah yea let's add a removal argument, can set removal="0.2.0"

…_endpoints

aymeric-roucher · 2024-02-19T12:40:11Z

@baskaryan seems like we're good to merge?

aymeric-roucher · 2024-02-20T09:11:59Z

@baskaryan thanks a lot for this review!

I added follow-up on a very strange issue here: the order of the type hints in llm: Union[HuggingFaceTextGenInference, HuggingFaceEndpoint, HuggingFaceHub] has an impact on the way the program runs! I don't know how to solve this yet.

…ain-ai#17254) ## Description Fuse HuggingFace Endpoint-related classes into one: - [HuggingFaceHub](https://github.com/langchain-ai/langchain/blob/5ceaf784f324064b868a3cfed3fab7554173e7b3/libs/community/langchain_community/llms/huggingface_hub.py) - [HuggingFaceTextGenInference](https://github.com/langchain-ai/langchain/blob/5ceaf784f324064b868a3cfed3fab7554173e7b3/libs/community/langchain_community/llms/huggingface_text_gen_inference.py) - and [HuggingFaceEndpoint](https://github.com/langchain-ai/langchain/blob/5ceaf784f324064b868a3cfed3fab7554173e7b3/libs/community/langchain_community/llms/huggingface_endpoint.py) Are fused into - HuggingFaceEndpoint ## Issue The deduplication of classes was creating a lack of clarity, and additional effort to develop classes leads to issues like [this hack](https://github.com/langchain-ai/langchain/blob/5ceaf784f324064b868a3cfed3fab7554173e7b3/libs/community/langchain_community/llms/huggingface_endpoint.py#L159). ## Dependancies None, this removes dependancies. ## Twitter handle If you want to post about this: @AymericRoucher --------- Co-authored-by: Bagatur <baskaryan@gmail.com>

aymeric-roucher added 8 commits February 9, 2024 17:01

Fuse HF endpoints into one class

d49c57a

Improve validation and parameter layout

8c5e332

Add task support

2317ed3

Formatting

7a0b2ad

Change tests

9e48e57

Fix linting

4e970b7

Remove deprecated HD Endpoint classes

566e60c

Update doc

5b02bc6

aymeric-roucher force-pushed the master branch from 2dcee49 to 5b02bc6 Compare February 9, 2024 17:03

Correct line lengths

a55b398

aymeric-roucher force-pushed the master branch from edf5b20 to a55b398 Compare February 9, 2024 17:11

aymeric-roucher changed the title ~~[Draft] Community: Fuse HuggingFace Endpoint-related classes into one~~ Community: Fuse HuggingFace Endpoint-related classes into one Feb 9, 2024

vercel bot deployed to Preview February 9, 2024 17:19 View deployment

aymeric-roucher force-pushed the master branch from 747272b to 36f5074 Compare February 13, 2024 09:43

Deprecate HuggingFaceHub and HuggingfaceTextGenPipeline classes

e53c11f

aymeric-roucher force-pushed the master branch from 36f5074 to e53c11f Compare February 13, 2024 09:48

vercel bot deployed to Preview February 13, 2024 09:56 View deployment

baskaryan reviewed Feb 13, 2024

View reviewed changes

vercel bot deployed to Preview February 14, 2024 18:35 View deployment

fmt

159a9ef

baskaryan approved these changes Feb 14, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Feb 14, 2024

vercel bot deployed to Preview February 14, 2024 18:51 View deployment

Revert type hint change

961e623

vercel bot had a problem deploying to Preview February 15, 2024 08:55 Failure

Merge branch 'master' of github.com:aymeric-roucher/langchain_fuse_hf…

0c6411e

…_endpoints

aymeric-roucher force-pushed the master branch from 2eaead5 to 0c6411e Compare February 15, 2024 08:58

vercel bot had a problem deploying to Preview February 15, 2024 09:02 Failure

Formatting

478034c

vercel bot deployed to Preview February 15, 2024 09:25 View deployment

baskaryan mentioned this pull request Feb 15, 2024

community[patch]: Fix HuggingFace LLM to not repeat the prompt as part of the result #17363

Closed

This was referenced Feb 15, 2024

Issue in replicating results or running benchmark.ipynb aymeric-roucher/benchmark_agents#2

Closed

Not able to reproduce result with benchmark.ipynb aymeric-roucher/benchmark_agents#1

Closed

aymeric-roucher added 2 commits February 16, 2024 16:31

Add removal argument to decorator in huggingface_hub.py

f601292

Add removal argument in decorator of huggingface_text_gen_inference.py

58f32c8

vercel bot deployed to Preview February 16, 2024 15:54 View deployment

fmt

3f3e491

vercel bot deployed to Preview February 18, 2024 18:18 View deployment

Merge branch 'master' into master

a53fbab

vercel bot deployed to Preview February 19, 2024 11:18 View deployment

fmt

3f5b768

baskaryan merged commit 0d29476 into langchain-ai:master Feb 19, 2024
59 of 60 checks passed

vercel bot temporarily deployed to Preview February 19, 2024 18:36 Inactive

aymeric-roucher mentioned this pull request Mar 5, 2024

LangChain: Fix ChatHuggingFace by simplifying possible classes #18570

Closed

martj001 mentioned this pull request Apr 11, 2024

Handling huggingfacehub_api_token=None for HuggingFaceEndpoint #20342

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Community: Fuse HuggingFace Endpoint-related classes into one #17254

Community: Fuse HuggingFace Endpoint-related classes into one #17254

aymeric-roucher commented Feb 8, 2024

vercel bot commented Feb 8, 2024 •

edited

aymeric-roucher commented Feb 9, 2024

aymeric-roucher commented Feb 12, 2024

baskaryan commented Feb 12, 2024

aymeric-roucher commented Feb 13, 2024

baskaryan Feb 13, 2024

baskaryan left a comment

baskaryan Feb 14, 2024

aymeric-roucher Feb 15, 2024

aymeric-roucher Feb 15, 2024

baskaryan Feb 15, 2024 •

edited

aymeric-roucher commented Feb 19, 2024

aymeric-roucher commented Feb 20, 2024

		logger = logging.getLogger(__name__)


		@deprecated("0.1.7", alternative="HuggingFaceEndppoint")

Community: Fuse HuggingFace Endpoint-related classes into one #17254

Community: Fuse HuggingFace Endpoint-related classes into one #17254

Conversation

aymeric-roucher commented Feb 8, 2024

Description

Issue

Dependancies

Twitter handle

vercel bot commented Feb 8, 2024 • edited

aymeric-roucher commented Feb 9, 2024

aymeric-roucher commented Feb 12, 2024

baskaryan commented Feb 12, 2024

aymeric-roucher commented Feb 13, 2024

baskaryan Feb 13, 2024

Choose a reason for hiding this comment

baskaryan left a comment

Choose a reason for hiding this comment

baskaryan Feb 14, 2024

Choose a reason for hiding this comment

aymeric-roucher Feb 15, 2024

Choose a reason for hiding this comment

aymeric-roucher Feb 15, 2024

Choose a reason for hiding this comment

baskaryan Feb 15, 2024 • edited

Choose a reason for hiding this comment

aymeric-roucher commented Feb 19, 2024

aymeric-roucher commented Feb 20, 2024

vercel bot commented Feb 8, 2024 •

edited

baskaryan Feb 15, 2024 •

edited