community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings #16457

abhishek9998 · 2024-01-23T15:59:22Z

Description: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings
Issue: HuggingFaceInferenceAPIEmbeddings getting 413 request code because of not batching mechanism like SentenceTransformer #16443
Dependencies: tqdm
Twitter handle: @Abhishingadiya

vercel · 2024-01-23T15:59:26Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		May 8, 2024 7:54pm

hwchase17 · 2024-01-30T03:55:41Z

libs/community/langchain_community/embeddings/huggingface.py

-        return response.json()
+        all_embeddings = []
+        length_sorted_idx = np.argsort([-self._text_length(sen) for sen in texts])
+        sentences_sorted = [texts[idx] for idx in length_sorted_idx]


where is this used?

Example Code

embeddings = HuggingFaceInferenceAPIEmbeddings( api_key=inference_api_key, api_url=api_url, model_name="bge-large-en-v1.5" ) pinecone.init(api_key=os.getenv("PINECONE_API_KEY"), environment=environment) loader = PyPDFDirectoryLoader("data") docs = loader.load() text_splitter = RecursiveCharacterTextSplitter(chunk_size=2000, chunk_overlap=200) chunks = text_splitter.split_documents(docs) vectordb = Pinecone.from_documents(chunks, embeddings, index_name=index_name, namespace=namespace)

this code snippet is getting 314 request code from huggingface.py

response = requests.post( self._api_url, headers=self._headers, json={ "inputs": texts, "options": {"wait_for_model": True, "use_cache": True}, }, ) return response.json()

we should support batch size here. like local model embedding

Description

I am trying to use pinecone with hugging face inference as a embedding model. My total chunks are 420. and it is trying to process in one request.
Also embedding_chunk_size is not parsable from Pinecone.from_documents() method

removed extra parameter.

Added batching mechanism in HuggingFaceInferenceAPIEmbeddings to support max_client_batch_size from server side.

abhishek9998 · 2024-05-08T19:17:59Z

Any Update @baskaryan, @hwchase17 ?
PS: I am using custom branch of langchain for my prod due to HuggingFaceInferenceAPIEmbeddings is not supporting batching mechanism.
same issue: https://discuss.huggingface.co/t/batch-size-limit-32/82471

abhishek9998 added 3 commits January 23, 2024 17:50

Update huggingface.py

55aa112

Update huggingface.py

962498d

Update huggingface.py

41391c0

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jan 23, 2024

dosubot bot added Ɑ: embeddings Related to text embedding models module 🤖:improvement Medium size change to existing code to handle new use-cases labels Jan 23, 2024

Merge branch 'master' into master

10ff259

hwchase17 reviewed Jan 30, 2024

View reviewed changes

Merge branch 'langchain-ai:master' into master

3a66dce

hwchase17 closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

baskaryan closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

baskaryan closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

abhishek9998 and others added 6 commits January 31, 2024 10:18

Merge branch 'langchain-ai:master' into master

4dd9161

Update huggingface.py

1fad58c

removed extra parameter.

Merge branch 'langchain-ai:master' into master

36d2bbb

Update huggingface.py

c18b129

Added batching mechanism in HuggingFaceInferenceAPIEmbeddings to support max_client_batch_size from server side.

Merge branch 'master' into abhishek9998/master

69681f2

fmt

0bd5fc7

Merge branch 'langchain-ai:master' into master

8e6d47c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings #16457

community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings #16457

abhishek9998 commented Jan 23, 2024 •

edited

vercel bot commented Jan 23, 2024 •

edited

hwchase17 Jan 30, 2024

abhishek9998 Jan 30, 2024 •

edited

abhishek9998 commented May 8, 2024 •

edited

community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings #16457

Are you sure you want to change the base?

community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings #16457

Conversation

abhishek9998 commented Jan 23, 2024 • edited

vercel bot commented Jan 23, 2024 • edited

hwchase17 Jan 30, 2024

Choose a reason for hiding this comment

abhishek9998 Jan 30, 2024 • edited

Choose a reason for hiding this comment

Example Code

Description

abhishek9998 commented May 8, 2024 • edited

abhishek9998 commented Jan 23, 2024 •

edited

vercel bot commented Jan 23, 2024 •

edited

abhishek9998 Jan 30, 2024 •

edited

abhishek9998 commented May 8, 2024 •

edited