Add New Integration for DeepInfra Embedding Model #1

ovuruska · 2024-05-02T15:07:25Z

Description
This pull request introduces a new integration for the DeepInfra Inference API.

Motivation and Context
Users are able to use DeepInfra's models using llama_index.

Fixes

No specific issue fixed; this is a new feature addition.

New Package?

Yes
No

If yes, I have filled in the tool.llamahub section in the pyproject.toml and provided a detailed README.md for my new integration.

Version Bump?

Yes
No

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Tests have been added to verify the functionality of the new embedding model integration. These tests ensure that both synchronous and asynchronous methods perform as expected.
Added new unit/integration tests
Added new notebook (that tests end-to-end)
I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

ichernev · 2024-05-03T16:34:33Z

I don't see the code that does the auto-batching (the one you added for lang-chain). Other than that it looks like a standalone project, which is probably how this is organized.

Other than that looks good.

ichernev · 2024-05-03T16:35:32Z

...rations/embeddings/llama-index-embeddings-deepinfra/llama_index/embeddings/deepinfra/base.py

+
+    async def _aget_query_embedding(self, query: str) -> List[float]:
+        """
+        Async get query embedding.


Async here? We do some sharing of sync and async code by computing headers, url and body in methods and calling those in sync and async requests/HTTP client APIs

ichernev · 2024-05-03T16:36:34Z

Also implement sync and async.

Also what is the difference between text and query? Actually some models do make a difference (normally you prefix the text with query: or prompt: or sth similar)

ichernev · 2024-05-06T17:57:03Z

llama-index-integrations/embeddings/llama-index-embeddings-deepinfra/BUILD

@@ -0,0 +1,3 @@
+poetry_requirements(
+    name="poetry",


Are you sure you need this? I mean if you already have poetry_requirements that would require poetry to read, no?

ichernev · 2024-05-06T17:58:55Z

llama-index-integrations/embeddings/llama-index-embeddings-deepinfra/README.md

+from dotenv import load_dotenv, find_dotenv
+from llama_index.embeddings.deepinfra import DeepInfraEmbeddingModel
+
+_ = load_dotenv(find_dotenv())


Maybe a comment to explain what are you loading exactly, not obvious to me.

ichernev · 2024-05-06T18:01:07Z

llama-index-integrations/embeddings/llama-index-embeddings-deepinfra/README.md

+print(response)
+```
+
+### Use with text prefix


There should be one example which passes both query and text prefixes, with a short description on why that would be useful. I don't see how having two examples is better.

ichernev · 2024-05-06T18:09:59Z

...rations/embeddings/llama-index-embeddings-deepinfra/llama_index/embeddings/deepinfra/base.py

+        """
+        Add query prefix to queries.
+        """
+        return [self._query_prefix + query for query in queries]


You can do [self._query_prefix + query for query in queries] if self._query_prefix else queries so it doesn't slow things down without query prefix

ichernev · 2024-05-06T18:12:06Z

llama-index-integrations/embeddings/llama-index-embeddings-deepinfra/README.md

+print(response)
+```
+
+### Asynchronous requests


I guess if all others have these verbose examples then fine, but the setup is the same, only the method call is different, and even that is the API of the BaseEmbedding (i.e nothing deepinfra specific). So you decide if you keep this or shorten it a bit (maybe a single setup + multiple fn calls).

Dealt with it. Decreased the code repetition.

ichernev · 2024-05-06T18:12:39Z

...rations/embeddings/llama-index-embeddings-deepinfra/llama_index/embeddings/deepinfra/base.py

+    """
+    Chunk items into batches of size MAX_BATCH_SIZE.
+    """
+    return [items[i : i + MAX_BATCH_SIZE] for i in range(0, len(items), MAX_BATCH_SIZE)]


MAX_BATCH_SIZE can be a constructor arg, and this function can be a method

Isn't this parameter related with deepinfra's backend?

ichernev · 2024-05-07T13:20:20Z

Looks good, send it upstream!

bootstrapped

06beba9

ovuruska marked this pull request as draft May 2, 2024 15:07

doc(deepinfra): Improved documentation

988ec53

ovuruska self-assigned this May 3, 2024

ovuruska added 2 commits May 3, 2024 17:58

doc(deepinfra): extra documentation

d3a2088

doc(README)

e708d85

ovuruska changed the title ~~bootstrapped~~ Add New Integration for DeepInfra Embedding Model May 3, 2024

ref(lint)

6fe45f8

ovuruska marked this pull request as ready for review May 3, 2024 16:19

fix(tests)

2eb0ef0

ichernev reviewed May 3, 2024

View reviewed changes

ovuruska added 6 commits May 4, 2024 16:32

doc(example),

d7f8ce8

doc(README)

0034d6a

fix(async)

313b06c

fix(notebook)

114df0d

test(prefix)

1cd6ef7

test(default prefix)

45c93ee

ichernev reviewed May 6, 2024

View reviewed changes

ovuruska added 3 commits May 7, 2024 14:43

ref(fixes)

61d04f6

ref(chunk)

d923f96

fix(chunk): call with batch_size field

9a1728d

ovuruska merged commit 5c11e6f into main May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add New Integration for DeepInfra Embedding Model #1

Add New Integration for DeepInfra Embedding Model #1

ovuruska commented May 2, 2024 •

edited

Loading

ichernev commented May 3, 2024

ichernev May 3, 2024

ichernev commented May 3, 2024

ichernev May 6, 2024

ichernev May 6, 2024

ichernev May 6, 2024

ichernev May 6, 2024

ichernev May 6, 2024

ovuruska May 7, 2024 •

edited

Loading

ichernev May 6, 2024

ovuruska May 7, 2024

ichernev commented May 7, 2024

Add New Integration for DeepInfra Embedding Model #1

Add New Integration for DeepInfra Embedding Model #1

Conversation

ovuruska commented May 2, 2024 • edited Loading

ichernev commented May 3, 2024

Choose a reason for hiding this comment

ichernev commented May 3, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ovuruska May 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ichernev commented May 7, 2024

ovuruska commented May 2, 2024 •

edited

Loading

ovuruska May 7, 2024 •

edited

Loading