Added support for LlamaIndex Retrievers as a DSPy Retriever Module #911

no-dice-io · 2024-04-26T16:07:06Z

Given that Llama Index has some of the most wide and robust support for various retrieval abstractions and methods, adding an interface from LlamaIndex into DSPy would allow many to leverage the various retrieval classes open in LlamaIndex.

https://docs.llamaindex.ai/en/stable/api_reference/retrievers/

compatibility.

logan-markewich · 2024-04-26T19:39:49Z

dspy/retrieve/llama_index_rm.py

+    @property
+    def similarity_top_k(self) -> int:
+        """Return similarity top k of retriever."""
+        return self.retriever.similarity_top_k


not every retriever in llama-index will have this property 👀 It kind of depends. The BaseRetriever class is fairly generic, and just exposes retrieve()/aretrieve() methods

Ooh thank you for pointing this out - I'll make an adjustment shortly!

Thanks @logan-markewich for looking into this!

I believe I've resolved this - please review!

I think this looks good, but the output should be typed as Optional[int] though

Kind of curious what the motivation is to expose this on the class? Is this something that dspy can optimize later?

I don't recall if it can be optimized, but its a parameter that is often used with their forward method so I was looking for a way to pass it to the underlying LI retriever.

ammirsm

Thanks for the contribution and help!

The PR overal LGTM, couple small stuffs which needs attention and we can approve and merge it.

ammirsm · 2024-04-26T23:27:44Z

pyproject.toml

 psycopg2 = { version = "^2.9.9", optional = true }
 pgvector = { version = "^0.2.5", optional = true }
 structlog = "^24.1.0"
+llama-index = "^0.10.30"


don't you think it should be optional?

I know it is causing lots of confusion but for the dependencies right now we use 3 different places... I have a PR to fix that in a short but if it is going to be merge before that you need to add your dependencies in requirements.txt (which use for building our package), here that you added it, and dependencies in the pyproject.toml ...

more info:
#819

I'm open to making the dependency optional - but wouldn't my tests fail as you pointed out if we run that route?

I think the best way is to tag those tests to just be ran when the llama-index is installed and we fix the CI based on that.

I'll go with that. I'll make the adjustment today or tomorrow. I'll only run those tests when llamaindex is present.

ammirsm · 2024-04-26T23:28:44Z

tests/retrieve/test_llama_index_rm.py

+
+
+def test_lirm_as_rm(rag_setup):
+    """Test the retriever as retriever method"""


I think this test will fail in the CI if we make that dependency optional...

dspy/retrieve/llama_index_rm.py

properties. Updated requirements.txt.

ammirsm · 2024-05-06T18:09:05Z

There are some merge conflicts in pyproject and poetry and I think after that we should be good to go.

no-dice-io · 2024-05-10T00:51:05Z

There are some merge conflicts in pyproject and poetry and I think after that we should be good to go.

I believe I've resolved this now!

ammirsm

LGTM.

arnavsinghvi11 · 2024-05-21T04:53:37Z

Thanks all!

no-dice-io added 4 commits April 22, 2024 19:52

feat(llama_index_rm): Llama Index retreivers

16c464d

compatibility.

feat(llamaindex): added llamaindex support

b3b47eb

feat(llamaindex): added llamaindex rm

1bebd46

feat(llamaindex): refactored tests

d630d8e

logan-markewich reviewed Apr 26, 2024

View reviewed changes

ammirsm requested changes Apr 26, 2024

View reviewed changes

no-dice-io added 3 commits April 28, 2024 14:53

refactor(llama_index_rm): per pr comments

a2f2bd2

Merge branch 'stanfordnlp:main' into li-bridge

f070e05

refactor(llama_index_rm): simplified class

5d1989a

properties. Updated requirements.txt.

no-dice-io requested review from ammirsm and logan-markewich April 29, 2024 15:51

no-dice-io added 2 commits May 10, 2024 00:35

fix(conflicts): resolved li dependency conflicts

216546b

fix(dependencies): updated LI tests to be optional

269d9e8

no-dice-io added 2 commits May 9, 2024 20:52

Merge branch 'main' into li-bridge

259843d

Merge branch 'main' into li-bridge

9ab749e

ammirsm approved these changes May 13, 2024

View reviewed changes

arnavsinghvi11 merged commit 5db3e34 into stanfordnlp:main May 21, 2024



		def test_lirm_as_rm(rag_setup):
		"""Test the retriever as retriever method"""

Added support for LlamaIndex Retrievers as a DSPy Retriever Module #911

Added support for LlamaIndex Retrievers as a DSPy Retriever Module #911

Uh oh!

Conversation

no-dice-io commented Apr 26, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ammirsm left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ammirsm commented May 6, 2024

Uh oh!

no-dice-io commented May 10, 2024

Uh oh!

ammirsm left a comment

Choose a reason for hiding this comment

Uh oh!

arnavsinghvi11 commented May 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants