Added RAG settings to settings.py, vector_store and chat_service to a… #1771

icsy7867 · 2024-03-20T16:47:51Z

Since i was out of town and there were some rather large changes to the code, I am doing a new PR for cleanliness.

#1715 was the original.

In the settings.yaml file I added two new settings. Both of these are important to use together IMO.

rag:
  similarity_top_k: 2
  #This value controls how many "top" documents the RAG returns to use in the context.
  #similarity_value: 0.45
  #This value is disabled by default.  If you enable this settings, the RAG will only use articles that meet a certain percentage score.

similarity_top_k controls how many of the "top" results are returned by the RAG Pipeline. Since i am ingesting lots of information for our organization from many different areas, sometimes I might have 5, 6 or 7 relevant sources.

However the risk of increasing this value, is that you get a lot of junk that does not relate other than having a key word or something. So similarity_value comes into play here and is used in the chat_service.py file.

This scare requires the RAG to match a certain level of the document, or it is thrown out. In my test cases 0.40 works well to weed out some of the lower end junk that comes out when you increase similarity_top_k to a higher value.

…dd similarity_top_k and similarity_score

imartinez

I've added two alternatives to an issue I found. I'd really advice for the second option, even if it implies changing a little more of your PR. Otherwise, looking great!

imartinez · 2024-03-20T19:18:31Z

private_gpt/components/vector_store/vector_store_component.py

@@ -135,6 +135,7 @@ def get_retriever(
        similarity_top_k: int = 2,
    ) -> VectorIndexRetriever:
        # This way we support qdrant (using doc_ids) and the rest (using filters)
+        similarity_top_k = self.settings.rag.similarity_top_k


You are effectively making the similarity_top_k function param useless, because you are overriding it. I'd suggest to respect it if set. Something like this should work:

def get_retriever( self, index: VectorStoreIndex, context_filter: ContextFilter | None = None, similarity_top_k: int | None = None, ) -> VectorIndexRetriever: # This way we support qdrant (using doc_ids) and the rest (using filters) similarity_top_k = similarity_top_k or self.settings.rag.similarity_top_k

There is an alternative which I like more because is more scalable as the project evolves than the current solution of impacting the "get_retriever" for the whole application:

change settings.rag.similarity_top_k to settings.context_chat_rag.similarity_top_k

in chat_service.py, when get_retriever is called, pass the similarity_top_k param getting it from that setting

That way the setting only applies to the contextual chat use case, and whenever we add more use cases that require a different amount of top_k retrieved, we can fine tune it in different settings

I have changed the code, except i left the settings value as "rag" instead of context_chat_rag. I basically ran out of time, and need to switch gear. But I can change that settings name to context_chat_rag if you prefer that.

imartinez

I think it is perfectly fine to call it RagSettings given the only "RAG" that is in PrivateGPT at the moment is the contextual chat

Added RAG settings to settings.py, vector_store and chat_service to a…

5fcca04

…dd similarity_top_k and similarity_score

imartinez requested changes Mar 20, 2024

View reviewed changes

icsy7867 added 2 commits March 20, 2024 21:14

Updated settings in vector and chat service per Ivans request

335df17

Updated code for mypy

040e7b7

imartinez approved these changes Mar 20, 2024

View reviewed changes

imartinez merged commit 087cb0b into zylon-ai:main Mar 20, 2024
6 checks passed

github-actions bot mentioned this pull request Mar 20, 2024

chore(main): release 0.5.0 #1708

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added RAG settings to settings.py, vector_store and chat_service to a… #1771

Added RAG settings to settings.py, vector_store and chat_service to a… #1771

icsy7867 commented Mar 20, 2024

imartinez left a comment

imartinez Mar 20, 2024

icsy7867 Mar 20, 2024

imartinez left a comment

Added RAG settings to settings.py, vector_store and chat_service to a… #1771

Added RAG settings to settings.py, vector_store and chat_service to a… #1771

Conversation

icsy7867 commented Mar 20, 2024

imartinez left a comment

Choose a reason for hiding this comment

imartinez Mar 20, 2024

Choose a reason for hiding this comment

icsy7867 Mar 20, 2024

Choose a reason for hiding this comment

imartinez left a comment

Choose a reason for hiding this comment