ConversationalRetrievalChain doesn't return score with sources #5067

zigax1 · 2023-05-21T18:40:34Z

System Info

LangChain v0.0.171
ChromaDB v0.3.22
Python v3.10.11

Who can help?

No response

Information

The official example notebooks/scripts
My own modified scripts

Related Components

Reproduction

This is my code:

 def askQuestion(self, collection_id, question):
        collection_name = "collection-" + str(collection_id)
        self.llm = ChatOpenAI(model_name=self.model_name, temperature=self.temperature, openai_api_key=os.environ.get('OPENAI_API_KEY'), streaming=True, callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]))
        self.memory = ConversationBufferMemory(memory_key="chat_history", return_messages=True,  output_key='answer')
        
        chroma_Vectorstore = Chroma(collection_name=collection_name, embedding_function=self.embeddingsOpenAi, client=self.chroma_client)

        self.chain = ConversationalRetrievalChain.from_llm(self.llm, chroma_Vectorstore.as_retriever(),
                                                             return_source_documents=True,verbose=VERBOSE, 
                                                             memory=self.memory)

    
        result = self.chain({"question": question})
        print(result)

        res_dict = {
            "answer": result["answer"],
        }

        res_dict["source_documents"] = []

        for source in result["source_documents"]:
            res_dict["source_documents"].append({
                "page_content": source.page_content,
                "metadata":  source.metadata
            })

        return res_dict

Expected behavior

When I print the result directly after result = self.chain({"question": question}), I get displayed sources, metadata, kwargs, question, chat_history.

I see here: https://github.com/hwchase17/langchain/blob/0c3de0a0b32fadb8caf3e6d803287229409f9da9/langchain/vectorstores/chroma.py#L165 and in line 182 in the official source code, that the similarity_search_with_score() is being called by default.

How can I also display the score than?

The text was updated successfully, but these errors were encountered:

AvikantSrivastava · 2023-05-21T21:05:54Z

Hey @zigax1 if you see the return line of the similarity_search function, only doc is being returned

 return [doc for doc, _ in docs_and_scores]

https://github.com/hwchase17/langchain/blob/0c3de0a0b32fadb8caf3e6d803287229409f9da9/langchain/vectorstores/chroma.py#L183

Try using the similarity_search_with_score function instead

zigax1 · 2023-05-21T21:09:47Z

@AvikantSrivastava Thank you for the answer, I really appreciate it.
Can you please be more specific, where should I use the similarity_search_with_score? Because I am not using even similarity_search.

I am using ConversationalRetrievalChain which has it integrated. The code snippet is in above.

Thank you

AvikantSrivastava · 2023-05-21T22:26:36Z

I checked the implementation of VectorStoreRetriever ,

The chain internally callsVectorStoreRetriever get_relevant_documents, and the default value for search_type is 'similarity' that produces no scores

AvikantSrivastava · 2023-05-21T22:31:07Z

@hwchase17
@agola11
@vowelparrot

Should we add a flag to ConversationalRetrievalChain like similarity = True to get similarity scores from the chain?
I can raise a PR, can you assign me this issue?

zigax1 · 2023-05-21T23:54:34Z

That would be awesome

vowelparrot · 2023-05-22T01:42:25Z

Thanks for raising this issue!

@zigax1 what do you wish to do with the scores once they're available?

zigax1 · 2023-05-22T09:38:04Z

@vowelparrot, I am using ConversationalRetrievalChain to chat over multiple files (some of them are PDF, some docx, txt,..). So for me is very important to also return the source, so I easily know from which file the answer came.
This however works great if I ask the model question, which is related to the content from some of the documents. If question is very different, like 'What is the distance from moon to mars?' or 'Write me a song about .....' the model still gives me source, but that source is the first embedding in database. This technically is not 'wrong', because this still is source but the distance from vector is very very large.

Here score comes in to make wonders. I could easily than filter the sources before displaying them, if the score is low.

I hope this will be a good enough answer for you to raise the PR for @AvikantSrivastava to integrate this funcionality.

Wish all of you a nice day.

KeshavSingh29 · 2023-05-23T03:44:47Z

@vowelparrot Something similar to the issue here.
The following function is currently not implemented. This makes it difficult to use vectorstore as retriever when we want to return docs and relevance scores.
Any plans to implement it ?

langchain/vectorstores/base.py

    def _similarity_search_with_relevance_scores(
        self,
        query: str,
        k: int = 4,
        **kwargs: Any,
    ) -> List[Tuple[Document, float]]:
        """Return docs and relevance scores, normalized on a scale from 0 to 1.

        0 is dissimilar, 1 is most similar.
        """
        raise NotImplementedError
```

vowelparrot · 2023-05-23T13:26:11Z

@KeshavSingh29 great question - the short answer is we're working with the maintainers of the vector stores towards that goal.

The longer answer is that each of the vector stores use different distance or similarity functions to compute scores (that also frequently are sensitive to the embeddings you're using). Sometimes it's a distance metric (lower number means more similar), while (less often) it's a similarity metric (higher number is more similar), and the results are usually unbounded.

We could add a default implementation that tries to capture the most common use cases but we have refrained from trying to put a 'one size fits all' normalization function in the base class to avoid providing erroneous results

zigax1 · 2023-05-23T14:30:04Z

Thanks for raising this issue!

@zigax1 what do you wish to do with the scores once they're available?

@vowelparrot What are your thoughts about my answer? Will there be incoming change for ConversationalRetrievalChain to also return score with sources?

Thank you for the answer.

zigax1 · 2023-05-26T14:43:34Z

@AvikantSrivastava do you have any new information about opening a PR? Is solving this in process or not yet?

Thanks

Fixes #5067 Verified the following code now works correctly: ``` db = Chroma(persist_directory=index_directory(index_name), embedding_function=embeddings) retriever = db.as_retriever(search_type="similarity_score_threshold", search_kwargs={"score_threshold": 0.4}) docs = retriever.get_relevant_documents(query) ```

…i#5655) Fixes langchain-ai#5067 Verified the following code now works correctly: ``` db = Chroma(persist_directory=index_directory(index_name), embedding_function=embeddings) retriever = db.as_retriever(search_type="similarity_score_threshold", search_kwargs={"score_threshold": 0.4}) docs = retriever.get_relevant_documents(query) ```

jezza7770 · 2023-06-22T02:01:46Z

Probably a dumb question, but what version of which package do I need to have for this to work? I've just done a pip install yesterday and it's still not returning scores.

paluchasz · 2023-07-17T14:05:20Z

Also doesn't seem to work for me still

khaledalarja · 2023-07-28T15:36:59Z

This issue should be reopened, the issue is still not solved, the PR #5655 is not related to the behavior of ConversationalRetrievalChain, it's only specific for Chroma, but here it's the problem, scores are neglected with the _

DmitryKatson · 2023-08-21T10:31:02Z

No chance we can return source documents with scores now, with ConversationalRetrievalChain.from_llm?

prasoons075 · 2023-09-01T07:43:37Z

is this resolved? I am trying to get a score when using ConversationalRetrivalChain with the FAISS vector store.

naarkhoo · 2023-09-15T19:45:40Z

looks this still only return the doc but not the score;

langchain/langchain/vectorstores/faiss.py

Line 206 in 0c3de0a

return docs

baswenneker · 2023-10-13T05:09:26Z

Here's my hacky solution. It's a retriever that saves the score as part of the documents metadata. I just took the original _get_relevant_documents and rewrote the part @khaledalarja pointed us at in his comment.

class MyVectorStoreRetriever(VectorStoreRetriever):
    # See https://github.com/langchain-ai/langchain/blob/61dd92f8215daef3d9cf1734b0d1f8c70c1571c3/libs/langchain/langchain/vectorstores/base.py#L500
    def _get_relevant_documents(
        self, query: str, *, run_manager: CallbackManagerForRetrieverRun
    ) -> List[Document]:
        docs_and_similarities = (
            self.vectorstore.similarity_search_with_relevance_scores(
                query, **self.search_kwargs
            )
        )

        # Make the score part of the document metadata
        for doc, similarity in docs_and_similarities:
            doc.metadata["score"] = similarity

        docs = [doc for doc, _ in docs_and_similarities]
        return docs

Instead of doing vectordb.as_retriever I instantiate the retriever by doing:

retriever = MyVectorStoreRetriever(
   vectorstore=vectordb,
   search_type="similarity_score_threshold",
   search_kwargs={"score_threshold": similarity_threshold, "k": 3},
)

Good luck :)

matardy · 2023-10-16T18:20:02Z

Here's my hacky solution. It's a retriever that saves the score as part of the documents metadata. I just took the original _get_relevant_documents and rewrote the part @khaledalarja pointed us at in his comment.

class MyVectorStoreRetriever(VectorStoreRetriever):
    # See https://github.com/langchain-ai/langchain/blob/61dd92f8215daef3d9cf1734b0d1f8c70c1571c3/libs/langchain/langchain/vectorstores/base.py#L500
    def _get_relevant_documents(
        self, query: str, *, run_manager: CallbackManagerForRetrieverRun
    ) -> List[Document]:
        docs_and_similarities = (
            self.vectorstore.similarity_search_with_relevance_scores(
                query, **self.search_kwargs
            )
        )

        # Make the score part of the document metadata
        for doc, similarity in docs_and_similarities:
            doc.metadata["score"] = similarity

        docs = [doc for doc, _ in docs_and_similarities]
        return docs

Instead of doing vectordb.as_retriever I instantiate the retriever by doing:

retriever = MyVectorStoreRetriever(
   vectorstore=vectordb,
   search_type="similarity_score_threshold",
   search_kwargs={"score_threshold": similarity_threshold, "k": 3},
)

Good luck :)

Thanks, this works totally fine!

Benvorth · 2023-10-31T09:03:16Z

Took me a bit to find the correct imports :

from langchain.schema.vectorstore import VectorStoreRetriever
from langchain.callbacks.manager import CallbackManagerForRetrieverRun
from langchain.schema.document import Document
from typing import List
...

glaucoleme · 2023-11-06T20:42:13Z

Here's my hacky solution. It's a retriever that saves the score as part of the documents metadata. I just took the original _get_relevant_documents and rewrote the part @khaledalarja pointed us at in his comment.

class MyVectorStoreRetriever(VectorStoreRetriever):
    # See https://github.com/langchain-ai/langchain/blob/61dd92f8215daef3d9cf1734b0d1f8c70c1571c3/libs/langchain/langchain/vectorstores/base.py#L500
    def _get_relevant_documents(
        self, query: str, *, run_manager: CallbackManagerForRetrieverRun
    ) -> List[Document]:
        docs_and_similarities = (
            self.vectorstore.similarity_search_with_relevance_scores(
                query, **self.search_kwargs
            )
        )

        # Make the score part of the document metadata
        for doc, similarity in docs_and_similarities:
            doc.metadata["score"] = similarity

        docs = [doc for doc, _ in docs_and_similarities]
        return docs

Instead of doing vectordb.as_retriever I instantiate the retriever by doing:

retriever = MyVectorStoreRetriever(
   vectorstore=vectordb,
   search_type="similarity_score_threshold",
   search_kwargs={"score_threshold": similarity_threshold, "k": 3},
)

Good luck :)

Also worked for me. This could be considered as a final solution to this PR.

jiayao mentioned this issue Jun 3, 2023

Support similarity_score_threshold retrieval with Chroma #5655

Merged

hwchase17 closed this as completed in #5655 Jun 3, 2023

sdellama mentioned this issue Aug 7, 2023

ConversationalRetrievalChain doesn't return score with sources langchain-ai/langchainjs#2191

Closed

Iodine98 mentioned this issue Jan 30, 2024

Add Retriever class to obtain scores per document Iodine98/dora-back#21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ConversationalRetrievalChain doesn't return score with sources #5067

ConversationalRetrievalChain doesn't return score with sources #5067

zigax1 commented May 21, 2023 •

edited

AvikantSrivastava commented May 21, 2023

zigax1 commented May 21, 2023 •

edited

AvikantSrivastava commented May 21, 2023

AvikantSrivastava commented May 21, 2023 •

edited

zigax1 commented May 21, 2023

vowelparrot commented May 22, 2023

zigax1 commented May 22, 2023

KeshavSingh29 commented May 23, 2023 •

edited

vowelparrot commented May 23, 2023

zigax1 commented May 23, 2023 •

edited

zigax1 commented May 26, 2023

jezza7770 commented Jun 22, 2023

paluchasz commented Jul 17, 2023

khaledalarja commented Jul 28, 2023

DmitryKatson commented Aug 21, 2023

prasoons075 commented Sep 1, 2023

naarkhoo commented Sep 15, 2023 •

edited

baswenneker commented Oct 13, 2023

matardy commented Oct 16, 2023

Benvorth commented Oct 31, 2023 •

edited

glaucoleme commented Nov 6, 2023

ConversationalRetrievalChain doesn't return score with sources #5067

ConversationalRetrievalChain doesn't return score with sources #5067

Comments

zigax1 commented May 21, 2023 • edited

System Info

Who can help?

Information

Related Components

Reproduction

Expected behavior

AvikantSrivastava commented May 21, 2023

zigax1 commented May 21, 2023 • edited

AvikantSrivastava commented May 21, 2023

AvikantSrivastava commented May 21, 2023 • edited

zigax1 commented May 21, 2023

vowelparrot commented May 22, 2023

zigax1 commented May 22, 2023

KeshavSingh29 commented May 23, 2023 • edited

vowelparrot commented May 23, 2023

zigax1 commented May 23, 2023 • edited

zigax1 commented May 26, 2023

jezza7770 commented Jun 22, 2023

paluchasz commented Jul 17, 2023

khaledalarja commented Jul 28, 2023

DmitryKatson commented Aug 21, 2023

prasoons075 commented Sep 1, 2023

naarkhoo commented Sep 15, 2023 • edited

baswenneker commented Oct 13, 2023

matardy commented Oct 16, 2023

Benvorth commented Oct 31, 2023 • edited

glaucoleme commented Nov 6, 2023

zigax1 commented May 21, 2023 •

edited

zigax1 commented May 21, 2023 •

edited

AvikantSrivastava commented May 21, 2023 •

edited

KeshavSingh29 commented May 23, 2023 •

edited

zigax1 commented May 23, 2023 •

edited

naarkhoo commented Sep 15, 2023 •

edited

Benvorth commented Oct 31, 2023 •

edited