Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #247

davidgxue · 2024-01-04T00:42:27Z

Description

Changed MultiQueryRetriever's prompt and parameter so we now keep user's original query + 2 reworded queries (whereas 3 different reworded queries)
Added Hybrid Search with alpha = 0.5 (equal weight between BM25/TF-TID & embedding model) and retrieving 100 documents for each prompt
Added Cohere Reranker (keeping 10 documents) after documents are combined from the 3 prompts from multi-query retriever
Added LLM Chain Filter (using GPT 3.5) to double check each remaining 10 documents is relevant to the question asked
- This essentially uses GPT 3.5 to ask if document X is relevant to the question (with a YES/NO) response
- Added our custom boolean parser as the default one in LangChain can be too strict causing error raising in specific cases

Technical Changes

Added api/ask_astro/chains/custom_llm_filter_prompt.py because the default implementation of boolean parser in LangChain library enforces that the GPT-3.5 used must either output "YES" or "NO" or else it raises an error. I essentially took their boolean parser from here and re-implemented such that 1. we only check YES/NO is CONTAINED in the response (as sometimes LLM can output "YES.") 2. If neither, then don't raise an error but just return True (in our use case we just don't filter out/still consider this doc relevant).
Main changes are in api/ask_astro/chains/answer_question.py
Added cohere package in api/pyproject.toml

Notes

Need further more extensive evaluation and comparison on a full Q&A dataset to determine improvements (and potential regressions).
My initial test/evaluation of the same 74 questions that my teammates have used in the past can be found on langsmith here

cloudflare-pages · 2024-01-04T23:59:20Z

Deploying with Cloudflare Pages

Latest commit:	`992beb5`
Status:	✅ Deploy successful!
Preview URL:	https://1fdf887b.ask-astro.pages.dev
Branch Preview URL:	https://hybrid-search-reword-and-rer.ask-astro.pages.dev

View logs

davidgxue · 2024-01-05T07:51:33Z

api/ask_astro/chains/custom_llm_filter_prompt.py

+from langchain_core.prompts import PromptTemplate
+
+
+class CustomBooleanOutputParser(BaseOutputParser[bool]):


Note: this is a changed implementation from langchain. The original code looked like this https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/output_parsers/boolean.py

I implemented this parser because of an unfixed issue on LangChain here langchain-ai/langchain#11408 where their check on the Yes/NO is way too strict and throws unwanted errors during runtime.

sunank200

@vatsrahul1001 did you get a chance to test this? which weaviate index should be used @davidgxue ?

vatsrahul1001 · 2024-01-08T12:31:51Z

@vatsrahul1001 did you get a chance to test this? which weaviate index should be used @davidgxue ?

@sunank200 no I have not tested this end to end, Also as per Steven's commentcan you readthedocs stuff is withdrawn?

sunank200 · 2024-01-08T15:35:16Z

@vatsrahul1001 did you get a chance to test this? which weaviate index should be used @davidgxue ?

@sunank200 no I have not tested this end to end, Also as per Steven's commentcan you readthedocs stuff is withdrawn?

@vatsrahul1001 yes. The readthedocs from astro-sdk is not there in database now

vatsrahul1001 · 2024-01-08T15:54:04Z

@vatsrahul1001 did you get a chance to test this? which weaviate index should be used @davidgxue ?

@sunank200 no I have not tested this end to end, Also as per Steven's commentcan you readthedocs stuff is withdrawn?

@vatsrahul1001 yes. The readthedocs from astro-sdk is not there in database now

Ok, I will test it tomorrow then

davidgxue · 2024-01-09T04:23:29Z

Yes, I checked with Rahul and he will get onto checking the response quality today!

vatsrahul1001 · 2024-01-09T07:14:37Z

Yes, I checked with Rahul and he will get onto checking the response quality today!

@davidgxue, I have completed the testing and observed an overall improvement in the quality of responses. Even for basic Astro SDK questions, responses have improved, even without the Astro SDK docs. However, for questions in Ask Astro that were specifically designed from the docs, the responses have degraded, which is as expected.

Results

sunank200

LGTM

davidgxue added 3 commits January 2, 2024 14:24

Add hybrid search and cohere reranker

bc71397

Merge branch 'main' into hybrid_search_reword_and_rerank

1441a86

Add LLM Chain Filter + Shift Where Rerank Takes Place

3e757f1

davidgxue added this to the Phase 2.5 - Enhanced Community Release milestone Jan 4, 2024

davidgxue self-assigned this Jan 4, 2024

davidgxue linked an issue Jan 4, 2024 that may be closed by this pull request

Improve quality of responses #213

Closed

davidgxue marked this pull request as ready for review January 4, 2024 02:29

davidgxue requested review from Lee-W, pankajastro and sunank200 as code owners January 4, 2024 02:29

davidgxue changed the title ~~Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker and LLM Chain Filter~~ Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter Jan 4, 2024

Add Customer Boolean LLM Parser to Bypass Error with LLM Chain Filter

00306ca

davidgxue added 2 commits January 4, 2024 22:12

Add comment

1c6764e

Remove comment

a5eb57d

davidgxue commented Jan 5, 2024

View reviewed changes

sunank200 reviewed Jan 8, 2024

View reviewed changes

davidgxue added 3 commits January 8, 2024 17:06

Add env var create schema if missing for weaviate

ce1f61a

Add Weaviate config attr for create schema if missing

ca74cdd

Merge branch 'main' into hybrid_search_reword_and_rerank

992beb5

This was linked to issues Jan 9, 2024

Research: Improve the ranking of the sources #133

Closed

Document sources are wrong in the answers #80

Closed

vatsrahul1001 mentioned this pull request Jan 9, 2024

[QA] Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #253

Closed

sunank200 approved these changes Jan 9, 2024

View reviewed changes

davidgxue mentioned this pull request Jan 10, 2024

Research: Improve the ranking of the sources #133

Closed

davidgxue merged commit c680cc9 into main Jan 10, 2024
8 checks passed

davidgxue deleted the hybrid_search_reword_and_rerank branch January 10, 2024 01:58

This was referenced Jan 11, 2024

clean html extractor #237

Merged

Test and evaluate the quality of response after single HTML ingestion #187

Closed

davidgxue mentioned this pull request Jan 17, 2024

Ask Astro || Sources references documents are not showing in Top 3 for questions #169

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #247

Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #247

davidgxue commented Jan 4, 2024 •

edited

Loading

cloudflare-pages bot commented Jan 4, 2024 •

edited

Loading

davidgxue Jan 5, 2024 •

edited

Loading

sunank200 left a comment

vatsrahul1001 commented Jan 8, 2024 •

edited

Loading

sunank200 commented Jan 8, 2024

vatsrahul1001 commented Jan 8, 2024

davidgxue commented Jan 9, 2024

vatsrahul1001 commented Jan 9, 2024

sunank200 left a comment

		from langchain_core.prompts import PromptTemplate


		class CustomBooleanOutputParser(BaseOutputParser[bool]):

Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #247

Change MultiQuery Prompt, Add Hybrid Search (BM25 + Embedding), Cohere Reranker & LLM Chain Filter #247

Conversation

davidgxue commented Jan 4, 2024 • edited Loading

Description

Technical Changes

Notes

cloudflare-pages bot commented Jan 4, 2024 • edited Loading

Deploying with Cloudflare Pages

davidgxue Jan 5, 2024 • edited Loading

Choose a reason for hiding this comment

sunank200 left a comment

Choose a reason for hiding this comment

vatsrahul1001 commented Jan 8, 2024 • edited Loading

sunank200 commented Jan 8, 2024

vatsrahul1001 commented Jan 8, 2024

davidgxue commented Jan 9, 2024

vatsrahul1001 commented Jan 9, 2024

sunank200 left a comment

Choose a reason for hiding this comment

davidgxue commented Jan 4, 2024 •

edited

Loading

cloudflare-pages bot commented Jan 4, 2024 •

edited

Loading

davidgxue Jan 5, 2024 •

edited

Loading

vatsrahul1001 commented Jan 8, 2024 •

edited

Loading