# GraphRAG

## Import packages

In [65]:
! pip install nltk numpy pandas unidecode scikit-learn tqdm llm-blender rouge-score xmltodict arxiv biopython
! pip install langchain langchain-core langchain-community langchain_experimental langchain-openai langchain-chroma langchain_mistralai langgraph langchainhub

huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
	- Avoid using `tokenizers` before the fork if possible
	- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)




huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
	- Avoid using `tokenizers` before the fork if possible
	- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)




In [66]:
import os
import re
import nltk
import string
import numpy as np
import pandas as pd
from unidecode import unidecode
from sklearn.metrics.pairwise import cosine_similarity
from tqdm import tqdm
from pathlib import Path
import pickle
from rouge_score import rouge_scorer
import json
import llm_blender
from operator import itemgetter
import operator
from dotenv import load_dotenv
from getpass import getpass
from typing import List, Annotated
from typing_extensions import TypedDict
from pydantic import BaseModel, Field
from Bio import Entrez, SeqIO

from langchain_core.callbacks import CallbackManagerForRetrieverRun
from langchain_core.documents import Document
from langchain_core.retrievers import BaseRetriever
from langchain.schema import Document
from langchain_community.document_loaders import PDFMinerLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_chroma import Chroma
from langchain_community.embeddings import OllamaEmbeddings
from langchain_mistralai.chat_models import ChatMistralAI
from langchain_openai import ChatOpenAI
from langchain.embeddings.cache import CacheBackedEmbeddings
from langchain.storage import LocalFileStore
from langchain_community.llms import Ollama
from langgraph.graph import START, END, StateGraph
from langchain_core.output_parsers import PydanticOutputParser
from langchain.output_parsers import RetryOutputParser
from langchain_core.prompts import PromptTemplate
from langchain_core.runnables import RunnableLambda, RunnableParallel
from langchain import hub
from langchain_core.output_parsers import StrOutputParser
from langchain_core.prompts import ChatPromptTemplate
from langchain_community.retrievers import PubMedRetriever, ArxivRetriever
from langchain_community.tools.tavily_search import TavilySearchResults

## Disable warnings

In [67]:
import warnings
warnings.filterwarnings('ignore')

## Setup environment variables

You have to define the following environment variables in the `.env` file, terminal environment, or input field within this Jupyter notebook:
1. MISTRAL_API_KEY
2. OPENAI_API_KEY
3. OPENAI_PROXY
4. TAVILY_API_KEY
5. ENTREZ_EMAIL

## Import packages

In [68]:
env_variables = [
  'MISTRAL_API_KEY',
  'OPENAI_API_KEY',
  'OPENAI_PROXY',
  'TAVILY_API_KEY',
  'ENTREZ_EMAIL',
]

load_dotenv()

for key in env_variables:
  value = os.getenv(key)

  if value is None:
    value = getpass(key)

  os.environ[key] = value

## Setup metrics

### Download NLTK dictionaries

These dictionaries are needed for further text preprocessing.

In [69]:
dict_ids = [
  'punkt_tab',
  'punkt',
  'stopwords',
  'wordnet',
]

for dict_id in dict_ids:
  nltk.download(dict_id, quiet=True)

### Text preprocessing

Define a function for text preprocessing, which is an important step before calculating any metrics. This preprocessing function will help in cleaning the text data, making it ready for further analysis. The preprocessing involves several steps:
1. Lowercasing
2. Stopwords removal
3. Lemmatization
4. Remove accents from characters

In [70]:
lemmatizer = nltk.stem.WordNetLemmatizer()

def preprocess(corpus: str) -> str:
  corpus = corpus.lower()
  stopset = nltk.corpus.stopwords.words('english') + nltk.corpus.stopwords.words('russian') + list(string.punctuation)
  tokens = nltk.word_tokenize(corpus)
  tokens = [t for t in tokens if t not in stopset]
  tokens = [lemmatizer.lemmatize(t) for t in tokens]
  corpus = ' '.join(tokens)
  corpus = unidecode(corpus)
  return corpus

### Embedding Initialization

Here we are initializing the Llama 3 embeddings model. The `OllamaEmbeddings` class is a component of the Ollama library, a set of pre-trained language models. This model is capable of embedding corpora of any length into a 4096-dimensional vector.

The use of `OllamaEmbeddings` requires the installation of a local Ollama server, which can be found at https://ollama.com.

In [71]:
embeddings = OllamaEmbeddings(model='llama3.1')
store = LocalFileStore("./.embeddings_cache")

cached_embeddings = CacheBackedEmbeddings.from_bytes_store(
  embeddings,
  store,
  namespace=embeddings.model,
)

### Average embeddings cosine similarity metric

This function calculates the average cosine similarity between expected answers and LLM predicted answers using their respective embeddings. Cosine similarity is a measure of similarity between two non-zero vectors of an inner product space that measures the cosine of the angle between them:

$$
K(a, b) = \frac{\sum \limits_{i=1}^n a_i b_i}{\sqrt{\sum \limits_{i=1}^n a_i^2} \cdot \sqrt{\sum \limits_{i=1}^n b_i^2}}
$$

In [72]:
def embeddings_cosine_sim_metric(expected_answers: list[str], predicted_answers: list[str]) -> float:
  results = []

  for expected_answer, predicted_answer in zip(expected_answers, predicted_answers):
    expected_answer = preprocess(expected_answer)
    predicted_answer = preprocess(predicted_answer)

    expected_embedding = np.array(cached_embeddings.embed_query(expected_answer))
    predicted_embedding = np.array(cached_embeddings.embed_query(predicted_answer))

    sim = cosine_similarity(
      expected_embedding.reshape(1, -1),
      predicted_embedding.reshape(1, -1),
    )[0][0]

    results.append(sim)

  return np.mean(results)

In [73]:
smoothie_f = nltk.translate.bleu_score.SmoothingFunction().method4

def bleu_metric(expected_answers, predicted_answers):
  scores = []

  for expected_answer, predicted_answer in zip(expected_answers, predicted_answers):
    expected_answer = preprocess(expected_answer)
    predicted_answer = preprocess(predicted_answer)

    predicted_tokens = nltk.word_tokenize(predicted_answer)
    expected_tokens = [nltk.word_tokenize(expected_answer)]

    score = nltk.translate.bleu_score.sentence_bleu(
      expected_tokens,
      predicted_tokens,
      smoothing_function=smoothie_f,
    )

    scores.append(score)

  return np.mean(scores)

In [74]:
rogue_1_scorer = rouge_scorer.RougeScorer(['rouge1'], use_stemmer=True)

def rogue_1_metric(expected_answers, predicted_answers):
  scores = []

  for expected_answer, predicted_answer in zip(expected_answers, predicted_answers):
    expected_answer = preprocess(expected_answer)
    predicted_answer = preprocess(predicted_answer)

    result = rogue_1_scorer.score(expected_answer, predicted_answer)

    scores.append(result['rouge1'])

  return np.mean(scores)

In [75]:
rogue_l_scorer = rouge_scorer.RougeScorer(['rougeL'], use_stemmer=True)

def rogue_l_metric(expected_answers, predicted_answers):
  scores = []

  for expected_answer, predicted_answer in zip(expected_answers, predicted_answers):
    expected_answer = preprocess(expected_answer)
    predicted_answer = preprocess(predicted_answer)

    result = rogue_l_scorer.score(expected_answer, predicted_answer)

    scores.append(result['rougeL'])

  return np.mean(scores)

## Load documents

In [76]:
docs_dir = Path('./docs')
docs_cache_dir = Path('./.docs_cache')
raw_docs_pkl_path = docs_cache_dir / 'parsed_docs_cache.pkl'

docs = None

if os.path.exists(raw_docs_pkl_path):
  with open(raw_docs_pkl_path, 'rb') as f:
    docs = pickle.load(f)
else:
  docs = []
  for file in docs_dir.iterdir():
    docs.extend(PDFMinerLoader(file).load())

  with open(raw_docs_pkl_path, 'wb') as f:
    pickle.dump(docs, f)

## Split documents

In [77]:
splitted_docs_pkl_path = docs_cache_dir / 'splitted_docs_cache.pkl'

if os.path.exists(splitted_docs_pkl_path):
  with open(splitted_docs_pkl_path, 'rb') as f:
    splitted_docs = pickle.load(f)
else:
  text_splitter = RecursiveCharacterTextSplitter(
    chunk_size=750,
    chunk_overlap=250,
    length_function=len,
    is_separator_regex=False,
  )
  splitted_docs = text_splitter.create_documents([doc.page_content for doc in docs])

  with open(splitted_docs_pkl_path, 'wb') as f:
    pickle.dump(splitted_docs, f)

len(splitted_docs)

17443

## Setup vector store

In [78]:
vector_store = Chroma(
  embedding_function=cached_embeddings,
  persist_directory='./chroma_db'
)
retriever = vector_store.as_retriever()

## Define JSON extractor

In [79]:
def extract_json(response):
  json_pattern = r'\{.*?\}'
  match = re.search(json_pattern, response, re.DOTALL)

  if match:
    return match.group().strip().replace('\\\\', '\\')

  return response

## Build LLM

In [80]:
llm = Ollama(model='llama3.1', temperature=0)

## Build chains

### Route chain

In [81]:
class RouteQuery(BaseModel):
  sources: List[str] = Field(
    description='Given a user question select the retrieval methods you consider the most appropriate for addressing this question. You may also return an empty array if no methods are required.',
  )

route_parser = PydanticOutputParser(pydantic_object=RouteQuery)
route_retry_parser = RetryOutputParser.from_llm(
  parser=route_parser,
  llm=llm,
  max_retries=3,
)

route_template = """
You are an expert at selecting retrieval methods.
Given a user question select the retrieval methods you consider the most appropriate for addressing user question.
You may also return an empty array if no methods are required.

Possible retrieval methods:
1. The "vectorstore" retriever contains documents related to neurobiology and medicine. Use the vectorstore for questions on these topics.
2. The "pubmed" retriever contains biomedical literature and research articles. It is particularly useful for answering detailed questions about medical research, clinical studies, and scientific discoveries.
3. The "arxiv" retriever contains preprints of research papers across various scientific fields, including physics, mathematics, computer science, and biology. Use the arxiv for questions on recent scientific research and theoretical studies in these areas.
4. The "ncbi_protein" retriever contains protein sequence and functional information. Use the NCBI protein DB for questions related to protein sequences, structures, and functions.

{format_instructions}

User question:
{question}
"""
route_prompt = PromptTemplate(
  template=route_template,
  input_variables=['question'],
  partial_variables={'format_instructions': route_parser.get_format_instructions()},
)

question_router = RunnableParallel(
  completion=route_prompt | llm | extract_json, prompt_value=route_prompt
) | RunnableLambda(lambda x: route_retry_parser.parse_with_prompt(**x))
print(question_router.invoke({'question': 'Who will the Bears draft first in the NFL draft?'}))
print(question_router.invoke({'question': 'What are the functions of the oculomotor nerve?'}))

sources=[]
sources=['vectorstore', 'ncbi_protein']


### Grade documents chain

In [82]:
class GradeDocuments(BaseModel):
  binary_score: str = Field(description="Documents are relevant to the question, 'yes' or 'no'")

docs_grade_parser = PydanticOutputParser(pydantic_object=GradeDocuments)
docs_grade_retry_parser = RetryOutputParser.from_llm(
  parser=docs_grade_parser,
  llm=llm,
  max_retries=3,
)

docs_grade_template = """
You are a grader assessing relevance of a retrieved document to a user question.
If the document contains keyword(s) or semantic meaning related to the question, grade it as relevant.
Give a binary score 'yes' or 'no' score to indicate whether the document is relevant to the question.

{format_instructions}

User question:
{question}

Retrieved document:
{document}
"""
docs_grade_prompt = PromptTemplate(
  template=docs_grade_template,
  input_variables=['document', 'question'],
  partial_variables={'format_instructions': docs_grade_parser.get_format_instructions()},
)

docs_grade_grader = RunnableParallel(
  completion=docs_grade_prompt | llm | extract_json, prompt_value=docs_grade_prompt
) | RunnableLambda(lambda x: docs_grade_retry_parser.parse_with_prompt(**x))
docs_grade_grader.invoke({'question': 'What is the color of the sky?', 'document': 'The color of the sky is blue'})

GradeDocuments(binary_score='yes')

### Hallucinations chain

In [83]:
class GradeHallucinations(BaseModel):
  binary_score: str = Field(description="Answer is grounded in the facts, 'yes' or 'no'")

hallucination_parser = PydanticOutputParser(pydantic_object=GradeHallucinations)
hallucination_retry_parser = RetryOutputParser.from_llm(
  parser=hallucination_parser,
  llm=llm,
  max_retries=3,
)

hallucination_template = """
You are a grader assessing whether an LLM generation is grounded in / supported by a set of retrieved facts. \n
Give a binary score 'yes' or 'no'. 'Yes' means that the answer is grounded in / supported by the set of facts."

{format_instructions}

Set of facts:
{documents}

LLM generation:
{generation}
"""
hallucination_prompt = PromptTemplate(
  template=hallucination_template,
  input_variables=['question', 'generation'],
  partial_variables={'format_instructions': hallucination_parser.get_format_instructions()},
)

hallucination_grader = RunnableParallel(
  completion=hallucination_prompt | llm | extract_json, prompt_value=hallucination_prompt
) | RunnableLambda(lambda x: hallucination_retry_parser.parse_with_prompt(**x))
print(hallucination_grader.invoke({'documents': ['Sky is blue'], 'generation': 'The color of the sky is blue'}))

binary_score='yes'


### Answer grade chain

In [84]:
class GradeAnswer(BaseModel):
  binary_score: str = Field(description="Answer addresses the question, 'yes' or 'no'")

grade_parser = PydanticOutputParser(pydantic_object=GradeAnswer)
grade_retry_parser = RetryOutputParser.from_llm(
  parser=grade_parser,
  llm=llm,
  max_retries=3,
)

grade_template = """
You are a grader assessing whether an answer addresses / resolves a question. \n
Give a binary score 'yes' or 'no'. 'yes' means that the answer resolves the question.

User question:
{question}

LLM generation:
{generation}

{format_instructions}
"""
grade_prompt = PromptTemplate(
  template=grade_template,
  input_variables=['question', 'generation'],
  partial_variables={'format_instructions': grade_parser.get_format_instructions()},
)

answer_grader = RunnableParallel(
  completion=grade_prompt | llm | extract_json, prompt_value=grade_prompt
) | RunnableLambda(lambda x: grade_retry_parser.parse_with_prompt(**x))
print(answer_grader.invoke({"question": "What is the order of the cranial nerves?", 'generation': 'I do not know.'}))

binary_score='no'


### HyDE chain

In [85]:
hyde_template = """
Please write a scientific paper passage to answer the question

Question: {question}

Passage:
"""
hyde_prompt = ChatPromptTemplate.from_template(hyde_template)
hyde_chain = hyde_prompt | llm | StrOutputParser()

hyde_chain.invoke({"question": 'What is the order of the cranial nerves ?'})

"Here's a scientific paper-style passage answering the question:\n\n**Title:** The Cranial Nerve Order: A Review and Classification\n\n**Abstract:**\n\nThe cranial nerves are a complex group of 12 pairs of nerves that arise directly from the brain, playing crucial roles in various physiological functions. Despite their importance, the order of these nerves has been a subject of interest for centuries. This review aims to provide an overview of the classification and nomenclature of the cranial nerves, highlighting their distinct characteristics and functions.\n\n**Introduction:**\n\nThe cranial nerves are a unique group of nerves that emerge from the brain, serving as the primary means of communication between the central nervous system and various peripheral structures. The order of these nerves has been a topic of debate among neuroscientists, with different classification systems proposed over the years. In this review, we will present an updated classification scheme for the crania

### Step-back

In [86]:
step_back_template = """
You are an AI assistant tasked with generating broader, more general queries to improve context retrieval in a RAG system.
Given the original query, generate a step-back query that is more general and can help retrieve relevant background information.

Original query: {question}

Step-back query:
"""
step_back_prompt = ChatPromptTemplate.from_template(step_back_template)
step_back_chain = step_back_prompt | llm | StrOutputParser()

step_back_chain.invoke({"question": 'What is Benedict’s syndrome?'})

'What is the definition of a neurological or medical syndrome?'

### Query Rewriting

In [87]:
rewrite_query_template = """
You are an AI assistant tasked with reformulating user queries to improve retrieval in a RAG system.
Given the original query, rewrite it to be more specific, detailed, and likely to retrieve relevant information.

Original query: {question}

Rewritten query:
"""
rewrite_query_prompt = ChatPromptTemplate.from_template(rewrite_query_template)
rewrite_query_chain = rewrite_query_prompt | llm | StrOutputParser()

rewrite_query_chain.invoke({"question": 'What is the order of the cranial nerves?'})

'Here\'s a rewritten query that\'s more specific, detailed, and likely to retrieve relevant information:\n\n"What is the correct anatomical order of the 12 pairs of cranial nerves, including their names (I-XII), functions, and any notable characteristics or associations with specific brain regions?"\n\nThis revised query adds more specificity by:\n\n* Specifying the number of cranial nerves (12)\n* Including their names (I-XII) to ensure accurate retrieval\n* Mentioning their functions to provide context for the information being sought\n* Asking about notable characteristics or associations, which may help retrieve additional relevant details\n\nBy making these changes, the rewritten query is more likely to retrieve precise and detailed information from a RAG system, rather than just a general answer.'

### Decomposition

In [88]:
class DecompositionAnswer(BaseModel):
  subqueries: List[str] = Field(description="Given the original query, decompose it into 2-4 simpler sub-queries as json array of strings")

decomposition_parser = PydanticOutputParser(pydantic_object=DecompositionAnswer)
decomposition_retry_parser = RetryOutputParser.from_llm(
  parser=decomposition_parser,
  llm=llm,
  max_retries=3,
)

decomposition_template = """
You are an AI assistant tasked with breaking down complex queries into simpler sub-queries for a RAG system.
Given the original query, decompose it into 2-4 simpler sub-queries that, when answered together, would provide a comprehensive response to the original query.

Original query: {question}

example: What are the impacts of climate change on the environment?

Sub-queries:
1. What are the impacts of climate change on biodiversity?
2. How does climate change affect the oceans?
3. What are the effects of climate change on agriculture?
4. What are the impacts of climate change on human health?

{format_instructions}
"""
decomposition_prompt = PromptTemplate(
  template=decomposition_template,
  input_variables=['question'],
  partial_variables={'format_instructions': decomposition_parser.get_format_instructions()},
)

decomposition_chain = RunnableParallel(
  completion=decomposition_prompt | llm | extract_json, prompt_value=decomposition_prompt
) | RunnableLambda(lambda x: decomposition_retry_parser.parse_with_prompt(**x))
print(decomposition_chain.invoke({"question": "What is Benedict’s syndrome?"}))

subqueries=["What are the symptoms of Benedict's syndrome?", "What are the causes of Benedict's syndrome?", "How is Benedict's syndrome diagnosed?", "What are the treatment options for Benedict's syndrome?"]


### RAG chain

In [89]:
blender = llm_blender.Blender()
blender.loadranker('llm-blender/PairRM', device='cuda')
blender.loadfuser('llm-blender/gen_fuser_3b', device='cuda')



Successfully loaded ranker from  /home/super-pc2/.cache/huggingface/hub/llm-blender/PairRM


In [90]:
prompt = hub.pull('rlm/rag-prompt')

llama_llm = Ollama(model='llama3.1', temperature=0)
mistral_llm = ChatMistralAI(model='mistral-large-latest', temperature=0)
gpt_llm = ChatOpenAI(model='gpt-4o-mini', temperature=0)

llama_chain = prompt | llama_llm | StrOutputParser()
mistral_chain = prompt | mistral_llm | StrOutputParser()
gpt_chain = prompt | gpt_llm | StrOutputParser()

def fuse_generations(dict):
  question = dict['question']

  llama_res = dict['llama_res']
  mistral_res = dict['mistral_res']
  gpt_res = dict['gpt_res']
  answers = [llama_res, mistral_res, gpt_res]

  fuse_generations, ranks = blender.rank_and_fuse(
    [question],
    [answers],
    instructions=[''],
    return_scores=False,
    batch_size=2,
    top_k=3
  )
  return fuse_generations[0]

rag_chain = (
  {
    'llama_res': llama_chain,
    'mistral_res': mistral_chain,
    'gpt_res': gpt_chain,
    'question': itemgetter('question')
  }
  | RunnableLambda(fuse_generations)
)

rag_chain.invoke({"context": '', "question": 'What is the order of the cranial nerves?'})

Ranking candidates: 100%|██████████| 1/1 [00:00<00:00,  3.05it/s]
Fusing candidates:   0%|          | 0/1 [00:00<?, ?it/s]2024-11-27 22:47:06.580994: I tensorflow/core/util/port.cc:153] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.
2024-11-27 22:47:06.669319: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:485] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
2024-11-27 22:47:06.704018: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:8454] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
2024-11-27 22:47:06.714629: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1452] Unable to register cuBLAS factory: Attempting to register factory for 

'The order of the cranial nerves is: I. Olfactory II. Optic III. Oculomotor IV. Trochlear V. Trigeminal VI. Abducens VII. Facial VIII. Auditory (or vestibulocochlear) nerve IX. Glossopharyngeal X. Vagus'

### Web Search Chain

In [91]:
web_search_tool = TavilySearchResults(k=5)

### PubMed Retriever

In [92]:
pub_med_retriever = PubMedRetriever()
pub_med_retriever.invoke('What is the order of the cranial nerves?')

[]

### Arxiv Retriever

In [93]:
arxiv_retriever = ArxivRetriever(load_max_docs=3, get_ful_documents=True)
arxiv_retriever.invoke('What is the order of the cranial nerves?')

[Document(metadata={'Entry ID': 'http://arxiv.org/abs/1912.10601v2', 'Published': datetime.date(2021, 3, 13), 'Title': 'Optimized Cranial Bandeau Remodeling', 'Authors': 'James Drake, Marina Drygala, Ricardo Fukasawa, Jochen Koenemann, Andre Linhares, Thomas Looi, John Phillips, David Qian, Nikoo Saber, Justin Toth, Chris Woodbeck, Jessie Yeung'}, page_content="Craniosynostosis, a condition affecting 1 in 2000 infants, is caused by\npremature fusing of cranial vault sutures, and manifests itself in abnormal\nskull growth patterns. Left untreated, the condition may lead to severe\ndevelopmental impairment. Standard practice is to apply corrective cranial\nbandeau remodeling surgery in the first year of the infant's life. The most\nfrequent type of surgery involves the removal of the so-called fronto-orbital\nbar from the patient's forehead and the cutting of well-placed incisions to\nreshape the skull in order to obtain the desired result. In this paper, we\npropose a precise optimizati

### NCBI Protein DB retriever

In [123]:
class NCBIProteinRetriever(BaseRetriever):
  k: int

  def __init__(self, k: int):
    super().__init__(k=k)

    self.k = k

    entrez_email = os.getenv('ENTREZ_EMAIL')
    if entrez_email == None:
      raise ValueError('ENTREZ_EMAIL is not defined')
    Entrez.email = entrez_email

  def _search_protein(self, query):
    handle = Entrez.esearch(db='protein', term=query, retmax=self.k)
    record = Entrez.read(handle)
    handle.close()
    return record['IdList']

  def _fetch_protein(self, protein_id):
    handle = Entrez.efetch(db='protein', id=protein_id, rettype='gb', retmode='text')
    record = SeqIO.read(handle, 'genbank')
    handle.close()
    return record

  def _get_relevant_documents(self, query: str, *, run_manager: CallbackManagerForRetrieverRun) -> List[Document]:
    protein_ids = self._search_protein(query)
    docs = []

    for protein_id in protein_ids:
      protein_record = self._fetch_protein(protein_id)
      molecule_type = protein_record.annotations.get("molecule_type", "N/A")
      organism = protein_record.annotations.get("organism", "N/A")
      comment = protein_record.annotations.get("comment", "N/A")
      page_content = (
        f'Protein ID: {protein_id}\n'
        f'Type: {molecule_type}\n'
        f'Name: {protein_record.name}\n'
        f'Organism: {organism}\n'
        f'Description: {protein_record.description}\n'
        f'Comment: {comment}\n'
        f'Sequence: {protein_record.seq}'
      )
      doc = Document(page_content=page_content)
      docs.append(doc)

    return docs

In [124]:
ncbi_protein_retriever = NCBIProteinRetriever(k=3)
ncbi_protein_retriever.invoke('1PPF_I')

[Document(metadata={}, page_content='Protein ID: 494468\nType: protein\nName: 1PPF_I\nOrganism: Meleagris gallopavo\nDescription: Chain I, Turkey Ovomucoid Inhibitor (omtky3)\nComment: X-Ray Crystal Structure Of The Complex Of Human Leukocyte Elastase\n(Pmn Elastase) And The Third Domain Of The Turkey Ovomucoid\nInhibitor.\nSequence: LAAVSVDCSEYPKPACTLEYRPLCGSDNKTYGNKCNFCNAVVESNGTLTLSHFGKC')]

### NCBI Protein DB chain

In [96]:
class NCBIProteinDBAnswer(BaseModel):
  query: str = Field(description='Given the original query, please formulate a concise and precise query for the NCBI protein database.')

ncbi_protein_db_parser = PydanticOutputParser(pydantic_object=NCBIProteinDBAnswer)
ncbi_protein_db_retry_parser = RetryOutputParser.from_llm(
  parser=ncbi_protein_db_parser,
  llm=llm,
  max_retries=3,
)

ncbi_protein_db_template = """
As an expert in bioinformatics and user query optimization for biological databases, your task is to transform user questions into precise and effective queries suitable for the NCBI protein database.
Create a refined query that captures the core intent while optimizing for search within the NCBI protein database.

Original query: {question}

{format_instructions}
"""
ncbi_protein_db_prompt = PromptTemplate(
  template=ncbi_protein_db_template,
  input_variables=['question'],
  partial_variables={'format_instructions': ncbi_protein_db_parser.get_format_instructions()},
)

query_extractor = lambda res: res.query

ncbi_protein_db_chain = RunnableParallel(
  completion=ncbi_protein_db_prompt | llm | extract_json, prompt_value=ncbi_protein_db_prompt
) | RunnableLambda(lambda x: ncbi_protein_db_retry_parser.parse_with_prompt(**x)) | query_extractor | ncbi_protein_retriever
print(ncbi_protein_db_chain.invoke({"question": "What is a protein sequence of turkey?"}))

[Document(metadata={}, page_content='Protein ID: 2860245381\nName: WP_405022447\nDescription: right-handed parallel beta-helix repeat-containing protein [Methanobrevibacter smithii]\nSequence: MGNNAAGKPTGQNTFAGNVISNAFIGIQAQGDVKEVIAINNTFINVKTGIDVYSNVADGGIVVKGNSINASNIGILLKKGYAIIENNTINADSYGIQFTSADSKNSIVDNNVIISGKDYAISVAGTNTSITDNYIISKDYYGNGAVTSKSNDTIIENNTPAGASINTDISASINKNATIKIDVLPFDANGNVTIKFNGKSETVSFNASQTIVYDLGVLGIGDYEVTVIYNGNAKYNATNITKTFSIGKISDYNVTLNTTDVVAGENSTLVIILPEDATGVVNVTVGKDSYKANVTDGVASVKINSLIAGDYKVNVTYSGDKTYEVSNNVFNLAVNPMKVNLNISDVVMFYRDGTRMVAILTDIKGNPITNATVYFTINGKTYARTTDTNGTASLAINLISKIYNATILYNGSDIYDKLSKNITVTVNPTILANDTVLMYMDGTVFVAKFLDKTGKALTNASVKFNINGVFYTRITDNDGVAKLNIRLRPGSYILTAYNNVTGEEKGFDITVKSLIVANDLTKYYLNATRFEATIYNKNGTLAINKNVTFNINGVFYTRQTNENGVVGLNINLRPGNYIITTMFDGLAIGNNINVLPTLVTNDLSMKYLDGSKFTAQTLDGQGNPLANQNVSFNINGVLYHRVTDKDGMASLNIRLMSGDYIITSYWNDFQVGNTVKIA'), Document(metadata={}, page_content='Protein ID: 2860245380\nName: WP_405022446\nDescription: glycine betaine ABC transporter 

## Build graph app

In [126]:
class GraphState(TypedDict):
  question: str

  specialized_srcs: List[str]

  step_back_query: str
  rewritten_query: str
  subqueries: List[str]

  generated_docs: List[str]

  documents: Annotated[list, operator.add]

  web_search: str

  generation: str
  generations_num: int

def determine_specialized_srcs(state):
  print('---DETERMINE SPECIALIZED SOURCES---')

  question = state['question']

  try:
    res = question_router.invoke({'question': question})
    srcs = [src.strip().lower() for src in res.sources]
  except:
    srcs = []

  return {'specialized_srcs': srcs}

def route_question(state):
  print('---ROUTE QUESTION---')

  sources = state['specialized_srcs']

  if len(sources) == 0:
    print('---ROUTE QUESTION TO WEB SEARCH---')
    return 'websearch'
  else:
    print(f'---ROUTE QUESTION TO SPECIALIZED SOURCES: {", ".join([source.upper() for source in sources])}---')
    return 'specialized_srcs'

def generate_step_back_query(state):
  print('---GENERATE STEP-BACK QUERY---')

  question = state['question']

  step_back_query = step_back_chain.invoke({'question': question})

  return {'step_back_query': step_back_query}

def generate_rewritten_query(state):
  print('---GENERATE REWRITTEN QUERY---')

  question = state['question']

  rewritten_query = rewrite_query_chain.invoke({'question': question})

  return {'rewritten_query': rewritten_query}

def generate_subqueries(state):
  print('---GENERATE SUBQUERIES---')

  question = state['question']

  try:
    decomposition_answer = decomposition_chain.invoke({'question': question})
    subqueries = decomposition_answer.subqueries
    # Limit to a maximum of four subqueries
    subqueries = subqueries[:4]
  except:
    subqueries = []

  print(f'---FINAL SUBQUERIES NUMBER: {len(subqueries)}---')

  return {'subqueries': subqueries}

def generate_hyde_docs(state):
  print('---GENERATE HYDE DOCUMENTS---')

  question = state['question']
  step_back_query = state['step_back_query']
  rewritten_query = state['rewritten_query']
  subqueries = state['subqueries']

  queries = [question, step_back_query, rewritten_query, *subqueries]
  generated_docs = []

  for query in queries:
    generated_doc = hyde_chain.invoke({'question': query})
    generated_docs.append(generated_doc)

  return {'question': question, 'generated_docs': generated_docs}

def vector_store_retriever_node(state):
  generated_docs = state['generated_docs']
  specialized_srcs = state['specialized_srcs']

  if 'vectorstore' not in specialized_srcs:
    return {'documents': []}

  print('---RETRIEVE FROM VECTOR STORE---')

  documents = []

  for generated_doc in generated_docs:
    documents.extend(retriever.invoke(generated_doc))

  unique_documents = []
  seen_contents = set()

  for document in documents:
    if document.page_content in seen_contents:
      continue

    unique_documents.append(document)
    seen_contents.add(document.page_content)

  return {'documents': unique_documents}

def pub_med_retriever_node(state):
  specialized_srcs = state['specialized_srcs']

  if 'pubmed' not in specialized_srcs:
    return {'documents': []}

  print('---RETRIEVE FROM PUBMED---')

  question = state['question']
  step_back_query = state['step_back_query']
  rewritten_query = state['rewritten_query']
  subqueries = state['subqueries']

  queries = [question, step_back_query, rewritten_query, *subqueries]
  documents = []

  for query in queries:
    try:
      documents.extend(pub_med_retriever.invoke(query))
    except:
      pass

  unique_documents = []
  seen_contents = set()

  for document in documents:
    if document.page_content in seen_contents:
      continue

    unique_documents.append(document)
    seen_contents.add(document.page_content)

  return {'documents': unique_documents}

def arxiv_retriever_node(state):
  specialized_srcs = state['specialized_srcs']

  if 'arxiv' not in specialized_srcs:
    return {'documents': []}

  print('---RETRIEVE FROM ARXIV---')

  question = state['question']
  step_back_query = state['step_back_query']
  rewritten_query = state['rewritten_query']
  subqueries = state['subqueries']

  queries = [question, step_back_query, rewritten_query, *subqueries]
  documents = []

  for query in queries:
    try:
      documents.extend(arxiv_retriever.invoke(query))
    except:
      pass

  unique_documents = []
  seen_contents = set()

  for document in documents:
    if document.page_content in seen_contents:
      continue

    unique_documents.append(document)
    seen_contents.add(document.page_content)

  return {'documents': unique_documents}

def ncbi_protein_db_retriever_node(state):
  specialized_srcs = state['specialized_srcs']

  if 'ncbi_protein' not in specialized_srcs:
    return {'documents': []}

  print('---RETRIEVE FROM NCBI PROTEIN DB---')

  question = state['question']
  step_back_query = state['step_back_query']
  rewritten_query = state['rewritten_query']
  subqueries = state['subqueries']

  queries = [question, step_back_query, rewritten_query, *subqueries]
  documents = []

  for query in queries:
    try:
      documents.extend(ncbi_protein_db_chain.invoke(query))
    except:
      pass

  unique_documents = []
  seen_contents = set()

  for document in documents:
    if document.page_content in seen_contents:
      continue

    unique_documents.append(document)
    seen_contents.add(document.page_content)

  return {'documents': unique_documents}

def grade_documents(state):
  print('---CHECK DOCUMENT RELEVANCE TO QUESTION---')

  question = state['question']
  documents = state['documents']

  # Score each doc
  filtered_docs = []
  web_search = 'No'
  for index, d in enumerate(documents):
    print(f'---GRADE DOCUMENT ({index + 1}/{len(documents)})---')
    try:
      score = docs_grade_grader.invoke({'question': question, 'document': d.page_content})
      grade = score.binary_score
    except:
      grade = 'No'
    # Document relevant
    if grade.lower() == 'yes':
      print('---GRADE: DOCUMENT RELEVANT---')
      filtered_docs.append(d)
    # Document not relevant
    else:
      print('---GRADE: DOCUMENT NOT RELEVANT---')
      # We do not include the document in filtered_docs
      # We set a flag to indicate that we want to run web search
      web_search = 'Yes'
      continue

  print(f'---FINAL DOCUMENTS NUMBER: {len(filtered_docs)}---')

  return {
    'question': question,
    'documents': filtered_docs,
    'web_search': web_search,
  }

def decide_to_generate(state):
  print('---ASSESS GRADED DOCUMENTS---')

  web_search = state['web_search']

  if web_search == 'Yes':
    # Some documents have been filtered check_relevance
    # We will re-generate a new query
    print('---DECISION: SOME DOCUMENTS ARE NOT RELEVANT TO QUESTION, INCLUDE WEB SEARCH---')
    return 'websearch'
  else:
    # We have relevant documents, so generate answer
    print('---DECISION: GENERATE---')
    return 'generate'

def web_search(state):
  print('---WEB SEARCH---')

  question = state['question']
  documents = state.get('documents')

  docs = web_search_tool.invoke({'query': question})
  web_results = '\n'.join([d['content'] for d in docs])
  web_results = Document(page_content=web_results)
  if documents is not None:
    documents.append(web_results)
  else:
    documents = [web_results]

  return {
    'question': question,
    'documents': documents,
  }

def generate(state):
  print('---GENERATE---')

  question = state['question']
  documents = state['documents']
  generations_num = state.get('generations_num', 0)

  # RAG generation
  generation = rag_chain.invoke({'context': documents, 'question': question})
  return {
    'question': question,
    'documents': documents,
    'generation': generation,
    'generations_num': generations_num + 1,
  }

def grade_generation(state):
  print('---CHECK HALLUCINATIONS---')

  question = state['question']
  documents = state['documents']
  generation = state['generation']
  generations_num = state['generations_num']

  if generations_num >= 2:
    return 'useful'

  try:
    score = hallucination_grader.invoke({'documents': documents, 'generation': generation})
    grade = score.binary_score
  except:
    grade = 'no'

  # Check hallucination
  if grade == 'yes':
    print('---DECISION: GENERATION IS GROUNDED IN DOCUMENTS---')
    # Check question-answering
    print('---GRADE GENERATION vs QUESTION---')
    try:
      score = answer_grader.invoke({'question': question,'generation': generation})
      grade = score.binary_score
    except:
      grade = 'no'

    if grade == 'yes':
      print('---DECISION: GENERATION ADDRESSES QUESTION---')
      return 'useful'
    else:
      print('---DECISION: GENERATION DOES NOT ADDRESS QUESTION---')
      return 'not useful'
  else:
    print('---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---')
    return 'not supported'

In [127]:
workflow = StateGraph(GraphState)

In [128]:
workflow.add_node('determine_specialized_srcs', determine_specialized_srcs)

workflow.add_node('generate_step_back_query', generate_step_back_query)
workflow.add_node('generate_rewritten_query', generate_rewritten_query)
workflow.add_node('generate_subqueries', generate_subqueries)

workflow.add_node('generate_hyde_docs', generate_hyde_docs)

workflow.add_node('vector_store_retriever', vector_store_retriever_node)
workflow.add_node('pub_med_retriever', pub_med_retriever_node)
workflow.add_node('arxiv_retriever', arxiv_retriever_node)
workflow.add_node('ncbi_protein_db_retriever', ncbi_protein_db_retriever_node)

workflow.add_node('websearch', web_search)
workflow.add_node('generate', generate)
workflow.add_node('grade_documents', grade_documents)

<langgraph.graph.state.StateGraph at 0x712e0f6b8fa0>

In [129]:
workflow.add_edge(START, 'determine_specialized_srcs')
workflow.add_conditional_edges(
  'determine_specialized_srcs',
  route_question,
  {
    'websearch': 'websearch',
    'specialized_srcs': 'generate_step_back_query',
  },
)

workflow.add_edge('generate_step_back_query', 'generate_rewritten_query')
workflow.add_edge('generate_rewritten_query', 'generate_subqueries')
workflow.add_edge('generate_subqueries', 'generate_hyde_docs')

workflow.add_edge('generate_hyde_docs', 'vector_store_retriever')
workflow.add_edge('generate_hyde_docs', 'pub_med_retriever')
workflow.add_edge('generate_hyde_docs', 'arxiv_retriever')
workflow.add_edge('generate_hyde_docs', 'ncbi_protein_db_retriever')

workflow.add_edge('vector_store_retriever', 'grade_documents')
workflow.add_edge('pub_med_retriever', 'grade_documents')
workflow.add_edge('arxiv_retriever', 'grade_documents')
workflow.add_edge('ncbi_protein_db_retriever', 'grade_documents')

workflow.add_conditional_edges(
  'grade_documents',
  decide_to_generate,
  {
    'websearch': 'websearch',
    'generate': 'generate',
  },
)
workflow.add_edge('websearch', 'generate')
workflow.add_conditional_edges(
  'generate',
  grade_generation,
  {
    'not supported': 'generate',
    'useful': END,
    'not useful': 'websearch',
  },
)

<langgraph.graph.state.StateGraph at 0x712e0f6b8fa0>

In [130]:
app = workflow.compile()

In [134]:
app.invoke({"question": 'Translate 1PPF_I protein sequence'})

---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: NCBI_PROTEIN---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 4---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM NCBI PROTEIN DB---
---CHECK DOCUMENT RELEVANCE TO QUESTION---
---GRADE DOCUMENT (1/1)---
---GRADE: DOCUMENT RELEVANT---
---FINAL DOCUMENTS NUMBER: 1---
---ASSESS GRADED DOCUMENTS---
---DECISION: GENERATE---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.42it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.85it/s]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS GROUNDED IN DOCUMENTS---
---GRADE GENERATION vs QUESTION---
---DECISION: GENERATION DOES NOT ADDRESS QUESTION---
---WEB SEARCH---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.76it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.34it/s]

---CHECK HALLUCINATIONS---





{'question': 'Translate 1PPF_I protein sequence',
 'specialized_srcs': ['ncbi_protein'],
 'step_back_query': "A great example of how to create a step-back query!\n\nThe original query is quite specific and focused on a particular task (translating a protein sequence). To generate a more general step-back query, I'll try to broaden the context and focus on related concepts.\n\nHere's my attempt:\n\nStep-back query: What are the key considerations for working with protein sequences?\n\nThis query aims to retrieve background information on the general topic of protein sequences, including their structure, function, and manipulation. This can help provide a broader understanding of the original query's context, such as the importance of sequence translation in bioinformatics or the tools available for analyzing protein sequences.\n\nBy asking this more general question, I hope to retrieve relevant information that can inform and improve the performance of the RAG system when faced with sim

## Evaluate RAG

### Load QA dataset

In [None]:
qa_df = pd.read_csv('brainscape.csv')
qa_df

Unnamed: 0,question,answer
0,What are the afferent cranial nerve nuclei?,Trigeminal sensory nucleus- fibres carry gener...
1,What is the order of the cranial nerves ?,1-olfactory\n2-optic\n3-oculomotor\n4-trochlea...
2,What are the efferent cranial nerve nuclei?,Edinger-westphal nucleus\nOculomotor nucleus\n...
3,Which nuclei share the embryo logical origin -...,Oculomotor nucleus Trochlear nucleus Abducens ...
4,Which nuclei share the embryo logical origin- ...,Trigeminal motor nucleus Facial motor nucleus ...
...,...,...
1047,What is the purpose of gephyrin in the glycine...,Involved in anchoring the receptor to a specif...
1048,What is the glycine receptor involved in ?,Reflex response\nCauses reciprocal inhibition ...
1049,What happens in hyperperplexia ?,It’s an exaggerated reflex Often caused by a m...
1050,What is hyperperplexia treated with ?,Benzodiazepine


### Load cached RAGs responses

In [None]:
cache_path = Path('cache.json')

if not os.path.exists(cache_path):
  data = {}
  with open(cache_path, 'w') as file:
    json.dump(data, file)

with open(cache_path, 'r') as f:
  cache = json.load(f)

len(cache.keys())

770

In [None]:
questions = list(qa_df['question'].tolist())
expected_answers = list(qa_df['answer'].tolist())
predicted_answers = []

for index, question in tqdm(enumerate(questions)):
  if not question in cache:
    cache[question] = app.invoke({'question': question})['generation']

  predicted_answers.append(cache[question])

  with open(cache_path, 'w') as f:
    json.dump(cache, f)

cos_score = embeddings_cosine_sim_metric(expected_answers, predicted_answers)
bleu_score = bleu_metric(expected_answers, predicted_answers)
rogue_1_score = rogue_1_metric(expected_answers, predicted_answers)
rogue_l_score = rogue_l_metric(expected_answers, predicted_answers)

cos_score, bleu_score, rogue_1_score, rogue_l_score

615it [00:00, 1213.20it/s]

---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO WEB SEARCH---
---WEB SEARCH---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.08it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.06s/it]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.34it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.07s/it]
649it [00:23, 11.79it/s]  

---CHECK HALLUCINATIONS---
---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: VECTORSTORE, PUBMED---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 4---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM VECTOR STORE------RETRIEVE FROM PUBMED---

vector store documents len 22
pubmed documents len 12
---CHECK DOCUMENT RELEVANCE TO QUESTION---
final documents len 34
---GRADE DOCUMENT (1/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (2/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (3/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (4/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (5/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (6/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (7/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (8/34)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMEN

Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 11.90it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.07it/s]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 10.59it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.20s/it]
652it [02:09,  1.59it/s]

---CHECK HALLUCINATIONS---
---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: VECTORSTORE---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 4---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM VECTOR STORE---
vector store documents len 19
---CHECK DOCUMENT RELEVANCE TO QUESTION---
final documents len 19
---GRADE DOCUMENT (1/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (2/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (3/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (4/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (5/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (6/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (7/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (8/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (9/19)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOC

Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 11.76it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.68s/it]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 11.52it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.65s/it]
653it [03:28,  1.20s/it]

---CHECK HALLUCINATIONS---
---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: VECTORSTORE---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 4---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM VECTOR STORE---
vector store documents len 22
---CHECK DOCUMENT RELEVANCE TO QUESTION---
final documents len 22
---GRADE DOCUMENT (1/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (2/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (3/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (4/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (5/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (6/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (7/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (8/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (9/22)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOC

Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.15it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.20it/s]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.03it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.15it/s]
654it [04:51,  2.05s/it]

---CHECK HALLUCINATIONS---
---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: VECTORSTORE---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 3---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM VECTOR STORE---
vector store documents len 15
---CHECK DOCUMENT RELEVANCE TO QUESTION---
final documents len 15
---GRADE DOCUMENT (1/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (2/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (3/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (4/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (5/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (6/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (7/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (8/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOCUMENT (9/15)---
---GRADE: DOCUMENT NOT RELEVANT---
---GRADE DOC

Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.41it/s]
Fusing candidates: 100%|██████████| 1/1 [00:00<00:00,  1.63it/s]


---CHECK HALLUCINATIONS---
---DECISION: GENERATION IS NOT GROUNDED IN DOCUMENTS, RE-TRY---
---GENERATE---


Ranking candidates: 100%|██████████| 1/1 [00:00<00:00, 12.64it/s]
Fusing candidates: 100%|██████████| 1/1 [00:01<00:00,  1.14s/it]
655it [05:47,  2.83s/it]

---CHECK HALLUCINATIONS---
---DETERMINE SPECIALIZED SOURCES---
---ROUTE QUESTION---
---ROUTE QUESTION TO SPECIALIZED SOURCES: VECTORSTORE, PUBMED---
---GENERATE STEP-BACK QUERY---
---GENERATE REWRITTEN QUERY---
---GENERATE SUBQUERIES---
---FINAL SUBQUERIES NUMBER: 4---
---GENERATE HYDE DOCUMENTS---
---RETRIEVE FROM VECTOR STORE---
---RETRIEVE FROM PUBMED---
vector store documents len 20


660it [06:29,  1.70it/s]

pubmed documents len 3





KeyboardInterrupt: 