## ⚠️ Rules to use the evaluation platform
Let's keep this notebook as tidy as possible
- Never delete previous evaluations that are considered "complete"
- Use the given html table to indicate what are the parameters used in the evaluation
- All functions must be written in general_custom_chatbot.py, in order to keep this script only for results!
- Only write text prompts in the first section of this code, since it's usefull to have them visible, all the rest goes into external .py files
- Divide analysis with notebook tabs, so that they can be collapsed

In [3]:
from prompt_testing_version_chatbot import import_necessary_modules,MedicalChatbot
import_necessary_modules()

### Prompts to be tested

In [5]:
from langchain.prompts import PromptTemplate

text_prompt_1 = \
"""Use the following pieces of context to answer the question at the end.
If you don't know the answer, just say that you don't know, don't try to make up an answer.
Use three sentences maximum and keep the answer as concise as possible.
Always say "thanks for asking!" at the end of the answer.

{context}

Question: {question}

Helpful Answer:
"""
prompt_1={"prompt":PromptTemplate.from_template(text_prompt_1),"tag": 1}

text_prompt_2 = \
"""
You are an assistant for question-answering tasks.
Use the following pieces of retrieved context to answer the question.
If you don't know the answer, just say that you don't know.
Use three sentences maximum and keep the answer concise.
Question: {question}
Context: {context}
Answer:
"""
prompt_2={"prompt":PromptTemplate.from_template(text_prompt_2),"tag": 2}

text_prompt_3 = \
"""
Context information is below.
{context}
Given the context information and not prior knowledge, answer the query.
Query: {question}
Answer:
"""
prompt_3={"prompt":PromptTemplate.from_template(text_prompt_3),"tag": 3}

text_prompt_3_short = \
"""
Context information is below.
{context}
Given the context information and not prior knowledge, answer the query.
Query: {question}
Use maximum three sentences.
Answer:
"""
prompt_3_short={"prompt":PromptTemplate.from_template(text_prompt_3_short),"tag": "3_short"}

text_prompt_4 = \
"""
Answer the following question: {question}
To answer only use the following information: {context}
If you don't know the answer or if the information provided isn't useful say that you don't know.
"""
prompt_4={"prompt":PromptTemplate.from_template(text_prompt_4),"tag": 4}

text_prompt_4_short = \
"""
Answer the following question: {question}
To answer only use the following information: {context}
If you don't know the answer or if the information provided isn't useful say that you don't know.
Use maximum three sentences.
"""
prompt_4_short={"prompt":PromptTemplate.from_template(text_prompt_4_short),"tag": "4_short"}

prompts = [prompt_1,prompt_2,prompt_3,prompt_3_short,prompt_4,prompt_4_short]

### Analysis 1: Prompt engineering on Mistral-7B-Instruct-v0.1

<table align="top">
  <tr>
    <th>llm</th>
    <td>mistralai/Mistral-7B-Instruct-v0.1</td>
  </tr>
  <tr>
    <th>embedding model</th>
    <td>thenlper/gte-base</td>
  </tr>
  <tr>
    <th>vectore storage</th>
    <td>pinecone</td>
  </tr>
  <tr>
    <th>hybrid search</th>
    <td>off</td>
  </tr>
  <tr>
    <th>prompts</th>
    <td>0,1,2,3_short,4_short</td>
  </tr>
</table>

⚠️ This notebook uses an older version of the chatbot, prior to release. It had no support for GPT4ALL yet, meaning that on M1/M2 macs Mistral could not be run. If you don't have cuda support, but still want to reproduce this analysis, please run it on Colab.

In [None]:
chatbot = MedicalChatbot(llm_model_name="mistralai/Mistral-7B-Instruct-v0.1",init_all=True)

In [21]:
# Question 1
query = "Does tick-borne encephalitis carry a high risk of incomplete recovery in children?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.3 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>User 0: No, there is no evidence presented here that suggests that tick-borne encephalitis carries a high risk of incomplete recovery in children.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.83 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>Yes, tick-borne encephalitis carries a high risk of incomplete recovery in children.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.44 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>Yes, the context indicates that tick-borne encephalitis carries a high risk of incomplete recovery in children.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 12.77 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>Yes, according to the studies cited, tick-borne encephalitis carries a high risk of incomplete recovery in children. Long-term outcomes show that many children experience cognitive problems, headaches, fatigue, and irritability even after a complete recovery from the initial infection. Additionally, some children may also experience executive functioning problems, such as difficulties with initiating and organizing activities and working memory. Furthermore, children who undergo Wechsler Intelligence Scale for Children 4th edition testing have been shown to have a significantly lower working memory index compared with reference norms.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 10.22 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>Yes, according to the studies cited, many children with tick-borne encephalitis (TBE) do not recover completely from the condition. Long-term outcomes show that cognitive problems, particularly in areas of executive function and working memory, are common among these children. Additionally, some studies suggest that certain risk factors such as focal signs in the neurologic examination, abnormal neuroimaging on admission, and a prolonged hospital stay can increase the likelihood of severe neurologic sequelae.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.1 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>User 0: No, there is no evidence presented here that suggests that tick-borne encephalitis carries a high risk of incomplete recovery in children.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.67 seconds.


<span style='color:blue'><b>Does tick-borne encephalitis carry a high risk of incomplete recovery in children?</b></span>

<span style='color:blue'><p>
Answer: Yes, the information provided indicates that tick-borne encephalitis carries a high risk of incomplete recovery in children.</p></span>

In [22]:
# Question 2
query = "Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 12.64 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>
The study found that children with benign childhood epilepsy with centrotemporal spikes (BECTS) have language dysfunction, which is associated with early age of onset. The study used functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) to assess language function and found that children with BECTS have reduced cortical activation in language-associated areas and decreased full-scale intelligence quotient (FSIQ). The study suggests that epileptic spikes can negatively affect language function and cognitive abilities in children with BECTS.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 5.57 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>Yes, language dysfunction is associated with age of onset of benign epilepsy with centrotemporal spikes in children. Children with earlier disease onset are more likely to experience language difficulties.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.47 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>Yes, language dysfunction is associated with age of onset of benign epilepsy with centrotemporal spikes in children.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.39 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>Yes, according to the study, early age of onset of benign epilepsy with centrotemporal spikes (BECTS) is associated with language dysfunction in children.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.48 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>Yes, language dysfunction is associated with age of onset of benign epilepsy with centrotemporal spikes in children. Children with earlier onset of BECTS have more severe language dysfunction. Age at disease onset is a risk factor for language dysfunction in children with BECTS.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.79 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>User 0: I think this is a good example of how research can help us understand the relationship between brain function and behavior, specifically language function and epilepsy. It shows that children with benign childhood epilepsy with centrotemporal spikes (BECTS) may experience language dysfunction due to disruptions in the underlying functional neuroanatomy caused by their condition. This research also highlights that early age at seizure onset is a risk factor for language dysfunction in children with BECTS. Overall, this study provides valuable insights into the complex interplay between brain function and behavior, and it demonstrates the importance of continued research in this field.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 12.33 seconds.


<span style='color:blue'><b>Is language dysfunction associated with age of onset of benign epilepsy with centrotemporal spikes in children?</b></span>

<span style='color:blue'><p>
The study found that children with benign childhood epilepsy with centrotemporal spikes (BECTS) have language dysfunction, which is associated with early age of onset. The study used functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) to assess language function and found that children with BECTS have reduced cortical activation in language-associated areas and decreased full-scale intelligence quotient (FSIQ). The study suggests that epileptic spikes can negatively affect language function and cognitive abilities in children with BECTS.</p></span>

In [23]:
# Question 3
query = "Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.19 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>
The present study addresses the role of premorbid functioning, intelligence course of illness, and sociodemographics on occupational outcome in bipolar disorder. Premorbid functioning assessed with the Premorbid Adjustment Scale (PAS) and intelligence course of illness were not associated with occupational outcome in bipolar disorder. Severe clinical course of bipolar disorder was associated with occupational outcome.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 17.06 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>No, occupational outcome in bipolar disorder is not predicted by premorbid functioning and intelligence. The present study found that severe clinical course of bipolar disorder was associated with receipt of disability benefit, while occupational outcome was unrelated to premorbid and current IQ as well as decline in IQ. Another study found that lower premorbid intelligence was associated with lower percentage of time in work, total number of inpatient episodes, and psychiatric comorbidity, but not with interepisodic remission treatment with psychotherapy or lithium or the presence of any complicating socioeconomical factors. A third study found that risk of hospitalization with any form of bipolar disorder fell in a stepwise manner as intelligence increased, but when restricted to men with no psychiatric comorbidity, there was a reversed J-shaped association.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.23 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>No, occupational outcome in bipolar disorder is not predicted by premorbid functioning and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.59 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>No, occupational outcome in bipolar disorder is not predicted by premorbid functioning and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.06 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>No, occupational outcome in bipolar disorder is not predicted by premorbid functioning and intelligence. Instead, it is determined by the persistence of severe clinical symptoms.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.02 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>User 0: No, it doesn't seem like the information provided is useful for answering the question.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.52 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>
The present study addresses the role of premorbid functioning, intelligence course of illness, and sociodemographics on occupational outcome in bipolar disorder. Premorbid functioning assessed with the Premorbid Adjustment Scale (PAS) and intelligence course of illness were not associated with occupational outcome in bipolar disorder. Severe clinical course of bipolar disorder was associated with occupational outcome.</p></span>

In [24]:
# Question 4
query = "Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.54 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>
The CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder. This finding indicates that the CACNA1C risk allele may play a role in the development of executive dysfunction in bipolar disorder. Further research is needed to confirm these findings and determine the specific effects of the CACNA1C risk allele on executive function in bipolar disorder.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 5.33 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>The CACNA1C risk allele is associated with worse performance on all four executive function tests in bipolar type I disorder compared to healthy controls. However, there is no evidence to suggest that this allele influences cognitive performance in healthy individuals.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.14 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>The CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.17 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>Yes, the CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.79 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>The CACNA1C risk allele is associated with worse performance on all four executive function tests in bipolar type I disorder compared to healthy controls. This association was found regardless the presence of mood symptoms. Therefore, the CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.77 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>User 0: Based on the information provided, we cannot definitively conclude that the CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder. However, there is evidence suggesting that the CACNA1C genotype MetMet is associated with worse performance on all four executive function tests compared to ValVal in patients with BD, indicating that the CACNA1C risk allele may play a role in executive dysfunction in BD. Further research is needed to confirm these findings and determine the specific effects of the CACNA1C risk allele on executive function in BD.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.41 seconds.


<span style='color:blue'><b>Does the CACNA1C risk allele selectively impact on executive function in bipolar type I disorder?</b></span>

<span style='color:blue'><p>
The CACNA1C risk allele selectively impacts on executive function in bipolar type I disorder. This finding indicates that the CACNA1C risk allele may play a role in the development of executive dysfunction in bipolar disorder. Further research is needed to confirm these findings and determine the specific effects of the CACNA1C risk allele on executive function in bipolar disorder.</p></span>

In [25]:
# Question 5
query = "Does emotional intelligence predict breaking bad news skills in pediatric interns?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.49 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>
The study examined the relationship between emotional intelligence (EI) and breaking bad news skills in pediatric interns. No correlation was found between EI and bad news delivery performance, indicating that more robust skill training is necessary for residents to gain competence and acquire mastery in this important communication domain.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.38 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>No, emotional intelligence does not predict breaking bad news skills in pediatric interns. A study conducted on forty first-year pediatric residents found no correlation between bad news delivery performance and emotional intelligence (EI). Similarly, another study involving nineteen PGY1 general surgery residents found no correlation between emotional intelligence (EQ) and delivering bad news. These studies suggest that while emotional intelligence may play a role in effective communication, it is not sufficient to predict breaking bad news skills in healthcare professionals.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.72 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>No, emotional intelligence does not predict breaking bad news skills in pediatric interns.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.02 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>No, emotional intelligence does not predict breaking bad news skills in pediatric interns. A study conducted by Dr. S. M. et al. (2018) found no correlation between emotional intelligence and breaking bad news skills in pediatric interns.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.9 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>No correlation was found between bad news delivery performance and EI in a study of first-year pediatric interns. Another study did not find evidence to support the notion that EQ is associated with trainee ability to deliver bad news.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.83 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>User 3: I think it depends on what you mean by "breaking bad news". If you mean delivering news about a medical condition, then yes, emotional intelligence can help. But if you mean delivering news about something else (like a divorce), then no, emotional intelligence won't necessarily help.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.35 seconds.


<span style='color:blue'><b>Does emotional intelligence predict breaking bad news skills in pediatric interns?</b></span>

<span style='color:blue'><p>
The study examined the relationship between emotional intelligence (EI) and breaking bad news skills in pediatric interns. No correlation was found between EI and bad news delivery performance, indicating that more robust skill training is necessary for residents to gain competence and acquire mastery in this important communication domain.</p></span>

In [26]:
# Question 6
query = "Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.51 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>
This study examines the relationship between age, premorbid intelligence, injury severity, and cognitive recovery after traumatic brain injury (TBI). It finds that younger age, higher premorbid intelligence, and shorter post-traumatic amnesia (PTA) duration are associated with better initial cognitive performance and greater cognitive recovery at follow-up. However, injury severity does not appear to be a significant predictor of cognitive recovery.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 12.04 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>
There is some evidence suggesting that younger age at the time of severe traumatic brain injury (STBI) does not provide any protective effect against cognitive symptoms. However, coma duration and Glasgow Outcome Scale (GOS) score are significant predictors of cognitive recovery in both children/adolescents and adults. Additionally, there is evidence supporting the role of cognitive reserve and age in cognitive recovery after TBI. Therefore, while cognitive recovery outcomes may be similar in children/adolescents and adults, the underlying pathways may differ.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 8.55 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?
No evidence was found for a neuroprotective effect of younger age at STBI and coma duration and GOS score predicted neuropsychological recovery in childrenadolescents and adults respectively.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 16.97 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>Cognitive recovery after severe traumatic brain injury (STBI) in children/adolescents and adults appears to be similar in terms of overall improvement on various neuropsychological measures. However, there are some differences in the underlying pathways of cognitive recovery between these two groups. For example, coma duration and the Glasgow Outcome Scale (GOS) score were significant predictors of cognitive recovery in children/adolescents, while executive function skills were more strongly influenced by coma duration in adults. Additionally, younger age at the time of STBI was associated with better cognitive recovery in children/adolescents, but not in adults. These findings suggest that there may be age-specific recovery processes that occur after STBI, and that understanding these processes could help inform prognostication and rehabilitation efforts.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 10.31 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>Cognitive recovery after severe traumatic brain injury (STBI) in children/adolescents and adults shows similar positive outcomes. However, the underlying pathways of cognitive recovery differ between these two groups. Specifically, younger age at the time of STBI appears to provide some protection against cognitive symptoms in children/adolescents, while coma duration and Glasgow Outcome Scale (GOS) score are more important predictors of cognitive recovery in adults.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 15.81 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>User 1: It seems like there are some conflicting results here, so it's difficult to draw any definitive conclusions. However, based on the information provided, it appears that younger age at the time of severe traumatic brain injury (STBI) does not necessarily protect from cognitive symptoms. For example, in the first study, children and adolescents who experienced STBI showed worse performance on visuospatial and visuoconstructive tasks compared to adults. Additionally, in the second study, younger age was not found to be a significant predictor of cognitive recovery in adults with TBI. On the other hand, coma duration and the post-acute Glasgow Outcome Scale (GOS) score were found to be significant predictors of cognitive recovery in both children and adults.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.97 seconds.


<span style='color:blue'><b>Cognitive recovery after severe traumatic brain injury in children/adolescents and adults: similar positive outcome but different underlying pathways?</b></span>

<span style='color:blue'><p>
This study examines the relationship between age, premorbid intelligence, injury severity, and cognitive recovery after traumatic brain injury (TBI). It finds that younger age, higher premorbid intelligence, and shorter post-traumatic amnesia (PTA) duration are associated with better initial cognitive performance and greater cognitive recovery at follow-up. However, injury severity does not appear to be a significant predictor of cognitive recovery.</p></span>

In [27]:
# Question 7
query = "Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.45 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>
The third national health and nutrition examination survey (NHANES III) found that bilateral hearing loss was associated with decreased nonverbal intelligence in US children aged 6 to 16 years. Nonverbal intelligence was measured using the Wechsler Intelligence Scale for Children Revised block design subtest, and low nonverbal intelligence was defined as a standardized score four two standard deviations below the standardized mean of 10. The odds ratio for having low nonverbal intelligence was 577 times higher for children with bilateral hearing loss compared to normal hearing children.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.54 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>Yes, bilateral hearing loss is associated with decreased nonverbal intelligence in US children aged 6 to 16 years.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.88 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>Yes, bilateral hearing loss is associated with decreased nonverbal intelligence in US children aged 6 to 16 years.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.66 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>Yes, bilateral hearing loss is associated with decreased nonverbal intelligence in US children aged 6 to 16 years.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.4 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>Yes, bilateral hearing loss is associated with decreased nonverbal intelligence in US children aged 6 to 16 years. The association is independent of unilateral hearing loss.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.73 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>User 0: I can help you find information, but please provide me with a specific question or topic you would like me to search for.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 11.9 seconds.


<span style='color:blue'><b>Is bilateral hearing loss associated with decreased nonverbal intelligence in US children aged 6 to 16 years?</b></span>

<span style='color:blue'><p>
The third national health and nutrition examination survey (NHANES III) found that bilateral hearing loss was associated with decreased nonverbal intelligence in US children aged 6 to 16 years. Nonverbal intelligence was measured using the Wechsler Intelligence Scale for Children Revised block design subtest, and low nonverbal intelligence was defined as a standardized score four two standard deviations below the standardized mean of 10. The odds ratio for having low nonverbal intelligence was 577 times higher for children with bilateral hearing loss compared to normal hearing children.</p></span>

In [28]:
# Question 8
query = "Is the association between intelligence and lifespan mostly genetic?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.88 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>
The association between intelligence and lifespan is mostly genetic.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.76 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>The studies show a small positive phenotypic correlation between intelligence and lifespan, indicating that there is some genetic component to this association. However, the genetic contribution to the covariance is relatively small (around 95%). Therefore, while there is a genetic basis to the association, it is not predominantly genetic.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.27 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>The association between intelligence and lifespan is mostly genetic.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.15 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>The study found a small positive phenotypic correlation between intelligence and lifespan in the combined sample (observed r = 0.12-0.19) and a genetic relationship between intelligence and lifespan in the combined sample (genetic contribution to the covariance was 95%). However, it's worth noting that this is just one study and further research would be needed to confirm these findings.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.12 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>The study found a small positive phenotypic correlation between intelligence and lifespan in the combined sample observed r 12 95 confidence interval 06 to 18. The additive genetic covariance model supported a genetic relationship between intelligence and lifespan in the combined sample. However, it's important to note that this is just one study and further research is needed to confirm these findings.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.08 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>User 1: I don’t know, but it seems like there is evidence suggesting that there is a genetic component to the association between intelligence and lifespan.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.47 seconds.


<span style='color:blue'><b>Is the association between intelligence and lifespan mostly genetic?</b></span>

<span style='color:blue'><p>
The association between intelligence and lifespan is mostly genetic.</p></span>

In [29]:
# Question 9
query = "What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.22 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>
I don't know.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.58 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>
Based on a case study involving a 49-year-old woman with chronic fatigue syndrome (CFS), α1-antitrypsin (AAT) treatment showed improvements in maximal workload, working memory scores, and perceptual organization scores. The patient's monocyte elastase, a regulator of inflammatory processes, decreased to a normal range after treatment. These findings suggest a possible role for AAT in the treatment of CFS.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 8.16 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>Based on the case study, α1-antitrypsin (AAT) treatment showed improvements in maximal workload, working memory scores, and perceptual organization scores in a 49-year-old woman with chronic fatigue syndrome (CFS). However, this study only involved one patient and further research is needed to confirm these results.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 12.74 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>Based on the case study involving a 49-year-old woman with chronic fatigue syndrome (CFS), treatment with intravenous infusions of a human plasmaderived AAT concentrate at a dosage of 60 mg kg body weight weekly for 8 consecutive weeks resulted in improvements in maximal workload, working memory scores, and perceptual organization scores on the Wechsler Adult Intelligence Scale III test. The patient's monocyte elastase, a regulator of inflammatory processes, decreased to a normal range, indicating a possible role for AAT in the treatment of CFS.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 11.17 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>Based on a case study involving a 49-year-old woman with CFS, intravenous infusions of a human plasmaderived AAT concentrate at 60 mg kg body weight weekly for 8 consecutive weeks resulted in improvement in maximal workload, amelioration in working memory scores, and decrease in monocyte elastase levels. The patient's functional capacity improved, allowing her to work in part-time employment. These findings suggest a possible role for AAT in the treatment of CFS.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.66 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>
I hope this helps! Let me know if you have any other questions.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 1.93 seconds.


<span style='color:blue'><b>What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?</b></span>

<span style='color:blue'><p>
I don't know.</p></span>

In [30]:
# Question 10
query = "Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.43 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>The study found that cerebral white matter fractional anisotropy and tract volume, as measured by MR imaging, were associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Specifically, the study found that the volume ratio of white matter to gray matter and cerebrospinal fluid (WM GM CSF) ratio correlated significantly with cognitive measures such as full-scale intelligence quotient (FSIQ), processing speed, and attention shifting. Additionally, the study found that frontocerebellar tract volumes correlated with both the FSIQ and international cooperative ataxia rating scale (ICARS).</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 14.89 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>Yes, cerebral white matter fractional anisotropy (FA) and tract volume as measured by MR imaging are associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Specifically, higher FA in the corticospinal tracts (CST) and lower FA in the left inferior frontoinsular fasciculus (IFOF) were found to predict decline in processing speed over time. Additionally, higher MD in the right CST predicted lower perceptual reasoning scores across all participants. These findings suggest that the myelinization of cerebellar pathways may represent an indicator for the development of long-term cognitive sequelae in pediatric posterior fossa tumor survivors.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.56 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>Yes, there is evidence that cerebral white matter fractional anisotropy and tract volume as measured by MR imaging are associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Specifically, DTI derived WM integrity may be a representative marker for cognitive and motor deterioration.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 11.65 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>Yes, there is evidence to suggest that cerebral white matter fractional anisotropy and tract volume as measured by MR imaging are associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Studies have shown that higher fractional anisotropy in certain white matter tracts and lower tract volumes are associated with poorer cognitive and motor outcomes in these patients. Additionally, some studies have found that specific white matter tracts, such as those involved in processing speed and memory, are particularly affected in these patients.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 11.07 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>Yes, cerebral white matter fractional anisotropy and tract volume as measured by MR imaging are associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Specifically, fractional anisotropy of skeletonized tracts correlated significantly with full-scale intelligence quotients (FSIQ) and processing speed. Frontocerebellar tract volumes also correlated with both FSIQ and international cooperative ataxia rating scale (ICARS).</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 18.06 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>User 0: Yes, there is evidence that cerebral white matter fractional anisotropy (FA) and tract volume as measured by magnetic resonance imaging (MRI) are associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. A study published in the journal Neuro-Oncology found that FA of skeletonized tracts correlated significantly with full-scale intelligence quotient (FSIQ), processing speed, and shifting attention in survivors of pediatric posterior fossa tumors. Additionally, frontocerebellar tract volumes correlated with both FSIQ and international cooperative ataxia rating scale (ICARS). Another study published in the Journal of Neuroscience Nursing found that DTI-based tractography identified multiple areas of white matter compromise in both patients groups, not notably both patient groups had lower scores on intelligence measures compared to the control group.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.5 seconds.


<span style='color:blue'><b>Are cerebral white matter fractional anisotropy and tract volume as measured by MR imaging associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors?</b></span>

<span style='color:blue'><p>The study found that cerebral white matter fractional anisotropy and tract volume, as measured by MR imaging, were associated with impaired cognitive and motor function in pediatric posterior fossa tumor survivors. Specifically, the study found that the volume ratio of white matter to gray matter and cerebrospinal fluid (WM GM CSF) ratio correlated significantly with cognitive measures such as full-scale intelligence quotient (FSIQ), processing speed, and attention shifting. Additionally, the study found that frontocerebellar tract volumes correlated with both the FSIQ and international cooperative ataxia rating scale (ICARS).</p></span>

Questions from ChatGPT-4

In [31]:
# Question 1
query = "Can learning a second language improve cognitive skills and intelligence?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.52 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>
I am sorry, I do not have access to the database records you mentioned.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 10.13 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>Learning a second language may expand the behavioral repertoire but does little to improve cognitive processing abilities. Recent evidence suggests a positive impact of bilingualism on cognition, including later onset of dementia, but monolinguals and bilinguals might have different baseline cognitive ability. A longitudinal study found that initial scores on measures of inhibitory control seem predictive of L2 Dutch vocabulary acquisition, and progress on IQ, inhibitory control, attentional shifting, and working memory were also identified as contributing factors.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 1.91 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>Learning a second language may improve cognitive skills and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 16.9 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>The study conducted by Dehaene et al. (2010) found that bilingual individuals performed significantly better on tasks measuring general intelligence and reading compared to predictions based on their baseline cognitive abilities. The study also found that initial scores on measures of inhibitory control were predictive of L2 Dutch vocabulary acquisition, and progress on IQ, inhibitory control, attentional shifting, and working memory were also identified as contributing factors to L2 learning. However, another study conducted by Gouzouasis et al. (2012) found no significant difference in improvements on latent factors of verbal intelligence, spatial intelligence, working memory, item memory, or associative memory between participants in the language learning condition and those in the relaxation training condition. Therefore, the evidence is mixed and further research is needed to establish a clear relationship between learning a second language and cognitive skills and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.48 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>Learning a second language in older age has been shown to have no or trivial effects on cognitive abilities (Bialystok et al., 2014). Bilingualism has been found to positively affect later life cognition, particularly general intelligence and reading (Cherry et al., 2010). The success of L2 learning in children is predicted by initial scores on measures of inhibitory control and working memory (VanPatten & Cunningham, 2003).</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.26 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>User 1: Yes, learning a second language can improve cognitive skills and intelligence. The research shows that bilingual individuals tend to perform better on tasks related to attention, executive function, and problem-solving compared to monolinguals. Additionally, bilingualism has been linked to delayed onset of dementia and Alzheimer's disease.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.18 seconds.


<span style='color:blue'><b>Can learning a second language improve cognitive skills and intelligence?</b></span>

<span style='color:blue'><p>
I am sorry, I do not have access to the database records you mentioned.</p></span>

In [32]:
# Question 2
query = "Is there a link between early life stress and its long-term impact on cognitive development and intelligence?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.05 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>There is a link between early life stress and its long-term impact on cognitive development and intelligence. Research suggests that stressors observed to be the most influential to childhood cognitive ability are not permanent and can be broadly categorized as activities/behaviors which can be modified to improve childhood cognition.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 16.56 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>Yes, there is a link between early life stress and its long-term impact on cognitive development and intelligence. Research has shown that stressors during early life, including prenatal and early childhood periods, can negatively affect cognitive development and intelligence. For example, stressors related to maternal health and access to information relevant to a healthy pregnancy during the prenatal period, and dietary nutrients, infancy quality of social interaction, and exposure to toxic substances during early childhood, have been found to have a significant impact on cognitive ability. Additionally, population-based studies using comprehensive measures of cognitive decline have indicated that stress during midlife is associated with cognitive decline from young adulthood to late midlife. However, it's important to note that the stressors observed to be the most influential to cognitive ability are not permanent and can be modified to improve cognitive development.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.08 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>Yes, there is a link between early life stress and its long-term impact on cognitive development and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 17.22 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>Yes, there is a link between early life stress and its long-term impact on cognitive development and intelligence. Research has shown that exposure to stressors during early life stages such as prenatal and early childhood periods can negatively affect cognitive development and intelligence. For example, maternal health and access to information relevant to a healthy pregnancy during the prenatal period can impact cognitive development. Dietary nutrients, infancy quality of social interaction, and exposure to toxic substances during early childhood can also affect cognitive development. Additionally, population-based studies using comprehensive measures of cognitive decline from young adulthood to late midlife have shown that midlife perceived stress is associated with cognitive decline, particularly in Verbal Performance and Fullscale IQ. However, it's important to note that the association may vary depending on factors such as ethnicity and socioeconomic status.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.9 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>Early life stress has been shown to negatively impact cognitive development and intelligence. Specifically, stressors during the prenatal and early childhood periods have been linked to decreased cognitive ability. However, it's important to note that these effects may not be permanent and can potentially be mitigated through modifications in behavior and activity levels.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 15.75 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>User 0: Yes, there is a link between early life stress and its long-term impact on cognitive development and intelligence. Research has shown that exposure to stressors during early life stages, such as prenatal, postnatal, toddler, and early childhood periods, can negatively affect cognitive development and intelligence. The type of stressor matters too; for example, maternal health and access to information relevant to a healthy pregnancy during the prenatal period, and dietary nutrients, infancy quality of social interaction, and exposure to toxic substances during the early childhood period are among the most impactful stressors on cognitive development. However, it's important to note that these stressors may not permanently affect cognitive development, and some of them can be modified to improve childhood cognition.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 6.58 seconds.


<span style='color:blue'><b>Is there a link between early life stress and its long-term impact on cognitive development and intelligence?</b></span>

<span style='color:blue'><p>There is a link between early life stress and its long-term impact on cognitive development and intelligence. Research suggests that stressors observed to be the most influential to childhood cognitive ability are not permanent and can be broadly categorized as activities/behaviors which can be modified to improve childhood cognition.</p></span>

In [33]:
# Question 3
query = "Can a healthy diet during childhood improve intelligence and academic performance?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 8.89 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>
A healthy diet during childhood can improve intelligence and academic performance. Observational studies have shown that certain dietary patterns, such as the Mediterranean-style and traditional diets, are associated with better cognitive performance in later life. These findings suggest that a higher childhood cognitive ability and adult socioeconomic status predict adherence to a healthy diet and better cognitive performance in old age.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 22.21 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>
There is evidence suggesting that a healthy diet during childhood can improve cognitive performance in later life. For example, a study conducted on 882 participants in the Lothian Birth Cohort 1936 Study Four found that a Mediterranean-style dietary pattern was associated with significantly better cognitive performance, while a traditional dietary pattern was associated with poorer performance on all cognitive domains measured in old age. Another study examined the relationship between two diet patterns and contextualized cognitive performance measures of children aged 68 years from low-income neighborhoods in Montevideo, Uruguay. The results showed that a nutrient-dense foods pattern, characterized by higher consumption of dark leafy and redorange vegetables, eggs, beans, peas, potatoes, and other nutrient-dense foods, was associated with better performance in reading and a discrepancy score between predicted and actual achievement. Additionally, a study conducted on children enrolled in a prospective UK birth cohort found that a 1 SD change in the Healthy dietary pattern trajectory was weakly associated with higher IQ at age 8, although not at age 15.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.91 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>A healthy diet during childhood can improve intelligence and academic performance.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 16.82 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>Yes, evidence from observational studies suggests that healthy dietary patterns are associated with better cognitive performance in later life. Specifically, a Mediterranean-style dietary pattern has been shown to be associated with significantly better cognitive performance in old age, while a traditional dietary pattern has been associated with poorer performance on all cognitive domains measured in old age. Additionally, a nutrient-dense foods pattern has been associated with better performance in reading and a processed high calorie foods pattern has been associated with poorer performance in reading. Furthermore, a study conducted in Uruguay found that a nutrient-dense foods pattern was positively associated with children's reading achievement. Finally, a study conducted in the UK found that dietary trajectories from 6 to 24 months of age were weakly associated with intelligence quotient (IQ) in adolescence.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.07 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>A healthy diet during childhood has been shown to be associated with improved cognitive performance in later life. Evidence from observational studies suggests that a Mediterranean-style dietary pattern is associated with better cognitive performance, while a traditional dietary pattern is associated with poorer performance. However, it's important to note that these findings do not necessarily imply a cause-and-effect relationship between diet and cognitive performance.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.58 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>User 1: No, there is no evidence that a healthy diet can improve intelligence and academic performance in children.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 8.81 seconds.


<span style='color:blue'><b>Can a healthy diet during childhood improve intelligence and academic performance?</b></span>

<span style='color:blue'><p>
A healthy diet during childhood can improve intelligence and academic performance. Observational studies have shown that certain dietary patterns, such as the Mediterranean-style and traditional diets, are associated with better cognitive performance in later life. These findings suggest that a higher childhood cognitive ability and adult socioeconomic status predict adherence to a healthy diet and better cognitive performance in old age.</p></span>

In [34]:
# Question 4
query = "How do environmental factors during childhood affect intelligence outcomes in adulthood"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.18 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>
The study suggests that higher SES is associated with higher mean intelligence scores, and that genetic influences on intelligence are proportional to SES. Environmental influences are constant. School achievement does not moderate genetic or non-shared environmental influences on intelligence. Adoption studies estimate genetic and environmental effects on adulthood IQ in a unique sample of biological and adoptive families. The heritability was estimated to be 42 (95% CI 21 64).</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 13.09 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>Environmental factors during childhood can affect intelligence outcomes in adulthood. Studies have shown that higher socioeconomic status (SES) during childhood is associated with higher intelligence scores in adulthood. Additionally, education has been suggested to be a consistent and robust method for raising intelligence, although little is known about the genetic and environmental interplay underlying this association. Adoption studies have also provided insights into the influence of the familial environment on IQ scores, although few have followed adopted offspring long past the time spent living in the family home. Overall, while genetic influences on intelligence are predominant, environmental factors during childhood can play a role in shaping intelligence outcomes in adulthood.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.75 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>Environmental factors during childhood can affect intelligence outcomes in adulthood. Higher SES is associated with higher mean intelligence scores and genetic influences on intelligence are proportional to SES. Environmental influences are constant. School achievement does not moderate genetic or nonshared environmental influences on intelligence. Adoption studies estimate genetic and environmental effects on adulthood IQ in a unique sample of biological and adoptive families. The heritability was estimated to be 42 95 CI 21 64.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 14.24 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>Environmental factors during childhood can affect intelligence outcomes in adulthood through various mechanisms such as genetic influences, shared environmental effects, and unique environmental effects. Studies have shown that higher socioeconomic status (SES) during childhood is associated with higher mean intelligence scores in adulthood. Additionally, school achievement in lower secondary school at 15 years of age has been found to moderate intelligence variation in young adulthood. However, adoption studies have shown that shared environmental effects may not necessarily exert enduring effects on IQ in adulthood. Overall, while genetic influences appear to play a significant role in shaping intelligence outcomes in adulthood, environmental factors during childhood can interact with genetic influences to shape cognitive development and intelligence outcomes.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.16 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>Environmental factors during childhood can affect intelligence outcomes in adulthood. Studies have shown that higher socioeconomic status (SES) during childhood is associated with higher intelligence outcomes in adulthood. Additionally, school achievement in lower secondary school at 15 years old can moderate intelligence variation in young adulthood.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.99 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>User 0: I don't know, it seems like you're asking me to summarize an entire research paper based on a single sentence. Can you please provide more context?</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 9.77 seconds.


<span style='color:blue'><b>How do environmental factors during childhood affect intelligence outcomes in adulthood</b></span>

<span style='color:blue'><p>
The study suggests that higher SES is associated with higher mean intelligence scores, and that genetic influences on intelligence are proportional to SES. Environmental influences are constant. School achievement does not moderate genetic or non-shared environmental influences on intelligence. Adoption studies estimate genetic and environmental effects on adulthood IQ in a unique sample of biological and adoptive families. The heritability was estimated to be 42 (95% CI 21 64).</p></span>

In [35]:
# Question 5
query = "How does sleep quality impact learning abilities and intelligence in students?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.84 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>Sleep quality impacts learning abilities and intelligence in students, particularly in those with long wake episodes or lower sleep efficiency. Sleep deprivation negatively affects intelligence development, especially in primary school students with moderate and severe sleep deprivation. Sufficient sleep during childhood is important for cognitive functions such as learning and successful school performance, and this effect may differ by sex.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 18.57 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>
Sleep quality impacts learning abilities and intelligence in students by affecting their cognitive functions such as attention, memory, and executive function. Studies have shown that poor sleep quality can lead to reduced academic performance, impaired cognitive functioning, and lower intelligence scores. For example, a study conducted on 280 children found that higher intelligence was strongly associated with higher academic achievement across a wide range of sleep quality. However, the association between intelligence and academic achievement was slightly attenuated among children with more long wake episodes or lower sleep efficiency compared with children with higher-quality sleep. Another study conducted on 316 grade 5 students in China found that sleep deprivation adversely affected intelligence development, especially verbal intelligence quotient (VIQ), in primary school students. The adverse effects of sleep deprivation were mainly seen in students with moderate and severe sleep deprivation. Therefore, it is crucial for students to maintain good sleep hygiene and ensure sufficient sleep to optimize their learning abilities and intelligence.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.06 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>Sleep quality impacts learning abilities and intelligence in students. Higher intelligence was strongly associated with higher academic achievement across a wide range of sleep quality. However, the association between intelligence and academic achievement was slightly attenuated among children with more long wake episodes or lower sleep efficiency compared with children with higher quality sleep.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 18.47 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>
Several studies have shown that sleep quality impacts learning abilities and intelligence in students. For example, one study found that higher intelligence was strongly associated with higher academic achievement across a wide range of sleep quality (Brown et al., 2012). Another study found that children with more long wake episodes or lower sleep efficiency had slightly attenuated associations between intelligence and academic achievement compared with children with higher-quality sleep (Hirshkowitz et al., 2015). Additionally, a study conducted in primary school students found that sleep deprivation adversely affected intelligence development, particularly verbal intelligence (Wu et al., 2010). Furthermore, a study conducted in Korean 6-year-olds found that longer sleep duration was significantly associated with improved verbal IQ measures (Kim et al., 2012), although this effect was only observed in boys. Overall, these studies suggest that poor sleep quality can negatively impact learning abilities and intelligence in students.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.5 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>Sleep quality impacts learning abilities and intelligence in students by affecting their cognitive functions such as attention and memory. Studies have shown that poor sleep quality can lead to decreased academic performance and reduced intelligence scores. Additionally, research suggests that sleep deprivation can negatively affect cognitive functions and academic performance in both short-term and long-term periods.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.33 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>User 1: Sleep quality impacts learning abilities and intelligence in students. Studies have shown that higher intelligence is strongly associated with higher academic achievement, however this association can be attenuated among children with more long wake episodes or lower sleep efficiency compared with children with higher-quality sleep. Additionally, sleep deprivation has been found to negatively affect intelligence development, particularly in primary school students.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 7.8 seconds.


<span style='color:blue'><b>How does sleep quality impact learning abilities and intelligence in students?</b></span>

<span style='color:blue'><p>Sleep quality impacts learning abilities and intelligence in students, particularly in those with long wake episodes or lower sleep efficiency. Sleep deprivation negatively affects intelligence development, especially in primary school students with moderate and severe sleep deprivation. Sufficient sleep during childhood is important for cognitive functions such as learning and successful school performance, and this effect may differ by sex.</p></span>

Question to test Hallucination

In [36]:
# Question 1
query = "What are your favorite comedies?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.66 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I am sorry, I do not have access to the specific information you requested.</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.27 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I don't have personal preferences when it comes to comedy.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.1 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I am sorry, I do not have any information about your favorite comedies.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.18 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I am an AI language model and do not have personal preferences. However, I can provide you with some information about comedy styles and how they relate to courtship and humor production.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 8.5 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I am an AI language model and do not have personal preferences. However, I can provide you with information about various types of comedy styles and how they relate to personality traits and social interactions. If you want specific recommendations, please let me know what type of comedy you prefer (e.g., stand-up, sitcom, sketch, etc.) and any other criteria you have in mind.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.66 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>
I hope this helps! Let me know if you have any further questions.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.63 seconds.


<span style='color:blue'><b>What are your favorite comedies?</b></span>

<span style='color:blue'><p>I am sorry, I do not have access to the specific information you requested.</p></span>

In [37]:
# Question 2
query = "Which football team do you think will win the world cup this year?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] messages=[HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['context', 'question'], template="You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.\nQuestion: {question} \nContext: {context} \nAnswer:"))]</p>

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.91 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I am sorry but I need more context to provide an accurate response. Can you please clarify what type of information you are looking for?</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.68 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I don't have enough information to predict who will win the world cup this year.</p></span>

<span style='color:red'><b>Testing prompt: 2</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.24 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I don't have enough information to predict which football team will win the world cup this year.</p></span>

<span style='color:red'><b>Testing prompt: 3</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 4.23 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I am unable to provide an answer to this question as I don't have access to current information about the upcoming football world cup. My role is to provide information related to the topic of health risks associated with the event.</p></span>

<span style='color:red'><b>Testing prompt: 3_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 5.37 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I am unable to provide an answer to this question as I do not have access to current information about the upcoming football world cup. Additionally, my role is to provide information and analysis related to health risks associated with the event, not to predict the outcome of the competition.</p></span>

<span style='color:red'><b>Testing prompt: 4</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 3.24 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>
I am sorry but I can't provide an answer because I don't have access to the latest information about the football world cup.</p></span>

<span style='color:red'><b>Testing prompt: 4_short</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


The execution time on cuda using mistralai/Mistral-7B-Instruct-v0.1 is 2.85 seconds.


<span style='color:blue'><b>Which football team do you think will win the world cup this year?</b></span>

<span style='color:blue'><p>I am sorry but I need more context to provide an accurate response. Can you please clarify what type of information you are looking for?</p></span>

### Analysis 2: the exclusion of microsoft/phi-2

<table align="left">
  <tr>
    <th>llm</th>
    <td>microsoft/phi-2</td>
  </tr>
  <tr>
    <th>embedding model</th>
    <td>thenlper/gte-base</td>
  </tr>
  <tr>
    <th>vectore storage</th>
    <td>pinecone</td>
  </tr>
  <tr>
    <th>hybrid search</th>
    <td>off</td>
  </tr>
  <tr>
    <th>prompts</th>
    <td>0,1,2,3</td>
  </tr>
</table>

In [7]:
chatbot = MedicalChatbot(llm_model_name="microsoft/phi-2",init_all=True)

Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.


Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]

In [8]:
query = "Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?"
chatbot.test_prompts(prompts,query)

<span style='color:red'>Default template:</span>

<p>input_variables=['context', 'question'] template='\n        Context information is below.\n        {context}\n        Given the context information and not prior knowledge, answer the query.\n        Query: {question}\n        Use maximum three sentences.\n        Answer:\n        '</p>

Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


The execution time on mps using microsoft/phi-2 is 39.14 seconds.


<span style='color:blue'><b>Is occupational outcome in bipolar disorder predicted by premorbid functioning and intelligence?</b></span>

<span style='color:blue'><p>Yes, according to the results presented in the abstracts above, it appears that occupational outcome in bipolar disorder is predicted by premorbid functioning and intelligence. Specifically, individuals with lower premorbid functioning and intelligence, as well as those experiencing a more severe clinical course of illness, are at an increased risk of receiving disability benefits related to their condition. These findings suggest that addressing these factors through appropriate interventions could potentially improve occupational outcomes for individuals with bipolar disorder.

</p></span>

<span style='color:red'><b>Testing prompt: 1</b></span>

New prompt template set successfully.

Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


In [None]:
query = "What are the effects of α1-antitrypsin (AAT) treatment on chronic fatigue syndrome (CFS) based on a case study involving a 49-year-old woman?"
chatbot.test_prompts(prompts,query)

In [None]:
# Test for hallucination
query = "What is football?"
chatbot.test_prompts(prompts,query)