In [None]:
# Make sure to set your OpenAI API key in the .env file
import openai
import os
from dotenv import load_dotenv

load_dotenv()
openai.api_key = os.getenv("OPENAI_API_KEY")

Steps:

1. Break the problem down into sub-problems
2. 1 prompt per sub-problem
3. Evaluate performance on each sub-problem
4. Compose your successful prompts
5. ? 

__Task:__

__Automatic reference section from a markdown essay.__ 

Breaking it down

1. Identify all the citations with the respective source
2. For each citation produce the correct reference bullet point
3. Convert to a markdown format with the url hyper link syntax: `[]()`
4. Add the reference section at the end

Now, why separate the second part and write Python code rather than ask the model? Because since this second part is deterministic, 
we save in tokens as well as in complexity where the model can focus only on the problem of correctly extracting the citations.

Example draft essay:

```
In the evolving field of artificial intelligence, recent studies have highlighted the profound impact of large language models (LLMs) on natural language processing capabilities. Smith et al. (2023) in their groundbreaking research, presented in the Journal of AI Research, discussed how the integration of LLMs has revolutionized machine translation, making it significantly more accurate and context-aware (https://www.journalofairesearch.org/integration-of-llms). This leap in technology underscores the need for continuous innovation and ethical considerations in AI development. Additionally, the work by Davis and O'Neil (2024) sheds light on the ethical implications of LLMs in content generation, revealing potential biases that could perpetuate misinformation if not addressed properly (https://www.ethicsinaijournal.org/llm-content-generation-implications).

Conversely, initiatives to make LLMs more transparent and accountable have been gaining traction. The OpenAI team's recent publication (2025) in the AI Transparency Review highlights the successful implementation of new algorithms that enhance the interpretability of LLM decisions, thereby making them more reliable and trustworthy (https://www.aitransparencyreview.org/enhancing-llm-interpretability). According to Anderson and Yamamoto (2025), these advancements not only contribute to the reliability of AI systems but also foster a deeper understanding among users, facilitating a more informed and responsible use of AI technologies (https://www.aiuserinsights.org/understanding-ai-decisions). This body of work underscores the dynamic nature of AI research and the collaborative effort required to harness the full potential of LLMs while mitigating associated risks.
```

In [2]:
from openai import OpenAI

client = OpenAI()

def get_response(prompt_question):
    response = client.chat.completions.create(
        model="gpt-3.5-turbo-0125",
        messages=[{"role": "system", "content": "You are a helpful research and programming assistant"},
                  {"role": "user", "content": prompt_question}]
    )
    
    return response.choices[0].message.content

In [4]:
instruction = "Extract all the citations in the following draft essay with their respective source:"

In [15]:
input_data = """'''\nIn the evolving field of artificial intelligence, recent studies have highlighted the profound impact of large language models (LLMs) on natural language processing capabilities. Smith et al. (2023) in their groundbreaking research, presented in the Journal of AI Research, discussed how the integration of LLMs has revolutionized machine translation, making it significantly more accurate and context-aware (https://www.journalofairesearch.org/integration-of-llms). This leap in technology underscores the need for continuous innovation and ethical considerations in AI development. Additionally, the work by Davis and O'Neil (2024) sheds light on the ethical implications of LLMs in content generation, revealing potential biases that could perpetuate misinformation if not addressed properly (https://www.ethicsinaijournal.org/llm-content-generation-implications).

Conversely, initiatives to make LLMs more transparent and accountable have been gaining traction. The OpenAI team's recent publication (2025) in the AI Transparency Review highlights the successful implementation of new algorithms that enhance the interpretability of LLM decisions, thereby making them more reliable and trustworthy (https://www.aitransparencyreview.org/enhancing-llm-interpretability). According to Anderson and Yamamoto (2025), these advancements not only contribute to the reliability of AI systems but also foster a deeper understanding among users, facilitating a more informed and responsible use of AI technologies (https://www.aiuserinsights.org/understanding-ai-decisions). This body of work underscores the dynamic nature of AI research and the collaborative effort required to harness the full potential of LLMs while mitigating associated risks. '''\n"""

In [6]:
context = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."
output_indicator = "Your output should be just bullet points containing all the citations and their sources. Output: \n\n"

In [17]:
from IPython.display import Markdown
prompt = f"{context}\n\n{instruction}\n\n{input_data}\n\n{output_indicator}"
Markdown(prompt)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Extract all the citations in the following draft essay with their respective source and output a bullet list with all the citations and their sources.

'''
In the evolving field of artificial intelligence, recent studies have highlighted the profound impact of large language models (LLMs) on natural language processing capabilities. Smith et al. (2023) in their groundbreaking research, presented in the Journal of AI Research, discussed how the integration of LLMs has revolutionized machine translation, making it significantly more accurate and context-aware (https://www.journalofairesearch.org/integration-of-llms). This leap in technology underscores the need for continuous innovation and ethical considerations in AI development. Additionally, the work by Davis and O'Neil (2024) sheds light on the ethical implications of LLMs in content generation, revealing potential biases that could perpetuate misinformation if not addressed properly (https://www.ethicsinaijournal.org/llm-content-generation-implications).

Conversely, initiatives to make LLMs more transparent and accountable have been gaining traction. The OpenAI team's recent publication (2025) in the AI Transparency Review highlights the successful implementation of new algorithms that enhance the interpretability of LLM decisions, thereby making them more reliable and trustworthy (https://www.aitransparencyreview.org/enhancing-llm-interpretability). According to Anderson and Yamamoto (2025), these advancements not only contribute to the reliability of AI systems but also foster a deeper understanding among users, facilitating a more informed and responsible use of AI technologies (https://www.aiuserinsights.org/understanding-ai-decisions). This body of work underscores the dynamic nature of AI research and the collaborative effort required to harness the full potential of LLMs while mitigating associated risks. '''


Your output should be just bullet points containing all the citations and their sources. Output: 



In [19]:
output = get_response(prompt)

Markdown(output)

- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms
- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications
- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability
- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions

In [21]:
def add_reference_header(output_citation_bullets):
    return "# References\n" + output_citation_bullets

output_reference_section = add_reference_header(output)

Markdown(output_reference_section)

# References:
- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms
- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications
- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability
- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions

__Sub-task 3__

In [22]:
output_reference_section

"# References:\n- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms\n- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications\n- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability\n- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions"

In [23]:
context = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction = "Convert the following bullet points into markdown format:"

In [25]:
input_data = output_reference_section
input_data

"# References:\n- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms\n- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications\n- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability\n- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions"

In [27]:
prompt = f"{context}\n\n{instruction}\n\n{input_data}"

In [28]:
Markdown(prompt)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Convert the following bullet points into markdown format:

# References:
- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms
- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications
- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability
- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions

In [29]:
output = get_response(prompt)

Markdown(output) 

```markdown
# References:
- Smith et al. (2023) - Journal of AI Research - [Integration of LLMs](https://www.journalofairesearch.org/integration-of-llms)
- Davis and O'Neil (2024) - Ethics in AI Journal - [LLM Content Generation Implications](https://www.ethicsinaijournal.org/llm-content-generation-implications)
- OpenAI team (2025) - AI Transparency Review - [Enhancing LLM Interpretability](https://www.aitransparencyreview.org/enhancing-llm-interpretability)
- Anderson and Yamamoto (2025) - AI User Insights - [Understanding AI Decisions](https://www.aiuserinsights.org/understanding-ai-decisions)
```

# References:
- Smith et al. (2023) - Journal of AI Research - [Integration of LLMs](https://www.journalofairesearch.org/integration-of-llms)
- Davis and O'Neil (2024) - Ethics in AI Journal - [LLM Content Generation Implications](https://www.ethicsinaijournal.org/llm-content-generation-implications)
- OpenAI team (2025) - AI Transparency Review - [Enhancing LLM Interpretability](https://www.aitransparencyreview.org/enhancing-llm-interpretability)
- Anderson and Yamamoto (2025) - AI User Insights - [Understanding AI Decisions](https://www.aiuserinsights.org/understanding-ai-decisions)

So the output here looks off because now we have 3 parts to the references instead of just the sentence description and the url source.

In [30]:
context = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction = "Convert the following bullet points into markdown format:"

In [31]:
input_data

"# References:\n- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms\n- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications\n- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability\n- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions"

In [32]:
output_indicator = "Your output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this: '''[<anchor text>](<url source>)'''. Output:"

In [33]:
prompt = f"{context}\n\n{instruction}\n\n{input_data}\n\n{output_indicator}"

Markdown(prompt)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Convert the following bullet points into markdown format:

# References:
- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms
- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications
- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability
- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions

Your output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this: '''[<anchor text>](<url source>)'''. Output:

In [34]:
output = get_response(prompt)

Markdown(output)

## References:
- [Smith et al. (2023)](https://www.journalofairesearch.org/integration-of-llms) - Journal of AI Research 
- [Davis and O'Neil (2024)](https://www.ethicsinaijournal.org/llm-content-generation-implications) - Ethics in AI Journal 
- [OpenAI team (2025)](https://www.aitransparencyreview.org/enhancing-llm-interpretability) - AI Transparency Review 
- [Anderson and Yamamoto (2025)](https://www.aiuserinsights.org/understanding-ai-decisions) - AI User Insights 

In [35]:
output

"## References:\n- [Smith et al. (2023)](https://www.journalofairesearch.org/integration-of-llms) - Journal of AI Research \n- [Davis and O'Neil (2024)](https://www.ethicsinaijournal.org/llm-content-generation-implications) - Ethics in AI Journal \n- [OpenAI team (2025)](https://www.aitransparencyreview.org/enhancing-llm-interpretability) - AI Transparency Review \n- [Anderson and Yamamoto (2025)](https://www.aiuserinsights.org/understanding-ai-decisions) - AI User Insights "

In [40]:
output_indicator = "Your output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this:\n\n'''- [<all the anchor text>](<url source>)\n\n- [<all the anchor text>](<url source>) ...'''. Output:"

In [43]:
prompt = f"{context}\n\n{instruction}\n\n{input_data}\n\n{output_indicator}"

prompt

"You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.\n\nConvert the following bullet points into markdown format:\n\n# References:\n- Smith et al. (2023) - Journal of AI Research - https://www.journalofairesearch.org/integration-of-llms\n- Davis and O'Neil (2024) - Ethics in AI Journal - https://www.ethicsinaijournal.org/llm-content-generation-implications\n- OpenAI team (2025) - AI Transparency Review - https://www.aitransparencyreview.org/enhancing-llm-interpretability\n- Anderson and Yamamoto (2025) - AI User Insights - https://www.aiuserinsights.org/understanding-ai-decisions\n\nYour output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this:\n\n'''- [<all the anchor text>](<url source>)\n\n- [<all the anchor text>](<url source>) ...'''. Output:"

In [44]:
output = get_response(prompt)

Markdown(output)

- [Smith et al. (2023)](https://www.journalofairesearch.org/integration-of-llms)
- [Davis and O'Neil (2024)](https://www.ethicsinaijournal.org/llm-content-generation-implications)
- [OpenAI team (2025)](https://www.aitransparencyreview.org/enhancing-llm-interpretability)
- [Anderson and Yamamoto (2025)](https://www.aiuserinsights.org/understanding-ai-decisions)

In [45]:
output

"- [Smith et al. (2023)](https://www.journalofairesearch.org/integration-of-llms)\n- [Davis and O'Neil (2024)](https://www.ethicsinaijournal.org/llm-content-generation-implications)\n- [OpenAI team (2025)](https://www.aitransparencyreview.org/enhancing-llm-interpretability)\n- [Anderson and Yamamoto (2025)](https://www.aiuserinsights.org/understanding-ai-decisions)"

Perfect!

For the final sub-task we won't use chatgpt because we can simply just add it through a simple function call.

In [46]:
def add_reference_header(output_citation_bullets):
    return "# References:\n" + output_citation_bullets

output_reference_section = add_reference_header(output)
Markdown(output_reference_section)

# References:
- [Smith et al. (2023)](https://www.journalofairesearch.org/integration-of-llms)
- [Davis and O'Neil (2024)](https://www.ethicsinaijournal.org/llm-content-generation-implications)
- [OpenAI team (2025)](https://www.aitransparencyreview.org/enhancing-llm-interpretability)
- [Anderson and Yamamoto (2025)](https://www.aiuserinsights.org/understanding-ai-decisions)

Now, let's put everything together!

In [47]:
context1_2 = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction1_2 = """Extract all the citations in the following draft essay with their respective source and output a bullet list with all the citations and their sources"""

output_indicator1_2 = "Your output should be just bullet points containing all the citations and their sources. Output:"

# I generated this mock draft with GPT-4 using the web interface
input_data1_2 = """The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).

In efforts to make AI tools in healthcare more transparent and accountable, the recent initiative by the AI Health Transparency Board (2026) aims to establish standards for AI applications in medicine (https://www.aihealthtransparency.org/standards). These standards are designed to ensure that AI health tools are reliable and their functions are clear to both healthcare providers and patients, fostering trust and understanding in AI-driven processes (https://www.aihealthtrust.org/ai-clarity).

Furthermore, the collaboration between AI developers and healthcare professionals is crucial for optimizing the benefits of AI in healthcare, as discussed by Martinez and Liu (2027) in *AI and Healthcare Collaboration Review*. Their research indicates that cooperative development leads to more effective and user-friendly AI solutions that are better aligned with clinical needs (https://www.aihealthcollaboration.org/effective-partnerships).

This collective body of research and initiatives highlights the dynamic and evolving nature of AI in healthcare, emphasizing the crucial balance between innovation, ethics, and user-centric development to fully realize the potential of AI in improving health outcomes and patient care."""

In [53]:
prompt1_2 = f"{context1_2}\n\n{instruction1_2}\n\n{input_data1_2}\n\n{output_indicator1_2}"

In [54]:
output1_2 = get_response(prompt1_2)
Markdown(output1_2)

- Johnson et al. (2024) - *Journal of Medical AI Research* - https://www.medicalairesearch.org/ai-in-diagnostics
- Thompson and colleagues (2023) - *Journal of Personalized Medicine* - https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai
- Gupta and Singh (2025) - *AI Ethics Journal* - https://www.aiethicsjournal.org/unbiased-ai-healthcare
- AI Health Transparency Board (2026) - https://www.aihealthtransparency.org/standards
- Martinez and Liu (2027) - *AI and Healthcare Collaboration Review* - https://www.aihealthcollaboration.org/effective-partnerships

In [55]:
output1_2

'- Johnson et al. (2024) - *Journal of Medical AI Research* - https://www.medicalairesearch.org/ai-in-diagnostics\n- Thompson and colleagues (2023) - *Journal of Personalized Medicine* - https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai\n- Gupta and Singh (2025) - *AI Ethics Journal* - https://www.aiethicsjournal.org/unbiased-ai-healthcare\n- AI Health Transparency Board (2026) - https://www.aihealthtransparency.org/standards\n- Martinez and Liu (2027) - *AI and Healthcare Collaboration Review* - https://www.aihealthcollaboration.org/effective-partnerships'

Problem here:

ChatGPT did not identify all the urls! Easy to automate the process of checking for this particular case.

In [57]:
context1_2 = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction1_2 = """Extract all the citations in the following draft essay with their respective source."""

output_indicator1_2 = "Your output should be the bullet points containing all the citations and their sources. Output:"

# I generated this mock draft with GPT-4 using the web interface
input_data1_2 = """'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).

In efforts to make AI tools in healthcare more transparent and accountable, the recent initiative by the AI Health Transparency Board (2026) aims to establish standards for AI applications in medicine (https://www.aihealthtransparency.org/standards). These standards are designed to ensure that AI health tools are reliable and their functions are clear to both healthcare providers and patients, fostering trust and understanding in AI-driven processes (https://www.aihealthtrust.org/ai-clarity).

Furthermore, the collaboration between AI developers and healthcare professionals is crucial for optimizing the benefits of AI in healthcare, as discussed by Martinez and Liu (2027) in *AI and Healthcare Collaboration Review*. Their research indicates that cooperative development leads to more effective and user-friendly AI solutions that are better aligned with clinical needs (https://www.aihealthcollaboration.org/effective-partnerships).

This collective body of research and initiatives highlights the dynamic and evolving nature of AI in healthcare, emphasizing the crucial balance between innovation, ethics, and user-centric development to fully realize the potential of AI in improving health outcomes and patient care.'''"""

In [59]:
prompt1_2 = f"{context1_2}\n\n{instruction1_2}\n\n{input_data1_2}\n\n{output_indicator1_2}"

Markdown(prompt1_2)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Extract all the citations in the following draft essay with their respective source.

'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).

In efforts to make AI tools in healthcare more transparent and accountable, the recent initiative by the AI Health Transparency Board (2026) aims to establish standards for AI applications in medicine (https://www.aihealthtransparency.org/standards). These standards are designed to ensure that AI health tools are reliable and their functions are clear to both healthcare providers and patients, fostering trust and understanding in AI-driven processes (https://www.aihealthtrust.org/ai-clarity).

Furthermore, the collaboration between AI developers and healthcare professionals is crucial for optimizing the benefits of AI in healthcare, as discussed by Martinez and Liu (2027) in *AI and Healthcare Collaboration Review*. Their research indicates that cooperative development leads to more effective and user-friendly AI solutions that are better aligned with clinical needs (https://www.aihealthcollaboration.org/effective-partnerships).

This collective body of research and initiatives highlights the dynamic and evolving nature of AI in healthcare, emphasizing the crucial balance between innovation, ethics, and user-centric development to fully realize the potential of AI in improving health outcomes and patient care.'''

Your output should be the bullet points containing all the citations and their sources. Output:

In [60]:
output1_2 = get_response(prompt1_2)
Markdown(output1_2)

- Johnson et al. (2024) in the *Journal of Medical AI Research*: https://www.medicalairesearch.org/ai-in-diagnostics
- Thompson and colleagues (2023) in the *Journal of Personalized Medicine*: https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai
- Gupta and Singh (2025) in the *AI Ethics Journal*: https://www.aiethicsjournal.org/unbiased-ai-healthcare
- AI Health Transparency Board (2026): https://www.aihealthtransparency.org/standards
- Martinez and Liu (2027) in *AI and Healthcare Collaboration Review*: https://www.aihealthcollaboration.org/effective-partnerships

In [61]:
output1_2

'- Johnson et al. (2024) in the *Journal of Medical AI Research*: https://www.medicalairesearch.org/ai-in-diagnostics\n- Thompson and colleagues (2023) in the *Journal of Personalized Medicine*: https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai\n- Gupta and Singh (2025) in the *AI Ethics Journal*: https://www.aiethicsjournal.org/unbiased-ai-healthcare\n- AI Health Transparency Board (2026): https://www.aihealthtransparency.org/standards\n- Martinez and Liu (2027) in *AI and Healthcare Collaboration Review*: https://www.aihealthcollaboration.org/effective-partnerships'

In [62]:
def get_response(prompt_question, system_message="You are a helpful research and programming assistant"):
    response = client.chat.completions.create(
        model="gpt-3.5-turbo-0125",
        messages=[{"role": "system", "content": system_message},
                  {"role": "user", "content": prompt_question}]
    )
    
    return response.choices[0].message.content

In [75]:
context1_2 = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction1_2 = """Extract all the citations in the following draft essay with their respective source."""

output_indicator1_2 = "Your output should be the bullet points containing all the citations and their sources. Output:"

# Here we cut input data in half to see if we detect progress.
input_data1_2 = """'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).'''"""

prompt1_2 = f"{context1_2}\n\n{instruction1_2}\n\n{input_data1_2}\n\n{output_indicator1_2}"

Markdown(prompt1_2)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Extract all the citations in the following draft essay with their respective source.

'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).'''

Your output should be the bullet points containing all the citations and their sources. Output:

In [73]:
output1_2 = get_response(prompt1_2)
Markdown(output1_2)

- Johnson et al. (2024) in the *Journal of Medical AI Research* - https://www.medicalairesearch.org/ai-in-diagnostics
- Thompson and colleagues (2023) in the *Journal of Personalized Medicine* - https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai
- Gupta and Singh (2025) in the *AI Ethics Journal* - https://www.aiethicsjournal.org/unbiased-ai-healthcare

In [74]:
output1_2

'- Johnson et al. (2024) in the *Journal of Medical AI Research* - https://www.medicalairesearch.org/ai-in-diagnostics\n- Thompson and colleagues (2023) in the *Journal of Personalized Medicine* - https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai\n- Gupta and Singh (2025) in the *AI Ethics Journal* - https://www.aiethicsjournal.org/unbiased-ai-healthcare'

In [76]:
context1_2 = "You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports."

instruction1_2 = """Extract all the citations in the following draft essay with their respective source."""

output_indicator1_2 = "Your output should be the bullet points containing all the citations and their urls sources. Make sure to extract all the links in the draft. Output:"

# Here we cut input data in half to see if we detect progress.
input_data1_2 = """'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).'''"""

prompt1_2 = f"{context1_2}\n\n{instruction1_2}\n\n{input_data1_2}\n\n{output_indicator1_2}"

Markdown(prompt1_2)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Extract all the citations in the following draft essay with their respective source.

'''The rapid progression of artificial intelligence (AI) has brought significant transformations to numerous sectors, with healthcare being a prominent field experiencing revolutionary changes. In a recent publication by Johnson et al. (2024) in the *Journal of Medical AI Research*, the integration of AI in diagnostic processes has notably increased the accuracy and speed of medical diagnostics (https://www.medicalairesearch.org/ai-in-diagnostics). This evolution in medical technology emphasizes the potential of AI to enhance patient outcomes and streamline healthcare services.

Moreover, the use of AI in personalized medicine is becoming increasingly significant, as highlighted by Thompson and colleagues (2023) in their study published in the *Journal of Personalized Medicine*. They discuss how machine learning models are used to tailor treatments based on individual genetic profiles, significantly improving treatment efficacy (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai). The implications of such advancements underscore the need for robust data protection regulations and ethical guidelines to manage patient data sensitively and securely (https://www.healthprivacy.org/ai-ethics).

The challenge of ensuring fairness and eliminating bias in AI algorithms is also critical, especially in high-stakes fields like healthcare. Research by Gupta and Singh (2025) presented in the *AI Ethics Journal* examines the latent biases in AI systems that can lead to disparities in patient care, stressing the importance of developing unbiased AI tools (https://www.aiethicsjournal.org/unbiased-ai-healthcare).'''

Your output should be the bullet points containing all the citations and their urls sources. Make sure to extract all the links in the draft. Output:

In [77]:
output1_2 = get_response(prompt1_2)

Markdown(output1_2)

- Johnson et al. (2024) in the *Journal of Medical AI Research* (https://www.medicalairesearch.org/ai-in-diagnostics)
- Thompson and colleagues (2023) in the *Journal of Personalized Medicine* (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)
- Gupta and Singh (2025) in the *AI Ethics Journal* (https://www.aiethicsjournal.org/unbiased-ai-healthcare)
- https://www.healthprivacy.org/ai-ethics

In [78]:
output1_2

'- Johnson et al. (2024) in the *Journal of Medical AI Research* (https://www.medicalairesearch.org/ai-in-diagnostics)\n- Thompson and colleagues (2023) in the *Journal of Personalized Medicine* (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)\n- Gupta and Singh (2025) in the *AI Ethics Journal* (https://www.aiethicsjournal.org/unbiased-ai-healthcare)\n- https://www.healthprivacy.org/ai-ethics'

In [80]:
context3 = context1_2

instruction3 = "Convert the following bullet points into markdown format:"

output_indicator3 = "Your output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this:\n\n'''- [<all the anchor text>](<url source>)\n\n- [<all the anchor text>](<url source>) ...'''. Output:"

input_data3 = "'''" + output1_2 + "'''"

prompt3 = f"{context3}\n\n{instruction3}\n\n{input_data3}\n\n{output_indicator3}"

Markdown(prompt3)

You are a helpful research assistant specialized in extracting useful information from content and helping with research essays and reports.

Convert the following bullet points into markdown format:

'''- Johnson et al. (2024) in the *Journal of Medical AI Research* (https://www.medicalairesearch.org/ai-in-diagnostics)
- Thompson and colleagues (2023) in the *Journal of Personalized Medicine* (https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)
- Gupta and Singh (2025) in the *AI Ethics Journal* (https://www.aiethicsjournal.org/unbiased-ai-healthcare)
- https://www.healthprivacy.org/ai-ethics'''

Your output should be the bullet point list with the source references as the hyperlinks and the descriptions as the anchor text like this:

'''- [<all the anchor text>](<url source>)

- [<all the anchor text>](<url source>) ...'''. Output:

In [81]:
output3 = get_response(prompt3)

Markdown(output3)

- [Johnson et al. (2024) in the *Journal of Medical AI Research](https://www.medicalairesearch.org/ai-in-diagnostics)
- [Thompson and colleagues (2023) in the *Journal of Personalized Medicine*](https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)
- [Gupta and Singh (2025) in the *AI Ethics Journal*](https://www.aiethicsjournal.org/unbiased-ai-healthcare)
- [Health Privacy](https://www.healthprivacy.org/ai-ethics)

In [82]:
output3

'- [Johnson et al. (2024) in the *Journal of Medical AI Research](https://www.medicalairesearch.org/ai-in-diagnostics)\n- [Thompson and colleagues (2023) in the *Journal of Personalized Medicine*](https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)\n- [Gupta and Singh (2025) in the *AI Ethics Journal*](https://www.aiethicsjournal.org/unbiased-ai-healthcare)\n- [Health Privacy](https://www.healthprivacy.org/ai-ethics)'

Putting everything together!

In [83]:
output4 = add_reference_header(output3)

Markdown(output4)

# References:
- [Johnson et al. (2024) in the *Journal of Medical AI Research](https://www.medicalairesearch.org/ai-in-diagnostics)
- [Thompson and colleagues (2023) in the *Journal of Personalized Medicine*](https://www.journalofpersonalizedmedicine.org/tailoring-treatments-with-ai)
- [Gupta and Singh (2025) in the *AI Ethics Journal*](https://www.aiethicsjournal.org/unbiased-ai-healthcare)
- [Health Privacy](https://www.healthprivacy.org/ai-ethics)

In [84]:
def create_reference_section(input_data):
    """Extract all the citations in the input data and output a markdown-formatted reference section."""
    output_refs = get_response(input_data, prompt1_2)
    output_refs_markdown = get_response(output_refs, prompt3)
    output_refs_markdown_with_header = add_reference_header(output_refs_markdown)
    return output_refs_markdown_with_header

In [85]:
input_data_test = """
The field of artificial intelligence (AI) continues to evolve rapidly, with recent advancements significantly enhancing various capabilities, particularly in natural language processing (NLP). A pivotal study by Brown et al. (2023) published in the AI Advancements Journal highlights the development of advanced algorithms that improve the efficiency and accuracy of language models (https://www.aiadvancementsjournal.org/advanced-algorithms). These improvements have opened new avenues for real-time translation services and more nuanced human-computer interactions, setting a new standard in technological integration.

Furthermore, the implications of AI in decision-making processes have prompted a considerable amount of ethical scrutiny. A report by Thompson and Lee (2024) in the AI Ethics Quarterly discusses how AI systems are increasingly used in judicial and healthcare settings, raising concerns about fairness and transparency (https://www.aiethicsquarterly.org/ai-judicial-use). The potential for AI to impact significant societal decisions necessitates ongoing research to ensure ethical deployment, as explored by Morales and Kumar (2024) who emphasize the need for robust frameworks to manage AI's societal impacts (https://www.aisocietalimpact.org/frameworks).

In response to these challenges, initiatives aiming to enhance the accountability of AI systems have gained momentum. Notably, an initiative detailed by Zhao and Michelson (2025) in the AI Accountability Forum describes efforts to implement standards that ensure AI systems are auditable and explainable, enhancing public trust in AI technologies (https://www.aiaccountabilityforum.org/standards). This initiative is part of a broader movement to secure a responsible future for AI, as discussed by Patel and Singh (2025), who highlight collaborative efforts to harmonize international AI regulations (https://www.globalairegulations.org/harmonization). This collective endeavor is critical to harnessing AI's full potential while safeguarding against its risks.
"""

Markdown(input_data_test)


The field of artificial intelligence (AI) continues to evolve rapidly, with recent advancements significantly enhancing various capabilities, particularly in natural language processing (NLP). A pivotal study by Brown et al. (2023) published in the AI Advancements Journal highlights the development of advanced algorithms that improve the efficiency and accuracy of language models (https://www.aiadvancementsjournal.org/advanced-algorithms). These improvements have opened new avenues for real-time translation services and more nuanced human-computer interactions, setting a new standard in technological integration.

Furthermore, the implications of AI in decision-making processes have prompted a considerable amount of ethical scrutiny. A report by Thompson and Lee (2024) in the AI Ethics Quarterly discusses how AI systems are increasingly used in judicial and healthcare settings, raising concerns about fairness and transparency (https://www.aiethicsquarterly.org/ai-judicial-use). The potential for AI to impact significant societal decisions necessitates ongoing research to ensure ethical deployment, as explored by Morales and Kumar (2024) who emphasize the need for robust frameworks to manage AI's societal impacts (https://www.aisocietalimpact.org/frameworks).

In response to these challenges, initiatives aiming to enhance the accountability of AI systems have gained momentum. Notably, an initiative detailed by Zhao and Michelson (2025) in the AI Accountability Forum describes efforts to implement standards that ensure AI systems are auditable and explainable, enhancing public trust in AI technologies (https://www.aiaccountabilityforum.org/standards). This initiative is part of a broader movement to secure a responsible future for AI, as discussed by Patel and Singh (2025), who highlight collaborative efforts to harmonize international AI regulations (https://www.globalairegulations.org/harmonization). This collective endeavor is critical to harnessing AI's full potential while safeguarding against its risks.


In [86]:
output_refs_section = create_reference_section(input_data_test)

Markdown(output_refs_section)

# References:
- [Brown et al. (2023), *AI Advancements Journal*](https://www.aiadvancementsjournal.org/advanced-algorithms)

- [Thompson and Lee (2024), *AI Ethics Quarterly*](https://www.aiethicsquarterly.org/ai-judicial-use)

- [Morales and Kumar (2024)](https://www.aisocietalimpact.org/frameworks)

- [Zhao and Michelson (2025), *AI Accountability Forum*](https://www.aiaccountabilityforum.org/standards)

- [Patel and Singh (2025)](https://www.globalairegulations.org/harmonization)