SourceCheckup

This is the repo for the data and code used in the paper:

How well do LLMs cite relevant medical references? An evaluation framework and analyses

Overview

SourceCheckup is a tool designed to verify the accuracy of information extracted from a given citation URL or a direct question. The script processes the provided input, queries an AI model for generating questions and verifying responses, and outputs the results in a CSV file. It also provides insights into the fraction of statements supported by at least one citation.

Data

The data are contained in the following Google Drive: link The files are organized as follows:

Questions: The 1200 questions used in this analysis. The original documents from Mayo Clinic, UpToDate, and r/AskDocs used to generate each question are not included here due to terms of use for each site. However, the questions generated from each page are available in questions.csv.
Responses: The responses for each of the seven models are provided within each CSV in the Responses folder.
Parsed Statements: Each response is parsed for medically relevant statements, which are included as a list within a column.
Fact-Citation Pairs: The facts are paired with citations provided to back each response.
Expert Annotations: The question-annotation pairings from medical experts.

Modules

utils.py

Contains utility functions used in the main script:

extract_contents_from_url(citation_url): Extracts contents from the given URL.
GPTWrapper: A wrapper for querying the AI model with specified prompts and settings.

run.py

Main script that processes the given citation URL or question, verifies the information, and outputs the results.

Inputs and Outputs

Inputs

citation_url: URL to the citation to be processed.
question: Direct question to be processed.
output_file: Filename for the output CSV (default: "example_output.csv").

Outputs

CSV file containing the decision matrix with statements, citation URLs, decisions, and reasons.
Prints the fraction of unique statements supported by at least one citation.

Running the Script

Prerequisites

Python 3.x
Install required libraries: pandas, argparse, json

Command Line Usage

Provide a Citation URL

python run.py --citation_url "https://my.clevelandclinic.org/health/diseases/8541-thyroid-disease"

Provide a Question

python run.py --question "What is the correct dosage for acetaminophen for infants?"

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
mainfig.png		mainfig.png
prompts.json		prompts.json
requirements.txt		requirements.txt
run.py		run.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SourceCheckup

Overview

Data

Modules

utils.py

run.py

Inputs and Outputs

Inputs

Outputs

Running the Script

Prerequisites

Command Line Usage

Provide a Citation URL

Provide a Question

About

Uh oh!

Releases 1

Packages

Languages

kevinwu23/SourceCheckup

Folders and files

Latest commit

History

Repository files navigation

SourceCheckup

Overview

Data

Modules

utils.py

run.py

Inputs and Outputs

Inputs

Outputs

Running the Script

Prerequisites

Command Line Usage

Provide a Citation URL

Provide a Question

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages