Skip to content

YiminZhao97/PanCanBench

Repository files navigation

PanCanBench

PanCanBench is a benchmark of 282 de-identified authentic pancreatic cancer patient questions paired with 3,130expert-designed rubrics for evaluating large language models(LLM).

Data Availability

The question-rubrics are available on

Citation

About

PanCanBench is a clinically-grounded benchmark designed to evaluate the utility and safety of Large Language Models (LLMs) in responding to real-world patient inquiries about pancreatic cancer.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages