BioContextAD

Biomarker-Guided Context Engineering for Alzheimer's Disease Early Screening

⚠️ Disclaimer: This repository is a research prototype for academic exploration only. It is not a clinical diagnostic tool and should not be used for medical decision-making.

Overview

BioContextAD is a proof-of-concept framework that integrates biomarker-guided context engineering with large language models (LLMs) for Alzheimer's disease (AD) early screening. Rather than relying on unconstrained LLM inference, the system routes queries through a structured biomarker-aware pipeline to improve evidence grounding and safety.

Pipeline Architecture

flowchart LR
    A[AD Query / Case] --> B[BioRouter]
    B --> C[Biomarker-guided RAG]
    C --> D[Evidence Graph]
    D --> E[Abstention Module]
    E --> F[Multi-teacher Consistency]
    F --> G[Automated Evaluation\nMetrics / Report]

    style A fill:#f0f4ff,stroke:#4a6fa5
    style B fill:#dbeafe,stroke:#2563eb
    style C fill:#dcfce7,stroke:#16a34a
    style D fill:#fef9c3,stroke:#ca8a04
    style E fill:#fce7f3,stroke:#db2777
    style F fill:#ede9fe,stroke:#7c3aed
    style G fill:#f1f5f9,stroke:#64748b

Core Modules

Module	Role	Key Metric
BioRouter	Query classification across AD pathological axes (A/T/N/I/V + OTHER)	Macro-F1
Biomarker-guided RAG	Evidence retrieval anchored to AD biomarker categories	Evidence Relevance Score
Abstention Module	Safety control for unanswerable or unsafe queries	Abstention F1
Evidence Graph	Lightweight knowledge graph linking biomarkers and findings	Node/Edge Coverage
Multi-teacher Consistency	Cross-model agreement for answer reliability	Agreement Rate (κ)
Evaluation Pipeline	Automated metrics, error case analysis, weekly report	—

AD Biomarker Axes (NIA-AA ATNIV Framework)

The routing and retrieval system is anchored to five pathological axes:

A — Amyloid (Aβ42/Aβ40, CSF/PET/plasma)
T — Tau (p-tau181/217/231, NFT)
N — Neurodegeneration (NfL, GFAP, MRI atrophy)
I — Inflammation / Immunity (microglia, astrocyte, neuroinflammation)
V — Vascular contribution (vascular dysfunction, WMH)
OTHER — Risk factors, cognitive scales, APOE ε4

Quick Start

# 1. Clone
git clone https://github.com/ShengAnlin/BioContextAD.git
cd BioContextAD

# 2. Install dependencies
conda env create -f environment.yml
conda activate biocontextad
# or: pip install -r requirements.txt

# 3. Configure API keys
cp .env.example .env
# Edit .env and fill in your API keys

# 4. Run the full pipeline
bash scripts/run_all.sh

Results will be saved to results/. A Markdown summary report is generated at results/weekly_report.md.

Repository Structure

BioContextAD/
├── configs/
│   ├── models.yaml          # Model role assignments
│   └── axes.yaml            # AD pathological axis definitions
├── data/
│   ├── eval_questions.jsonl # Evaluation questions (seed set)
│   └── evidence_pairs.jsonl # (claim, evidence) pairs for ranking
├── prompts/
│   ├── router_prompt.md     # BioRouter system prompt
│   ├── rag_prompt.md        # Biomarker-guided RAG prompt
│   ├── abstention_prompt.md # Abstention/safety prompt
│   └── evidence_prompt.md   # Evidence extraction prompt
├── src/
│   ├── llm_client.py        # Unified LLM interface with caching & retry
│   ├── run_e1.py            # Experiment 1: BioRouter evaluation
│   ├── run_e3.py            # Experiment 3: Evidence ranking
│   ├── metrics.py           # Macro-F1, Fleiss' κ, confusion matrix
│   └── report.py            # Automated 6-section weekly report
├── notebooks/
│   └── exploration.ipynb    # EDA and result visualization
├── docs/
│   └── architecture.md      # Detailed pipeline documentation
├── scripts/
│   └── run_all.sh           # End-to-end pipeline runner
├── results/                 # Output directory (gitignored except .gitkeep)
├── environment.yml
├── requirements.txt
└── .env.example

Experimental Setup

Phase 1 (Dry-run)

Experiment	Cases	Models	Primary Metric
E1: BioRouter	30–50	DeepSeek-V4-Flash / Qwen3.5-27B	Macro-F1
E2: Abstention	20	Claude / DeepSeek-V4-Pro	Abstention F1
E3: RAG	30–50	Claude / GPT / Baichuan-M3	Evidence Relevance
E4: Multi-teacher	20–30 × 3	Claude + Baichuan-M3 + Qwen	Agreement Rate (κ)

Ablation Conditions

Condition	Description
Full pipeline	BioRouter + Biomarker RAG + Abstention
No-router	Vanilla RAG without axis routing
No-biomarker	Generic RAG without biomarker anchoring
No-abstention	Pipeline without safety/uncertainty gate
Vanilla RAG	Standard RAG baseline

Technical Design

Unified LLM Interface

All API calls go through a single call_llm(model_role, prompt, temperature) interface:

Caching: Results stored at results/raw/{task}/{sample_id}_{model}.json
Retry: Exponential backoff with 3 retries
Fallback: Configurable fallback model per role
Logging: All failures logged to logs/errors.log

API Role Assignments

Role	Model
Reasoning / Writing	Claude / GPT-4.1
Medical Teacher	Baichuan-M3 / DeepSeek-V4-Pro
Batch Baseline	DeepSeek-V4-Flash / Qwen3.5-27B
Structured Output	Qwen3.6-Plus
Long-doc / Report	Kimi-K2.6 / MiniMax-M2.5

Paper Contributions

This work makes three core contributions:

A biomarker-guided context engineering framework for AD early screening that anchors LLM retrieval to the NIA-AA ATNIV pathological axes.
A BioRouter + Abstention mechanism for routing AD queries across pathological categories and constraining unsafe or unanswerable inference.
An evidence-grounded evaluation pipeline integrating RAG, lightweight knowledge graph construction, and multi-teacher consistency scoring.

Citation

If you find this work useful, please cite:

@misc{sheng2026biocontextad,
  title  = {BioContextAD: Biomarker-Guided Context Engineering for Alzheimer's Disease Early Screening},
  author = {Sheng, Anlin},
  year   = {2026},
  url    = {https://github.com/ShengAnlin/BioContextAD}
}

License

This project is licensed under the MIT License. See LICENSE for details.

Research use only. This framework is intended for academic research and is not validated for clinical use. All outputs should be interpreted by qualified medical professionals.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BioContextAD

Overview

Pipeline Architecture

Core Modules

AD Biomarker Axes (NIA-AA ATNIV Framework)

Quick Start

Repository Structure

Experimental Setup

Phase 1 (Dry-run)

Ablation Conditions

Technical Design

Unified LLM Interface

API Role Assignments

Paper Contributions

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
data		data
docs		docs
notebooks		notebooks
prompts		prompts
results		results
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

BioContextAD

Overview

Pipeline Architecture

Core Modules

AD Biomarker Axes (NIA-AA ATNIV Framework)

Quick Start

Repository Structure

Experimental Setup

Phase 1 (Dry-run)

Ablation Conditions

Technical Design

Unified LLM Interface

API Role Assignments

Paper Contributions

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages