Skip to content
@GOATnote-Inc

GOATnote

Popular repositories Loading

  1. scribegoat2 scribegoat2 Public

    Open-source medical LLM safety evaluation pipeline with reproducible benchmarks and high-risk clinical failure analysis.

    Python 4 1

  2. lostbench lostbench Public

    Standalone benchmark for multi-turn safety persistence in medical LLM conversations. Measures recommendation monotonicity under sustained patient pressure.

    Python

  3. openem-corpus openem-corpus Public

    The AI-native emergency medicine knowledge base. Agent-compiled, physician-verified, grep-friendly.

    Python

  4. safeshift safeshift Public

    Does making the model faster make it less safe? Safety degradation benchmarking under inference optimization.

    Python

  5. radslice radslice Public

    Multimodal radiology LLM benchmark across CT, MRI, X-ray, and Ultrasound

    Python

  6. healthcraft healthcraft Public

    HEALTHCRAFT RL Training Environment: adapts the CORECRAFT architecture to emergency medicine

    Python

Repositories

Showing 7 of 7 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…