Skip to content

TurnGate Project: Detecting Malicious Intent

Official organization for research paper One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue — a novel turn-level monitor that identifies the earliest turn where multi-turn interactions become sufficient for harm, providing a robust defense against state-of-the-art adaptive attackers such as the CKA-Agent.

Links

Popular repositories Loading

  1. turn-gate.github.io turn-gate.github.io Public

    HTML

  2. .github .github Public

  3. TurnGate TurnGate Public

    Forked from Graph-COM/TurnGate

    Python

Repositories

Showing 3 of 3 repositories

Top languages

Loading…

Most used topics

Loading…