Skip to content

Anthropic API Incorrectly Flags Question About Data Quarantine Logic as Potential Malware Evasion #5342

@charles-dyfis-net

Description

@charles-dyfis-net

Bug Description
Incorrect AUP violation detection from a question related to debugging code that deliberately "quarantines" inert data samples by putting them into a holdout set when processing against them fails, and then reintroduces them later -- I presume that whatever model was used for checking misinterpreted this as being a request related to building evasion tools for malware.

Environment Info

  • Platform: darwin
  • Terminal: ghostty
  • Version: 1.0.70
  • Feedback ID: 1db608b4-8dff-4640-bf28-9a07702b1202

Errors

[]

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions