How often do frontier LLMs (GPT-5, Claude 4.7, Gemini 3, Llama 4) hallucinate trademark clearance? 500 names × 10 categories. Dataset (CC-BY-4.0) + scorer (MIT) + paper.
benchmark evaluation gemini llama gpt uspto trademark ai-safety claude hallucination llm brand-naming etymolt
-
Updated
May 16, 2026 - Python