langchain-ai · lnhsingh · Nov 3, 2025 · Nov 3, 2025
@@ -110,7 +110,7 @@ Learn about [how to define an LLM-as-a-judge evaluator](/langsmith/llm-as-judge)
 
 ### Pairwise
 
-Pairwise evaluators allow you to compare the outputs of two versions of an application. Think [LMSYS Chatbot Arena](https://chat.lmsys.org/) - this is the same concept, but applied to AI applications more generally, not just models! This can use either a heuristic ("which response is longer"), an LLM (with a specific pairwise prompt), or human (asking them to manually annotate examples).
+Pairwise evaluators allow you to compare the outputs of two versions of an application. This can use either a heuristic ("which response is longer"), an LLM (with a specific pairwise prompt), or human (asking them to manually annotate examples).
 
 **When should you use pairwise evaluation?**