#
balderdash
Here are 2 public repositories matching this topic...
A framework using the game Balderdash to evaluate creativity and logical reasoning in Large Language Models (LLMs). Multiple LLMs generate fictitious definitions to deceive others and identify correct ones, analyzing creativity, deception, and performance.
nlp
natural-language-processing
multi-agent-simulation
balderdash
large-language-models
llms
llms-benchmarking
-
Updated
Jul 1, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the balderdash topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the balderdash topic, visit your repo's landing page and select "manage topics."