Banishing LLM Hallucinations Requires Rethinking Generalization

Johnny Li, Saksham Consul, Eda Zhou, James Wong, Naila Farooqui, Nithyashree Manohar, Zhuxiaona (Nina) Wei, Tian Wu, Ben Echols, Sharon Zhou, and Gregory Diamos

Despite their powerful chat, coding, and reasoning abilities, Large Language Models (LLMs) frequently hallucinate. Conventional wisdom suggests that hallucinations are a consequence of a balance between creativity and factuality, which can be mitigated, but not eliminated, by grounding the LLM in external knowledge sources. Through extensive systematic experiments, we show that these traditional approaches fail to explain why LLMs hallucinate in practice. Specifically, we show that LLMs augmented with a mixture of Millions of Memory Experts (MoME) can easily memorize large datasets of random numbers. We corroborate these experimental findings with a theoretical construction showing that simple neural networks trained to predict the next token hallucinate when the training loss is above a threshold as it usually does in practice when training on internet scale data. We interpret our findings by comparing against traditional retrieval methods for mitigating hallucinations. We use our findings to design a first generation model for removing hallucinations - Lamini-1 - that stores facts in a massive mixture of millions of memory experts that are retrieved dynamically.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
research-paper.pdf		research-paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Banishing LLM Hallucinations Requires Rethinking Generalization

Johnny Li, Saksham Consul, Eda Zhou, James Wong, Naila Farooqui, Nithyashree Manohar, Zhuxiaona (Nina) Wei, Tian Wu, Ben Echols, Sharon Zhou, and Gregory Diamos

info@lamini.ai

About

Releases

Packages

Contributors 2

lamini-ai/Lamini-Memory-Tuning

Folders and files

Latest commit

History

Repository files navigation

Banishing LLM Hallucinations Requires Rethinking Generalization

Johnny Li, Saksham Consul, Eda Zhou, James Wong, Naila Farooqui, Nithyashree Manohar, Zhuxiaona (Nina) Wei, Tian Wu, Ben Echols, Sharon Zhou, and Gregory Diamos

info@lamini.ai

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages