The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” #1059

AkihikoWatanabe · 2023-10-09T11:07:02Z

https://bit.ly/3Rw6kk4

AkihikoWatanabe · 2023-10-09T11:08:19Z

A is Bという文でLLMを訓練しても、B is Aという逆方向には汎化されないことを示した。

著者ツイート: https://x.com/owainevans_uk/status/1705285631520407821?s=46&t=Y6UuIHB0Lv0IpmFAjlc2-Q

AkihikoWatanabe · 2023-10-09T11:16:14Z

GPT3, LLaMaを A is Bでfinetuneし、B is Aという逆方向のfactを生成するように（質問をして）テストしたところ、0%付近のAcc.だった。

また、Acc.が低いだけでなく、対数尤度もrandomなfactを生成した場合と、すべてのモデルサイズで差がないことがわかった。

このことら、Reversal Curseはモデルサイズでは解決できないことがわかる。

AkihikoWatanabe added the Pocket label Oct 9, 2023

AkihikoWatanabe changed the title あ Studying Large Language Model Generalization with Influence Functions, Roger Grosse+, N/A, arXiv'23 Oct 9, 2023

AkihikoWatanabe changed the title ~~Studying Large Language Model Generalization with Influence Functions, Roger Grosse+, N/A, arXiv'23~~ The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” Oct 9, 2023

AkihikoWatanabe added NLP LanguageModel labels Oct 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” #1059

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” #1059

AkihikoWatanabe commented Oct 9, 2023 •

edited

AkihikoWatanabe commented Oct 9, 2023

AkihikoWatanabe commented Oct 9, 2023 •

edited

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” #1059

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” #1059

Comments

AkihikoWatanabe commented Oct 9, 2023 • edited

AkihikoWatanabe commented Oct 9, 2023

AkihikoWatanabe commented Oct 9, 2023 • edited

AkihikoWatanabe commented Oct 9, 2023 •

edited

AkihikoWatanabe commented Oct 9, 2023 •

edited