Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 #1247

AkihikoWatanabe · 2024-03-05T09:16:36Z

URL

https://arxiv.org/abs/2402.10200

Affiliations

Xuezhi Wang, N/A
Denny Zhou, N/A

Abstract

In enhancing the reasoning capabilities of large language models (LLMs),prior research primarily focuses on specific prompting techniques such asfew-shot or zero-shot chain-of-thought (CoT) prompting. These methods, whileeffective, often involve manually intensive prompt engineering. Our study takesa novel approach by asking: Can LLMs reason effectively without prompting? Ourfindings reveal that, intriguingly, CoT reasoning paths can be elicited frompre-trained LLMs by simply altering the \textit{decoding} process. Rather thanconventional greedy decoding, we investigate the top-$k$ alternative tokens,uncovering that CoT paths are frequently inherent in these sequences. Thisapproach not only bypasses the confounders of prompting but also allows us toassess the LLMs' \textit{intrinsic} reasoning abilities. Moreover, we observethat the presence of a CoT in the decoding path correlates with a higherconfidence in the model's decoded answer. This confidence metric effectivelydifferentiates between CoT and non-CoT paths. Extensive empirical studies onvarious reasoning benchmarks show that the proposed CoT-decoding substantiallyoutperforms the standard greedy decoding.

Translation (by gpt-3.5-turbo)

大規模言語モデル（LLMs）の推論能力を向上させるために、従来の研究では、few-shotやzero-shotのchain-of-thought（CoT）プロンプティングなど、特定のプロンプティング技術に焦点を当ててきた。
これらの方法は効果的ではあるが、しばしば手動でのプロンプトエンジニアリングが必要となる。
本研究は、LLMsがプロンプトなしでも効果的に推論できるかどうかという新しいアプローチを取っている。
我々の研究結果は、興味深いことに、事前学習されたLLMsからCoT推論パスを単純に\textit{デコーディング}プロセスを変更することで引き出すことができることを示している。
従来の貪欲なデコーディングではなく、上位-$k$の代替トークンを調査することで、これらのシーケンスにCoTパスが頻繁に内在していることが明らかになっている。
このアプローチは、プロンプティングの混乱を回避するだけでなく、LLMsの\textit{固有の}推論能力を評価することも可能となる。
さらに、デコーディングパスにCoTが存在することは、モデルのデコードされた回答に対する信頼度が高いことと相関していることが観察されている。
この信頼度メトリクスは、CoTパスと非CoTパスを効果的に区別する。
様々な推論ベンチマークにおける包括的な実験研究は、提案されたCoTデコーディングが標準の貪欲なデコーディングを大幅に上回ることを示している。

Summary (by gpt-3.5-turbo)

LLMsの推論能力を向上させるための新しいアプローチに焦点を当てた研究が行われている。この研究では、LLMsがプロンプトなしで効果的に推論できるかどうかを検証し、CoT推論パスをデコーディングプロセスを変更することで引き出す方法を提案している。提案手法は、従来の貪欲なデコーディングではなく、代替トークンを調査することでCoTパスを見つけることができることを示しており、様々な推論ベンチマークで有効性を示している。

AkihikoWatanabe · 2024-03-05T09:32:57Z

以前にCoTを内部的に自動的に実施されるように事前学習段階で学習する、といった話があったと思うが、この研究はデコーディング方法を変更することで、promptingで明示的にinstructionを実施せずとも、CoTを実現するもの、ということだと思われる。

AkihikoWatanabe added the Pocket label Mar 5, 2024

AkihikoWatanabe changed the title あ Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 Mar 5, 2024

AkihikoWatanabe added NLP LanguageModel CoT Prompting labels Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 #1247

Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 #1247

AkihikoWatanabe commented Mar 5, 2024 •

edited

AkihikoWatanabe commented Mar 5, 2024

Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 #1247

Chain-of-Thought Reasoning Without Prompting, Xuezhi Wang+, N/A, arXiv'24 #1247

Comments

AkihikoWatanabe commented Mar 5, 2024 • edited

URL

Affiliations

Abstract

Translation (by gpt-3.5-turbo)

Summary (by gpt-3.5-turbo)

AkihikoWatanabe commented Mar 5, 2024

AkihikoWatanabe commented Mar 5, 2024 •

edited