You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Success and failure are simply defined as whether the LLM is able to reach correct answers after the ReAct procedure (measured by the final response).
Therefore, the LM can reach correct answers for the wrong ReAct trajectory. We would like to distinguish such cases from those successful cases where the trajectories are also correct, therefore report "false positive" in the table.
Hallucination can happen anytime, both in success and failure cases (even the model hallucinates at some point, it may still be able to reach the correct answer for the wrong reason). Therefore in both modes we report the hallucination traces.
I am impressed with your research. Thank you for your good research.
But I have a question and would like to ask.
According to Table 2 of the paper, success and failure modes are divided.
Thanks!
The text was updated successfully, but these errors were encountered: