Paper, table2 #13

serotoninpm · 2023-06-28T01:51:30Z

I am impressed with your research. Thank you for your good research.

But I have a question and would like to ask.

According to Table 2 of the paper, success and failure modes are divided.

what is the definition of success mode and failure mode?
if success mode is a successful case, it should not include false positives, because false positives are predicting something wrong as right.
ultimately, Hallucinated reasoning traces or facts are present in both success mode and failure mode. I wonder why?

Thanks!

ysymyth · 2023-07-02T16:18:39Z

Success and failure are simply defined as whether the LLM is able to reach correct answers after the ReAct procedure (measured by the final response).
Therefore, the LM can reach correct answers for the wrong ReAct trajectory. We would like to distinguish such cases from those successful cases where the trajectories are also correct, therefore report "false positive" in the table.
Hallucination can happen anytime, both in success and failure cases (even the model hallucinates at some point, it may still be able to reach the correct answer for the wrong reason). Therefore in both modes we report the hallucination traces.

serotoninpm · 2023-07-04T04:35:09Z

Thank you for your detailed and kind explanation!

ysymyth closed this as completed Jul 4, 2023

Provide feedback