Questions on Table 3 (AlfWorld) #19

guosyjlu · 2023-10-08T08:04:12Z

Hi,
Thanks for your great work! I have a question on Table 3, where results of Act and ReAct are reported as avg/best of 6. I am wondering where does 6 come from, given that the decoding strategy is greedy.
Thank you!

ysymyth · 2023-10-25T20:26:58Z

Hi @guosyjlu , as stated in the paper, "For robustness, we construct 6 prompts for each task type through each permutation of 2 annotated trajectories from the 3 we annotate." So 6 trials come from different selections of prompting examples!

ysymyth closed this as completed Oct 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on Table 3 (AlfWorld) #19

Questions on Table 3 (AlfWorld) #19

guosyjlu commented Oct 8, 2023

ysymyth commented Oct 25, 2023

Questions on Table 3 (AlfWorld) #19

Questions on Table 3 (AlfWorld) #19

Comments

guosyjlu commented Oct 8, 2023

ysymyth commented Oct 25, 2023