Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alfworld GPT-3 Results #11

Closed
gautierdag opened this issue Jun 2, 2023 · 3 comments
Closed

Alfworld GPT-3 Results #11

gautierdag opened this issue Jun 2, 2023 · 3 comments

Comments

@gautierdag
Copy link

Hi,
I wondered if you had more details or numbers from your GPT-3 results on Alfworld? For instance, do you have the splits of accuracy across the different subtasks (as in Table 3 in the paper)?

I would try to reproduce it, but I reckon the total cost would be > $100 and would like to avoid it if possible.

@ysymyth
Copy link
Owner

ysymyth commented Jun 11, 2023

Hi, at the end of 134 instances, the six category

prefixes = {
    'pick_and_place': 'put',
    'pick_clean_then_place': 'clean',
    'pick_heat_then_place': 'heat',
    'pick_cool_then_place': 'cool',
    'look_at_obj': 'examine',
    'pick_two_obj': 'puttwo'
}

has the final result

134 r 0 rs [19, 19, 7, 17, 16, 8] cnts [24, 31, 23, 21, 18, 17] sum(rs)/sum(cnts) 0.6417910447761194

e.g. put tasks are 19/24 correct.

@ysymyth
Copy link
Owner

ysymyth commented Jun 11, 2023

A more complete trajectory is at https://gist.github.com/ysymyth/01045e5b65651eccd63a5a46964b8216

@ysymyth ysymyth closed this as completed Jun 11, 2023
@gautierdag
Copy link
Author

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants