Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

higher accuracies for experiment two with modified prompt #4

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

sradc
Copy link

@sradc sradc commented Sep 24, 2023

Hi there, super interesting work.

Looks like the accuracy of gpt-4 might be higher with this prompt.

@lukasberglund
Copy link
Owner

lukasberglund commented Sep 24, 2023

Hi @sradc, thanks for pointing this out and glad you enjoyed our paper. What prompt did you use to achieve this accuracy?

@sradc
Copy link
Author

sradc commented Sep 24, 2023

Hey, the notebook is included in this MR, (and the predictions themselves). Will include below for convenience:

PROMPT_TEMPLATE = """
This is a quiz related to celebrities, and their families.
Here are some example question and answers:

Q: A parent of X is Fahimeh Rahim Nia. Who is X?
Golshifteh Farahani

Q: A parent of X is Timothy Christopher Mara. Who is X?
Kate Mara

Q: A parent of X is Samira Calle. Who is X?
Sasha Calle

Q: A parent of X is Fiona Biggar. Who is X?
Daniel Portman

Now answer (response with just the name):
Q: A parent of X is {parent}. Who is X?
""".strip()

(Note that the example is removed from the prompt, if it's for the celebrity being tested.)

Also running this on gpt3.5 currently.

Edit: also, used this for the system prompt:

You are a helpful assistant, being quizzed on celebrities. If you are not sure, you **must** guess a name.

@sradc
Copy link
Author

sradc commented Sep 24, 2023

gtp-3.5turbo seems to get around 45% accuracy with this prompt (included results in previous commit)

@sradc sradc changed the title higher accuracy with gpt-4 and this prompt (from ~0.3 to ~0.5) higher accuracy with for experiment two with this prompt (from ~0.3 to ~0.5 gpt4) Sep 24, 2023
@sradc sradc changed the title higher accuracy with for experiment two with this prompt (from ~0.3 to ~0.5 gpt4) higher accuracy with for experiment two with this prompt (from ~0.3 to ~0.5) Sep 24, 2023
@sradc
Copy link
Author

sradc commented Sep 25, 2023

Pushed updates. Best results so far:

gpt-4 - 0.561 accuracy (not in latest commit)
gpt-3.5-turbo-0613 - 0.484 accuracy

@sradc
Copy link
Author

sradc commented Sep 25, 2023

Best results are now in the latest commit:

gpt-4: 0.565 accuracy
gpt-3.5-turbo-0613: 0.516 accuracy

(Probably going to stop now because it's expensive.)

@sradc sradc changed the title higher accuracy with for experiment two with this prompt (from ~0.3 to ~0.5) higher accuracies for experiment two with modified prompt Sep 25, 2023
@lukasberglund
Copy link
Owner

Thanks for pointing these out! I'm not going to merge for now, since your change doesn't really integrate with the existing codebase, but it's cool to see that there are better prompts out there.

@sradc
Copy link
Author

sradc commented Sep 26, 2023

No prob, this PR was just to share and track the work. Let me know if you might want to integrate the prompt stuff.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants