Skip to content

Conversation

@cgpotts
Copy link
Collaborator

@cgpotts cgpotts commented Dec 27, 2023

Hi @okhat!

This is a simple example showing how to use a different (and presumably more powerful) model for bootstrapping examples than the one being used for the central predictions.

The example task is the ScoNe "one scoping negation" category, which is one of the hardest categories in ScoNe. Zero-shot, turbo is at chance, and using turbo to bootstrap full CoT examples seemed not to work well, but using GPT-4 for bootstrapping (a possibility you made me aware of!) took performance all the way north of 85% accuracy (and one of my runs was at 93%). This extremely high, but everything seems to be set up correctly, so this seems like a nice illustration of the power of this strategy.

Do let me know if I should make an adjustments to the way the notebook is set up. The hope is that this is code people can copy-paste for their own work (almost none of the DSPy code is even specific to ScoNe).

---Chris

@cgpotts cgpotts requested a review from okhat December 27, 2023 03:54
@okhat
Copy link
Collaborator

okhat commented Dec 27, 2023

Thank you Chris! Everything looks great to me, particularly after switching the flag to False just now.

Merging!

@okhat okhat merged commit 1a2ef18 into main Dec 27, 2023
@cgpotts
Copy link
Collaborator Author

cgpotts commented Dec 27, 2023

Yes, I very predictably forgot to toggle that flag!

@cgpotts cgpotts deleted the scone-example branch December 27, 2023 04:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants