Bootstrapping ScoNe CoT demos with a separate LLM from the main one #260

cgpotts · 2023-12-27T03:54:09Z

This is a simple example showing how to use a different (and presumably more powerful) model for bootstrapping examples than the one being used for the central predictions.

The example task is the ScoNe "one scoping negation" category, which is one of the hardest categories in ScoNe. Zero-shot, turbo is at chance, and using turbo to bootstrap full CoT examples seemed not to work well, but using GPT-4 for bootstrapping (a possibility you made me aware of!) took performance all the way north of 85% accuracy (and one of my runs was at 93%). This extremely high, but everything seems to be set up correctly, so this seems like a nice illustration of the power of this strategy.

Do let me know if I should make an adjustments to the way the notebook is set up. The hope is that this is code people can copy-paste for their own work (almost none of the DSPy code is even specific to ScoNe).

---Chris

okhat · 2023-12-27T04:00:34Z

Thank you Chris! Everything looks great to me, particularly after switching the flag to False just now.

Merging!

cgpotts · 2023-12-27T04:10:24Z

Yes, I very predictably forgot to toggle that flag!

Bootstrapping ScoNe CoT demos with a separate LLM from the main one

8588383

cgpotts requested a review from okhat December 27, 2023 03:54

Toggle RUN_FROM_SCRATCH to False

e70fc95

okhat merged commit 1a2ef18 into main Dec 27, 2023

cgpotts deleted the scone-example branch December 27, 2023 04:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bootstrapping ScoNe CoT demos with a separate LLM from the main one #260

Bootstrapping ScoNe CoT demos with a separate LLM from the main one #260

Uh oh!

cgpotts commented Dec 27, 2023

Uh oh!

okhat commented Dec 27, 2023

Uh oh!

cgpotts commented Dec 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Bootstrapping ScoNe CoT demos with a separate LLM from the main one #260

Bootstrapping ScoNe CoT demos with a separate LLM from the main one #260

Uh oh!

Conversation

cgpotts commented Dec 27, 2023

Uh oh!

okhat commented Dec 27, 2023

Uh oh!

cgpotts commented Dec 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants