Easy-CoT: Automated and Diverse Chain-of-Thought Reasoning on Small Language Models

Introduction

Paper available here

Instruction

To run experiment on an unseen complex reasoning dataset (one that has question-answer data), follow these steps:

Produce base dataset using Fine-tune-CoT.
Run run_auto_cot() to produce the Easy-CoT dataset.
Run finetune_model() to tune gpt2-small on this generated dataset.
Use test_all() to evaluate the two baselines + Easy-CoT.

Directories

code/

The easy_cot.ipynb notebook has the following APIs are avaliable for use:

run_auto_cot(dataset, task, save_dir, *hyperparams): apply auto-cot on base dataset to generate easy_cot dataset corresponding to the task and dump in save_dir. Note that since the base dataset is the output of teacher model from Fine-tune-CoT.
finetune_model(task, model_dir): fine-tune gpt2-small on Easy-CoT data of provided task, save the fine-tuned model in model_dir.
test_all(task, model_dir): run two baselines (auto-cot and fine-tune-cot) and easy-cot experiment on the test set for task, output average accuracy over three runs for each. The fine-tuned-model should be in model_dir.

easy_cot/

Contains our Easy-CoT dataset for five tasks. See details in paper. Each subdirectory of easy_cot has a name corresponding to the base dataset that it was generated from, and contains two files: data.json (containing output of fine-tune-cot applied on base dataset) and demos.json (containing output of auto-cot applied on data.json).

finetuned_model/

All fine-tuned models from our experiment can be downloaded here. There are five models in total, one for each base dataset.

datasets/

Contains the five base datasets on complex reasoning tasks used to generate our easy_cot dataset.

fine_tune_cot/: completed data generated by teacher model in fine-tune-cot, courtesy of Ho et al.
zero_shot_cot/: raw data for complex reasoning task benchmarks, courtesy of Kojima et al.. Not directly used in our experiment but was used to produce the fine_tune_cot/ dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Easy-CoT: Automated and Diverse Chain-of-Thought Reasoning on Small Language Models

Introduction

Instruction

Directories

code/

easy_cot/

finetuned_model/

datasets/

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
code		code
datasets		datasets
easy_cot		easy_cot
finetuned_model		finetuned_model
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Easy-CoT: Automated and Diverse Chain-of-Thought Reasoning on Small Language Models

Introduction

Instruction

Directories

code/

easy_cot/

finetuned_model/

datasets/

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages