Amortizing intractable inference in large language models

This repository contains code for GFlowNet fine-tuning of language models, as described in the paper

Amortizing intractable inference in large language models
Edward J. Hu*, Moksh Jain*, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin
Paper: https://arxiv.org/abs/2310.04363

BibTeX

@article{hu2023amortizing,
  title={Amortizing intractable inference in large language models},
  author={Hu, Edward J. and Jain, Moksh and Elmoznino, Eric and Kaddar, Younesse and Lajoie, Guillaume and Bengio, Yoshua and Malkin, Nikolay},
  year={2023},
  journal={arXiv preprint 2310.04363}
}

Visit the subdirectories to find code and documentation for each experiment in the paper:

Random number generation (§2): rng
Sentence continuation (§4.1): next_sentence
Story infilling (§4.2): infill_subj_arithmetic
Subjectivity classification (§4.3): infill_subj_arithmetic
Arithmetic with tool use (§4.4): infill_subj_arithmetic

Please contact us or post an issue if you have any questions.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
infill_subj_arithmetic		infill_subj_arithmetic
next_sentence		next_sentence
rng		rng
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amortizing intractable inference in large language models

About

Releases

Packages

Contributors 5

Languages

License

GFNOrg/gfn-lm-tuning

Folders and files

Latest commit

History

Repository files navigation

Amortizing intractable inference in large language models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages