Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

Commit

Permalink
added director folder. (#4597)
Browse files Browse the repository at this point in the history
  • Loading branch information
kushalarora committed Jun 15, 2022
1 parent 7082658 commit a3d2d1d
Showing 1 changed file with 13 additions and 0 deletions.
13 changes: 13 additions & 0 deletions projects/director/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# DIRECTOR: Generator-Classifiers For Supervised Language Modeling


Kushal Arora, Kurt Shuster, Sainbayar Sukhbaatar, Jason Weston

<!-- Paper: https://arxiv.org/2206. -->


## Abstract

Current language models achieve low perplexity but their resulting generations still suffer from toxic responses, repetitiveness and contradictions. The standard language modeling setup fails to address these issues. In this paper, we introduce a new architecture, {\sc Director}, that consists of a unified generator-classifier with both a language modeling and a classification head for each output token. Training is conducted jointly using both standard language modeling data, and data labeled with desirable and undesirable sequences. Experiments in several settings show that the model has competitive training and decoding speed compared to standard language models while yielding superior results, alleviating known issues while maintaining generation quality. It also outperforms existing model guiding approaches in terms of both accuracy and efficiency.


0 comments on commit a3d2d1d

Please sign in to comment.