Skip to content

causal decoder based on convolutions only (no attention): can be applied to ubbounded sequence lengths; the prediction of the next token depends on *all* previous tokens; allows autoregressive sampling; highly gpu-parralellizable; trained with teacher forcing;

Notifications You must be signed in to change notification settings

sliorde/conv-decoder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

About

causal decoder based on convolutions only (no attention): can be applied to ubbounded sequence lengths; the prediction of the next token depends on *all* previous tokens; allows autoregressive sampling; highly gpu-parralellizable; trained with teacher forcing;

Topics

Stars

Watchers

Forks

Languages