Skip to content

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

Notifications You must be signed in to change notification settings

j-min/MoChA-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

  • PyTorch 0.4

TODOs

  • Soft MoChA
  • Hard MoChA
  • Linear Time Decoding
  • Experiment with Real-world dataset

Model figure

Model figure 1

Linear Time Decoding

It's not clear if authors' TF implementation supports decoding in linear time. They calculate energies for whole encoder outputs instead of scanning from previously attended encoder output.

References

About

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages