Suggested reading:
- A Neural Probabilistic Language Model
- Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
- LONG SHORT-TERM MEMORY
- Bidirectional recurrent neural networks
- Distributed Representations of Words and Phrases and their Compositionality
- GloVe: Global Vectors for Word Representation
- Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
- Sequence to Sequence Learning with Neural Networks
- BLEU: a Method for Automatic Evaluation of Machine Translation
- Neural Machine Translation by Jointly Learning to Align and Translate
- Attention Is All You Need
- Show, Attend and Tell: Neural Image Caption Generation with Visual Attention