GitHub - tifoit/casual-digressions: Interesting readings

Goku Mohandas

One Shot Learning

[Matching Networks for One Shot Learning] (notes/oneshot.md) [arXiv]

Recommendation Engines

[Deep Neural Networks for Youtube Recommendations] (notes/youtube_recommendations.md) [[Google] (https://research.google.com/pubs/pub45530.html)]

Representation Learning

[Doctor AI: Predicting Clinical Events via Recurrent Neural Networks] (notes/docai.md) [[arXiv] (http://arxiv.org/abs/1511.05942)]
[Distributed Representations of Words and Phrases and their Compositionality] (notes/word2vec_mikolov.md) [[NIPS] (https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf)]
Multi-layer Representation Learning for Medical Concepts [[arXiv] (https://arxiv.org/abs/1602.05568)]

Text Classification

[Convolutional Neural Networks for Sentence Classification] (notes/cnn_text.md) [[arXiv] (http://arxiv.org/abs/1408.5882)]
Recurrent Neural Network Regularization [[arXiv] (http://arxiv.org/abs/1409.2329)]
Grammar as a Foreign Language [[arXiv] (http://arxiv.org/abs/1412.7449)]

Seq-to-Seq Models (translation)

[Sequence to Sequence Learning with Neural Networks] (notes/seq_to_seq_rnn.md) [[arXiv] (https://arxiv.org/abs/1409.3215)]
[Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation] (notes/rnn_encode_decode.md) [[arXiv] (http://arxiv.org/abs/1406.1078)]
[Neural Machine Translation by Jointly Learning to Align and Translate] (notes/rnn_attention.md) [[arXiv] (http://arxiv.org/abs/1409.0473)] - Attention in RNNs
[On Using Very Large Target Vocabulary for Neural Machine Translation] (notes/rnn_softmax.md) [[arXiv] (http://arxiv.org/abs/1412.2007)] - Sampled Softmax
[Pointer Sentinel Mixture Models] (notes/pointer_sentinel.md) [arXiv]
[Context-Dependent Word Representation for Neural Machine Translation] (notes/context.md) [arXiv]
[Learning to Translate in Real-time with Neural Machine Translation] (notes/real_time_NMT.md) [arXiv]
[Fully Character-Level Neural Machine Translation without Explicit Segmentation] (notes/fully_char.md) [arXiv]

Neural Conversation Models / QA

[A Neural Conversational Model] (notes/conversation.md) [[arXiv] (http://arxiv.org/abs/1506.05869)]
End-To-End Memory Networks [arXiv]
[Ask Me Anything: Dynamic Memory Networks for Natural Language Processing] (notes/ama.md) [arXiv]
[Dynamic Memory Networks for Visual and Textual Question Answering] (notes/visual_qa.md) [arXiv]
[Dynamic Coattention Networks For Question Answering] (notes/coattention.md) [arXiv]
[Richard Socher on the Future of Deep Learning] (notes/future_socher.md) [OReilly]
[A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks][arXiv]
Bidirectional Attention Flow for Machine Comprehension [arXiv]
[Generating Long and Diverse Responses with Neural Conversation Models] (notes/diverse.md) [arXiv]
[Gated-Attention Readers for Text Comprehension] (notes/ga.md) [arXiv]
[FVQA: Fact based Visual Question Answering] (notes/fvqa.md) [arXiv]
[Query-Reduction Networks for Question Answering] (notes/qrn.md) [arXiv]

Logic/Reasoning

[Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks] [arXiv]
[Deep API Learning] [arXiv]

Reinforcement Learning

Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks [[arXiv] (http://arxiv.org/abs/1609.02993)]
Third Person Imitation Learning [arXiv]

Google DeepMind

WaveNet: A Generative Model for Raw Audio [[arXiv] (https://arxiv.org/abs/1609.03499)][[Tutorial] (https://deepmind.com/blog/wavenet-generative-model-raw-audio/)]
Decoupled Neural Interfaces using Synthetic Gradients [[arXiv] (https://arxiv.org/abs/1608.05343)] [[Tutorial] (https://deepmind.com/blog/decoupled-neural-networks-using-synthetic-gradients/)]

Neural Turing Machines

[Neural Turing Machines] (notes/ntm.md) [[arXiv] (http://arxiv.org/abs/1410.5401)]
[Hybrid Computing using a Neural Network with Dynamic External Memory] [Nature]

Optimization/Architecture

[Highway Networks] (notes/highway.md) [[arXiv] (https://arxiv.org/abs/1611.03530)]
[Maxout Networks] [arXiv]
[HyperNetworks] (notes/hypernetworks.md) [[arXiv] (https://arxiv.org/abs/1609.09106)]
[Using Fast Weights to Attend to the Recent Past] (notes/fast_weights.md) [[arXiv] (https://arxiv.org/abs/1610.06258)]
Quasi-Recurrent Neural Networks [arXiv]
Learning to learn by gradient descent by gradient descent [arXiv]
GRAM: Graph-based Attention Model for Healthcare Representation Learning [arXiv]
[Language Modeling with Gated Convolutional Networks] [arXiv]
[Value Iteration Networks] [arXiv]
[Adding Gradient Noise Improves Learning for Very Deep Networks] [arXiv]
[Outrageously Large Neural Networks: The Sparsely-gated Mixture-of-Experts Layer] [Open Review]

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
notes		notes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notes

notes

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Goku Mohandas

One Shot Learning

Recommendation Engines

Representation Learning

Text Classification

Seq-to-Seq Models (translation)

Neural Conversation Models / QA

Logic/Reasoning

Reinforcement Learning

Google DeepMind

Neural Turing Machines

Generative Adversarial Networks

Image Captioning

Review Papers

Optimization/Architecture

About

Releases

Packages

tifoit/casual-digressions

Folders and files

Latest commit

History

Repository files navigation

Goku Mohandas

One Shot Learning

Recommendation Engines

Representation Learning

Text Classification

Seq-to-Seq Models (translation)

Neural Conversation Models / QA

Logic/Reasoning

Reinforcement Learning

Google DeepMind

Neural Turing Machines

Generative Adversarial Networks

Image Captioning

Review Papers

Optimization/Architecture

About

Resources

Stars

Watchers

Forks