Each heading is a date, displayed in YYMM format.
Each bulletpoint is a paper, with a link to the summary written in markdown. Each bulletpoint has an indented point with some "tags" for easy searching with ctrl+f. All tags use hyphens instead of spaces. Order of tags is irrelevant.
- How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings (2019) - Ethayarajh
- embeddings, bert, elmo, gpt-2
- Discounted Reinforcement Learning is Not an Optimization Problem (2019) - Naik, Shariff, Yasui, Sutton
- rl, optimization
- Training Agents using Upside-Down Reinforcement Learning (2019) - Srivastava, Shyam, Mutz, Jaśkowski, Schmidhuber
- rl, udrl, doom, a2c, dqn
-
Contextual Word Representations: A Contextual Introduction (2019) - Smith
- nlp, embeddings, fine-tuning, transfer-learning
-
- transformer, nlp, nmt, translation, attention
-
Universal Language Model Fine-tuning for Text Classification (2019) - Howard, Ruder
- nlp, fine-tuning, lm, transfer-learning, embeddings
-
The Measure of Intelligence (2019) - Chollet
- rl, agi
-
- lm, emlo, nlp, fine-tuning, transfer-learning, embeddings
-
- rl, attention, atari, interpretable-models
-
The Consciousness Prior (2017) - Bengio
- thought, rl, vae, gan, unsupervised
-
- rl, replay-memory, biological
-
Value Iteration Networks (2016) - Tamar, Wu, Thomas, Levine, Abbeel
- rl, trpo, planning, maze
-
Trust Region Policy Optimization (2015) - Schulman, Levine, Moritz, Jordan, Abbeel
- rl, atari, trpo, policy-gradient, optimization
-
Prioritized Experience Replay (2016) - Schal, Quan, Antonoglou, Silver
- rl, dqn, atari, replay-memory
-
Human-level Control Through Deep Reinforcement Learning (2015) - Mnih et al.
- rl, dqn, atari, replay-memory
-
Gated Path Planning Networks (2018) - Lee, Parisotto, Chaplot, Xing, Salakhutdinov
- rl, maze, doom, planning
-
Active Neural Localization (2018) - Chaplot, Parisotto, Salakhutdinov
- rl, maze, doom, a3c, localization
-
- rl, doom, rl-text, a3c
-
A Brief Survey of Deep Reinforcement Learning (2017) - Arulkumaran, Deisenroth, Brundage, Bharath
- rl, introduction, survey
-
- ml, decision-tree, ml4code
-
Reinforcement Learning with Attention that Works (2019) - Manchin, Abbasnejad, va den Hengel
- rl, atari, attention, ppo
-
World Models (2018) - Ha, Schmidhuber
- rl, vae, evolutionary-algorithms, world-models
-
- nlp, nli, sts, fine-tuning, transfer-learning, elmo, bert
-
Playing Atari with Six Neurons (2019) - Cuccu, Togelius, Cudre-Mauroux
- rl, atari, neuroevolution, evolutionary-algorithms
-
- lm, ml4code