Tracking progress in AI research and my journey as a student.
Pre-Attention
- Distributed Representations of Words and Phrases and their Compositionality - Word2Vec (2013)
- Generative Adversarial Networks (2014)
- Conditional Generative Adversarial Nets (2014)
2017
2018
2019
2020
- Language Models are Few-Shot Learners - GPT3
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - ViT
2021
2023
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models - CoT
- AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
- Generative Agents: Interactive Simulacra of Human Behavior - Smallville
- Large Language Models Are Zero-Shot Time Series Forecasters - LLMTime
- Gemini: A Family of Highly Capable Multimodal Models - Gemini
- (VIDEO) Sebastien Bubeck -- Sparks of AGI: early experiments with GPT-4
- (VIDEO) Andrej Karpathy -- Intro to Large Language Models - LLM OS
2024
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
- ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
- Video generation models as world simulators - OpenAI's Sora
- (VIDEO) Cognition Labs -- Introducing Devin, the first AI software engineer
- Pika, Figure AI, DBRX, Anthropic's Claude 3 Opus
RAG (2023 / 2024)
- Question-Answering Based Summarization of Electronic Health Records using Retrieval Augmented Generation
- Speak Like a Native: Prompting Large Language Models in a Native Style
- Corrective Retrieval Augmented Generation
- Dense Passage Retrieval for Open-Domain Question Answering
- Large Language Models for Mathematical Reasoning: Progresses and Challenges
- Retrieval-Augmented Generation for Large Language Models: A Survey