Skip to content

louisfb01/start-ai-engineering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Start AI Engineering in 2026 - Build real AI systems, mostly for free!

A complete guide to start and improve in AI engineering in 2026 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

This guide is intended for anyone with zero or a small background in programming, AI, or machine learning who wants to become a strong AI engineer in 2026. It is organized by how you like to learn: videos, articles, books, docs, courses, and real projects.

There is no single correct order to follow, but a classic path is from top to bottom. If you dislike books, skip them. If you do not want to follow an online course, skip that too. With enough motivation, projects, and repetition, you can absolutely learn this field.

Most resources listed here are free. Paid resources are clearly labelled, and some paid course and book links are affiliate links that support this guide at no extra cost to you. Thank you, and have fun learning!

Don't be afraid to repeat videos, learn from multiple sources, and build messy projects. Repetition and debugging are where the real learning happens.

Maintainer: louisfb01, also active on YouTube, the What's AI Podcast, and my personal newsletter if you want to see and hear more about AI.

X: @Whats_AI LinkedIn: Louis-François Bouchard YouTube: What's AI

Tag Louis-François Bouchard on X or LinkedIn if you share this guide, and feel free to suggest additions through pull requests.

If this guide helps you, please star the repo and share it. That is the main way other builders find it.

Want to know what this guide is about? Start with this video:

Watch AI Engineering Foundations: What Developers Actually Need to Know Today first, then subscribe to What's AI for more AI engineering videos.

This guide is updated throughout 2026 as the stack moves.


Table of Contents


Prerequisites and learning path

Before you start collecting resources, keep the goal clear: this guide is for becoming a better AI engineer, not merely a better agentic coder.

Quick LLM and coding-agent warning

Coding agents like Codex, Claude Code, Cursor, and similar tools can write code, scaffold apps, and speed up almost every step. You should use them. But AI engineering is the judgment layer behind the work: deciding what to build, what architecture fits, how to evaluate it, where it will fail, and whether it is reliable enough to ship.

This guide is not about outsourcing your thinking to an agent. It is about using those tools while building the foundations, taste, and decision-making ability to become a true AI engineer.

What AI engineering means in 2026

In 2026, AI engineering goes well past prompting. You need context engineering, Retrieval-Augmented Generation (RAG), tools and the Model Context Protocol (MCP), workflow and agent design, evaluations, observability, harnesses, deployment, security, and a working understanding of reasoning models.

That is also why this guide, and our courses, prioritize learning by building. I learned AI engineering by building, and I now interview and hire AI engineers for consulting work at Towards AI, so this guide is biased toward the decision-making skills I actually look for. You can learn a lot alone with coding agents, but structure and expert feedback help you turn projects into true expertise instead of a pile of fragile demos.

Suggested learning path

There is no single correct order. If you want a default path, I would do this:

  1. Watch a few foundational videos to pick up vocabulary and intuition.
  2. Pick one free course and one framework whose docs you commit to reading end to end.
  3. Pick one or two books to build a solid foundation you can return to when the tools change.
  4. Optionally take one or two advanced applied courses with real projects, especially if you want a structured path before breaking things on your own.
  5. Build two or three small but real projects that break in interesting ways.
  6. Add evaluations, tracing, and deployment before you call anything production-ready.

After that, you should have the foundations of a solid AI engineer ready for many entry-level or transition roles. Most importantly, keep learning and keep an open mind. This field changes fast, and the best engineers stay curious instead of getting religious about one model, framework, or workflow.

Difficulty guide

Resources use compact markers from 1️⃣ to 🔟. 1️⃣ means absolute beginner, like an intro Python course; 3️⃣ is beginner-friendly AI vocabulary; 5️⃣ is practical builder material you can apply in a project; 7️⃣ is production engineering depth; 9️⃣ is advanced systems or research; and 🔟 is the kind of senior-level paper or technique you may want to revisit after you have shipped a few systems. Lower numbers first, scars later.

Personalize this roadmap with an AI agent

You can use this guide with your favorite AI agent. Paste the prompt below into Codex, Claude Code, ChatGPT, Cursor, or another assistant, then tell it how you like to learn:

Use this repo as my AI engineering roadmap: https://github.com/louisfb01/start-ai-engineering

Create a personalized learning plan for me. First ask about my background, coding level, available time, budget, preferred learning style, and goals. Then choose the most relevant resources from the repo, explain why you picked them, order them from easiest to hardest, and turn them into a weekly plan with projects, checkpoints, and what I should be able to build after each stage.

If you are brand new to code

  • 1️⃣ Learn Python - Free interactive tutorial to learn Python fundamentals if you have never touched the language.
  • 1️⃣ AI Python for Beginners - DeepLearning.AI. Free short course from Andrew Ng's team, lighter on-ramp than a full bootcamp.
  • 2️⃣ Python Fundamentals + CS Concepts — A One-Stop Starter Class - Louis-François Bouchard, What's AI. Free playlist covering Python fundamentals and core computer science concepts in one place. The right starting point if you want a single resource before jumping into LLM development.
  • 2️⃣ Beginner Python for AI Engineering - Towards AI. An LLM-native Python course for people who want to go straight to building with LLMs, not through six months of classical scripting first. (Paid, $149)

If you already know some Python, you can jump into the rest of this guide. You do not need a mathematics PhD or deep research background. You do need basic Python, comfort reading docs, willingness to debug messy systems, and enough curiosity to build things that break. The last point matters more than people expect.


Start with short YouTube and video introductions

Video is still the fastest way to pick up vocabulary and mental models.

Start here for AI engineering judgment

Foundational explainer videos

YouTube channels worth subscribing to

  • 2️⃣ StatQuest with Josh Starmer - Josh Starmer. The clearest visual explanations of ML and neural network concepts on YouTube. Ideal for building solid intuition about how transformers, attention, and training actually work before you start building.
  • 3️⃣ 3Blue1Brown - Grant Sanderson. Visual math and deep learning intuition. The neural networks and attention series are widely considered the best visual introductions to these concepts.
  • 3️⃣ DeepLearning.AI - Andrew Ng's official channel. Free recorded short courses on prompting, RAG, agents, evals, and more. Most of the DeepLearning.AI short courses land here first.
  • 3️⃣ IBM Technology - Clear concept explainers on LLMs, RAG, agents, and enterprise AI. Good for quickly getting up to speed on a new concept with no background noise.
  • 3️⃣ Tech With Tim - Tim Ruscica. 1.89M subscribers. Beginner-to-intermediate coding and AI projects in Python. Strong for learners who want to build working things (AI games, assistants, chatbots, small ML projects) alongside the theory.
  • 4️⃣ What's AI - Practical AI engineering explainers from Louis-François Bouchard. Useful for RAG, agents, MCP, evals, and learning how to reason about the stack instead of only chasing tools.
  • 4️⃣ Hugging Face - Official tutorials across the open-source AI ecosystem. Covers fine-tuning, inference, datasets, and new model releases.
  • 5️⃣ LangChain - Official channel for LangChain and LangGraph. Tutorial-first videos on agents, workflows, and graph-based orchestration.
  • 5️⃣ Jeremy Howard - fast.ai co-founder. Practical, builder-oriented, strong on software craft and AI-assisted coding.
  • 5️⃣ Two Minute Papers - Károly Zsolnai-Fehér. Short, enthusiastic summaries of AI research papers. Good for staying aware of what is being published without reading every paper.
  • 5️⃣ Bycloud - Weekly video essays on AI news and research, aimed at builders.
  • 6️⃣ Andrej Karpathy - Former Tesla AI and OpenAI. Best long-form explanations of how LLMs actually work — essential mental models for anyone building on top of them.
  • 7️⃣ Umar Jamil - Line-by-line implementations of transformers, vision-language models, and LoRA. Strong for understanding what is happening inside a model when you are debugging or fine-tuning.
  • 8️⃣ Yannic Kilcher - In-depth walkthroughs of new research papers. Essential for staying current with model releases and understanding what papers actually claim vs. what they prove.

Podcasts and longer listening are collected in the Newsletters, podcasts, and blogs section below.


Books and long-form reading

If you prefer reading to watching, this path goes very far, especially with these books focusing on actually coding and building.

Books worth your time

  • 5️⃣ Building LLMs for Production - Towards AI. 465 pages covering prompting, RAG, fine-tuning, reliability, and shipping. Used as an internal reference manual in many companies. The Academy e-book version is also available. (Paid, $29 e-book)
  • 5️⃣ Hands-On Large Language Models - Jay Alammar and Maarten Grootendorst. Visual, code-first companion that pairs well with Chip Huyen's book. (Paid)
  • 5️⃣ Prompt Engineering for LLMs - John Berryman and Albert Ziegler. Written by GitHub Copilot engineers, with useful field-tested patterns. (Paid)
  • 6️⃣ LLM Engineer's Handbook - Paul Iusztin and Maxime Labonne. Production-focused, built around a real end-to-end project. Pairs with the companion code repo. (Paid)
  • 7️⃣ AI Engineering - Chip Huyen. The most-read book on O'Reilly for this space. Strong on system design, evaluation, and when each technique earns its place. (Paid)
  • 8️⃣ Build a Large Language Model (From Scratch) - Sebastian Raschka. Foundations and intuition. Code a GPT-style LLM from scratch in PyTorch, no libraries that hide the internals. The right book for developers who want to move past calling APIs and actually understand transformers, tokenization, attention, and fine-tuning. Pairs with the companion LLMs-from-scratch repo. (Paid)

Free long-form explainers that still hold up

Essential 2025-2026 articles on AI engineering

A curated short list of valuable long-form articles from 2025-2026. All are substantial reads (10+ minutes) that reward a full sitting. Topic-specific articles are in their respective sections below.

Articles from Anthropic, OpenAI, and individual practitioners (Shreya Shankar, Paul Iusztin, and others) are also referenced in the topic-specific sections below. Start with the topic you care about most and work outward.

For ongoing reading, rotate between practitioner blogs, official engineering posts, the Towards AI publication on Medium, and the Towards AI Newsletter instead of relying on one source.

A reading loop that actually works

A common mistake is reading ten articles on the same topic and building nothing. A better loop is: read one conceptual article, read one official docs page, build one tiny version yourself, then reread the article once you have scars. The second pass hits very differently.


Online courses

If you want more structure, courses are the fastest route through this material.

Deep, end-to-end programs

  • 2️⃣ AI for Work - Towards AI. 15 modules for non-developers who want to actually use AI at work. No coding required. (Paid, $399)
  • 3️⃣ 10-Hour LLM Fundamentals - Towards AI. Compact video-first crash course covering when to use prompting, RAG, fine-tuning, or agents. Useful before going deep. (Paid, $199)
  • 5️⃣ Full Stack AI Engineering - Towards AI's flagship program. 90+ lessons across prompting, RAG, fine-tuning, tools, agents, and deployment, built around one production capstone. Designed for people who want a full developer path to AI engineering. (Paid, $349)
  • 7️⃣ Agentic AI Engineering - Towards AI. 34 lessons and two production agents (a research agent and a writing workflow), covering context engineering, evaluations, observability, containers, and deployment. For people who already ship LLM apps and want to specialize. (Paid, $499)

Free docs-heavy paths

Useful DeepLearning.AI short courses (free)

Several DeepLearning.AI courses are listed in the topic sections below instead of here: AI Agents in LangGraph (under Agents), Automated Testing for LLMOps (under Evaluations), Red Teaming LLM Applications (under AI Safety), Efficient Inference with SGLang (under Deployment), and Document AI: From OCR to Agentic Doc Extraction (under Multimodal).

Which course to pick from the Towards AI offerings


Practice and projects

Reading and watching will only take you so far. You become an AI engineer by building systems that fail in expensive and educational ways.

Watch What I Look For When Hiring AI Engineers before you start your first serious project. I share how I evaluate AI engineering candidates, why decision-making matters more than polished agent-generated output, and what kinds of practice projects actually teach useful skills.

Good first projects

  • 4️⃣ A document question-answering assistant with citations and a real eval set.
  • 4️⃣ A customer support workflow with tools and structured outputs.
  • 5️⃣ A research assistant that plans, searches, reads, and writes a short brief.
  • 5️⃣ A coding helper scoped to one narrow internal task.
  • 5️⃣ A multimodal invoice or receipt parser with validation.
  • 6️⃣ Designing Real-World AI Agents Workshop - Paul Iusztin's hands-on workshop for building a Deep Research Agent plus a LinkedIn Writing Workflow as MCP servers. It includes code, slides, video, evaluation patterns, and an implement_yourself/ path designed to be rebuilt with agentic coding tools instead of copied.
  • 6️⃣ A small agent that plans, acts, checks, and retries within a budget.

Reference repos and tutorials

Framework docs for agent-oriented libraries (LangGraph, CrewAI, AutoGen, Agno) live in the Agents section below.

Questions to force yourself to answer on every project

  • Why is this prompt, tool, or architecture chosen?
  • Where and how will it fail?
  • How will I evaluate it, offline and online?
  • What will I log and inspect when it misbehaves?
  • What is the cheapest design that still clears the bar?
  • Is an agent actually the right choice here, or is a workflow enough?

If you cannot answer those, keep building.


Prompting and structured outputs

Prompting still matters in 2026. The useful version is not clever tricks. It is writing reliable contracts for non-deterministic systems.

Subtopics to cover

Clear task framing, output contracts, structured outputs and JSON schemas, few-shot examples, grounding and citations, verification loops, tool-use instructions, completion criteria, and prompt versioning.

Best resources

Treat prompts as code you version, interfaces you test, and product decisions you revisit. That framing is more useful than any list of prompting tricks.


Reasoning models and test-time compute

Reasoning models (OpenAI o-series, Anthropic Claude with extended thinking, Google Gemini Pro with thinking, DeepSeek R-models, Qwen reasoning variants) behave differently from standard chat models. They reward different prompting and break in different ways.

Subtopics to cover

When reasoning models help, when they hurt, how to set thinking budgets, how to structure input for a thinking model, extended thinking and tool use together, and cost/latency tradeoffs.

Best resources

Rule of thumb for 2026: reach for a reasoning model when the task genuinely requires multi-step planning, verification, or tool use. For simple classification, extraction, or short answers, a cheaper standard model still wins on cost and latency.


Context engineering and long context

Context engineering is one of the most important 2026 skills. The model is only as good as what you put in its context and how you stage it.

Subtopics to cover

What belongs in context and what does not, context windows and context rot, message history management, memory versus retrieval, compaction and summaries, working files and scratchpads, repo-level instructions such as AGENTS.md or CLAUDE.md, and context handoffs between runs.

Best resources

Most people try to fix bad systems by stuffing more tokens into the prompt. That usually makes results worse. The better habit is to be intentional about which instructions are permanent, which data is retrieved on demand, which state gets externalized into files or tools, and when to reset the context entirely.


Retrieval-Augmented Generation (RAG)

RAG is still a core technique. The naive "stuff some chunks into the prompt" version is no longer enough.

Subtopics to cover

Chunking strategies, embeddings, vector search, hybrid search with BM25, reranking, citations and provenance, metadata filtering, query rewriting, corrective RAG, retrieval quality evaluation, agentic retrieval, and knowing when RAG is the wrong answer.

Best resources

Do not stop at "uploaded PDF, got answer." Build one serious RAG app with citations, retrieval debugging, considered chunking choices, metadata filters, an eval set, and a way to inspect misses. That is where the real learning happens.


Embeddings, rerankers, and vector databases

Good retrieval depends on the pieces around the model.

Embedding models and rerankers

Vector databases

  • 4️⃣ Qdrant docs - Fast, production-ready, open source, free managed tier.
  • 4️⃣ Weaviate docs - Open source with built-in hybrid search and RAG modules.
  • 4️⃣ LanceDB docs - Embedded, Python-first, no server needed. Great for local RAG prototypes.
  • 4️⃣ Pinecone - Managed serverless, the most common enterprise default.
  • 4️⃣ pgvector - Vector search inside Postgres. Best choice when you already have Postgres and want to avoid a second system.
  • 4️⃣ Chroma - Light, simple, good for prototypes and tutorials.

Good practitioner write-ups


Tools, MCP, and computer use

If prompting was the first phase of AI apps, and tools the second, then in 2026 MCP and structured tool ecosystems are part of the default stack.

Subtopics to cover

Function and tool calling, tool schemas, tool selection and retries, permissions and safety boundaries, tool result formatting, MCP clients and servers, web search and code execution tools, computer use, and authentication against external systems.

Best resources

Search APIs worth knowing

Agents that need to search the web rarely call raw Google or Bing. These are the APIs most production stacks use:

  • 4️⃣ Tavily - Purpose-built search API for LLM agents with content extraction and summarization.
  • 4️⃣ Exa - Semantic search API with neural retrieval over the web.
  • 4️⃣ Brave Search API - Privacy-focused web search, common choice for agent stacks that need independent indexing.

The model is not your system. The tool layer is where most real capability and most real risk both live.


Workflows, agents, and multi-agent systems

This is where hype gets loud and engineering judgment becomes valuable.

Subtopics to cover

Workflow versus agent, single agent versus multi-agent, ReAct and tool loops, routing and orchestration, planning and reflection, human-in-the-loop, state and memory, failure modes, and when to avoid autonomy altogether.

Best resources

  • 5️⃣ AI Agents in LangGraph - Harrison Chase and DeepLearning.AI. Free. The cleanest intro to graph-based agents.
  • 5️⃣ LangGraph docs - Official graph-based orchestration docs for long-running, stateful agents.
  • 5️⃣ LlamaIndex Workflows - LlamaIndex's event-driven workflow system.
  • 5️⃣ CrewAI, AutoGen, and Agno - Framework docs for three of the main alternatives.
  • 6️⃣ Building Effective AI Agents - Anthropic. The reference post on agent vs workflow design.
  • 6️⃣ Stop Building Agent Demos - Louis-François Bouchard on the demo-to-production gap.
  • 6️⃣ Agents and Workflows - Louis-François Bouchard on when multi-agent is overengineering.
  • 6️⃣ What Makes an AI Agent Actually Agentic? - What separates a real agent from a workflow wearing an LLM hat: autonomy, memory, and resilience. Walks through refactoring a hardcoded LangGraph assistant into a ReAct-based agent with SQLite checkpointing and layered, context-aware error handling.
  • 6️⃣ Agent Architecture Guide - Louis-François Bouchard's 13-question decision framework for agent design.
  • 7️⃣ LLM Powered Autonomous Agents - Lilian Weng. The reference post that defined the field.
  • 7️⃣ Agents - Chip Huyen's long-form primer on agent design, planning, and tool use. One of the most-shared agent posts of 2025.
  • 7️⃣ 12-Factor Agents - Dex Horthy's widely-cited production-agent checklist covering state, tools, context, and reliability. Heavily referenced across 2025-2026 agent engineering discussions.
  • 7️⃣ Creating an Advanced AI Agent From Scratch with Python in 2026 - Modular architecture over framework lock-in: a flexible tool system, provider-agnostic LLM wrapper, and a ReAct-based agent orchestrator with Pydantic for type-safe tool execution. Lets you swap models and tools without touching the core loop.
  • 7️⃣ The Two Things Every Reliable Agent Needs - A framework centered on memory-first design and an anti-Goodhart scoreboard: treat memory as a core system with defined forms, functions, and dynamics, and evaluate with adversarial metrics across full episodes so agents solve the actual problem instead of gaming a proxy.
  • 7️⃣ LangChain Middleware: The Missing Layer Between Your Agent and Production - LangChain's new middleware system pulls operational concerns (summarization, human approval, retries, token tracking, dynamic routing, tool monitoring, context injection) out of agent logic and into a dedicated layer. Covers decorator vs class-style hooks, ordering rules, custom state schemas, and five production patterns.
  • 7️⃣ Google's A2A Protocol using LangGraph: Build Agent Systems That Actually Communicate - Divy Yadav. Practical deep-dive into Agent2Agent: Agent Cards for discovery, structured task lifecycles, HTTP messaging, and how A2A complements (not competes with) MCP. Covers real production failure modes — timeout handling, context mismatch, authentication drift — with a LangGraph implementation walkthrough.
  • 7️⃣ Agentic AI Engineering - Towards AI's deep dive with two shipped agents as capstones. (Paid)
  • 8️⃣ How we built our multi-agent research system - Anthropic. Real architecture behind a shipped multi-agent product.
  • 8️⃣ Building Production Text-to-SQL for 70,000+ Tables: OpenAI's Data Agent Architecture - How OpenAI built an internal data agent for its own data warehouse. Goes beyond naive query generation: six layers of context (table usage patterns, human annotations, business logic extracted from code), plus a closed-loop validation step where the agent profiles results, catches its own errors, and repairs queries. The real lesson — agent effectiveness depends on the richness of context, not the model.

Most teams should start with a workflow. Add autonomy only where it clearly buys something. That saves token spend, latency, debugging pain, and a lot of regret.


Evaluations, observability, and harnesses

The layer most people skip and rediscover the hard way.

Subtopics to cover

Golden datasets, rule-based checks, LLM-as-a-judge, regression testing, traces and spans, prompt versioning, error analysis, offline evaluations and online monitoring, harness design, and testability of agent behavior.

Best resources

If you cannot tell whether your system is improving, you are not engineering yet, you are moving vibes around.


Fine-tuning and data curation

Fine-tuning still matters, and in 2026 it is no longer the first hammer most teams reach for. Reasoning models, prompt caching, long context, and cheap high-quality base models shifted the tradeoff.

Subtopics to cover

When prompting is enough, when RAG is enough, when supervised fine-tuning helps, synthetic data generation, dataset cleaning and formatting, preference optimization and reinforcement fine-tuning, Low-Rank Adaptation (LoRA) and Decomposed Low-Rank Adaptation (DoRA), domain adaptation, and cost/maintenance tradeoffs.

Best resources

Only fine-tune after you understand the baseline and have evals. Otherwise you are tuning toward a blurry target.


Multimodal and document understanding

Many real AI products need to read images, parse PDFs, work with screenshots, or combine text and visuals.

Subtopics to cover

Vision inputs, document layout understanding beyond Optical Character Recognition (OCR), multimodal prompting, image-grounded extraction, and table and chart extraction.

Best resources

Good first project ideas: invoice extraction with validation, a receipt parser with structured outputs, a screenshot-to-action assistant, or a research workflow that extracts and cites figures from PDFs.


Voice agents and realtime AI

Voice became table stakes for many products in 2025-2026. Low-latency turn-taking and realtime multimodal APIs now compete with traditional text chat.

Subtopics to cover

Speech-to-text and text-to-speech selection, turn-taking and barge-in, session management, latency budgeting, tool use inside a voice turn, and when voice beats text.

Best resources

  • 4️⃣ Anthropic voice guidance - Pairs Claude with an external speech pipeline (ElevenLabs, Deepgram, etc.).
  • 4️⃣ ElevenLabs docs - Production voice cloning and streaming text-to-speech.
  • 4️⃣ Deepgram - Low-latency speech-to-text.
  • 5️⃣ OpenAI Realtime API - The primary realtime reference for most teams. Native speech-to-speech with tool use.
  • 5️⃣ Gemini Live API - Google's realtime multimodal endpoint.
  • 5️⃣ Pipecat - Open-source voice agent framework. Free.
  • 5️⃣ LiveKit Agents - Realtime agent infrastructure with strong WebRTC support.

Deployment, inference, and open-weight models

This is where "my notebook works" becomes "my product survives real users and traffic."

Subtopics to cover

Application Programming Interface (API) deployment, containers, concurrency, OpenAI-compatible serving, prompt and KV cache use, vLLM and other inference servers, local models and privacy tradeoffs, cost and latency and throughput tradeoffs, self-hosted versus serverless, and reliability, scaling, and rollbacks.

Serving and inference

  • 4️⃣ Ollama and Ollama docs - The easiest way to run open models locally.
  • 4️⃣ LM Studio - Graphical User Interface (GUI) for local inference, good for non-developers.
  • 6️⃣ vLLM docs and vLLM Quickstart - UC Berkeley's high-throughput inference server. De facto standard for self-hosting.
  • 6️⃣ SGLang - Structured generation and batching, strong for constrained outputs.
  • 6️⃣ Text Generation Inference (TGI) - Hugging Face's production-ready serving stack.
  • 6️⃣ llama.cpp - Central Processing Unit (CPU) and edge inference with GGUF quantization. The main path to running models on laptops.
  • 6️⃣ Efficient Inference with SGLang - DeepLearning.AI short course, free.

Cloud GPU and managed inference

LLM gateways and routing layers

Most production stacks sit one layer above the provider to handle fallbacks, rate limits, cost tracking, and per-request model selection:

  • 5️⃣ LiteLLM - Open-source proxy and Python SDK that lets you call 100+ LLM providers through a unified OpenAI-compatible interface. De facto standard for multi-provider applications.
  • 5️⃣ OpenRouter - Hosted router with a single API across hundreds of models, including preview access to models before they hit official APIs.
  • 5️⃣ Portkey - AI gateway with caching, observability, and guardrails built on top of the routing layer.

Open-weight model families in 2026

Questions you should be able to answer cleanly

Why you chose an API model or an open-weight model. Why you chose that latency and cost tradeoff. Why the system is safe enough to expose to real users. How you would debug a bad output in production. How the system behaves when a dependency fails. If you can answer those, you are already ahead of many AI app builders.


AI coding agents and developer tools

How AI engineers actually work changed in 2025-2026. Coding agents and agent-native editors are now part of daily practice and part of what teams expect you to have used.

Tools worth learning

  • 3️⃣ Claude Code - Anthropic's Command Line Interface (CLI) agent with the Claude Agent Software Development Kit (SDK) behind it. Strong for long-running, tool-heavy tasks.
  • 3️⃣ Cursor - Integrated Development Environment (IDE) with agent-native editing. One of the most widely-used AI IDEs as of 2026.
  • 3️⃣ GitHub Copilot - Now includes agent mode and skills. The default for many enterprise teams.
  • 3️⃣ Codex CLI - OpenAI's long-horizon coding agent.
  • 3️⃣ Gemini CLI - Google's open-source command-line agent.
  • 3️⃣ Windsurf - Cognition's (formerly Codeium's) agent-native editor, focused on flow and context handling.

Articles worth reading

Rule of thumb: pick one coding agent, commit to it for a month, and learn its scaffolding well. Rotating between tools is usually slower than mastering one.


AI safety, security, and guardrails

This part is not optional. If your AI system can search the web, call tools, touch private data, or send actions into other software, you need to think about risk early.

Subtopics to cover

Prompt injection, sensitive data handling, system prompt leakage, tool permissions, excessive agency, overreliance, output validation, human review thresholds, red teaming, and governance.

Core frameworks and references

Practitioner reading

Guardrail libraries

Treat LLM output like work from a fast intern with occasional alien instincts. You do not blindly trust it. You design systems around it.


Communities, subreddits, and Discords

The social layer where most of the real-time knowledge actually moves.

Discord servers to join

  • 2️⃣ Towards AI Discord - 80,000+ builders, direct access to the Towards AI team, weekly events, channels for RAG, agents, fine-tuning, and job search.
  • 2️⃣ Learn AI Together - Louis-François Bouchard's nearly 100,000-member server for AI enthusiasts, study groups, and Kaggle teammates.
  • 3️⃣ Hugging Face Discord - Home for the open-source AI ecosystem. Channels for every major model family and library.
  • 3️⃣ LangChain Discord - Official community for LangChain and LangGraph users.
  • 3️⃣ LlamaIndex Discord - Active channels on RAG, agents, and workflows.
  • 4️⃣ MLOps Community - Active Slack community covering production ML and increasingly LLM operations. One of the best places to ask real production questions.
  • 5️⃣ Modular (MAX) Discord - Mojo and MAX users, good for inference and performance topics.

Subreddits worth following

  • 2️⃣ r/artificial - General AI news and discussion.
  • 2️⃣ r/ArtificialInteligence - Broader AI community with a mix of news, opinion, and tutorials.
  • 2️⃣ r/learnmachinelearning - Beginner-friendly, good for study questions and roadmap discussions.
  • 3️⃣ r/OpenAI - News, API discussion, and model behavior debugging.
  • 3️⃣ r/ClaudeAI - Claude Code, Claude.ai, and Anthropic product discussion.
  • 3️⃣ r/LangChain - LangChain and LangGraph community troubleshooting.
  • 3️⃣ r/Rag - Focused subreddit on retrieval-augmented generation patterns.
  • 3️⃣ r/AI_Agents - Agent-specific community with framework debates and build-in-public threads.
  • 4️⃣ r/LocalLLaMA - By far the most useful subreddit for open-weight models, inference benchmarks, and quantization tips.
  • 4️⃣ r/computervision - Extracting useful information from images and video.
  • 4️⃣ r/LatestInML - Curated stream of newer ML developments.
  • 5️⃣ r/MachineLearning - The biggest machine learning subreddit, research-heavy.

Cheat sheets and decision guides


Newsletters, podcasts, and blogs

Newsletters

  • 3️⃣ Towards AI Newsletter - Weekly "What happened this week in AI" coverage with technical depth, benchmarks, and opinion.
  • 3️⃣ Last Week in AI - Andrey Kurenkov and Jeremie Harris. Weekly news roundup.
  • 3️⃣ The Batch - Andrew Ng's weekly summary of research and industry.
  • 4️⃣ Louis-François Bouchard's Substack - Short essays on harness engineering, agents, and the practice of AI engineering.
  • 4️⃣ Latent Space - swyx and Alessio Fanelli. Industry-heavy AI engineering newsletter with interviews.
  • 5️⃣ Decoding AI - Paul Iusztin on production machine learning and AI engineering.
  • 5️⃣ The Neural Maze - Miguel Otero Pedrido. Practical production ML and AI systems newsletter for builders tired of hype, with end-to-end projects, agent systems, deployment tradeoffs, and lessons from real ML engineering work.
  • 5️⃣ Profitable AI Blog - Tobias Zwingmann. Applied AI frameworks and real-world examples focused on profitable business outcomes, use-case selection, and moving beyond prototypes.
  • 5️⃣ AI Tidbits - Sahar Mor's technical briefings on new techniques.
  • 6️⃣ Interconnects - Nathan Lambert. Post-training, reasoning models, and RLHF explained with research-grade clarity.
  • 6️⃣ Import AI - Jack Clark's research-heavy roundup with policy perspective.
  • 6️⃣ Ahead of AI - Sebastian Raschka's monthly deep dives.

Podcasts

  • 3️⃣ Last Week in AI - Weekly news podcast companion to the newsletter.
  • 3️⃣ Lex Fridman Podcast - Occasional AI episodes with researchers and founders.
  • 4️⃣ The What's AI Podcast - Louis-François Bouchard. Interviews with AI builders and researchers.
  • 4️⃣ Latent Space - swyx and Alessio Fanelli. Deep interviews with practitioners shipping real systems.
  • 4️⃣ ThursdAI - Alex Volkov's weekly live show, podcast, and newsletter breaking down major AI news with builders. Strong for model releases, open-source AI, tooling, and practical context on what changed this week.
  • 5️⃣ Machine Learning Street Talk - Tim Scarfe. Long-form research conversations.

Practitioner blogs worth bookmarking

  • 4️⃣ Simon Willison - Near-daily AI engineering posts. The single most useful blog in this space.
  • 4️⃣ Louis-François Bouchard - Essays on harness engineering, agents, and hiring.
  • 4️⃣ Hamel Husain - Practical evals and consulting notes.
  • 4️⃣ Eugene Yan - Patterns, evaluation, and applied ML writing.
  • 4️⃣ Chip Huyen - System design for AI products.
  • 6️⃣ Sebastian Raschka - Monthly deep dives on LLM research and implementation.
  • 6️⃣ Lilian Weng - Longer-form research-style posts on agents, reasoning, and safety.
  • 6️⃣ Shreya Shankar - Research-grade posts on evals and data flywheels.
  • 6️⃣ Jason Liu - Consulting notes from enterprise RAG and agents work.
  • 6️⃣ Philipp Schmid - Staff Engineer at Google DeepMind (formerly Hugging Face). Practical fine-tuning and Gemini-focused tutorials.

Curated reading lists

  • 5️⃣ AI Engineering Field Guide - Alexey Grigorev (DataTalks.Club). Free. Research into AI engineering interview assignments, take-home challenges, hiring practices, and required skills from Q4 2025 / Q1 2026. Grounded in analysis of 51 companies and 100+ GitHub take-home repos. Includes role definitions, skill breakdowns, learning paths by background, and a curated awesome.md of the most-referenced 2025-2026 articles, talks, and interview resources.
  • 5️⃣ Agents Towards Production - Nir Diamant. Free. 28+ end-to-end, code-first tutorials for production-grade GenAI agents. Created in 2025 and expanded through 2026, with company-contributed tutorials covering stateful workflows, vector memory, MCP, Docker deployment, FastAPI endpoints, security guardrails, GPU scaling, browser automation, multi-agent coordination, observability, and evaluation. One of the cleanest hands-on resources on shipping agents.
  • 6️⃣ Awesome-LLM - Hannibal046. Free. One of the largest and most actively maintained LLM resource indexes on GitHub. Covers milestone papers, frontier models (DeepSeek V3/R1, Qwen 3, Kimi K-2, GPT-5, Claude 4, Gemini 2.5), open LLMs, training frameworks, deployment tools, courses, and specialized sub-lists (RAG, inference, compression, MoE, healthcare, 3D, Japanese). Updated continuously, useful as a broad navigational index when you know a topic exists but do not know where to start.
  • 7️⃣ The 2025 AI Engineering Reading List - Latent Space (swyx and Alessio Fanelli). The definitive paper and resource list for AI engineers, organized by topic: agents, evals, RAG, fine-tuning, inference, and coding agents. Dense, opinionated, and updated annually. Required reading if you want to understand where the field came from and where it is heading.

Official docs and learning hubs


People to follow

On Twitter/X and LinkedIn, most of the useful real-time signal comes from a relatively small group of practitioners. A good starter list:

  • 4️⃣ Louis-François Bouchard - Co-founder and Chief Technology Officer, Towards AI. Harness engineering, agents, AI education.
  • 4️⃣ Andrew Ng - DeepLearning.AI founder, weekly Batch newsletter.
  • 4️⃣ swyx (Shawn Wang) - Latent Space, AI engineer community builder.
  • 4️⃣ Harrison Chase - LangChain founder.
  • 4️⃣ Omar Sanseviero - Hugging Face, open-source LLMs.
  • 4️⃣ Logan Kilpatrick - Google DeepMind, working on Google AI Studio, the Gemini API, and Kaggle; formerly led developer relations at OpenAI. Useful for Gemini developer ecosystem updates, AI Studio workflows, and fast AI app prototyping.
  • 4️⃣ Alex Volkov - Host and curator of ThursdAI, AI Evangelist at Weights & Biases, and a strong real-time source for weekly model releases, open-source AI, tooling, and builder commentary.
  • 5️⃣ Simon Willison - Near-daily practical AI engineering posts. Also active on Mastodon and his blog.
  • 5️⃣ Hamel Husain - Evals and AI consulting notes.
  • 5️⃣ Jason Liu - RAG, consulting, structured outputs.
  • 5️⃣ Chip Huyen - Systems thinking for AI products.
  • 5️⃣ Philipp Schmid - Google DeepMind DevRel, formerly Hugging Face. Fine-tuning, Gemini, and open-model how-tos.
  • 5️⃣ Jeremy Howard - fast.ai co-founder, deep learning and software craft.
  • 6️⃣ Andrej Karpathy - Former Tesla AI and OpenAI. Long-form teaching, new architectures, occasional demo releases.
  • 6️⃣ Sebastian Raschka - Research and implementation detail on LLMs.
  • 6️⃣ Shreya Shankar - Evaluation, data pipelines, and research-to-practice.
  • 6️⃣ Aran Komatsuzaki - Fast, curated paper summaries.
  • 6️⃣ Jerry Liu - LlamaIndex founder.
  • 6️⃣ Jack Clark - Anthropic co-founder, Import AI newsletter.
  • 6️⃣ Nathan Lambert - Interconnects newsletter. One of the clearest writers on post-training, RLHF, and reasoning models.
  • 6️⃣ Dex Horthy - HumanLayer founder, creator of the 12-Factor Agents reference. Production-agent engineering.
  • 6️⃣ Lilian Weng - Former OpenAI research lead. Long-form posts on agents, reasoning, and safety that are cited everywhere.
  • 8️⃣ Yann LeCun - Meta Chief AI Scientist, Turing Award laureate.

How to find an AI engineering job

The market is messy. The signal is clearer than people think.

What companies actually want

People who can take a vague problem, make reasonable assumptions, build a baseline, evaluate it, document tradeoffs, and ship something testable. That is closer to real work than trivia-style interviews.

Best resources

What to do concretely

Ship two to four public projects that are small but serious. Write short READMEs that explain architecture choices, cost and latency tradeoffs, and failure modes. Include tests and at least one evaluation dataset. Show traces, monitoring, or experiment logs when relevant. Learn to explain why you chose not to use an agent in some places. Be able to compare prompting, RAG, fine-tuning, workflow, and agent approaches for a given problem. Many candidates can now generate code. Far fewer can show judgment.


Learn more and build more with AI

Use the models themselves to help you learn. That does not mean outsourcing your thinking. It means using them intelligently: ask for alternative architectures, ask them to critique your evaluation plan, ask them to generate synthetic test cases, ask them to explain a docs page you half-understand, ask them to refactor your prompt into a clearer contract, ask them to produce failing tests before you implement a feature, or ask them to compare two designs under cost and latency constraints.

A self-learning loop that compounds

  1. Give the model your goal.
  2. Ask it for three plausible approaches.
  3. Pick one and implement it.
  4. Make it run end to end.
  5. Evaluate it on a small golden dataset.
  6. Ask the model to explain the failure cases you found.
  7. Repeat.

That loop works frighteningly well when you keep it tight.


Final note

AI engineering in 2026 is a systems craft. Learn enough theory to avoid magical thinking. Learn enough tooling to build quickly. Learn enough evaluation to trust what you ship. Learn enough product judgment to avoid building the wrong thing faster. And above all, keep shipping. That is still the shortcut.

If you found this guide useful, please star the repo and share it with one person who could use it. That is how it keeps reaching the right people.

Tag Louis-François Bouchard on X or LinkedIn if you share this guide.

If you'd like to support our work, joining any Towards AI Academy course directly funds more free content like this one.

This guide is updated throughout 2026 as the stack moves. Suggestions and pull requests are welcome.

About

A complete guide to start and improve in AI engineering in 2026 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors