Starred repositories
🦜🔗 Build context-aware reasoning applications
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
12 Weeks, 24 Lessons, AI for All!
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
10 Weeks, 20 Lessons, Data Science for All!
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
A simple screen parsing tool towards pure vision based GUI agent
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Examples and guides for using the Gemini API
Anthropic's educational courses
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
A series of large language models trained from scratch by developers @01-ai
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Sweep: AI coding assistant for JetBrains
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
Inpaint anything using Segment Anything and inpainting models.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper