Lists (1)
Sort Name ascending (A-Z)
Stars
Comprehensive guide to learn RAG from basics to advanced.
[CVPR 2025] VGGT: Visual Geometry Grounded Transformer
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
NVR with realtime local object detection for IP cameras
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Lumina-T2X is a unified framework for Text to Any Modality Generation
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Official repository of In-Context LoRA for Diffusion Transformers
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se…
Low code web framework for real world applications, in Python and Javascript
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
To simplify and streamline LLM operations, empowering developers and organizations to harness the full potential of large language models with ease.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[CVPR 2025] Video Depth without Video Models
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
RiverSnap - estimation of river hydraulic parameters using machine learning/AI models
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
Repository for the "Building LLMs for Production" book by Towards AI.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
This SDK is now deprecated, use the new unified GenAI SDK.