Stars
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Open source platform for the machine learning lifecycle
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Train transformer language models with reinforcement learning.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
The official Python SDK for Model Context Protocol servers and clients
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
On-device AI across mobile, embedded and edge for PyTorch
Composable building blocks to build Llama Apps
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Build resilient language agents as graphs.
🦜🔗 Build context-aware reasoning applications
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
the AI-native open-source embedding database
A library for efficient similarity search and clustering of dense vectors.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
State-of-the-Art Text Embeddings
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A high-throughput and memory-efficient inference and serving engine for LLMs
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks
OpenTofu lets you declaratively manage your cloud infrastructure.