Disclaimer: This repository is maintained by the founders of SID Tech Inc. and other volunteers of the Y Combinator Community. This repository and SID Tech Inc. are not affiliated with, sponsored or endorsed by Y Combinator.
This is a curated collection of AI developer tools built by YC companies.
We're aiming to serve as a reliable starting point for LLM/ML developers.
- 🔭 Analytics & Monitoring
- 💾 Vector DB & Embeddings
- 🎯 Data Integrations & Retrieval
- 🚧 Infrastructure
- 🔎 LLM Serving & Fine-Tuning
- 📝 Dataset Generation & Handling
- 🔒 Security
- 💬 Prompt Management & Testing
- 🥁 Orchestration
- 🔈 Audio
- 🤖 Making Development Easier
- Humanloop: Humanloop is like datadog for LLMs. They give you the tools you need to evaluate LLM apps and then take action to improve them.
- Helicone: The easiest way to capture data from your LLMs (Open Source).
- Langfuse: Open-source analytics for LLM applications. (Demo / Docs)
- UpTrain: Open-source toolkit to evaluate and monitor LLM applications on aspects like hallucinations, bias, tonality, correctness, etc. (Demo / Docs)
- Structured: LLM tool transforming complex system log data into easily understandable insights. (Demo)
- Traceloop: Deploy with confidence. Automatically evaluate and monitor changes to models, prompts, and LLM architectures.
- BerriAI: A simple & light package to call OpenAI, Azure, Cohere, and Anthropic API Endpoints.
- Parea: Improve and monitor the performance of your LLM apps through rigorous testing and version control.
- Axilla: Open-source AI framework for TypeScript that covers the whole lifecycle: document ingestion & retrieval, continuous evaluation, serving, and monitoring. (Docs)
- DAGWorks: The observability and monitoring solution for Hamilton. Get lineage, a catalog, and observability on top of Hamilton with a one-line code change.
- HegelAI's PromptTools: Open-source tools for evaluation and experimentation with prompts, models, and vector databases. (Demo / Docs)
- Supabase Vector: Open-source Vector Toolkit for Postgres. Use the Supabase client libraries to store, index, and query your vector embeddings at scale. (Demo / Docs)
- LanceDB: Open-source, developer-friendly vector database for multi-modal AI. Reduce unstructured storage costs by 80% and get 1000x faster performance than parquet for AI. (Demo / Docs)
- SID.ai: Fully-hosted retrieval pipeline that makes it easy to connect services like Google Mail, Notion, GDrive, or fully custom data. In one afternoon, you can connect to any data source you'd like and instantly scale to millions of users. (Demo / Docs).
- SID.ai: Connect customers' data from GSuite, Notion, Mail, etc. to your LLM app in one afternoon. Simply add a "Connect" button, then call our API to retrieve context. SID takes care of the embeddings, sync, and hosting. (Demo / Docs)
- Automorphic's Trex': Intelligently transform unstructured data to structured JSON, SQL, or other context-free grammar output. (Demo / Docs)
- Axilla: Open-source AI framework for TypeScript that covers the whole lifecycle: document ingestion & retrieval, continuous evaluation, serving, and monitoring. (Docs)
- Outerbase: The interface for your database. EZQL is our open-source natural language to SQL agent that allows anyone to ask their data questions. (Demo / Docs)
- Anarchy: LLM infrastructure for developers. Use Anarchy to run open-source models efficiently and augmented with capabilities.
- SID.ai: Fully-hosted retrieval pipeline that makes it easy to connect services like Google Mail, Notion, GDrive, or fully custom data. In one afternoon, you can connect to any data source you'd like and instantly scale to millions of users. (Demo / Docs).
- Ivy: Accelerate Your AI With One Line of Code. (Demo / Docs)
- Pump: The fastest way to save 60% on AWS for free. Pump uses AI & group buying to automate cost-saving with no engineering effort.
- Cedana: Intelligently migrate AI workloads across instances to improve resource utilization, enable job-level SLAs and increase reliability for cost-effective, scalable training and inference. (Demo / Docs)
- OpenAI: Needs no introduction. (Demo / Docs)
- BerriAI: A simple & light package to call OpenAI, Azure, Cohere, Anthropic API Endpoints.
- Anarchy: LLM infrastructure for developers. Use Anarchy to run open-source models efficiently and augmented with capabilities.
- Ivy: Accelerate Your AI With One Line of Code. (Demo / Docs)
- Cedana: Intelligently migrate AI workloads across instances to improve resource utilization, enable job-level SLAs and increase reliability for cost-effective, scalable training and inference. (Demo / Docs)
- pyq AI: Easy way for developers to train and deploy task-specific AI models in the cloud. Pyq does so by providing easy-to-use software that takes in your datasets and task as inputs, and outputs a custom AI model.
- Flower: Open-source framework for training AI on distributed data using federated learning. Companies use Flower to easily improve their AI models on sensitive data they could not leverage before. (Demo / Docs)
- FiddleCube: Generate high-quality datasets for fine-tuning LLMs in minutes.
- Scale: Scale has pioneered in the data labeling industry by combining AI-based techniques with human-in-the-loop, delivering labeled data at unprecedented quality, scalability, and efficiency. (Demo / Docs)
- FiddleCube: Generate high-quality datasets for fine-tuning LLMs in minutes.
- pyq AI: Easy way for developers to train and deploy task-specific AI models in the cloud. Pyq does so by providing easy-to-use software that takes in your datasets and task as inputs, and outputs a custom AI model.
- DAGWorks' Hamilton: Open-source micro-orchestration framework for describing data flows. Companies use it for modeling data and feature engineering pipelines, prompt engineering, and LLM application workflows. (Demo)
- Query Vary: Test suite for LLMs. (Demo / Docs)
- Automorphic's Aegis: Self-hardening firewall for large language models.
- Flower: Open-source framework for training AI on distributed data using federated learning. Companies use Flower to easily improve their AI models on sensitive data they could not leverage before. (Demo / Docs)
- Parea: Improve and monitor the performance of your LLM apps through rigorous testing and version control.
- HegelAI's PromptTools: Open-source tools for evaluation and experimentation with prompts, models, and vector databases. (Demo / Docs)
- Traceloop: Deploy with confidence. Automatically evaluate and monitor changes to models, prompts, and LLM architectures.
- Query Vary: Test suite for LLMs. (Demo / Docs)
- Humanloop: Humanloop is like datadog for LLMs. They give you the tools you need to evaluate LLM apps and then take action to improve them.
- UpTrain: Open-source toolkit to prompt-test LLM applications by evaluating them on aspects like hallucinations, bias, tonality, correctness, etc. (Demo / Docs)
- Sematic: The open-source orchestrator loved by ML teams. It enables end-to-end pipelines to reduce model turnaround time by 80%. (Demo / Docs)
- DAGWorks' Hamilton: Open-source micro-orchestration framework for describing data flows. Companies use it for modeling data and feature engineering pipelines, prompt engineering, and LLM application workflows. (Demo)
- Arakoo's EdgeChains: Open Source SDK that models generative AI applications as config management. Built on top of Jsonnet as the orchestration grammar.
- AssemblyAI: AI models for speech recognition, automatic transcription, speech summarization, and more through our secure and scalable API. (Demo / Docs)
- Sweep AI: AI-powered junior dev that turns bug reports & feature requests into code changes. Developers report bugs like "the payment link on my landing page is broken" and Sweep writes a code to fix it. (Demo / Docs)
- Continue: the open-source autopilot for software development — a VS Code extension that brings the power of ChatGPT to your IDE. (Demo / Docs)
- Tempo Labs: AI design & prototyping tool which generates and edits react code directly in your codebase.
- Theneo: Next-Gen API Documentation with AI Brilliance. Generate Stripe-like API docs in just a few seconds. (Demo / Docs)
Are you a YC founder and your AI dev tool is missing? Let us know or contribute here.