Skip to content

Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers

License

Notifications You must be signed in to change notification settings

sidhq/YC-alum-ai-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Y Combinator Alum – AI Dev Tool

Header Image

Disclaimer: This repository is maintained by the founders of SID Tech Inc. and other volunteers of the Y Combinator Community. This repository and SID Tech Inc. are not affiliated with, sponsored or endorsed by Y Combinator.

This is a curated collection of AI developer tools built by YC companies.
We're aiming to serve as a reliable starting point for LLM/ML developers.

Overview

Analytics & Monitoring

  • Humanloop: Humanloop is like datadog for LLMs. They give you the tools you need to evaluate LLM apps and then take action to improve them.
  • Helicone: The easiest way to capture data from your LLMs (Open Source).
  • Langfuse: Open-source analytics for LLM applications. (Demo / Docs)
  • UpTrain: Open-source toolkit to evaluate and monitor LLM applications on aspects like hallucinations, bias, tonality, correctness, etc. (Demo / Docs)
  • Structured: LLM tool transforming complex system log data into easily understandable insights. (Demo)
  • Traceloop: Deploy with confidence. Automatically evaluate and monitor changes to models, prompts, and LLM architectures.
  • BerriAI: A simple & light package to call OpenAI, Azure, Cohere, and Anthropic API Endpoints.
  • Parea: Improve and monitor the performance of your LLM apps through rigorous testing and version control.
  • Axilla: Open-source AI framework for TypeScript that covers the whole lifecycle: document ingestion & retrieval, continuous evaluation, serving, and monitoring. (Docs)
  • DAGWorks: The observability and monitoring solution for Hamilton. Get lineage, a catalog, and observability on top of Hamilton with a one-line code change.
  • HegelAI's PromptTools: Open-source tools for evaluation and experimentation with prompts, models, and vector databases. (Demo / Docs)

Vector DB & Embeddings

  • Supabase Vector: Open-source Vector Toolkit for Postgres. Use the Supabase client libraries to store, index, and query your vector embeddings at scale. (Demo / Docs)
  • LanceDB: Open-source, developer-friendly vector database for multi-modal AI. Reduce unstructured storage costs by 80% and get 1000x faster performance than parquet for AI. (Demo / Docs)
  • SID.ai: Fully-hosted retrieval pipeline that makes it easy to connect services like Google Mail, Notion, GDrive, or fully custom data. In one afternoon, you can connect to any data source you'd like and instantly scale to millions of users. (Demo / Docs).

Data Integrations & Retrieval

  • SID.ai: Connect customers' data from GSuite, Notion, Mail, etc. to your LLM app in one afternoon. Simply add a "Connect" button, then call our API to retrieve context. SID takes care of the embeddings, sync, and hosting. (Demo / Docs)
  • Automorphic's Trex': Intelligently transform unstructured data to structured JSON, SQL, or other context-free grammar output. (Demo / Docs)
  • Axilla: Open-source AI framework for TypeScript that covers the whole lifecycle: document ingestion & retrieval, continuous evaluation, serving, and monitoring. (Docs)
  • Outerbase: The interface for your database. EZQL is our open-source natural language to SQL agent that allows anyone to ask their data questions. (Demo / Docs)

Infrastructure

  • Anarchy: LLM infrastructure for developers. Use Anarchy to run open-source models efficiently and augmented with capabilities.
  • SID.ai: Fully-hosted retrieval pipeline that makes it easy to connect services like Google Mail, Notion, GDrive, or fully custom data. In one afternoon, you can connect to any data source you'd like and instantly scale to millions of users. (Demo / Docs).
  • Ivy: Accelerate Your AI With One Line of Code. (Demo / Docs)
  • Pump: The fastest way to save 60% on AWS for free. Pump uses AI & group buying to automate cost-saving with no engineering effort.
  • Cedana: Intelligently migrate AI workloads across instances to improve resource utilization, enable job-level SLAs and increase reliability for cost-effective, scalable training and inference. (Demo / Docs)

LLM Serving & Fine-Tuning

  • OpenAI: Needs no introduction. (Demo / Docs)
  • BerriAI: A simple & light package to call OpenAI, Azure, Cohere, Anthropic API Endpoints.
  • Anarchy: LLM infrastructure for developers. Use Anarchy to run open-source models efficiently and augmented with capabilities.
  • Ivy: Accelerate Your AI With One Line of Code. (Demo / Docs)
  • Cedana: Intelligently migrate AI workloads across instances to improve resource utilization, enable job-level SLAs and increase reliability for cost-effective, scalable training and inference. (Demo / Docs)
  • pyq AI: Easy way for developers to train and deploy task-specific AI models in the cloud. Pyq does so by providing easy-to-use software that takes in your datasets and task as inputs, and outputs a custom AI model.
  • Flower: Open-source framework for training AI on distributed data using federated learning. Companies use Flower to easily improve their AI models on sensitive data they could not leverage before. (Demo / Docs)
  • FiddleCube: Generate high-quality datasets for fine-tuning LLMs in minutes.

Dataset Generation & Handling

  • Scale: Scale has pioneered in the data labeling industry by combining AI-based techniques with human-in-the-loop, delivering labeled data at unprecedented quality, scalability, and efficiency. (Demo / Docs)
  • FiddleCube: Generate high-quality datasets for fine-tuning LLMs in minutes.
  • pyq AI: Easy way for developers to train and deploy task-specific AI models in the cloud. Pyq does so by providing easy-to-use software that takes in your datasets and task as inputs, and outputs a custom AI model.
  • DAGWorks' Hamilton: Open-source micro-orchestration framework for describing data flows. Companies use it for modeling data and feature engineering pipelines, prompt engineering, and LLM application workflows. (Demo)
  • Query Vary: Test suite for LLMs. (Demo / Docs)

Security

  • Automorphic's Aegis: Self-hardening firewall for large language models.
  • Flower: Open-source framework for training AI on distributed data using federated learning. Companies use Flower to easily improve their AI models on sensitive data they could not leverage before. (Demo / Docs)

Prompt Management & Testing

  • Parea: Improve and monitor the performance of your LLM apps through rigorous testing and version control.
  • HegelAI's PromptTools: Open-source tools for evaluation and experimentation with prompts, models, and vector databases. (Demo / Docs)
  • Traceloop: Deploy with confidence. Automatically evaluate and monitor changes to models, prompts, and LLM architectures.
  • Query Vary: Test suite for LLMs. (Demo / Docs)
  • Humanloop: Humanloop is like datadog for LLMs. They give you the tools you need to evaluate LLM apps and then take action to improve them.
  • UpTrain: Open-source toolkit to prompt-test LLM applications by evaluating them on aspects like hallucinations, bias, tonality, correctness, etc. (Demo / Docs)

Orchestration

  • Sematic: The open-source orchestrator loved by ML teams. It enables end-to-end pipelines to reduce model turnaround time by 80%. (Demo / Docs)
  • DAGWorks' Hamilton: Open-source micro-orchestration framework for describing data flows. Companies use it for modeling data and feature engineering pipelines, prompt engineering, and LLM application workflows. (Demo)
  • Arakoo's EdgeChains: Open Source SDK that models generative AI applications as config management. Built on top of Jsonnet as the orchestration grammar.

Audio

  • AssemblyAI: AI models for speech recognition, automatic transcription, speech summarization, and more through our secure and scalable API. (Demo / Docs)

Making Development Easier

  • Sweep AI: AI-powered junior dev that turns bug reports & feature requests into code changes. Developers report bugs like "the payment link on my landing page is broken" and Sweep writes a code to fix it. (Demo / Docs)
  • Continue: the open-source autopilot for software development — a VS Code extension that brings the power of ChatGPT to your IDE. (Demo / Docs)
  • Tempo Labs: AI design & prototyping tool which generates and edits react code directly in your codebase.
  • Theneo: Next-Gen API Documentation with AI Brilliance. Generate Stripe-like API docs in just a few seconds. (Demo / Docs)

Are you a YC founder and your AI dev tool is missing? Let us know or contribute here.

About

Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers

Topics

Resources

License

Stars

Watchers

Forks