-
FathomAI
- Vancouver, BC, Canada
-
14:16
(UTC -08:00) - bedirtapkan.com
- in/bedirtapkan
Highlights
- Pro
LARA
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Knowledge Agents and Management in the Cloud
LlamaIndex is the leading document agent and OCR platform
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
The programming language for agentic software. Build, run, and manage multi-agent systems at scale.
Fast and accurate automatic speech recognition (ASR) for edge devices
A natural language interface for computers
OCR, layout analysis, reading order, table recognition in 90+ languages
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
Robust Speech Recognition via Large-Scale Weak Supervision
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
✨ The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes ✨
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra





