ml
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Toolkit for linearizing PDFs for LLM datasets/training
Open-Sora: Democratizing Efficient Video Production for All
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, dif…
A TTS model capable of generating ultra-realistic dialogue in one pass.
CleverBee - The Open Source Deep Researcher Tool
Fast and local neural text-to-speech engine
A merged version of multiple open-source German speech datasets.
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
Set of tools to assess and improve LLM security.
A high-throughput and memory-efficient inference and serving engine for LLMs
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
my personal receipts collected all over the world
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Have a natural, spoken conversation with AI!
Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
A self-hosted RSS reader and personal knowledge management tool.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services

