Stars
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
Disaggregated serving system for Large Language Models (LLMs).
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
前端后端同时开源。 Ai-to-pptx是一个使用AI技术(DeepSeek)制作PPTX的助手,支持在线生成和导出PPTX。 主要功能: 1 使用DeepSeek等大语言模型来生成大纲 2 生成PPTX的时候可以选择不同的模板 3 支持导出PPTX
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Stable Diffusion and Flux in pure C/C++
Community maintained hardware plugin for vLLM on Ascend
Cost-efficient and pluggable Infrastructure components for GenAI inference
DeepEP: an efficient expert-parallel communication library
WeSQL is an innovative MySQL distribution that adopts a compute-storage separation architecture, with storage backed by S3 (and S3-compatible systems). It can run on any cloud, ensuring no vendor l…
Thor(雷神托尔) 是一款强大的人工智能模型管理工具,其主要目的是为了实现多种AI模型的统一管理和使用。通过Thor(雷神托尔),用户可以轻松地管理和使用众多AI模型,而且Thor(雷神托尔)兼容OpenAI的接口格式,使得使用更加方便。
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
A text extraction library supporting PDFs, images, office documents and more
AI-first Search & Answer Engine for work. Open-source alternative to Glean.
Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.
JupyterLab extension to create GitHub commits & pull requests
Tools for diffing and merging of Jupyter notebooks.
RooVetGit / Roo-Code
Forked from cline/clineRoo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.
A docker image based in ubuntu to run docker containers inside docker containers
PromptSite is a lightweight prompt version management package that helps you version control, track, experiment and debug with your LLM prompts with ease.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
A repo for Packs written and maintained by Nomad community members