Summarize YT videos in one go
-
Updated
Jun 9, 2024 - Python
Summarize YT videos in one go
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
A high-performance inference system for large language models, designed for production environments.
LLM bootstrap loader for local CPU/GPU inference with fully customizable chat.
The open-source serverless GPU container runtime.
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
Efficient and general syntactical decoding for Large Language Models
Library to supercharge your use of large language models
One .NET library to consume OpenAI, Anthropic, Cohere, Azure, and self-hosed APIs.
开源的智能体项目 支持6种聊天平台 Onebotv11一对多连接 流式信息 agent 对话keyboard气泡生成 支持6种大模型接口(持续增加中) 具有将多种大模型接口转化为带有上下文的通用格式的能力.
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Empower Your Productivity with Local AI Assistants
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A ChatBot written in C# using OpenAI's API
FlashInfer: Kernel Library for LLM Serving
Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.
To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."