Skip to content
View ttthree's full-sized avatar

Organizations

@microsoft

Block or report ttthree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,384 312 Updated Mar 12, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,883 709 Updated Mar 12, 2025

Expert Parallelism Load Balancer

Python 1,056 154 Updated Feb 27, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,271 790 Updated Mar 1, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,582 257 Updated Mar 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,923 488 Updated Mar 11, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 34,213 3,405 Updated Mar 12, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 40,819 3,917 Updated Mar 12, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 10,074 953 Updated Mar 11, 2025

structured outputs for llms

Python 9,758 752 Updated Mar 12, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,922 1,513 Updated Mar 11, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,938 6,092 Updated Mar 12, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,391 150 Updated Mar 7, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 8,441 864 Updated Mar 10, 2025

Collections of vector search related libraries, service and research papers

1,465 99 Updated Aug 6, 2024

A fast, powerful, safe and lightweight scripting language and engine for .NET

C# 3,360 366 Updated Mar 6, 2025

A natural language interface for computers

Python 58,718 5,004 Updated Jan 24, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,404 3,370 Updated Mar 25, 2024

A tool that transforms OpenAI API requests into Azure OpenAI API requests, allowing OpenAI-compatible applications to seamlessly use Azure OpenAI. 一个 OpenAI API 的代理工具,能将 OpenAI API 请求转为 Azure OpenA…

TypeScript 139 28 Updated Oct 5, 2024

React component to create graphic user interface with: - draggable nodes with ports and edges on a directed graph editor. - extensibility to customize the widgets or behaviors. - accessbility and t…

TypeScript 195 12 Updated Mar 11, 2025
Showing results