Skip to content
View ttthree's full-sized avatar

Organizations

@microsoft

Block or report ttthree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,222 298 Updated Mar 7, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,621 666 Updated Mar 7, 2025

Expert Parallelism Load Balancer

Python 1,027 150 Updated Feb 27, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,183 779 Updated Mar 1, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,522 243 Updated Mar 5, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,815 469 Updated Mar 5, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 33,391 3,286 Updated Mar 7, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 40,321 3,870 Updated Mar 4, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 10,045 944 Updated Feb 24, 2025

structured outputs for llms

Python 9,699 747 Updated Mar 6, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,866 1,504 Updated Mar 6, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,849 6,082 Updated Mar 7, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,382 150 Updated Mar 7, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 8,391 862 Updated Mar 4, 2025

Collections of vector search related libraries, service and research papers

1,463 99 Updated Aug 6, 2024

A fast, powerful, safe and lightweight scripting language and engine for .NET

C# 3,352 366 Updated Mar 6, 2025

A natural language interface for computers

Python 58,592 5,004 Updated Jan 24, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,393 3,369 Updated Mar 25, 2024

A tool that transforms OpenAI API requests into Azure OpenAI API requests, allowing OpenAI-compatible applications to seamlessly use Azure OpenAI. 一个 OpenAI API 的代理工具,能将 OpenAI API 请求转为 Azure OpenA…

TypeScript 139 28 Updated Oct 5, 2024

React component to create graphic user interface with: - draggable nodes with ports and edges on a directed graph editor. - extensibility to customize the widgets or behaviors. - accessbility and t…

TypeScript 195 12 Updated Jun 18, 2024
Showing results