Skip to content
View ttthree's full-sized avatar

Organizations

@microsoft

Block or report ttthree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,136 603 Updated Mar 4, 2025

Expert Parallelism Load Balancer

Python 999 140 Updated Feb 27, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,092 756 Updated Mar 1, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,447 224 Updated Mar 4, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,727 445 Updated Mar 4, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 32,835 3,204 Updated Mar 4, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 39,876 3,825 Updated Mar 4, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 10,029 942 Updated Feb 24, 2025

structured outputs for llms

Python 9,636 743 Updated Mar 4, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,837 1,498 Updated Mar 4, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,792 6,060 Updated Mar 4, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,381 150 Updated Feb 20, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 8,364 859 Updated Mar 4, 2025

Collections of vector search related libraries, service and research papers

1,461 100 Updated Aug 6, 2024

A fast, powerful, safe and lightweight scripting language and engine for .NET

C# 3,344 362 Updated Dec 18, 2024

A natural language interface for computers

Python 58,537 4,996 Updated Jan 24, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,388 3,369 Updated Mar 25, 2024

A tool that transforms OpenAI API requests into Azure OpenAI API requests, allowing OpenAI-compatible applications to seamlessly use Azure OpenAI. 一个 OpenAI API 的代理工具,能将 OpenAI API 请求转为 Azure OpenA…

TypeScript 139 28 Updated Oct 5, 2024

React component to create graphic user interface with: - draggable nodes with ports and edges on a directed graph editor. - extensibility to customize the widgets or behaviors. - accessbility and t…

TypeScript 195 12 Updated Jun 18, 2024
Showing results