openai-triton

Here are 5 public repositories matching this topic...

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

nlp deep-learning llama gpt model-serving llm openai-triton

Updated Nov 18, 2024
Python

chengzeyi / stable-fast

Sponsor

Star

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

cuda torch pytorch inference-engines performance-optimizations stable-diffusion diffusers deeplearnng openai-triton stable-video-diffusion

Updated Jul 16, 2024
Python

BobMcDear / attorch

Star

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

machine-learning deep-learning cuda pytorch openai triton openai-triton

Updated Oct 25, 2024
Python

DeepAuto-AI / hip-attention

Star

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

triton attention attention-mechanism sub-quadratic-attention openai-triton hip-attention

Updated Nov 5, 2024
Python

neural-bits / ai-programming-hub

Star

Learn and experiment with new techniques and programming languages with a focus on ML

python rust cpp cython cuda openai-triton

Updated Sep 11, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the openai-triton topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the openai-triton topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openai-triton

Here are 5 public repositories matching this topic...

ModelTC / lightllm

chengzeyi / stable-fast

BobMcDear / attorch

DeepAuto-AI / hip-attention

neural-bits / ai-programming-hub

Improve this page

Add this topic to your repo