Stars
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
A high-throughput and memory-efficient inference and serving engine for LLMs
An Open Source Machine Learning Framework for Everyone
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
A high-throughput and memory-efficient inference and serving engine for LLMs
An Open Source Machine Learning Framework for Everyone