A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,248 376 Updated Mar 8, 2025

TabbyML / tabby

Self-hosted AI coding assistant

Rust 30,337 1,396 Updated Mar 8, 2025

NVIDIA / nv-ingest

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,576 221 Updated Mar 7, 2025

wangzyon / NVIDIA_SGEMM_PRACTICE

Step-by-step optimization of CUDA SGEMM

Cuda 292 44 Updated Mar 30, 2022

yzhaiustc / Optimizing-SGEMM-on-NVIDIA-Turing-GPUs

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 326 49 Updated Jan 2, 2025

dvlab-research / MagicMirror

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

108 3 Updated Jan 13, 2025

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 18,079 2,501 Updated Mar 7, 2025

2DGD-F0TH / 2DGD_F0TH

[CC BY-NC-SA] A compendium of the community knowledge on game design and development

Lua 394 16 Updated Feb 26, 2025

uranusjr / simpleindex

Python 48 7 Updated Mar 27, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,150 649 Updated Mar 6, 2025

cyclotruc / gitingest

Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

Python 7,209 570 Updated Mar 7, 2025

emirsahin1 / llm-axe

A simple, intuitive toolkit for quickly implementing LLM powered applications.

Python 213 32 Updated Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ho John Lee hjl

Achievements

Achievements

Highlights

Block or report hjl

Stars

deepseek-ai / DeepGEMM

OpenKinect / libfreenect2

vlm-run / vlmrun-hub

Goldziher / kreuzberg

microsoft / data-formulator

navidrome / navidrome

Zjh-819 / LLMDataHub

HazyResearch / lolcats

HazyResearch / ThunderKittens

simplescaling / s1

huggingface / open-r1

deepseek-ai / Janus

irgroup / datasets

tiagolr / mididash

deepseek-ai / DeepSeek-R1

run-llama / create-llama

Azure / MS-AMP

pytorch-labs / float8_experimental

NVIDIA / TransformerEngine