Skip to content
View hjl's full-sized avatar
  • Palo Alto, California

Highlights

  • Pro

Block or report hjl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,830 471 Updated Mar 8, 2025

Open source drivers for the Kinect for Windows v2 device

C++ 2,113 761 Updated Apr 9, 2024

A hub for various industry-specific schemas to be used with VLMs.

Python 473 19 Updated Mar 4, 2025

A text extraction library supporting PDFs, images, office documents and more

Python 1,555 50 Updated Mar 7, 2025

🪄 Create rich visualizations with AI

TypeScript 8,569 668 Updated Mar 8, 2025

🎧☁️ Your Personal Streaming Service

Go 13,448 987 Updated Mar 8, 2025

A quick guide (especially) for trending instruction finetuning datasets

2,918 191 Updated Nov 28, 2023

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 221 23 Updated Jan 31, 2025

Tile primitives for speedy kernels

Cuda 2,127 123 Updated Mar 7, 2025

s1: Simple test-time scaling

Python 5,882 673 Updated Mar 6, 2025

Fully open reproduction of DeepSeek-R1

Python 22,353 2,003 Updated Mar 8, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,615 2,177 Updated Feb 1, 2025

Datasets in the IR-Group

R 8 Updated Jun 15, 2021

Midi router with Lua scripting and a node based interface

Vue 20 1 Updated Feb 14, 2025

The easiest way to get started with LlamaIndex

TypeScript 1,212 161 Updated Mar 5, 2025

Microsoft Automatic Mixed Precision Library

Python 576 48 Updated Sep 29, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 221 19 Updated Aug 1, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,248 376 Updated Mar 8, 2025

Self-hosted AI coding assistant

Rust 30,337 1,396 Updated Mar 8, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,576 221 Updated Mar 7, 2025

Step-by-step optimization of CUDA SGEMM

Cuda 292 44 Updated Mar 30, 2022

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 326 49 Updated Jan 2, 2025

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

108 3 Updated Jan 13, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 18,079 2,501 Updated Mar 7, 2025

[CC BY-NC-SA] A compendium of the community knowledge on game design and development

Lua 394 16 Updated Feb 26, 2025
Python 48 7 Updated Mar 27, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,150 649 Updated Mar 6, 2025

Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

Python 7,209 570 Updated Mar 7, 2025

A simple, intuitive toolkit for quickly implementing LLM powered applications.

Python 213 32 Updated Jan 5, 2025
Next
Showing results