Skip to content
View fatfatcd's full-sized avatar
😅
pooping
😅
pooping

Block or report fatfatcd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

NVIDIA Real-time Denoising (NRD) library

HLSL 549 49 Updated Mar 7, 2025

Command and Conquer: Red Alert

C++ 5,631 1,048 Updated Feb 27, 2025
HLSL 387 62 Updated Mar 4, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,801 694 Updated Mar 11, 2025

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Python 157 9 Updated May 9, 2024

Expert Parallelism Load Balancer

Python 1,050 152 Updated Feb 27, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,561 252 Updated Mar 10, 2025

The Modern Vulkan Cookbook published by Packt

C++ 148 16 Updated Mar 2, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,882 479 Updated Mar 10, 2025

Rendering glTF scenes with ray tracer and raster (Vulkan)

C++ 192 14 Updated Feb 3, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,112 615 Updated Mar 11, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,239 785 Updated Mar 1, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,128 1,743 Updated Mar 2, 2025

AMD ROCm™ Software - GitHub Home

Shell 5,052 410 Updated Mar 10, 2025

lightweight connection pooler for PostgreSQL

C 3,180 481 Updated Mar 9, 2025

Highly available elephant herd: HA PostgreSQL cluster using Docker

Python 1,631 415 Updated Mar 7, 2025

Path tracing renderer and utilities for three.js built on top of three-mesh-bvh.

JavaScript 1,443 135 Updated Aug 31, 2024

Real-time PathTracing with global illumination and progressive rendering, all on top of the Three.js WebGL framework. Click here for Live Demo: https://erichlof.github.io/THREE.js-PathTracing-Rende…

GLSL 2,003 183 Updated Mar 10, 2025

A Tiny WebGL helper Library

JavaScript 2,772 261 Updated Nov 18, 2024

Official mirror of Blender

C++ 14,427 2,165 Updated Mar 11, 2025

Babylon.js is a powerful, beautiful, simple, and open game and rendering engine packed into a friendly JavaScript framework.

TypeScript 23,720 3,489 Updated Mar 10, 2025

Code for the "Graphics Gems" book series

C 1,442 270 Updated Dec 21, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,453 1,047 Updated Mar 10, 2025
Python 1 Updated Feb 26, 2025

A template for PostgreSQL High Availability with Etcd, Consul, ZooKeeper, or Kubernetes

Python 7,139 889 Updated Mar 2, 2025

Ray tracing examples and tutorials using VK_KHR_ray_tracing

C++ 1,468 153 Updated Sep 17, 2024

Fully open reproduction of DeepSeek-R1

Python 22,519 2,023 Updated Mar 10, 2025

Machine Learning Engineering Open Book

Python 13,118 799 Updated Mar 9, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,071 128 Updated Mar 10, 2025

PlayStation 4 emulator for Windows, Linux and macOS written in C++

C++ 19,080 1,170 Updated Mar 11, 2025
Next
Showing results