Skip to content
View ftxj's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ftxj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Python 1,347 300 Updated Mar 26, 2025

Sampling profiler for Python programs

Rust 13,443 450 Updated Feb 6, 2025

Machine Learning Engineering Open Book

Python 13,263 805 Updated Mar 29, 2025

The Art of Debugging

C 863 39 Updated Aug 3, 2024
Jupyter Notebook 971 155 Updated Mar 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 43,042 6,538 Updated Mar 29, 2025

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 314 55 Updated Mar 29, 2025

Making large AI models cheaper, faster and more accessible

Python 40,692 4,488 Updated Mar 28, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,289 569 Updated Oct 28, 2024

Awesome resources for GPUs

554 53 Updated Jul 1, 2023

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,495 317 Updated Jul 15, 2024

Pyjion - A JIT for Python based upon CoreCLR

C++ 1,429 61 Updated Dec 25, 2024

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,552 529 Updated Mar 26, 2025

Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.

C++ 52 16 Updated Nov 20, 2023

An optimizing compiler for decision tree ensemble inference.

C++ 17 5 Updated Mar 22, 2025

Reinforcement learning environments for compiler and program optimization tasks

Python 935 128 Updated Oct 9, 2024

A speculative mechanism to accelerate long-latency off-chip load requests by removing on-chip cache access latency from their critical path, as described by MICRO 2022 paper by Bera et al. (https:/…

C++ 72 12 Updated Sep 8, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

C++ 26 7 Updated Apr 20, 2023

Ceras is yet another tiny deep learning engine, in pure c++ and header only.

C++ 122 11 Updated Sep 11, 2024

Compile Time Regular Expression in C++

C++ 3,501 191 Updated Feb 25, 2025

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 3,958 163 Updated Mar 24, 2025

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 43,457 5,466 Updated Jan 16, 2025

Papers on Graph Analytics, Mining, and Learning

124 19 Updated Aug 15, 2022

Fluid simulation engine for computer graphics applications

C++ 1,961 273 Updated Dec 24, 2023

Study Group of Deep Learning Compiler

157 16 Updated Jan 15, 2023

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,621 376 Updated Dec 4, 2024

⚓ 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.

9,428 1,723 Updated Oct 16, 2021

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,526 308 Updated Oct 19, 2024

compiler learning resources collect.

Python 2,331 344 Updated Mar 19, 2025
Next
Showing results