Skip to content
View flixmk's full-sized avatar

Block or report flixmk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 567 27 Updated Mar 26, 2025

Official Implementation for Diffusion Models Without Classifier-free Guidance

Python 102 6 Updated Feb 18, 2025
Python 27 Updated Mar 8, 2025

Latest Weight Averaging (NeurIPS HITY 2022)

Python 29 2 Updated Jun 20, 2023

Train VAE like a boss

Jupyter Notebook 270 12 Updated Oct 21, 2024

Implementation of Autoregressive Diffusion in Pytorch

Python 365 10 Updated Nov 3, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 790 39 Updated Mar 14, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,180 142 Updated Mar 28, 2025

A PyTorch native library for large model training

Python 3,506 324 Updated Mar 28, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 22,222 3,221 Updated Mar 5, 2025

Fast Diffusion Models with Transformers

Python 811 109 Updated Oct 25, 2024
Showing results