dvmazur

🍋

Denis Mazur dvmazur

🍋

switch stance, face plant; napalm fire starters; flatlander, plug puller; nose dive turbulence chartered

120 followers · 40 following

@dvmazur

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Starred repositories

Jur1cek / codeforces-dataset

Collected sollutions from codeforces.com.

18 2 Updated May 1, 2022

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,496 2,141 Updated Mar 30, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,679 283 Updated Mar 10, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,112 536 Updated Mar 28, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 376 22 Updated Mar 4, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,689 81 Updated Mar 5, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,391 812 Updated Mar 1, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,168 322 Updated Mar 25, 2025

copy / v86

x86 PC emulator and x86-to-wasm JIT, running in the browser

JavaScript 20,497 1,472 Updated Mar 25, 2025

goodevening13 / aquakv

Jupyter Notebook 12 1 Updated Mar 28, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,643 1,391 Updated Mar 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,902 589 Updated Mar 30, 2025

lichess-org / lila

♞ lichess.org: the forever free, adless and open source chess server ♞

Scala 16,421 2,364 Updated Mar 30, 2025

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

358 8 Updated Mar 25, 2025

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 235 8 Updated Feb 23, 2025

anvaka / map-of-github

Inspirational Mapping

Vue 2,445 61 Updated Sep 25, 2024

SJTU-IPADS / PhoenixOS

Fast OS-level support for GPU checkpoint and restore

C++ 170 15 Updated Mar 4, 2025

0burak / imperial_hft

C++ 848 122 Updated Sep 10, 2023

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 209 17 Updated Nov 28, 2024

python-trio / trio

Trio – a friendly Python library for async concurrency and I/O

Python 6,411 352 Updated Mar 28, 2025

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 940 54 Updated Dec 6, 2024

PrimeIntellect-ai / prime

prime is a framework for efficient, globally distributed training of AI models over the internet.

Python 689 67 Updated Mar 28, 2025

google / nsync

nsync is a C library that exports various synchronization primitives, such as mutexes

C 1,147 86 Updated Jul 23, 2024

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 34,640 3,031 Updated Mar 28, 2025

NVlabs / tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

C++ 3,945 486 Updated Jan 27, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,948 652 Updated Mar 28, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,594 1,436 Updated Mar 26, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,752 289 Updated Mar 28, 2025

galqiwi / fair-p

Go 7 Updated Feb 10, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 815 53 Updated Mar 19, 2025

Python

P2P

Natural language processing

Denis Mazur dvmazur

Starred repositories

Python

P2P

Natural language processing

Machine learning

IPFS

Go

Deep learning

Data structures

Algorithm

nats