jankinf

🤪

Zhengwei Fang jankinf

🤪

13 followers · 17 following

Beijing, China

Achievements

Organizations

Stars

benfred / py-spy

Sampling profiler for Python programs

Rust 13,356 449 Updated Feb 6, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,072 134 Updated Mar 3, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,539 182 Updated Mar 10, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 624 81 Updated Mar 10, 2025

vsubramaniam851 / multiagent-ft

Python 179 20 Updated Feb 24, 2025

tinyzqh / light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Python 601 90 Updated Feb 26, 2025

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,467 315 Updated Jul 18, 2024

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,612 347 Updated Mar 10, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,722 357 Updated Mar 9, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,466 1,658 Updated Feb 26, 2025

thu-ml / STAIR

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 24 1 Updated Feb 26, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,655 1,139 Updated Mar 7, 2025

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 737 90 Updated Mar 10, 2025

actions / starter-workflows

Accelerating new GitHub Actions workflows

TypeScript 9,865 5,767 Updated Mar 5, 2025

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,997 233 Updated Mar 10, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,561 73 Updated Mar 5, 2025

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,296 168 Updated Jul 25, 2023

GAIR-NLP / LIMR

Python 150 5 Updated Feb 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,499 2,020 Updated Mar 10, 2025

wangshusen / DRL

Deep Reinforcement Learning

3,652 615 Updated Dec 10, 2022

simplescaling / s1

s1: Simple test-time scaling

Python 5,918 684 Updated Mar 6, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,073 1,411 Updated Mar 10, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,557 430 Updated Mar 10, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,649 2,183 Updated Feb 1, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,605 71 Updated Aug 15, 2024

Near32 / ReferentialGym

This framework provides out-of-the-box implementations of Referential Games variants in order to study the emergence of artificial languages using deep learning, relying on PyTorch (https://www.pyt…

Python 22 3 Updated Feb 24, 2025

facebookresearch / EGG

EGG: Emergence of lanGuage in Games

Jupyter Notebook 297 106 Updated Apr 4, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 149,171 27,859 Updated Mar 4, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 70,430 7,601 Updated Mar 10, 2025

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,089 1,158 Updated Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhengwei Fang jankinf

Achievements

Achievements

Organizations

Block or report jankinf

Stars

benfred / py-spy

Unakar / Logic-RL

argilla-io / distilabel

dhcode-cpp / X-R1

vsubramaniam851 / multiagent-ft

tinyzqh / light_mappo

marlbenchmark / on-policy

X-PLUG / MobileAgent

PKU-Alignment / align-anything

deepseek-ai / DeepSeek-VL2

thu-ml / STAIR

NVIDIA / TensorRT-LLM

NVIDIA / NeMo-Aligner

actions / starter-workflows

ModelTC / lightllm

Open-Reasoner-Zero / Open-Reasoner-Zero

openai / lm-human-preferences

GAIR-NLP / LIMR

huggingface / open-r1

wangshusen / DRL

simplescaling / s1

Jiayi-Pan / TinyZero

volcengine / verl

deepseek-ai / Janus

FoundationVision / LlamaGen

Near32 / ReferentialGym

facebookresearch / EGG

AUTOMATIC1111 / stable-diffusion-webui

comfyanonymous / ComfyUI

google-deepmind / pysc2