Skip to content

Pinned Loading

  1. understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1.1k 50

  2. zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 414 26

  3. lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 645 40

  4. EditAnything Public

    Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

    Python 3.4k 199

  5. oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 427 32

  6. stde Public

    Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024

    Python 115 7

Repositories

Showing 10 of 90 repositories
  • oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 427 Apache-2.0 32 1 2 Updated Jul 28, 2025
  • SkyLadder Public Forked from jzhang38/TinyLlama

    The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

    Python 33 Apache-2.0 567 0 0 Updated Jul 27, 2025
  • understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1,058 MIT 50 7 0 Updated Jul 24, 2025
  • jrystal Public

    A JAX-based Differentiable Density Functional Theory Framework for Materials

    Python 30 Apache-2.0 2 7 3 Updated Jul 24, 2025
  • AnytimeReasoner Public

    Optimizing Anytime Reasoning via Budget Relative Policy Optimization

    Python 43 Apache-2.0 2 0 0 Updated Jul 15, 2025
  • LongSpec Public

    LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

    Python 61 MIT 3 0 0 Updated Jul 14, 2025
  • Attention-Sink Public

    [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

    Python 107 MIT 4 1 0 Updated Jul 8, 2025
  • Python 8 0 1 0 Updated Jul 5, 2025
  • VeriFree Public

    Reinforcing General Reasoning without Verifiers

    Python 78 6 6 0 Updated Jun 24, 2025
  • d4ft Public

    A JAX library for Density Functional Theory.

    Python 54 Apache-2.0 5 16 0 Updated Jun 18, 2025