Skip to content

Pinned Loading

  1. understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1k 48

  2. zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 400 26

  3. lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 640 38

  4. EditAnything Public

    Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

    Python 3.4k 198

  5. oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 385 29

  6. stde Public

    Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024

    Python 109 8

Repositories

Showing 10 of 90 repositories

Top languages

Loading…

Most used topics

Loading…