Skip to content
View simonucl's full-sized avatar

Highlights

  • Pro

Block or report simonucl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ChenmienTan/RL2 Public

    Python 195 17

  2. spiral-rl/spiral Public

    SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Python 99 9

  3. LeonGuertler/TextArena Public

    A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

    Python 209 44

  4. Cohere-Labs-Community/iterative-data-selection Public

    Python 28 5

  5. hanxuhu/SeqIns Public

    The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAVIS

    Jupyter Notebook 29 2

  6. HJCL Public

    Python 15 3