Skip to content

Popular repositories Loading

  1. OpenPipe OpenPipe Public

    Turn expensive prompts into cheap fine-tuned models

    TypeScript 2.6k 146

  2. ART ART Public

    Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!

    Python 977 61

  3. deductive-reasoning deductive-reasoning Public

    Train your own SOTA deductive reasoning model

    Python 96 6

  4. pii-redaction pii-redaction Public

    Detect and redact PII locally with SOTA performance

    Python 54 9

  5. rl-experiments rl-experiments Public

    OpenPipe Reinforcement Learning Experiments

    Jupyter Notebook 25 4

  6. Summary-RL Summary-RL Public

    Train an agent to generate high quality summaries

    Jupyter Notebook 17 2

Repositories

Showing 10 of 25 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…