Add code-rl recipe with DeepCoder #83

Xiuyu-Li · 2025-11-10T21:43:51Z

This PR adds an example recipe for reinforcement learning (RL) to solve competitive programming problems using Tinker and the DeepCoder dataset. The recipe is located at tinker_cookbook/recipes/code_rl. The environment uses sandboxing via Sandbox Fusion for security, without introducing any additional dependencies when running in Docker.

TieMoulton · 2025-11-10T21:56:32Z

this is pretty neat, i agree

joschu · 2025-11-17T07:50:01Z

Cool!

joschu · 2025-11-17T07:51:18Z

Thanks for adding this! We'll review shortly!

Tiiiger · 2025-11-18T18:16:24Z

Great! I ran the experiment in the README and seems to be working as expected. Merging now

Tiiiger

LGTM and tested

Add code-rl recipe with DeepCoder

2ba6b20

Fix pre-commit and pyright checks

64a3762

Tiiiger self-requested a review November 12, 2025 23:10

Tiiiger approved these changes Nov 18, 2025

View reviewed changes

Tiiiger merged commit 320e1c0 into thinking-machines-lab:main Nov 18, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add code-rl recipe with DeepCoder #83

Add code-rl recipe with DeepCoder #83

Uh oh!

Xiuyu-Li commented Nov 10, 2025

Uh oh!

TieMoulton commented Nov 10, 2025

Uh oh!

joschu commented Nov 17, 2025

Uh oh!

joschu commented Nov 17, 2025

Uh oh!

Tiiiger commented Nov 18, 2025

Uh oh!

Tiiiger left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add code-rl recipe with DeepCoder #83

Add code-rl recipe with DeepCoder #83

Uh oh!

Conversation

Xiuyu-Li commented Nov 10, 2025

Uh oh!

TieMoulton commented Nov 10, 2025

Uh oh!

joschu commented Nov 17, 2025

Uh oh!

joschu commented Nov 17, 2025

Uh oh!

Tiiiger commented Nov 18, 2025

Uh oh!

Tiiiger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants