Tinker example by benjibc · Pull Request #340 · eval-protocol/python-sdk

benjibc · 2025-11-20T07:03:22Z

Note

Introduce Tinker-based rollout/evaluation with an eval_protocol adapter-to-RL bridge and a GSM8K RL training example with metrics plotting.

Integrations (tinker-cookbook):
- EvalProtocolRLDataset, EvalProtocolEvaluator, and create_eval_protocol_dataset_builder to bridge eval_protocol adapters to Tinker RL datasets/evaluators.
- TinkerRolloutProcessor implementing rollouts via Tinker SamplingClient, renderer/tokenizer setup, and async batching.
Examples (GSM8K RL):
- examples/tinker_math_rl/train.py: end-to-end RL training using GSM8K adapter, row conversion to ProblemGroupBuilder, and periodic evaluation.
- examples/tinker_math_rl/test_gsm8k_eval.py: evaluation test using TinkerRolloutProcessor and GSM8K rows.
- examples/tinker_math_rl/plot_metrics.py: plot training metrics.
- examples/tinker_math_rl/README.md: setup and run instructions.

^{Written by Cursor Bugbot for commit 8a1b8f0. This will update automatically on new commits. Configure here.}

eval_protocol/integrations/tinker_cookbook.py

examples/tinker_math_rl/train.py

examples/tinker_math_rl/plot_metrics.py

examples/tinker_math_rl/debug_dataset.py

eval_protocol/integrations/tinker_cookbook.py

eval_protocol/integrations/tinker_rollout_processor.py

examples/tinker_math_rl/train.py

examples/tinker_math_rl/plot_metrics.py

examples/tinker_math_rl/train.py

examples/tinker_math_rl/test_gsm8k_eval.py

cursor · 2025-11-20T08:55:44Z

examples/tinker_math_rl/train.py

+from eval_protocol.integrations.tinker_rollout_processor import TinkerRolloutProcessor
+
+# Import test components
+from examples.tinker_math_rl.test_gsm8k_eval import test_gsm8k_tinker, get_gsm8k_input_rows


Bug: Broken import path in training script

The script attempts to import test_gsm8k_eval using the full path examples.tinker_math_rl, which fails with ModuleNotFoundError when running python train.py from the directory as documented. The examples package is not in sys.path in that context; use a relative or direct import instead.

cursor bot reviewed Nov 20, 2025

View reviewed changes

Tinker example

36f0095

benjibc force-pushed the tinker_example branch from 19e7a98 to 36f0095 Compare November 20, 2025 08:29

cursor bot reviewed Nov 20, 2025

View reviewed changes

examples/tinker_math_rl/train.py Show resolved Hide resolved

examples/tinker_math_rl/test_gsm8k_eval.py Outdated Show resolved Hide resolved

benjibc added 2 commits November 20, 2025 00:39

fixes

c69f1f4

typo

8a1b8f0

cursor bot reviewed Nov 20, 2025

View reviewed changes

benjibc merged commit 936f4a5 into main Nov 20, 2025
9 checks passed

benjibc deleted the tinker_example branch November 20, 2025 09:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tinker example#340

Tinker example#340
benjibc merged 3 commits intomainfrom
tinker_example

benjibc commented Nov 20, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

benjibc commented Nov 20, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot Nov 20, 2025

Choose a reason for hiding this comment

Bug: Broken import path in training script

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

benjibc commented Nov 20, 2025 •

edited by cursor bot

Loading