moving from qwen to llama 8b for benchmark run by abaheti95 · Pull Request #132 · databricks/compose-rl

abaheti95 · 2025-08-04T20:47:07Z

Uses a llama 8b math dataset instead of qwen open_r1. Modifies the generation length and sequence length to see rewards and hillclimbing in 5 steps.

gupta-abhay · 2025-08-04T20:56:17Z

please wait on the mlflow PR to go through before merging.

abaheti95 added 2 commits August 4, 2025 20:42

moving from qwen to llama 8b for benchmark run

8f58700

matching irene's mlflow commit

08c2fdf

abaheti95 marked this pull request as ready for review August 4, 2025 20:50

abaheti95 requested review from bowenyang008 and gupta-abhay as code owners August 4, 2025 20:50

gupta-abhay approved these changes Aug 4, 2025

View reviewed changes

abaheti95 added 2 commits August 4, 2025 23:04

Merge branch 'single-controller-hackathon' into ashu/benchmark_test

8dc2fd2

removed comment

1c93f00

abaheti95 merged commit 680a115 into single-controller-hackathon Aug 4, 2025

abaheti95 deleted the ashu/benchmark_test branch August 4, 2025 23:19

Provide feedback