Skip to content

moving from qwen to llama 8b for benchmark run#132

Merged
abaheti95 merged 4 commits intosingle-controller-hackathonfrom
ashu/benchmark_test
Aug 4, 2025
Merged

moving from qwen to llama 8b for benchmark run#132
abaheti95 merged 4 commits intosingle-controller-hackathonfrom
ashu/benchmark_test

Conversation

@abaheti95
Copy link
Collaborator

Uses a llama 8b math dataset instead of qwen open_r1. Modifies the generation length and sequence length to see rewards and hillclimbing in 5 steps.

mlflow green curve

@abaheti95 abaheti95 marked this pull request as ready for review August 4, 2025 20:50
@gupta-abhay
Copy link
Collaborator

please wait on the mlflow PR to go through before merging.

@abaheti95 abaheti95 merged commit 680a115 into single-controller-hackathon Aug 4, 2025
@abaheti95 abaheti95 deleted the ashu/benchmark_test branch August 4, 2025 23:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants