Skip to content

[Training] Understand new llama3.1 405B RCPs which changed after v5.0 #429

@ShriyaRishab

Description

@ShriyaRishab

In v5.0, llama3.1 405b benchmark was added and RCPs were generated for the same. For v5.1, the llama3.1 405B eval schedule changed by doing more frequent evaluations and skipping a few evaluations in the beginning: mlcommons/training_policies#563. This lead to new RCPs #424.

The aim of this issue is to understand how the new RCPs relate to the old ones and figure out if there is still a way to compare llama3.1 405b results from v5.0 that were scaled to the old RCPs to results from future rounds which would be scaled to the new RCPs.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions