-
Notifications
You must be signed in to change notification settings - Fork 31
[WS3] End-to-end logprob cross-benchmark tool #131
Copy link
Copy link
Open
Labels
component: distributedTasks involving Ray actor management, cross-node scheduling, and communication synchronization.Tasks involving Ray actor management, cross-node scheduling, and communication synchronization.component: testingAdd test cases and benchmark-related tasksAdd test cases and benchmark-related tasksfeaturenext-phase
Metadata
Metadata
Labels
component: distributedTasks involving Ray actor management, cross-node scheduling, and communication synchronization.Tasks involving Ray actor management, cross-node scheduling, and communication synchronization.component: testingAdd test cases and benchmark-related tasksAdd test cases and benchmark-related tasksfeaturenext-phase
Type
Fields
Give feedbackNo fields configured for issues without a type.
Part of #83 · Deferred to next phase — not in this month's sprint. Tracks #106.
The headline deliverable: one command runs the same fixed model + prompt set + seed through real vLLM and real Megatron, dumps both logprob streams, and computes drift. Tolerance reuses the #108 threshold table.
Planned PRs: