Skip to content

fix: unify logger#66

Merged
LovelyBuggies merged 4 commits into
mainfrom
log
Jul 1, 2026
Merged

fix: unify logger#66
LovelyBuggies merged 4 commits into
mainfrom
log

Conversation

@LovelyBuggies

@LovelyBuggies LovelyBuggies commented Jun 30, 2026

Copy link
Copy Markdown
Member

I have put up with slight inconsistencies in the metric logging parameter names across different algorithms for a long time. Today, I finally decided I’d had enough, so I opened a PR.

@LovelyBuggies LovelyBuggies merged commit 10c5083 into main Jul 1, 2026
3 checks passed
@LovelyBuggies LovelyBuggies deleted the log branch July 1, 2026 16:18
@LovelyBuggies

LovelyBuggies commented Jul 2, 2026

Copy link
Copy Markdown
Member Author

This pull request aims to improve the consistency of logs across different algorithms. Currently, for all three algorithms, the results are consistent within each panel to within WandB.

TLDR and arXiv

image image

For both, MAGRPO reached 8,000 steps in about 4 hours; MAAC and IAC took about 12 hours to reach 7,000+ steps on B200. The total VRAM usage for all algorithms is under 80 GB. VRAM for MAGRPO and IAC shared is a bit over 40 GB.

CoopHE

image

MAGRPO reached 10,000 steps in about 5 hours; MAAC and IAC took about 9 hours to reach 8,000+ steps on B200. IAC (not shared-AC) uses the most memory, reaching approximately 130GB of VRAM; MAAC uses the second most, reaching over 110GB of VRAM; MAGRPO and IAC (shared-AC) use the least memory, at about 80GB.

HouseBuild

image

The performance of Pulse isn't that stable, so you might need to do a few more runs.

MAGRPO reached 4,000 steps in about 8 hours; MAAC and IAC took about 4 hours to reach 1,000+ steps on B200. I only tested IAC with shared actor-critic. The VRAM usage is about 140 GB, which is similar to MAAC and seems to be the limit for the H200. In contrast, MAGRPO uses slightly less VRAM, coming in at under 120 GB.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant