Skip to content

Conversation

@danielvegamyhre
Copy link
Contributor

@danielvegamyhre danielvegamyhre commented Feb 1, 2025

This is a script to produce minimal repro (with exported memory snapshot) of the unexpectedly high memory usage of selective per op AC + fp8 rowwise quantization #808

@danielvegamyhre danielvegamyhre marked this pull request as draft February 1, 2025 01:17
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 1, 2025
@danielvegamyhre
Copy link
Contributor Author

danielvegamyhre commented Feb 26, 2025

Added a re-usable version of the repro script here for debugging future memory issues: https://github.com/danielvegamyhre/ml-perf-tools/blob/main/torchtitan/memory_analysis.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants