save unnecessary matmul #30

skyw · 2025-09-30T22:39:53Z

Only diagonal of approx_eigenvalue_matrix is needed for _orthogonal_iteration, added logic to skip the last matrix multiply.

Frobenius norm is unitarily invariant, that is for any orthogonal matrix Q and square matrix A , $||Q^T A Q||_F = ||A||_F$. So we can calculate norm on kronecker_factor instead of approx_eigenvalue_matrix.

copy-pr-bot · 2025-09-30T22:39:56Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Hao Wu <skyw@nvidia.com>

skyw · 2025-09-30T22:46:28Z

/ok to test e3bf176

mkhona-nvidia · 2025-09-30T22:59:05Z

emerging_optimizers/soap/soap_utils.py

-            # (i.e. the approximated eigenvectors diagonalize the kronecker factor)
-            approx_eigenvalue_matrix = eigenbasis.T @ kronecker_factor @ eigenbasis
            # Update eigenbasis when necessary. Update is skipped only when adaptive update criteria is met.
            if _adaptive_criteria_met(


_adaptive_criteria_met also extracts the diagonal and computes diagonal norm, in addition to matrix frobenius norm. So I think this will need the full approx_eigenvalue_matrix diagonal for diagonal norm and the kronecker factor for frobenius norm, separately.

right. Added it back.

emerging_optimizers/soap/soap_utils.py

Signed-off-by: Hao Wu <skyw@nvidia.com>

emerging_optimizers/soap/soap_utils.py

skyw · 2025-09-30T23:44:38Z

/ok to test 7265ff4

mkhona-nvidia

New change

* save unnecessary matmul Signed-off-by: Hao Wu <skyw@nvidia.com> * simplify criteria logic Signed-off-by: Hao Wu <skyw@nvidia.com> * remove max precondition dim Signed-off-by: Hao Wu <skyw@nvidia.com> Signed-off-by: mikail <mkhona@nvidia.com>

skyw requested a review from mkhona-nvidia September 30, 2025 22:39

save unnecessary matmul

e3bf176

Signed-off-by: Hao Wu <skyw@nvidia.com>

skyw force-pushed the skyw/optimize_soap branch from 990a47a to e3bf176 Compare September 30, 2025 22:43

copy-pr-bot bot temporarily deployed to test September 30, 2025 22:46 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci September 30, 2025 22:46 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci September 30, 2025 22:48 Inactive

mkhona-nvidia reviewed Sep 30, 2025

View reviewed changes

skyw added 3 commits September 30, 2025 16:15

simplify criteria logic

b66bf2a

Signed-off-by: Hao Wu <skyw@nvidia.com>

remove max precondition dim

25fd2a4

Signed-off-by: Hao Wu <skyw@nvidia.com>

add pst back

7265ff4

Signed-off-by: Hao Wu <skyw@nvidia.com>

mkhona-nvidia reviewed Sep 30, 2025

View reviewed changes

emerging_optimizers/soap/soap_utils.py Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to test September 30, 2025 23:44 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci September 30, 2025 23:44 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci September 30, 2025 23:46 Inactive

mkhona-nvidia approved these changes Sep 30, 2025

View reviewed changes

skyw enabled auto-merge (squash) October 1, 2025 00:00

skyw merged commit 2ebdb41 into main Oct 1, 2025
12 checks passed

skyw deleted the skyw/optimize_soap branch October 1, 2025 00:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

save unnecessary matmul #30

save unnecessary matmul #30

Uh oh!

skyw commented Sep 30, 2025

Uh oh!

copy-pr-bot bot commented Sep 30, 2025

Uh oh!

skyw commented Sep 30, 2025

Uh oh!

mkhona-nvidia Sep 30, 2025

Uh oh!

skyw Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

skyw commented Sep 30, 2025

Uh oh!

mkhona-nvidia left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

save unnecessary matmul #30

save unnecessary matmul #30

Uh oh!

Conversation

skyw commented Sep 30, 2025

Uh oh!

copy-pr-bot bot commented Sep 30, 2025

Uh oh!

skyw commented Sep 30, 2025

Uh oh!

mkhona-nvidia Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

skyw Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

skyw commented Sep 30, 2025

Uh oh!

mkhona-nvidia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants