Aurora

Aurora is an optimizer for non-square matrices that achieves more effective utilization of MLP neurons. Instead of polar(G), which inherits non-uniform left-singular row norms, Aurora iteratively approximates a projection onto the intersection of the row oblique and Steifel manifolds, giving more balanced updates without sacrificing polar factor precision. For square matrices Aurora reduces to the standard Muon update.

See the blog for more information: https://blog.tilderesearch.com/blog/aurora

And Twitter at: https://x.com/tilderesearch/status/2052798181558370419

Code structure

src/
├── main.py               # Entry point: training loop and CLI
├── polar.py              # Polar factor via simple-quintic Newton-Schulz
├── aurora.py             # Aurora update rule
└── riemannian_aurora.py  # Riemannian Aurora: Riemannian gradient ascent on the balanced Stiefel manifold

Usage

from aurora import aurora

# Inside the training loop, for each weight tensor W with gradient G
# and a caller-managed momentum buffer m (zeros at init):
aurora(W, G, m, eta=lr, weight_decay=0.025)

Hyperparameters

pp_iterations (default 2): number of update refinement iterations. Higher values refine the update toward the row-uniform fixed point at the cost of one extra polar call per parameter per iteration.
pp_beta (default 0.5): damping exponent for the row normalization step, in (0, 1]. Default 0.5 gives undamped square-root steps; lower values damp oscillation between odd/even D iterates.
mu (default 0.95), nesterov (default True), weight_decay (default 0.025): standard Muon / SGD-momentum hyperparameters.

Utilities

polar.py uses simple-quintic 12-iteration Newton-Schulz with coefficients. Aurora's full aurora() step follows: Nesterov momentum → leverage-uniform polar → spectral aspect-ratio scale → decoupled weight decay. Different Newton-Schultz iterations can be added as a drop-in replacement to our polar function.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aurora

Code structure

Usage

Hyperparameters

Utilities

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aurora

Code structure

Usage

Hyperparameters

Utilities

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages