Improve the OptimizationManopt.jl interface #1009

kellertuer · 2025-09-12T08:52:01Z

I sat down for about an hour and looked more seriously at the interface.
The old form had a bit too much of clutter in.

This should close #906, #943, and #944. We could also check #814 on this branch a bit closer.

3 tests are not yet passing again but the error messages are so long I struggle a bit with even reading them
there is a few things I do not understand. I marked them with TODO and will add comments here later
documentation should be checked, especially the SPD example is too complicated and bogus.
What code formatter is used now here?
do we care about test coverage?
if we wait a bit for Introduce has_converged JuliaManifolds/Manopt.jl#511 (to be merged soon), we can even resolve that currently Optimization.jl always claims solver runs with Manopt failed.

/cc @Vaibhavdixit02 @oscardssmith (maybe @ChrisRackauckas ?)

…sBase 1.0

kellertuer · 2025-09-12T09:17:11Z

lib/OptimizationManopt/src/OptimizationManopt.jl

+# TODO: WHY? they both still accept not passing it
 function SciMLBase.requireshessian(opt::Union{
        AdaptiveRegularizationCubicOptimizer, TrustRegionsOptimizer})
    true
 end



How is this function defined and what is it for?

The current definition here is not correct, both ARC and TR can perform their own (actually quite good) approximation of the hessian – similar to what QN does.
So they do not need a Hessian, but the exact one of course performs a bit better than the approximate one.

It's a trait for checking whether a solver requires that the Hessian function is required in order to use the solver. For example, if your solver uses prob.f.hess then this should be true, so that way you can fail if a second order AD method is not given.

if it's not required then this should be false. What this will do is, if true, turn on an error message that says "prob.f.hess is not defined and therefore you cannot use this method" (not exactly, but high level that's pretty much what it's for, for higher level error messages and reporting)

lib/OptimizationManopt/src/OptimizationManopt.jl

kellertuer · 2025-09-12T09:32:56Z

Currently:

both ARC and TR fail, something with the Hessian maybe (not yet had the time to scroll through the kilometres of the error message
I am not sure how ConvexBundle ever worked, since a subsolver is only available if one also loads RipQP
the SPD example fails, but that one I plan to replace anyways, since it is an overcomplicated one that is not a good one to run tests on – or in other words there are much better suited ones on SPDs

kellertuer · 2025-09-23T14:13:13Z

lib/OptimizationManopt/src/OptimizationManopt.jl

+    # TODO: I do not understand this. Why is the manifold not used?
+    # Either this is an Euclidean cost, then we should probably still call `embed`,
+    # or it is not, then we need M.
+    return function (::AbstractManifold, θ)


Here we should check what best to do, the current one works in some cases, but not all.

This I don't know. Do you need to know the manifold to know how to calculate the loss? I guess to know the mapping for some parameter values in some representations?

The signature of cost/grad/hess always has the first parameter as the manifold, since it allows to implement several costs for arbitrary manifolds, e..g. the Karcher mean to minimise the distances squared.

My main problem is that I do not understand which cost that is

on a manifold it would contradict what the gradient function next does

in the embedding we would have to call embed(M, θ) before passing it to the function f that is defined in the embedding.

as long as embed is the identity, like for SPDs and the sphere the current code works. But for fixed rank it for example would not work.

lib/OptimizationManopt/src/OptimizationManopt.jl

kellertuer · 2025-09-23T14:17:42Z

@ChrisRackauckas @Vaibhavdixit02

I think I did nearly all I could do here regarding the questions above.

All 4 questions above should still be addressed, but they are about technical stuff in Optimization.jl I have no clue about
the entry post also has a few questions that should be resolved before this is finished
~~Currently both Hessian solvers (ARC & TR) fail because the used AutoDiff() does not provide a Hessian, but they expect one. I have no clue how that ever worked.~~ fixed it. Still yields warnings but does no longer error.

From the Manopt.jl side all has been addressed – even the convergence should now perform much better, since we introduced a has_converged(state) function recently.

kellertuer · 2025-09-23T14:35:33Z

Ha! Found the bug! Someone (before me) implemented one Riemannian Hessian in the wrapper using f.hv (which I had to look up and hence documented in a comment) and f.Hess in the other. The matrix is the wrong thing here, we need hv.

So tests run fine. I would still prefer to get answers to the questions above.
And will have to check what the documentation does. Can that easily be run locally? Or can someone(tm) run that here on CI for me?

…errored on CI.

# Conflicts: # lib/OptimizationManopt/Project.toml

kellertuer · 2025-09-23T15:56:39Z

As a last step for today I tried generating the docs locally, but it fails on other-solvers-examples like NOMAD and MOI – so I can not check how they would look like.

Also there are 5 broken links that error and

┌ Warning: 1353 docstrings not included in the manual:
[...]

🤨

# Conflicts: # docs/Project.toml

ChrisRackauckas · 2025-10-01T14:26:03Z

Rebase onto latest master so you get the test fixes.

ChrisRackauckas · 2025-10-01T16:42:51Z

lib/OptimizationManopt/src/OptimizationManopt.jl

 end

+# cf. https://github.com/SciML/SciMLBase.jl/blob/master/src/problems/optimization_problems.jl
+# {iip} is the parameter here – nowhere explained but very much probably “is in place”


it's defined in the SciMLBase interfaces.

It took me very long to find that. But it is your package so I do not have to maintain the code.

Is what you are looking for instead a comment that says # {true} denotes the dispatch is in-place? That's fine. See the other comment thread, I think I may have misinterpreted what you had meant with this stream of text.

Yes, because that is no-where documented, so if it is at least a comment here with source, that might help future people looking at this. As I said I spent about an hour finding that out, more like by looking at how it is used.

I can not write all your missing docs, but I can help code that uses the undocumented things to maybe be more readable. This change here indicated for me, code should not be readable?

lib/OptimizationManopt/src/OptimizationManopt.jl

ChrisRackauckas · 2025-10-01T16:48:48Z

Looks reasonable and is passing tests. I left comments for each of the questions in there.

kellertuer · 2025-10-01T20:08:20Z

Originally I wanted to check the examples and provide nicer ones, but this was now rushed that fast, that it took only about a third of my grandfathers 93rd birthday cake and dinner to be merged.
Is it necessary that PRs are that much rushed?

Now I feel a bit demotivated to do a next PR and check that all again and look at the examples. The current ones are – mildly phrased – technical and too complicated. There is much easier ones for new users to see how Manopt could be used.

edit: For example – the starting point of this PR was to fix that all solver runs still always report FAILURE in https://docs.sciml.ai/Optimization/stable/optimization_packages/manopt/ - I had hoped my code did that, but I am still unsure how and where that is decided – because again it is only mildly documented if at all how that return code is determined. Probably it is somewhere in the 150+ packages, but a mere muggle like me has no change finding that.

But ok. In this rushed development here, maybe the time to find out that things work is also not so wanted. That is my feeling, sadly.

ChrisRackauckas · 2025-10-02T00:45:19Z

Hey first of all, calm down, take a deep breath. I don't see what the controversy of all of this is, but we can take this in steps. It is becoming very difficult to weave through the passive aggressive comments here to find out what exactly is being discussed. For the purpose of changing the tone of this going forward, I'm just going to give the benefit of the doubt and respond faithfully to every line, and I hope you will give the benefit of the doubt similarly in the future as well.

Originally I wanted to check the examples and provide nicer ones, but this was now rushed that fast, that it took only about a third of my grandfathers 93rd birthday cake and dinner to be merged.
Is it necessary that PRs are that much rushed?

I'm not entirely sure what was rushed here. If you are truly feeling rushed, I am sorry that is not the intention. For this PR I received multiple pings and slack messages about not being attentive to this. I assumed that this meant that I needed to move faster on this because you wanted to get this done quicker. Sorry if I had misinterpreted that.

Now I feel a bit demotivated to do a next PR and check that all again and look at the examples. The current ones are – mildly phrased – technical and too complicated. There is much easier ones for new users to see how Manopt could be used.

Normally it helps the velocity of the repo and the simplicity of the process to "do one thing per PR". It wasn't clear to me after the last comments there that the intention was also to put the example change in this PR. I don't think that it would be necessary to do it in this PR because that example should at least work if the package is working again. While I would agree that "the SPD example is too complicated and bogus" as a first example for solving optimizations on manifolds, the first problem of this type is usually something simple like "optimize on a sphere!" etc., writing a new tutorial that improves the learning is pretty disconnected from just getting the package up and running on latest versions, and so normal development practices would split those into separate PRs. It doesn't necessarily have to, but it since that's the norm I had assumed that was the direction this was moving since the update process turned into a month long slog instead of something quick. And with the pings from you and @oscardssmith about unblocking latest versions around ForwardDiff and the autodiff channel discussions on the topic (or wherever that was), I had though "oh I am behind, and everyone wants this in, let me make sure I get tests fixed so this can rebase and get in, and then we continue in the next step".

The goal was to reduce the maintenance burden by breaking this down to simple problems and taking it step-by-step, getting the critical bug fixes and version bumps out there ASAP once tests are passing due to requests from the ecosystem, and then follow up with next steps. I am sorry if that was not clearly communicated.

So in going forward, I would suggest we do different steps

edit: For example – the starting point of this PR was to fix that all solver runs still always report FAILURE in https://docs.sciml.ai/Optimization/stable/optimization_packages/manopt/ - I had hoped my code did that, but I am still unsure how and where that is decided – because again it is only mildly documented if at all how that return code is determined.

It would be due to the return code handling. You can see the solution struct building in https://github.com/SciML/SciMLBase.jl/blob/master/src/solutions/optimization_solutions.jl#L84-L135. The default should be ReturnCode.Default in the constructor, but right here https://github.com/SciML/Optimization.jl/blob/v4.8.0/lib/OptimizationManopt/src/OptimizationManopt.jl#L460 it's overridden so that it's always Success or Failure. I am not sure why Manopt.indicates_convergence(asc) is always false but that would be the reason for seeing the Failure outputs. I'll note too that in standard development practice is a good self-contained single PR that can be done independently of the other things. As to why Manopt.indicates_convergence(asc) is always false, I am not sure but I would assume you might have some information on that.

Probably it is somewhere in the 150+ packages, but a mere muggle like me has no change finding that.

All of this is contained within only this repo (notice that OptimizationBase is a part of this Github repository) or SciMLBase, where SciMLBase defines the global interfaces for all of SciML. Indeed, everything asked in this PR is documented there, such as the in-place specification:

https://docs.sciml.ai/SciMLBase/stable/interfaces/Problems/#In-place-Specification

the algorithm trait system

https://docs.sciml.ai/SciMLBase/stable/interfaces/Algorithms/

and the return code specification

https://docs.sciml.ai/SciMLBase/stable/interfaces/Solutions/#retcodes

Notably the optimizer algorithm traits are missing from the specification though (and shouldn't be optimizer specific...) and so that is something I will have to clean up as another contributor did not properly define the interfaces here. This is part of the greater Optimization.jl clean up that is in progress.

Note that I have actually wanted to pull the repo together, but other contributors had mentioned in SciML/DifferentialEquations.jl#1082 that they wanted to see SciMLBase kept separate, so the architecture we're going towards by popular demand is to have just SciMLBase as top level interfaces with each tower then being contained, so one repo for all differential equations, one repo for all of optimization, etc. So it should only generally be 2 required, where SciMLBase only comes into play when dealing with the general multi-repo pieces (i.e. interface disconnected from solver implementation).

But ok. In this rushed development here, maybe the time to find out that things work is also not so wanted. That is my feeling, sadly.

We are happy to have you here. Sorry if you feel rushed. But instead of reverting, I think at this point what would be best would be to keep moving forward, and one of the major things to do here is to divy this up into discrete chunks of work in order to invite more people to participate in the action. I will help out here by taking the initiative to turn the remaining points I see into documented issues, and will start to look at this after JuliaCon Paris (of course today is JuliaCon Paris and I just had a 28 hour flight from Australia to get here, so sorry if I'm a bit less responsive).

I'll create the issues and let me know if any topic is missing.

ChrisRackauckas · 2025-10-02T00:45:31Z

Other comments addressed in the additional threads.

ChrisRackauckas · 2025-10-02T01:20:58Z

Are the remaining topics I see here.

kellertuer · 2025-10-02T03:57:03Z

Thanks for the explanations, I feel the idea to work here is maybe to different from how I do that usually. My PRs are slow, never rushed, aim to provide thorough code and well-documented code.

Here I think I feel too rushed when things are merged that fast. This PR was meant to do one thing: Fix the FAILURE thing. That was now not fixed.

kellertuer added 2 commits September 12, 2025 10:39

Starts adapting and reworking to Manopt 0.5, Manifolds 0.10, Manifold…

a074422

…sBase 1.0

fix a few tests,

a064672

kellertuer commented Sep 12, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Show resolved Hide resolved

kellertuer commented Sep 12, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

kellertuer commented Sep 12, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

kellertuer added 3 commits September 12, 2025 13:05

Move all tests to the allocating case default.

98b43be

Collect a few comments on why and where currently tests still fail.

dbddd25

Fix a few bugs in the existing code.

0ac3af5

kellertuer commented Sep 23, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Show resolved Hide resolved

Fix the tests.

efc42ea

kellertuer added 3 commits September 23, 2025 17:27

Bump version number

b488dc4

Bump docs versions – also for OptimizationIpOpt since that currently …

1b4864d

…errored on CI.

Merge branch 'master' into kellertuer/properManopt

18c558a

# Conflicts: # lib/OptimizationManopt/Project.toml

Merge branch 'master' into kellertuer/properManopt

ec2183b

# Conflicts: # docs/Project.toml

kellertuer mentioned this pull request Sep 26, 2025

CompatHelper: bump compat for OptimizationManopt to 0.0.4 for package docs, (keep existing compat) #1024

Closed

Merge branch 'master' into kellertuer/properManopt

561d9e2

ChrisRackauckas reviewed Oct 1, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Oct 1, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Oct 1, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Oct 1, 2025

View reviewed changes

lib/OptimizationManopt/src/OptimizationManopt.jl Outdated Show resolved Hide resolved

Update lib/OptimizationManopt/src/OptimizationManopt.jl

213760e

ChrisRackauckas added 3 commits October 1, 2025 12:51

Update lib/OptimizationManopt/src/OptimizationManopt.jl

0a47945

Update lib/OptimizationManopt/src/OptimizationManopt.jl

b980305

Update lib/OptimizationManopt/src/OptimizationManopt.jl

2e46ca1

ChrisRackauckas merged commit f9e94f7 into SciML:master Oct 1, 2025
57 of 71 checks passed

ChrisRackauckas mentioned this pull request Oct 2, 2025

Improve return codes from OptimizationManopt #1034

Open

SebastianM-C mentioned this pull request Oct 24, 2025

Clean up and update to OptimizationBase@v4 #1067

Merged

5 tasks

Uh oh!

Improve the OptimizationManopt.jl interface #1009

Improve the OptimizationManopt.jl interface #1009

Uh oh!

Conversation

kellertuer commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kellertuer commented Sep 12, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kellertuer commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kellertuer commented Sep 23, 2025

Uh oh!

kellertuer commented Sep 23, 2025

Uh oh!

ChrisRackauckas commented Oct 1, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ChrisRackauckas commented Oct 1, 2025

Uh oh!

Uh oh!

kellertuer commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChrisRackauckas commented Oct 2, 2025

Uh oh!

ChrisRackauckas commented Oct 2, 2025

Uh oh!

ChrisRackauckas commented Oct 2, 2025

Uh oh!

kellertuer commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kellertuer commented Sep 12, 2025 •

edited

Loading

kellertuer commented Sep 23, 2025 •

edited

Loading

kellertuer commented Oct 1, 2025 •

edited

Loading