Improve optimisers for +R models #56

roblanf · 2022-02-10T10:21:40Z

On many datasets I notice warnings like:

WARNING: Log-likelihood -7226.94 of K2P+R4 worse than K2P+R3 -7059.91

Obviously this shouldn't happen. The lnL should always be better with R4 than R3. I'm guessing this is just a limitation of the current optimiser. In many cases it seems like a fairly big limitation too. E.g. in the example above the difference is >150 likelihood units.

So, I have a suggestion. When we optimise RN+1 (e.g. R4) we should do an intialisation step where we start with the ML rate parameters from RN (e.g. R3), and just add an extra one while holding the initial N parameters constant. We can then try to optimise this constrained model, e.g. by sliding the new parameter from the minimum up to double the maximum rate from RN. My bet is that this will often get us a model with RN+1 that has a better likelihood. But even if it doesn't, we can then pass these RN+1 rates to the BFGS or EM optimiser to further optimise them all together.

Thoughts @bqminh and @thomaskf? This is really just a constrained EM step to start with. And maybe we already do something like this.

Either way, it seems like there's room for improvement here.

The text was updated successfully, but these errors were encountered:

bqminh · 2024-05-20T00:39:06Z

This is done already, i.e. R4 parameters are initialised from the R3. In the code it's this function: RateFree::initFromCatMinusOne() of model/ratefree.cpp,

iqtree2/model/ratefree.cpp

Line 138 in c8be3c3

void RateFree::initFromCatMinusOne() {

.
However, this is only one way of initialisation. Happy to chat if you have 'better' suggestions. Anyway, a lot of testings will be needed...

roblanf · 2024-05-20T05:57:56Z

My suggestion is not quite the same. It's that we initially hold the CatMinusOne parameters constant, and only optimise the new parameter. Once that's done, we optimise all of them.

E.g. if R2 gave: 0.1, 2.0

Then we initialise R3 with 0.1, 2.0, New

And we hold 0.1, 2.0 constant while finding the optimum value of New (allowing it to be anything from the minimum to maximum bound, i.e. smaller than 0.1, between 0.1 and 2.0, and larger than 2.0).

This might give e.g.

R3: New=0.08, 0.1, 2.0; 0.1, New=0.5, 2.0; 0.1, 2.0, New=3.2 etc.

It should be simple to find a better likelihood like this, because we are optimising a single parameter.

The final step is to optimise all parameters at once, using the values from the prior step as the initialisation.

Happy to test this if someone can implement it.

roblanf added the modelfinder2 things to do before benchmarking modelfinder2 label Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve optimisers for +R models #56

Improve optimisers for +R models #56

roblanf commented Feb 10, 2022

bqminh commented May 20, 2024

roblanf commented May 20, 2024

Improve optimisers for +R models #56

Improve optimisers for +R models #56

Comments

roblanf commented Feb 10, 2022

bqminh commented May 20, 2024

roblanf commented May 20, 2024