More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS #394

jaimergp · 2023-11-21T10:24:12Z

Description

Inspired by #391. Refactoring a bit before we delve in the question of "how many attempts should we try before giving up?".

cc @mbargull

Checklist - did you ...

Add a file to the news directory (using the template) for the next release's release notes?
Add / update necessary tests?
Add / update outdated documentation?

jaimergp · 2023-11-21T12:19:50Z

conda_libmamba_solver/solver.py

+            return max_attempts_from_env
+        if in_state.update_modifier.FREEZE_INSTALLED:
+            # this the default, but can be overriden with --update-specs
+            # TODO: should we cap this at a reasonable number? some base envs have 100s of pkgs


This is the main point of this PR :) Thoughts?

I haven't messed with libmamba solver at all, so absolutely no idea/feeling about what ranges would be sensible.

If you don't want a hard limit, you could use some function that just increases slower for higher inputs.
If you want a hard limit you could either do min(n_installed, limit) or something that only approaches the limit gradually like math.ceil(n_installed / (1 + (n_installed / limit)**2)**.5) or something more/less sophisticated.

Each solve is usually sub-second, but some complex one that need some extra backtracking (like the one reported in your issue) take a 2-4s. If you end up with 100 installed packages then you need to wait a few minutes for the solver to give up.

By "giving up" I mean that we stop trying to unlock installed packages and we let the solver modify them as needed, just as mamba does. So maybe we can get by with 10 attempts. In other words, let the solver unfreeze up to 5-10 conflicting installed packages one by one until we just let everything float.

Alternatively, we could make the retry loop be based on time spent and not iterations, but that might be more complex than necessary.

I'm going to try with 10 attempts and see if that is enough to pass all tests. Then we adjust as necessary.

Makes sense.
Could you add that as a constant variable (DEFAULT_LIBMAMBA_SOLVER_MAX_ATTEMPTS or something ) to better track that?

Alternatively, we could make the retry loop be based on time spent and not iterations, but that might be more complex than necessary.

Yes, complexity, but also determinism ;).

jaimergp · 2023-11-25T16:17:37Z

Errors are due to conda/conda#13360

jezdez

Just a few suggestions, nothing major

conda_libmamba_solver/solver.py

jezdez · 2023-12-04T17:28:39Z

conda_libmamba_solver/solver.py

-            int(os.environ.get("CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS", len(in_state.installed))) + 1,
-        )
-        for attempt in range(1, max_attempts):
+        for attempt in range(1, self._max_attempts(in_state) + 1):


Would it be useful to prepopulate the max attempts at the start of the solve as a instance attribute, instead of inline, so it's easier to debug?

I'm not sure if this might break some tests. It shouldn't, but by saving it as instance attribute we are assuming we are going to have the same number of attempts for every call to solve_for_*() (for the lifetime of the same instance). In some instances, that might be tied to len(in_state.installed).

So I'd rather keep it like it is because it's technically more correct and does not assume that "one instantiation, one solve".

Sounds good, thank you!

jezdez · 2023-12-04T17:31:05Z

@mbargull To clarify, does this actually fix #391 for you or is this just a nice-to-have?

mbargull · 2023-12-05T09:57:03Z

@mbargull To clarify, does this actually fix #391 for you or is this just a nice-to-have?

No, gh-391 actually failed on the first attempt for me.
The example from gh-391 was just one of those for which @jaimergp noticed long run times for when it didn't segfaulted (since in that example it has to downgrade some packages).

gh-381 incidentally fixes the example from gh-391, but it is unclear if other cases might still fail depending on the ordering; see the discussing from #391 (comment) onward.

Co-authored-by: Jannis Leidel <jannis@leidel.info>

jezdez · 2023-12-05T13:50:36Z

@mbargull To clarify, does this actually fix #391 for you or is this just a nice-to-have?

No, gh-391 actually failed on the first attempt for me. The example from gh-391 was just one of those for which @jaimergp noticed long run times for when it didn't segfaulted (since in that example it has to downgrade some packages).

gh-381 incidentally fixes the example from gh-391, but it is unclear if other cases might still fail depending on the ordering; see the discussing from #391 (comment) onward.

Ah, thanks for the pointer, much appreciated!

more explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS

a6e6eeb

conda-bot added the cla-signed [bot] added once the contributor has signed the CLA label Nov 21, 2023

jaimergp commented Nov 21, 2023

View reviewed changes

add news

0520a1b

jaimergp marked this pull request as ready for review November 22, 2023 09:28

let's see if anything breaks with 10 attempts

6630bd5

define MAX_SOLVER_ATTEMPTS_CAP

d817f33

jaimergp closed this Nov 27, 2023

jaimergp reopened this Nov 27, 2023

pre-commit

dae4c3f

jezdez reviewed Dec 4, 2023

View reviewed changes

Update conda_libmamba_solver/solver.py

90db74e

Co-authored-by: Jannis Leidel <jannis@leidel.info>

jezdez approved these changes Dec 5, 2023

View reviewed changes

jezdez changed the title ~~more explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS~~ More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS Dec 5, 2023

jezdez merged commit 96c59f3 into main Dec 5, 2023
71 checks passed

jezdez deleted the limit-max-attempts branch December 5, 2023 13:54

jaimergp mentioned this pull request Dec 11, 2023

conda solver is still extremely slow, even with version 23.10.0 #399

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS #394

More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS #394

jaimergp commented Nov 21, 2023 •

edited

Loading

jaimergp Nov 21, 2023

mbargull Nov 21, 2023

jaimergp Nov 22, 2023

jaimergp Nov 23, 2023

mbargull Nov 23, 2023

jaimergp commented Nov 25, 2023

jezdez left a comment

jezdez Dec 4, 2023

jaimergp Dec 5, 2023

jezdez Dec 5, 2023

jezdez commented Dec 4, 2023

mbargull commented Dec 5, 2023

jezdez commented Dec 5, 2023

More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS #394

More explicit logic for CONDA_LIBMAMBA_SOLVER_MAX_ATTEMPTS #394

Conversation

jaimergp commented Nov 21, 2023 • edited Loading

Description

Checklist - did you ...

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaimergp commented Nov 25, 2023

jezdez left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jezdez commented Dec 4, 2023

mbargull commented Dec 5, 2023

jezdez commented Dec 5, 2023

jaimergp commented Nov 21, 2023 •

edited

Loading