Heuristic Improvements: balance between generation and improvement heuristics #382

akifcorduk · 2025-09-09T10:12:38Z

Description

This PR changes the heuristic structure by creating a natural balance between generation and improvement.
The FP/FJ loop now adds solution to the population and only if we have enough diverse solutions we exit the loop and execute the population improvement. The diversity is increased to sqrt(n_integers). The recombiners are run between the current best and all other solutions in the current population, if stagnation is detected in FP/FJ loop and then the loop continues. The bounds prop rounding in the context of FP is also improved. When the dual simplex solution is set, the pdlp is warm started now with both primal and dual solutions.

The default tolerance is now 1e-6 absolute tolerance and 1e-12 relative tolerance.

This PR includes bug fixes on:

Apperance of inf/nan on z vector dual simplex phase2.
Invalid launch dimensions on FJ and hash kernels.
Timer diff and function time limit issues when the solver is run with unlimited time limit.

Benchmark results in 10 mins run on H100:

Main branch: 207 feasible solutions and average gap: '28.54', 3 unfinished/crashed
This PR: 213 feasible and average gap: '23.11', 1 unfinished/crushed. (The PR didn't have any crash before merge with main branch)

closes #142
closes #374
closes #218

cpp/include/cuopt/linear_programming/mip/solver_settings.hpp

hlinsen · 2025-09-16T17:45:10Z

cpp/src/mip/presolve/load_balanced_bounds_presolve.cu

    context(context_)
 {
-  setup(problem_);
+  // setup(problem_);


Should we remove setup completely?

There is a bug in load balanced setup, @kaatish is fixing that. This should normally be enabled, I will try if it is resolved with Aatish's recommendation.

chris-maes · 2025-09-16T20:26:48Z

cpp/src/dual_simplex/branch_and_bound.cpp

-    if (settings.heuristic_preemption_callback != nullptr) {
-      settings.heuristic_preemption_callback();
-    }
+    // FIXME: rarely dual simplex detects infeasible whereas it is feasible.


So if heuristics has a feasible solution, but branch and bound says it is infeasible, you take the feasible solution?

Yes, that's what i wanted to do for now.

chris-maes · 2025-09-16T20:32:51Z

cpp/src/dual_simplex/phase2.cpp

  for (i_t k = 0; k < delta_z_nz; ++k) {
    const i_t j = delta_z_indices[k];
    z[j] += step_length * delta_z[j];
+    if (std::isnan(z[j]) || std::isinf(z[j])) { return -1; }


This is a performance critical loop. Hopefully the compiler is smart about this, but would you mind doing something like:

f_t zj = z[j] + step_length * delta_z[j]; z[j] = zj; if (zj != zj || std::isinf(zj)) { return -1; }

Also, can you file an issue so that we can track that this should be removed?

I think isfinite is probably better suited as a single instruction, let me know if it is better.

Oh you are worried about double access, I am 90% sure compiler will optimize this but sure i can fix this to have single access.

We might not need this at all because the bound_flipping_ratio test has solved the numerical issue.

Indeed, we don't need this. Removed.

benchmarks/linear_programming/cuopt/run_mip.cpp

nguidotti · 2025-09-16T19:27:07Z

cpp/src/mip/presolve/probing_cache.cu

Is there a reason why we are limiting the number of threads to 8? It is not better to let be a user controller setting?

Actually yes, but since we don't have a thread pool now, it is separate for now. Once we have a thread pool we should enable more dynamic strategy.

nguidotti · 2025-09-16T20:48:02Z

cpp/include/cuopt/logger.hpp

 #endif
    logger_.set_level(default_level());
-    logger_.flush_on(rapids_logger::level_enum::info);
+    logger_.flush_on(rapids_logger::level_enum::debug);


Maybe we should keep the flush in the info option unless there is a performance penalty.

Debug is disabled in production, so there will be no performance penalty.

aliceb-nv

LGTM engine side, approving; thanks a lot for the great work Akif!

aliceb-nv · 2025-09-17T07:44:28Z

cpp/CMakeLists.txt



 option(BUILD_MIP_BENCHMARKS "Build MIP benchmarks" OFF)
+set(BUILD_MIP_BENCHMARKS ON)


Debugging leftover I assume :)
On my setup I add --cmake-args=\"-DBUILD_MIP_BENCHMARKS=ON\" to my build.sh command which does the trick

Makes sense, thanks will do that. I think we should add a build.sh command for this.

aliceb-nv · 2025-09-17T07:45:03Z

cpp/src/dual_simplex/bound_flipping_ratio_test.cpp

+  if (nonbasic_entering == -1) {
+    // -1,-2 and -3 are reserved for other things
+    return -4;
+  }


This issue is tracked, right?

I will create an issue thanks!

cpp/src/mip/diversity/population.cu

akifcorduk · 2025-09-18T08:42:03Z

/merge

…uristics (#382) This PR changes the heuristic structure by creating a natural balance between generation and improvement. The FP/FJ loop now adds solution to the population and only if we have enough diverse solutions we exit the loop and execute the population improvement. The diversity is increased to `sqrt(n_integers)`. The recombiners are run between the current best and all other solutions in the current population, if stagnation is detected in FP/FJ loop and then the loop continues. The bounds prop rounding in the context of FP is also improved. When the dual simplex solution is set, the pdlp is warm started now with both primal and dual solutions. The default tolerance is now 1e-6 absolute tolerance and 1e-12 relative tolerance. This PR includes bug fixes on: - Apperance of inf/nan on `z` vector dual simplex phase2. - Invalid launch dimensions on FJ and hash kernels. - Timer diff and function time limit issues when the solver is run with unlimited time limit. Benchmark results in 10 mins run on H100: - Main branch: 207 feasible solutions and average gap: '28.54', 3 unfinished/crashed - This PR: 213 feasible and average gap: '23.11', 1 unfinished/crushed. (The PR didn't have any crash before merge with main branch) closes #142 closes #374 closes #218 Authors: - Akif ÇÖRDÜK (https://github.com/akifcorduk) Approvers: - Ramakrishnap (https://github.com/rgsl888prabhu) - Alice Boucher (https://github.com/aliceb-nv) URL: #382

akifcorduk added 30 commits July 31, 2025 07:09

test fp only

d1ab4eb

generate and then run FP

47708f4

with fj

17561a2

with obj cut

6276efa

test FP ls

b2ed360

20s fp run

ebbb5e4

fix feasibility run. 20s LS

90c1696

fix feasibility run. 20s LS

7e859fb

add as part of the local search

5363e79

fp ls with mab

474d170

try with shorter run time

0690083

tidy up functions and reduce local min to 500

6c02547

revert cmake comments

0a176f1

fix typo

e2d0d8b

fix typo

a4c37d6

remove the warning in pdlp solve

49584ed

handle review comments

5ea08e8

handle review comments

85820e0

try without fp

4437481

fix warning

c0b0a21

enable lp and probing cache

ebb4958

fix issues including the objective cut not being on copy constructor

54afd53

without probing cache

df34a01

nearest rounding

5ebb86c

FJ alone run

57b1b9d

remove unnecessary function

ee861a2

Merge branch 'branch-25.10' of github.com:NVIDIA/cuopt into fp_ls

f960946

add assert

cc21c3d

handle weight issue on fj with changed size

9821f03

Merge branch 'branch-25.10' of github.com:NVIDIA/cuopt into fp_ls

9cd097a

akifcorduk added 7 commits September 16, 2025 03:22

don't preempt on mip infeasible

9186f2c

increased cpu threads and lowered probing cache

8e34d06

with presolve on miplib dataset

6556656

remove mps files

96d922b

Merge branch 'branch-25.10' of github.com:NVIDIA/cuopt into fp_tests

a743959

fix merge conflicts

88ae1cf

fix limiting memory resource

54941ee

akifcorduk marked this pull request as ready for review September 16, 2025 16:15

akifcorduk requested review from a team as code owners September 16, 2025 16:15

akifcorduk requested review from hlinsen, nguidotti and rgsl888prabhu September 16, 2025 16:15

try fixing load balancing

3f8b57b

rgsl888prabhu approved these changes Sep 16, 2025

View reviewed changes

hlinsen reviewed Sep 16, 2025

View reviewed changes

chris-maes reviewed Sep 16, 2025

View reviewed changes

nguidotti reviewed Sep 16, 2025

View reviewed changes

aliceb-nv approved these changes Sep 17, 2025

View reviewed changes

akifcorduk added 6 commits September 17, 2025 03:53

handle review comments

0d644fd

remove the -1 return

618d550

Merge branch 'branch-25.10' of github.com:NVIDIA/cuopt into fp_tests

7f529f5

fix fast solution not being returned

4748744

fix standardization test

153fbb2

fix elim var tests

aaeb08f

rapids-bot bot merged commit 3307a41 into NVIDIA:branch-25.10 Sep 18, 2025
201 of 202 checks passed



		option(BUILD_MIP_BENCHMARKS "Build MIP benchmarks" OFF)
		set(BUILD_MIP_BENCHMARKS ON)

Heuristic Improvements: balance between generation and improvement heuristics #382

Heuristic Improvements: balance between generation and improvement heuristics #382

Uh oh!

Conversation

akifcorduk commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aliceb-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

akifcorduk commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

akifcorduk commented Sep 9, 2025 •

edited

Loading