Solve systems with multiple rhs #81

geoffroyleconte · 2023-03-09T17:59:39Z

Hello @mzy2240, does this fix your issue?

@dpo I copy-pasted some code that was in LDLFactorizations.jl for multiple rhs.

codecov · 2023-03-09T18:12:42Z

Codecov Report

All modified lines are covered by tests ✅

Files	Coverage Δ
src/LimitedLDLFactorizations.jl	`89.81% <100.00%> (+0.89%)`	⬆️

📢 Thoughts on this report? Let us know!.

mzy2240 · 2023-03-09T20:30:02Z

That's awesome! Thank you!

I tried lldl(A; memory=size(A,1)) to give an exact factorization, and then \ B does not give the same result as A \ B. Did I mis-understand anything here?

geoffroyleconte · 2023-03-09T21:57:03Z

Is the norm of the difference between the solution with lldl and \ high? If so can you send me a basic working example so that I can reproduce the issue?

mzy2240 · 2023-03-10T03:56:37Z

Ok the original PR gives accurate result after re-testing. However, I did a few tweaks to make it multi-threading, which surprisingly gives totally different result. See below:

function lldl_lsolve!(n, X::AbstractMatrix, Lp, Li, Lx)
  Threads.@threads for j = 1:n
    @inbounds for p = Lp[j]:(Lp[j + 1] - 1)
      @simd for k ∈ axes(X, 2)
        X[Li[p], k] -= Lx[p] * X[j, k]
      end
    end
  end
  return X
end

function lldl_dsolve!(n, X::AbstractMatrix, D)
  Threads.@threads for j = 1:n
    @simd for k ∈ axes(X, 2)
      X[j, k] /= D[j]
    end
  end
  return X
end

function lldl_ltsolve!(n, X::AbstractMatrix, Lp, Li, Lx)
  Threads.@threads for j = n:-1:1
    @inbounds for p = Lp[j]:(Lp[j + 1] - 1)
      @simd for k ∈ axes(X, 2)
        X[j, k] -= conj(Lx[p]) * X[Li[p], k]
      end
    end
  end
  return X
end

function lldl_solve!(n, B::AbstractMatrix, Lp, Li, Lx, D, P)
  @views Y = B[P, :]
  lldl_lsolve!(n, Y, Lp, Li, Lx)
  lldl_dsolve!(n, Y, D)
  lldl_ltsolve!(n, Y, Lp, Li, Lx)
  return B
end

Any idea why it gives wrong result?

mzy2240 · 2023-03-10T04:01:27Z

I think I found the reason. lldl_lsolve! and lldl_ltsolve! are not embarrassingly-parallelable. Any idea how to make it more thread-friendly?

dpo · 2023-03-10T08:22:39Z

Thanks @geoffroyleconte. There could also be methods for right-hand sides that are contiguous in memory, so that you can use x[j, :] instead of a for loop over the second axis.

geoffroyleconte · 2023-03-10T17:10:17Z

@mzy2240 maybe you are modifying some elements in X simultaneously, in which case you would need to use Threads.Atomic?

I'm not an expert in this subject, @amontoison do you know what could be wrong here?

amontoison · 2023-03-10T17:23:04Z

You should permute the for loops if you want to use Threads.@thread. It's normal that you have a wrong result, you can't perform backward and forward sweeps in any order.

But you do a complete backward and forward sweep of the columns of B on different threads.

amontoison · 2023-09-23T02:02:31Z

@geoffroyleconte Can we merge this PR?

amontoison · 2023-10-18T16:25:39Z

@geoffroyleconte Can you rebase your branch?

amontoison · 2023-10-18T18:20:42Z

Thanks @geoffroyleconte!

amontoison approved these changes Sep 23, 2023

View reviewed changes

geoffroyleconte force-pushed the multiple-rhs branch from 99829f3 to 48a7643 Compare September 23, 2023 16:02

geoffroyleconte added 2 commits October 18, 2023 13:42

fix JuliaSmoothOptimizers#80 (multiple rhs)

6112cee

fix type solve function

2df725d

geoffroyleconte force-pushed the multiple-rhs branch from 48a7643 to 2df725d Compare October 18, 2023 17:43

geoffroyleconte requested a review from amontoison October 18, 2023 17:47

amontoison approved these changes Oct 18, 2023

View reviewed changes

amontoison merged commit 58f1e47 into JuliaSmoothOptimizers:main Oct 18, 2023
16 checks passed

geoffroyleconte deleted the multiple-rhs branch October 18, 2023 19:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solve systems with multiple rhs #81

Solve systems with multiple rhs #81

geoffroyleconte commented Mar 9, 2023 •

edited

Loading

codecov bot commented Mar 9, 2023 •

edited

Loading

mzy2240 commented Mar 9, 2023

geoffroyleconte commented Mar 9, 2023 •

edited

Loading

mzy2240 commented Mar 10, 2023

mzy2240 commented Mar 10, 2023 •

edited

Loading

dpo commented Mar 10, 2023

geoffroyleconte commented Mar 10, 2023

amontoison commented Mar 10, 2023

amontoison commented Sep 23, 2023

amontoison commented Oct 18, 2023

amontoison commented Oct 18, 2023

Solve systems with multiple rhs #81

Solve systems with multiple rhs #81

Conversation

geoffroyleconte commented Mar 9, 2023 • edited Loading

codecov bot commented Mar 9, 2023 • edited Loading

Codecov Report

mzy2240 commented Mar 9, 2023

geoffroyleconte commented Mar 9, 2023 • edited Loading

mzy2240 commented Mar 10, 2023

mzy2240 commented Mar 10, 2023 • edited Loading

dpo commented Mar 10, 2023

geoffroyleconte commented Mar 10, 2023

amontoison commented Mar 10, 2023

amontoison commented Sep 23, 2023

amontoison commented Oct 18, 2023

amontoison commented Oct 18, 2023

geoffroyleconte commented Mar 9, 2023 •

edited

Loading

codecov bot commented Mar 9, 2023 •

edited

Loading

geoffroyleconte commented Mar 9, 2023 •

edited

Loading

mzy2240 commented Mar 10, 2023 •

edited

Loading