vec/qf - initial valid/borrowed/owned split for data #853

jeremylt · 2021-12-07T21:01:06Z

As we talked about on the telecon. I still want to do a pass to simplify the logic, but this runs and passes the tests.

jeremylt · 2021-12-07T23:06:05Z

I think we should make sure this design lines ups with @nbeams's path ahead for CeedVector precision, but I think the owned vs borrowed changes in here are important to get into main as they represent a current [minor] flaw in our interface.

nbeams

Thanks @jeremylt for working this up so quickly. I think this will be helpful to have in place before adding multiprecision storage to CeedVector.

I assume we also want to use void * instead of CeedScalar * in the CeedQFunctionContext_[Cuda/Hip] backend data structs? And in the parameter for the data variable in CeedQFunctionContextTakeData_Cuda? (The type in the HIP version got changed.)

backends/cuda/ceed-cuda-qfunctioncontext.c

backends/cuda/ceed-cuda-vector.c

jeremylt · 2021-12-09T23:37:49Z

I assume we also want to use void * instead of CeedScalar * in the CeedQFunctionContext_[Cuda/Hip] backend data structs? And in the parameter for the data variable in CeedQFunctionContextTakeData_Cuda? (The type in the HIP version got changed.)

Thanks for catching that! Updated.

jeremylt · 2021-12-10T18:35:15Z

I rearranged and clarified the logic. I think this should be easier to understand/remember and modify in future work.

jeremylt · 2021-12-11T00:01:33Z

With this latest pass, I pulled the data validity checks up into the interface, simplifying the logic in the backends. I think this should be pretty close to what we need.

doc/sphinx/source/libCEEDdev.md

jeremylt · 2021-12-15T00:09:44Z

Everything passes the test suite. This PR got bigger than I wanted, but I think we should be good for review + squash + merge, modulo anything else that we want to fixup.

nbeams · 2021-12-15T00:40:27Z

A quick test tells me we've broken something in the MFEM integration for MFEM's algebraic CEED solver -- from MFEM's ex1 with -d ceed-cuda -a -pa options. That part seems to be fixed by swapping a GetArray for GetArrayWrite, but then I'm getting a segfault from CeedOperatorGetActiveElemRestriction, which is most likely not related to this PR, I'd assume... (I cherry-picked the commits from mfem/mfem#2569 since I know the MFEM integration was already broken.)

At any rate, this may be a breaking change for some people using libCEED, depending on how it's integrated (where they should use GetArrayWrite now instead of GetArray), in case we want to be more explicit about a warning.

jeremylt · 2021-12-15T00:46:23Z

Since this is a breaking change, it might be a fair point to cut a new release. We haven't done a fall release yet.

nbeams

Still have a few minor questions, but I'm going ahead and marking as "approve" for this version of the interface. Thanks for all the work @jeremylt

backends/blocked/ceed-blocked-operator.c

backends/cuda/ceed-cuda-operator.c

python/tests/test-1-vector.py

nbeams · 2021-12-15T18:21:26Z

Though I guess, do we need to investigate the GitLab failure on Noether?

nbeams · 2021-12-15T18:34:31Z

Oh wait, I have another question about GetArrayWrite, prompted by the issue with the MFEM integration. It seems I may have run into a situation where calling GetArrayRead after GetArrayWrite throws the "no valid data to read" error, though GetArrayWrite should leave the vector in a valid state. Hmmm... will need to investigate further to make sure the issue is on the MFEM side rather than libCEED.

jeremylt · 2021-12-15T18:56:22Z

ICC/IFort failure is Intel's flakey installer, unrelated to this PR

jeremylt · 2021-12-15T19:04:45Z

It seems I may have run into a situation where calling GetArrayRead after GetArrayWrite throws the "no valid data to read" error, though GetArrayWrite should leave the vector in a valid state. Hmmm... will need to investigate further to make sure the issue is on the MFEM side rather than libCEED.

I think it's on the MFEM side of the integration? t124 tests this specific use case and all backends pass.

nbeams · 2021-12-15T19:39:38Z

It seems I may have run into a situation where calling GetArrayRead after GetArrayWrite throws the "no valid data to read" error, though GetArrayWrite should leave the vector in a valid state. Hmmm... will need to investigate further to make sure the issue is on the MFEM side rather than libCEED.

I think it's on the MFEM side of the integration? t124 tests this specific use case and all backends pass.

Yes, after looking into it some more, I do think it's just that we broke the MFEM integration (but it was already broken).

* vec/qf - initial valid/borrowed/owned split for data * vec/qf - tidy logic for checking active/stale data * minor - add missing NULL * doc - explain VectorTakeArray update * minor - update error messages * test - update error message in junit/tap * gpu - fix stray CeedScalar vs void for QFunctionContext * vec/qf - clarify/simplify access logic * vec - calloc host arrays when no value set to make empty * style - minor * style - minor * minor - fix error messages * vec/qf - move data validity checking to backend interface * gpu - add missing sync error checking for qfcontext * gpu - homogonize use of impl for backend data to reduce confusion * vec - clarify access conditions * python - update test for stricter vector access * vec - minor fixes * minor - fix ipython change * vec - add missing declarations in ceed/backend.h * ctx - mirror vector borrowed data check in ctx interface * vec - add CeedVectorGetArrayWrite * vec - consistent use of CeedVectorGetArray vs CeedVectorGetArrayWrite * python - small vec fixes * doc - describe vector data semantics * magma - update restriction * gpu - fix restr bug I added, need to sum into target * magma - fix restriction bug * cpu - fix restriction bug here too * op - fix evec allocations * julia - fix ElemRestriction for new vector access rules * op - double check GetArray vs Read vs Write usage * doc - small fix * restr - clean up read/write logic for restr * python - add vec.array_write * magma - typo fix

jeremylt added bug enhancement backend GPU CPU labels Dec 7, 2021

jeremylt requested review from jedbrown and nbeams December 7, 2021 21:01

jeremylt self-assigned this Dec 7, 2021

jeremylt force-pushed the jeremy/vec-borrow-own branch from 99553ae to 67afcc6 Compare December 7, 2021 21:05

jeremylt added the 1-In Review label Dec 7, 2021

nbeams mentioned this pull request Dec 7, 2021

Mixed precision functionality #778

Open

nbeams reviewed Dec 9, 2021

View reviewed changes

jeremylt added 11 commits December 10, 2021 15:52

vec/qf - initial valid/borrowed/owned split for data

caee039

vec/qf - tidy logic for checking active/stale data

cacf5be

minor - add missing NULL

996b89c

doc - explain VectorTakeArray update

85509fa

minor - update error messages

16e7618

test - update error message in junit/tap

bbe7d78

gpu - fix stray CeedScalar vs void for QFunctionContext

6f133d9

vec/qf - clarify/simplify access logic

0d1a96d

vec - calloc host arrays when no value set to make empty

62422d1

style - minor

61b8241

style - minor

1501cb3

jeremylt force-pushed the jeremy/vec-borrow-own branch from dacbc88 to 1501cb3 Compare December 10, 2021 22:54

jeremylt added 2 commits December 10, 2021 16:02

minor - fix error messages

12138f0

vec/qf - move data validity checking to backend interface

b353686

jeremylt commented Dec 14, 2021

View reviewed changes

doc/sphinx/source/libCEEDdev.md Show resolved Hide resolved

jeremylt added 2 commits December 14, 2021 15:19

magma - update restriction

c7df0e5

gpu - fix restr bug I added, need to sum into target

7b1c546

jeremylt force-pushed the jeremy/vec-borrow-own branch 2 times, most recently from 8b51f80 to 7b1c546 Compare December 15, 2021 00:06

jeremylt requested a review from nbeams December 15, 2021 00:08

nbeams approved these changes Dec 15, 2021

View reviewed changes

backends/blocked/ceed-blocked-operator.c Outdated Show resolved Hide resolved

backends/cuda/ceed-cuda-operator.c Outdated Show resolved Hide resolved

python/tests/test-1-vector.py Show resolved Hide resolved

jeremylt added 2 commits December 15, 2021 11:25

magma - fix restriction bug

44211aa

cpu - fix restriction bug here too

cbc8319

op - fix evec allocations

137f2ef

julia - fix ElemRestriction for new vector access rules

dc2457c

jeremylt added 2 commits December 15, 2021 12:32

op - double check GetArray vs Read vs Write usage

97b3d79

doc - small fix

01e02ac

restr - clean up read/write logic for restr

6bb2089

jeremylt force-pushed the jeremy/vec-borrow-own branch 2 times, most recently from 3e8c176 to 9c7553f Compare December 15, 2021 21:18

jeremylt added 2 commits December 15, 2021 14:19

python - add vec.array_write

2abadb7

magma - typo fix

3b62776

jeremylt force-pushed the jeremy/vec-borrow-own branch from 9c7553f to 2abadb7 Compare December 15, 2021 21:21

jeremylt merged commit 9c774ed into main Dec 17, 2021

jeremylt deleted the jeremy/vec-borrow-own branch December 17, 2021 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vec/qf - initial valid/borrowed/owned split for data #853

vec/qf - initial valid/borrowed/owned split for data #853

jeremylt commented Dec 7, 2021

jeremylt commented Dec 7, 2021

nbeams left a comment

jeremylt commented Dec 9, 2021

jeremylt commented Dec 10, 2021

jeremylt commented Dec 11, 2021

jeremylt commented Dec 15, 2021

nbeams commented Dec 15, 2021 •

edited

Loading

jeremylt commented Dec 15, 2021

nbeams left a comment

nbeams commented Dec 15, 2021

nbeams commented Dec 15, 2021

jeremylt commented Dec 15, 2021

jeremylt commented Dec 15, 2021

nbeams commented Dec 15, 2021

vec/qf - initial valid/borrowed/owned split for data #853

vec/qf - initial valid/borrowed/owned split for data #853

Conversation

jeremylt commented Dec 7, 2021

jeremylt commented Dec 7, 2021

nbeams left a comment

Choose a reason for hiding this comment

jeremylt commented Dec 9, 2021

jeremylt commented Dec 10, 2021

jeremylt commented Dec 11, 2021

jeremylt commented Dec 15, 2021

nbeams commented Dec 15, 2021 • edited Loading

jeremylt commented Dec 15, 2021

nbeams left a comment

Choose a reason for hiding this comment

nbeams commented Dec 15, 2021

nbeams commented Dec 15, 2021

jeremylt commented Dec 15, 2021

jeremylt commented Dec 15, 2021

nbeams commented Dec 15, 2021

nbeams commented Dec 15, 2021 •

edited

Loading