Newton solver CUDA compatibility with Eigen #45

pramodk · 2019-03-09T13:45:03Z

mark newton solver routines with EIGEN_DEVICE_FUNC
so that the solver can be used from cpu or gpu kernels
usable with latest master of eigen
update eigen submodule to latest master

Resolves #41

- mark newton solver routines with EIGEN_DEVICE_FUNC so that the solver can be used from cpu or gpu kernels - usable with latest master of eigen - update eigen submodule to latest master Resolves #41 Change-Id: I49460a1a16428c3f1ad0b8cacd025f83b96a8389

lkeegan

LGTM!

By the way did you find some documentation for EIGEN_DEVICE_FUNC? All I could find was this forum post:
https://eigen.tuxfamily.narkive.com/zeJ54QCx/when-to-mark-eigen-device-func
but based on this it seems that since we use fixed size matrices and therefore don't have any dynamic memory allocations it should work fine on gpus.

Also would it be beneficial to define EIGEN_DEFAULT_DENSE_INDEX_TYPE to int, to use a 32bit int for array indexing instead of the default 64bit? as suggested here:
https://eigen.tuxfamily.org/dox/TopicCUDA.html

lkeegan · 2019-03-09T16:12:53Z

Also by default Eigen will use openMP if enabled in the compiler.
If openMP is already used in coreneuron we should probably set
EIGEN_DONT_PARALLELIZE
to prevent eigen from creating more threads.
see https://eigen.tuxfamily.org/dox/TopicMultiThreading.html

pramodk · 2019-03-09T23:20:09Z

By the way did you find some documentation for EIGEN_DEVICE_FUNC? All I could find was this forum

I was just grepping through the code :) ...it gets expand to cuda device/host kernel annotation (__host__ __device__) if code is being compiled with NVCC otherwise its empty (see here)

Also would it be beneficial to define EIGEN_DEFAULT_DENSE_INDEX_TYPE to int, to use a 32bit int for array indexing instead of the default 64bit? as suggested here:
https://eigen.tuxfamily.org/dox/TopicCUDA.html

Also by default Eigen will use openMP if enabled in the compiler. If openMP is already used in coreneuron we should probably set EIGEN_DONT_PARALLELIZE

Yeah, good points! We don't want threads from Eigen as simulator already uses OpenMP. If I am not mistaken, those macros should go into somewhere in global header. I created #47.

…des & settings Change-Id: Ibb92c931933ec657e0ccd3ef83f9ffcb368d934c

pramodk added the codegen Code generation backend label Mar 9, 2019

pramodk added this to the v0.2 milestone Mar 9, 2019

pramodk requested review from ohm314 and lkeegan March 9, 2019 13:45

pramodk mentioned this pull request Mar 9, 2019

Eigen and CUDA compatibility #41

Closed

lkeegan approved these changes Mar 9, 2019

View reviewed changes

pramodk mentioned this pull request Mar 9, 2019

Define EIGEN_DEFAULT_DENSE_INDEX_TYPE to int and EIGEN_DONT_PARALLELIZE in global header #47

Closed

Address review comments : separate header nmodl.hpp with common inclu…

14863b1

…des & settings Change-Id: Ibb92c931933ec657e0ccd3ef83f9ffcb368d934c

pramodk merged commit ad9c2ef into master Mar 10, 2019

pramodk deleted the pr/eigen-cuda branch March 10, 2019 12:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Newton solver CUDA compatibility with Eigen #45

Newton solver CUDA compatibility with Eigen #45

pramodk commented Mar 9, 2019

lkeegan left a comment

lkeegan commented Mar 9, 2019

pramodk commented Mar 9, 2019

Newton solver CUDA compatibility with Eigen #45

Newton solver CUDA compatibility with Eigen #45

Conversation

pramodk commented Mar 9, 2019

lkeegan left a comment

Choose a reason for hiding this comment

lkeegan commented Mar 9, 2019

pramodk commented Mar 9, 2019