Get the largest allocatable block size by abergeron · Pull Request #304 · Theano/libgpuarray

abergeron · 2016-11-29T22:35:30Z

This also includes a location change for blas loading.

lamblin · 2016-11-30T02:51:12Z

Can you temporarily change in travis.yml the way theano is installed, so that it gets it from your updated branch?

nouiz · 2016-11-30T14:21:29Z


  case GA_CTX_PROP_FREE_GMEM:
+    /* There is no way to query free memory so we just return the
+        largest block size */


We can get an upperbound as we know the size we allocated and the total size of the memory on the GPU.

I'm not suggesting that we do that here, as using an upperbound could cause crash, using an lower bound as currently done is safer, but could be less efficient. This is mostly just a comment.

I don't know the size that was allocated here since I don't keep track of it for OpenCL.

Also this is neither a lower nor upper bound. Just the largest size that clMemAlloc will accept, not taking into account how much memory is actually free. It's really crappy, but until we handle memory allocation in a similar way to cuda here, we can't do better.

nouiz · 2016-11-30T14:22:46Z

+  cuda_exit(ctx);
+   /* We guess that we can allocate at least a quarter of the free size
+     in a single block. This might be wrong though. */
+  sz /= 4;


This should be documented the doc about the user function to get the properties. Mostly, that this return the preallocated biggest block of a quarter of the free memory on the GPU.

This case handles memory that hasn't been preallocated. We can't query the largest block available for cuMalloc, so I am resorting to a guess here.

Yes I understand that. My only point is to document that.

I consider this to be an implementation detail (and a bad one at that) and I would prefer not to document it since I hope to change it to something better whenever possible.

ok that the size / 4 is a detail. But there is no doc about which property to can be queried. I'll make an issue about that.

There is documentation that is generated from the headers. This should list all the defined properties.

nouiz · 2016-11-30T14:30:40Z

+    {
+      int e = load_libcublas(major, minor);
+      if (e != GA_NO_ERROR)
+        return e;


Can you confirm that this cause the init of cublas before the prealloc? I think so, but I'm not sure at 100%.

No it doesn't in most situations.

nouiz · 2016-11-30T14:34:02Z

  MACOSX_RPATH OFF
  # This is the shared library version
-  VERSION 0.0
+  VERSION 0.1


I'll come back to the versioning. Now have a new interface. This won't trigger the recompilation and won't give good user warning. If people update Theano but not libgpuarray, then they will get compilation error related to the convolution as GA_CTX_PROP_LARGEST_MEMBLOCK isn't defined. Check jenkins buildbot

http://darjeeling.iro.umontreal.ca:8080/job/Theano%20gpu/550/testReport/junit/theano.gpuarray.tests.test_dnn/TestDnnInferShapes/test_conv_time_on_shape_change_valid_conv/

If we keep it like that, then we will frequently have useless user questions. We will loose our own time and user time. If we bump the major version, I don't think it will give a good user error. We need to fix that. It could be for an 0.9rc2, but it should be before 0.9.

I don't think that it is a problem that changing the minor version here will not trigger a recompilation. Everything that worked with 0.0 will work with 0.1. That is the point of this version scheme.

If we were to change the major version it would make the currently compiled modules unloadable, possibly triggering a recompilation (I'm not 100% sure how Theano deal with C code it can't reload).

If people update Theano and it uses newly introduced symbols, they will need to update libgpuarray also, yes. Usually this is handled with recommended versions for releases and a guideline of the style: use the latest master of libgpuarray for the latest master of Theano.

My problem with maintaining another version scheme is that it duplicates works and we will forget to bump one or the other for some changes, which will lead to exactly the problems that you describe, except that nobody will expect them.

nouiz · 2016-11-30T14:34:59Z

jenkins failed: http://darjeeling.iro.umontreal.ca:8080/job/libgpuarray_PR/63/testReport/junit/nose.failure/Failure/runTest/

… ABI version.

nouiz · 2016-12-01T14:23:00Z

+ *
+ * Type: `size_t`
+ */
+#define GA_CTX_PROP_LARGEST_MEMBLOCK 20


It should be added to gpuarray.pxd.

I can add it for sure, but I usually only add the values that I need, and I don't need this one currently.

nouiz · 2016-12-01T15:49:21Z

in the C api, but not in the python api. People should be able to use the python api without looking at the C api, or at least, have a link from the python api to the relevent part of the c api.

…

On Thu, Dec 1, 2016 at 10:42 AM, abergeron ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In src/gpuarray_buffer_cuda.c <#304>: > @@ -443,6 +443,21 @@ static void find_best(cuda_context *ctx, gpudata **best, gpudata **prev, } } +static size_t largest_size(cuda_context *ctx) { + gpudata *temp; + size_t sz, dummy; + cuda_enter(ctx); + ctx->err = cuMemGetInfo(&sz, &dummy); + cuda_exit(ctx); + /* We guess that we can allocate at least a quarter of the free size + in a single block. This might be wrong though. */ + sz /= 4; There is documentation that is generated from the headers. This should list all the defined properties. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#304>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AALC-2S5uAZA-BCuS1UWlR1wtJxkKSREks5rDuregaJpZM4K_krZ> .

abergeron · 2016-12-01T15:50:21Z

You want me to add a new context property that exposes this value? In that case ok.

abergeron added 2 commits November 29, 2016 12:25

Fix leak in error path for GA_CTX_PROP_DEVNAME

2d22306

Add a way to query the largest allocatable size.

c18bdaf

abergeron changed the title ~~Largest block~~ Get the largest allocatable block size Nov 29, 2016

abergeron mentioned this pull request Nov 29, 2016

Select the dnn convolution algorithm using actually available memory. Theano/Theano#5305

Closed

nouiz reviewed Nov 30, 2016

View reviewed changes

abergeron force-pushed the largest_block branch from 407e94c to c18bdaf Compare November 30, 2016 19:06

abergeron added 6 commits November 30, 2016 14:07

Bump the SOVERSION

7a4af36

Move the loading of libcublas to when you fetch the blas ops.

de0ab99

Fix compile error.

9f78968

Add an API version since that is a completely different notion to the…

3002f62

… ABI version.

Excpose the API version with the existing api_version() function.

73f617e

Add an object dictionary to GpuContext.

5882289

nouiz reviewed Dec 1, 2016

View reviewed changes

Expose the LARGEST_MEMBLOCK property in python.

1732459

abergeron force-pushed the largest_block branch from 2a8e641 to 1732459 Compare December 1, 2016 16:02

Get rid of most of the cython C code build warnings.

637783a

nouiz merged commit d53f327 into Theano:master Dec 1, 2016

lamblin mentioned this pull request Feb 17, 2017

Not able to use gpu with new backend on power Theano/Theano#5528

Closed

Conversation

abergeron commented Nov 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lamblin commented Nov 30, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nouiz commented Nov 30, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nouiz commented Dec 1, 2016 via email

Uh oh!

abergeron commented Dec 1, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abergeron commented Nov 29, 2016 •

edited

Loading