Clean up #4672

Sentient07 · 2016-06-27T14:26:10Z

The cleanup of PR #4570. Will rebase with the master once #4570 gets merged.
Tasks :

Correctly handle HostFromGpu nodes
Remove the special case for Alloc / AllocEmpty from the new optimizer, and register these lifters in an "out2in" topooptimizer
Replace op_lifter with out2in
Factor out caching of Op instances
Check if all the optimizers used by op_lifter is being used by op_lifter_topo
Implement backward pass.
Remove op_lifter from EquilibriumOptimizer

@lamblin Am i missing anything in the to-do list?

Sentient07 · 2016-06-27T15:14:44Z

theano/gpuarray/extra_ops.py

@@ -468,4 +469,4 @@ def use_gpu_cumsumop(op, ctx_name, inputs, outputs):
        if axis is None:
            axis = 0
        assert isinstance(x.type, GpuArrayType)
-        return GpuCumsum(axis)(x)
+        return GpuCumsum(axis, ctx_name)(x)


I felt it is better to pass the context_name. Will change if necessary

As documented here, the best practice is to infer the context during make_node, not to force the context when creating the Op.
Please remove the context_name attribute.

Yeah, i'll remove that commit.

Sentient07 · 2016-06-27T17:48:23Z

In line:467 of theano/gpuarray/type.py, if i make a call to transfer instead of host_from_gpu, it gives ,
RuntimeError: maximum recursion depth exceeded

Sentient07 · 2016-06-28T12:38:52Z

In this PR, i've removed ShapeOptimizer from fast_compile. There are some optimizations that need ShapeOptimizer and is a part of fast_compile. I'm removing those optimizations from fast_compile. Yet to open that PR of removing ShapeOptimizer. @lamblin I believe that PR should get merged before this ?

lamblin · 2016-06-30T22:55:54Z

theano/gpuarray/opt.py

 gpu_seqopt.register('gpuarray_local_optimiziations', gpu_optimizer, 1,
                    'fast_compile', 'fast_run', 'gpuarray')
 gpu_seqopt.register('gpuarray_cut_transfers', gpu_cut_copies, 2,
                    'fast_compile', 'fast_run', 'gpuarray')

+gpu_seqopt.register('op_lifter_topo', TopoOptimizer(gpu_topo.query('+fast_compile'), order='out_to_in'),
+                    10, 'fast_run', 'gpuarray', 'fast_compile')


Use something between -0.5 and 1 instead of 10 here.

lamblin · 2016-06-30T22:58:41Z

It seems to make sense for a first draft of the backward pass (using TopoOptimizer and LocalGroupDB).
When you change the priorities so it is executed before the EquilibriumOptimizer, we can see in the profiling if there is something to gain, and if we need to optimize the implementation further.
For instance, maybe we would need to implement tracks in LocalGroupDB, like in EquilibriumDB.

Sentient07 · 2016-06-30T23:20:15Z

For instance, maybe we would need to implement tracks in LocalGroupDB, like in EquilibriumDB.

Could you please elaborate on this ?

lamblin · 2016-06-30T23:53:09Z

EquilibriumDB has "tracks", or ops to match when trying to apply its local optimizers. For instance, some local optimizers will only be tried on nodes corresponding to some Ops. It uses the Ops passed to the local_optimizer decorator.
For instance, if a local opt registered with @local_optimizer([GpuFromHost, GpuToGpu, HostFromGpu]), then the EquilibriumOptimizer will only try to apply it on nodes where node.owner.op is an instance of those types.
I'm not sure if the LocalGroup uses this information. If it does not, then we may need to implement that logic, like in EquilibriumOptimizer.

Sentient07 · 2016-07-01T23:20:03Z

From what I see,LocalGroup does not use that info. LocalOptimizer uses track but I don't think LocalGroup does. I'll confirm again once.

Sentient07 · 2016-07-06T17:55:28Z

@nouiz The new optimizer op_lifter_topo doesn't show all the local optimizers as gpu_optimizer. Am I doing the query wrong ?

theano-bot · 2016-07-06T18:16:19Z

Can one of the admins verify this patch?

Sentient07 · 2016-07-07T13:01:17Z

We're registering theano.tensor.opt.local_remove_all_assert into gpu_optimizer separately. Don't we need to do the same for gpu_optimizer2 ?

nouiz · 2016-07-11T15:46:37Z

theano/gpuarray/extra_ops.py

@@ -22,8 +22,9 @@ class GpuCumsum(GpuKernelBase, Op):
    SUPPORTED_NDIMS = 3
    __props__ = ('axis',)

-    def __init__(self, axis):


Why this change? Just remove this commit, or explain why it is needed.

oh! This should have gotten removed from the rebase. It existed before, but i have changed this behaviour in #4570 itself

Yeah, this diff doesn't exist anymore. It has been removed. Earlier, i thought it was better to pass context_name

Sentient07 · 2016-07-12T14:59:11Z

There are few redundant commits. #128f257 and #3855671 (Did in #4570 but with a different name), #5bbbb31 did as a different PR. Shall I squash them ?

lamblin · 2017-02-14T03:28:04Z

We will need to the the automatic cache (in Op's metaclass) first, and then go on with the cleaning up.
We will not be able to do that for 0.9, so bumping to 0.10

Sentient07 · 2017-02-21T16:45:29Z

Closing as continued in #5579

Sentient07 reviewed Jun 27, 2016
View reviewed changes

Sentient07 mentioned this pull request Jun 28, 2016

New graph2gpu #4570

Merged

6 tasks

Sentient07 force-pushed the CleanUp branch from 4c34868 to 62c3ee4 Compare June 28, 2016 23:45

lamblin reviewed Jun 30, 2016
View reviewed changes

Sentient07 force-pushed the CleanUp branch from dc8f2a9 to 4ba546f Compare July 4, 2016 20:08

Sentient07 force-pushed the CleanUp branch from a0f8c7f to 1cb24d7 Compare July 11, 2016 15:35

nouiz reviewed Jul 11, 2016
View reviewed changes

Sentient07 added 10 commits July 12, 2016 20:05

Generalized filter dialation parameter and reverted a change

5bbbb31

Attempted fix for extra_ops

128f257

replaced host_to_gpu with transfer

394c531

Added out2in for op_lifter

9f1e9cb

Reverted subtensor to old version

3855671

Added a decorator for topo

64d91cb

Created TopoDB

0b65948

Added topo optimization and removed incorrect TopoOptDB

50b5007

Moved op_lifter_topo before gpuarray_local_optimiziations

5768ba6

Removed caching instances

065556b

Sentient07 force-pushed the CleanUp branch from 1cb24d7 to 065556b Compare July 12, 2016 14:36

Sentient07 mentioned this pull request Jul 28, 2016

CleanUp of New GraphToGpu Optimizer #4801

Open

7 tasks

nouiz added this to the 0.9 milestone Dec 1, 2016

lamblin modified the milestones: 0.10, 0.9 Feb 14, 2017

ReyhaneAskari mentioned this pull request Feb 20, 2017

Clean up #5579

Merged

Sentient07 closed this Feb 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up #4672

Clean up #4672

Sentient07 commented Jun 27, 2016 •

edited

Sentient07 Jun 27, 2016

lamblin Jun 27, 2016

Sentient07 Jun 27, 2016

Sentient07 commented Jun 27, 2016

Sentient07 commented Jun 28, 2016

lamblin Jun 30, 2016

lamblin commented Jun 30, 2016

Sentient07 commented Jun 30, 2016

lamblin commented Jun 30, 2016

Sentient07 commented Jul 1, 2016

Sentient07 commented Jul 6, 2016

theano-bot commented Jul 6, 2016

Sentient07 commented Jul 7, 2016

nouiz Jul 11, 2016

Sentient07 Jul 11, 2016

Sentient07 Jul 11, 2016

Sentient07 commented Jul 12, 2016

lamblin commented Feb 14, 2017

Sentient07 commented Feb 21, 2017

Clean up #4672

Clean up #4672

Conversation

Sentient07 commented Jun 27, 2016 • edited

Sentient07 Jun 27, 2016

Choose a reason for hiding this comment

lamblin Jun 27, 2016

Choose a reason for hiding this comment

Sentient07 Jun 27, 2016

Choose a reason for hiding this comment

Sentient07 commented Jun 27, 2016

Sentient07 commented Jun 28, 2016

lamblin Jun 30, 2016

Choose a reason for hiding this comment

lamblin commented Jun 30, 2016

Sentient07 commented Jun 30, 2016

lamblin commented Jun 30, 2016

Sentient07 commented Jul 1, 2016

Sentient07 commented Jul 6, 2016

theano-bot commented Jul 6, 2016

Sentient07 commented Jul 7, 2016

nouiz Jul 11, 2016

Choose a reason for hiding this comment

Sentient07 Jul 11, 2016

Choose a reason for hiding this comment

Sentient07 Jul 11, 2016

Choose a reason for hiding this comment

Sentient07 commented Jul 12, 2016

lamblin commented Feb 14, 2017

Sentient07 commented Feb 21, 2017

Sentient07 commented Jun 27, 2016 •

edited