[OP] broadcast plus, minus, mul, ... #2039

mli · 2016-05-05T05:37:18Z

add a new op _broadcast_plus, both forward and backward are tested.

before extending to other elemental binary operators easily, i'd like to PR first because the implementation is unexpected complicate, i may be get something wrong..

piiswrong · 2016-05-05T18:54:04Z

src/operator/elementwise_binary_broadcast_op-inl.h

+}
+
+template<typename xpu>
+void PlusBroadcastBackward_(const OutputGrad& out_grad,


can we reuse backward code for different operation?

only plus and minus can reuse the same codes

pluskid · 2016-05-06T01:43:44Z

Very nice! I would suggest an add on to our shape inference mechanism here: we should allow the user to attach shape information to a Variable. During shape inference, this information can be used. This is especially useful in the case when broadcasting is supported. Sometimes I have a parameter that should be broadcast through the samples in a mini-batch, if I use broad cast plus to construct a symbol, the shape inference will not be able to figure out the shape of that parameter purely based on the data shape.

mli · 2016-05-06T05:21:24Z

@pluskid i didn't get your question. currently the shape inference is done by BinaryBroadcastShape_, which will determine the broadcast dimensions. One can also always use symbol.Reshape to make the broadcast performs correctly.

pluskid · 2016-05-06T06:25:58Z

@mli For example, this currently works

import mxnet as mx

a = mx.sym.Variable('a')
b = mx.sym.Variable('b')
c = a+b
c.infer_shape(a=(10,20))

But if we replace + with boradcasting plus, will it still work? How is it going to infer the shape of b purely based on a for broadcasting? If you need a real scenario, there is this full LSTM implementation in acoustic model where a broadcasting elementwise multiplication is needed.

mli · 2016-05-06T06:31:48Z

the only way works is

>>> d = mx.sym.BroadcastPlus(a,b)
>>> d.infer_shape(a=(10,20), b=(10,1))

so your use case is that the shape of b is unclear?

pluskid · 2016-05-06T06:35:59Z

@mli in my use case, b is a parameter, while a is the data (or some hidden layer computed from the data). Our model or module are all based on the assumption that shape information are only from provide_data and provide_label from a DataIter, everything else should be able to be shape inferred. This works very well so far, except for this case. I think allowing the user to attach shape information to Variable sounds like a simple solution.

a = mx.sym.Variable('a')
b = mx.sym.Variable('b', shape=(1, 10))
c = mx.sym.BroadcastPlus(a,b)

c.simple_bind(a=(5, 10))

mli · 2016-05-06T06:49:27Z

understood now. i'm agree that adding a shape attribution is cleaner than the possible solution that passes additional shape info to the data iterator

piiswrong · 2016-05-06T07:54:23Z

I guess dont broadcast unless you have to and chiyuan's example still works.
On May 5, 2016 11:49 PM, "Mu Li" notifications@github.com wrote:

understood now. i'm agree that adding a shape attribution is cleaner than
the possible solution that passes additional shape info to the data iterator

—
You are receiving this because you commented.
Reply to this email directly or view it on GitHub
#2039 (comment)

pluskid · 2016-05-06T14:21:05Z

@piiswrong My problem is that I know that the expected behavior is to be shared. For example, if I want to implement the fully connected layer with low level arithmetics, I could say out = matmul(X, W) + b, here b should not be of shape N-by-d, instead it should be broadcast and have the shape 1-by-d or just (d, ).

piiswrong · 2016-05-06T19:06:50Z

src/operator/elementwise_binary_broadcast_op-inl.h

+}
+
+
+template<typename xpu, typename LHS_OP, typename RHS_OP>


if you feed both left hand right to both LHD_OP and RHS_OP mulbackward can be merged right? This shouldn't increase time or memory.

tqchen · 2016-05-07T17:36:53Z

please check why the python test fails. Note that if bcast do not support inplace, inplace declaration should be disabled

tqchen · 2016-05-07T17:38:13Z

src/operator/elementwise_binary_broadcast_op-inl.h

+.describe("lhs minus rhs with broadcast");
+
+MXNET_REGISTER_SIMPLE_OP(_broadcast_mul, XPU)
+.set_symbol_op_name("BroadcastMul")


for new operators, let us keep the name consistent, with lower cases, so mx.sym and mx.nd have exact the same function.

piiswrong · 2016-05-09T18:05:12Z

@mli Any updates on this?

sxjscience · 2016-05-15T04:16:30Z

Really looking forward to some updates.

piiswrong · 2016-05-15T04:20:21Z

@mli is probably busy lately. Can someone takeover this?

piiswrong · 2016-05-17T07:51:30Z

closing this since we are moving to new one

mli added 6 commits May 3, 2016 14:23

[op] add broadcast plus

628de81

[op] bugfix with scalar tensor

46c75bd

add broadcast plus to sym and unittest

767452f

[op] add backward and test

0e268b3

fix lint

bba934d

Merge branch 'master' into op

97344e4

piiswrong reviewed May 5, 2016
View reviewed changes

mli added 2 commits May 5, 2016 23:17

[op] add BroadcastPlus, BroadcastMinus, BroadcastMul

9d6a950

Merge branch 'op' of github.com:mli/mxnet into op

af0bb80

mli changed the title ~~[OP] let symbol plus support broadcast~~ [OP] broadcast plus, minus, mul, ... May 6, 2016

pluskid mentioned this pull request May 6, 2016

Optional shape information for Variable #2054

Merged

piiswrong reviewed May 6, 2016
View reviewed changes

tqchen mentioned this pull request May 7, 2016

about broadcast #1995

Closed

tqchen reviewed May 7, 2016
View reviewed changes

pluskid mentioned this pull request May 10, 2016

Speech Demo Sync #2086

Merged

sxjscience mentioned this pull request May 17, 2016

Broadcast op + Update Submodules #2165

Merged

piiswrong closed this May 17, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OP] broadcast plus, minus, mul, ... #2039

[OP] broadcast plus, minus, mul, ... #2039

mli commented May 5, 2016

piiswrong May 5, 2016

mli May 6, 2016

pluskid commented May 6, 2016

mli commented May 6, 2016

pluskid commented May 6, 2016

mli commented May 6, 2016 •

edited

Loading

pluskid commented May 6, 2016

mli commented May 6, 2016

piiswrong commented May 6, 2016

pluskid commented May 6, 2016

piiswrong May 6, 2016

tqchen commented May 7, 2016

tqchen May 7, 2016

piiswrong commented May 9, 2016

sxjscience commented May 15, 2016

piiswrong commented May 15, 2016

piiswrong commented May 17, 2016

[OP] broadcast plus, minus, mul, ... #2039

[OP] broadcast plus, minus, mul, ... #2039

Conversation

mli commented May 5, 2016

piiswrong May 5, 2016

Choose a reason for hiding this comment

mli May 6, 2016

Choose a reason for hiding this comment

pluskid commented May 6, 2016

mli commented May 6, 2016

pluskid commented May 6, 2016

mli commented May 6, 2016 • edited Loading

pluskid commented May 6, 2016

mli commented May 6, 2016

piiswrong commented May 6, 2016

pluskid commented May 6, 2016

piiswrong May 6, 2016

Choose a reason for hiding this comment

tqchen commented May 7, 2016

tqchen May 7, 2016

Choose a reason for hiding this comment

piiswrong commented May 9, 2016

sxjscience commented May 15, 2016

piiswrong commented May 15, 2016

piiswrong commented May 17, 2016

mli commented May 6, 2016 •

edited

Loading