You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The expression is y = A' * B where A, B are vectors. Calculating A' * B is useless when calculating dy/dA and dy/dB, but are time consuming.
These kind of problems won't significantly affect the speed in deep learning scenarios but adds a lot of consumed time when the "network" is really shallow.
So is there any way to calculate gradient without calculating function outputs?
The text was updated successfully, but these errors were encountered:
The expression is
y = A' * B
whereA
,B
are vectors. CalculatingA' * B
is useless when calculatingdy/dA
anddy/dB
, but are time consuming.These kind of problems won't significantly affect the speed in deep learning scenarios but adds a lot of consumed time when the "network" is really shallow.
So is there any way to calculate gradient without calculating function outputs?
The text was updated successfully, but these errors were encountered: