Linear3D #2508

mrityunjay-tripathi · 2020-07-10T04:58:04Z

The current Linear layer works well for the 2D input, like if a data point has 'n' features. Then the shape of input is (n, batchSize).
The size of the weight for the 2D Linear layer is (inSize, outSize).

But when we have 3D input where every batch contains multiple data points and each data points has 'n' features. Like in the case of word embeddings, where each batch contains multiple sequences/sentences and each sequence contains embedding vector for each word. So, effectively the shape of such input will be (sequenceLength, embeddingSize, batchSize), and the number of features is embeddingSize. Now when we try to use the existing Linear layer, we need to vectorize each slice so that shape becomes (sequenceLength * embeddingSize, batchSize). And the number of features is taken as sequenceLength * embeddingSize which is not true. Even if we try to do such a thing, the number of parameters (the shape of weight will be (sequenceLength * embeddingSize, outSize)) will be much higher than what it should be.

In this pull request, I have tried to solve this issue. Let me know your thoughts.
(I asked about this on the IRC but when I am searching those messages now, I'm not getting. Looks like sometimes my messages go to Bermuda triangle :D)

lozhnikov

I added a couple of comments.

src/mlpack/methods/ann/layer/linear3d.hpp

src/mlpack/methods/ann/layer/linear3d_impl.hpp

mrityunjay-tripathi · 2020-08-04T13:58:15Z

@lozhnikov Memory errors fixed! (I don't know how :D)

lozhnikov

Looks good to me, but I have a question about the ordering.

src/mlpack/methods/ann/layer/linear3d_impl.hpp

src/mlpack/methods/ann/layer/linear3d.hpp

lozhnikov · 2020-08-06T13:33:39Z

So, effectively the shape of such input will be (embeddingSize, sequenceLength, batchSize)

If I remember right, we changed the ordering in MultiHeadAttention, didn't we?

mrityunjay-tripathi · 2020-08-06T17:43:01Z

So, effectively the shape of such input will be (embeddingSize, sequenceLength, batchSize)

If I remember right, we changed the ordering in MultiHeadAttention, didn't we?

Right. I will update the description :)

lozhnikov

Looks good to me. I added a few style suggestions.

src/mlpack/methods/ann/layer/linear3d_impl.hpp

src/mlpack/methods/ann/layer/lookup_impl.hpp

mlpack-bot

Second approval provided automatically after 24 hours. 👍

lozhnikov · 2020-08-20T20:54:35Z

@mlpack-jenkins test this please

lozhnikov · 2020-08-21T11:03:44Z

I checked the Linear3D tests and the Lookup tests with valgrind. It didn't show any memory errors. So, I assume these errors are in other tests/methods. If no one objects, I'll merge the PR tomorrow.

lozhnikov · 2020-08-22T21:53:16Z

I merged the PR. Thanks for the contribution!

mlpack-bot bot added s: needs review s: unanswered s: unlabeled labels Jul 10, 2020

lozhnikov reviewed Jul 10, 2020

View reviewed changes

src/mlpack/methods/ann/layer/linear3d.hpp Outdated Show resolved Hide resolved

src/mlpack/methods/ann/layer/linear3d_impl.hpp Outdated Show resolved Hide resolved

src/mlpack/methods/ann/layer/linear3d_impl.hpp Outdated Show resolved Hide resolved

birm added c: methods t: added feature and removed s: unanswered labels Jul 11, 2020

zoq removed the s: unlabeled label Jul 13, 2020

kartikdutt18 reviewed Jul 14, 2020

View reviewed changes

src/mlpack/methods/ann/layer/linear3d_impl.hpp Outdated Show resolved Hide resolved

lozhnikov reviewed Aug 6, 2020

View reviewed changes

mrityunjay-tripathi force-pushed the linear3d branch from e560028 to 196b14b Compare August 13, 2020 03:23

lozhnikov approved these changes Aug 17, 2020

View reviewed changes

mrityunjay-tripathi added 5 commits August 17, 2020 20:56

adding linear3d layer

9364fad

simple layer test and jacobian test added

8c6e790

adding tests

422ae89

switch rows and cols

f8ba736

swap rows and cols in lookup layer accordingly

e511f5b

mrityunjay-tripathi force-pushed the linear3d branch from 5196160 to e511f5b Compare August 17, 2020 15:26

mlpack-bot bot approved these changes Aug 18, 2020

View reviewed changes

mlpack-bot bot removed the s: needs review label Aug 18, 2020

lozhnikov merged commit e6d047b into mlpack:master Aug 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear3D #2508

Linear3D #2508

mrityunjay-tripathi commented Jul 10, 2020 •

edited

lozhnikov left a comment

mrityunjay-tripathi commented Aug 4, 2020

lozhnikov left a comment

lozhnikov commented Aug 6, 2020

mrityunjay-tripathi commented Aug 6, 2020

lozhnikov left a comment •

edited

mlpack-bot bot left a comment

lozhnikov commented Aug 20, 2020

lozhnikov commented Aug 21, 2020

lozhnikov commented Aug 22, 2020

Linear3D #2508

Linear3D #2508

Conversation

mrityunjay-tripathi commented Jul 10, 2020 • edited

lozhnikov left a comment

Choose a reason for hiding this comment

mrityunjay-tripathi commented Aug 4, 2020

lozhnikov left a comment

Choose a reason for hiding this comment

lozhnikov commented Aug 6, 2020

mrityunjay-tripathi commented Aug 6, 2020

lozhnikov left a comment • edited

Choose a reason for hiding this comment

mlpack-bot bot left a comment

Choose a reason for hiding this comment

lozhnikov commented Aug 20, 2020

lozhnikov commented Aug 21, 2020

lozhnikov commented Aug 22, 2020

mrityunjay-tripathi commented Jul 10, 2020 •

edited

lozhnikov left a comment •

edited