Vector Quantization Layer Prototype #52

Simsso · 2018-09-18T16:35:28Z

This PR contains a vector quantization (VQ) prototype that was developed and tested in four Jupyter notebooks.

The code is not production-ready and only there for the sake of experimenting (it is located in /experiments). The well-tested and functionally extended version can now be developed based on this prototype (#51). Merging of this PR resolves #25.

The work can be found here (please review the four files + README.md):

Layer function definition
Projection testing; whether the mapping of inputs to the closest vector in the embedding space works
Embedding space training testing; whether the gradient-based training of the embedding space vectors works
Embedding space training (2) testing; same as above except the loss is applied to all vectors, not just the closest one

Simsso · 2018-09-20T10:41:24Z

Great notebook, @FlorianPfisterer.
Here is my review on the vq_layer function definition.

Why do you need a transpose operations in this line of code?
```
scores = tf.reduce_sum(tf.square(tf.expand_dims(x, 2) - tf.transpose(embed_space)), axis=1)
```
Update: okay, got it. You can save the transpose when expanding on axis=1.

You are adding a stop_gradient over here; I forgot that.

vq_loss = tf.reduce_mean(tf.norm(tf.stop_gradient(x) - chosen_embeddings, ord='euclidean'))

This loss term is similar to the VQ-VAE paper:
```
commit_loss = BETA * tf.reduce_mean(tf.norm(x - tf.stop_gradient(chosen_embeddings), ord='euclidean'))
```
It's a very good idea to add it, even though I'm not sure whether we're going to train our conv layers. But the production VQ layer version should have it!

FlorianPfisterer · 2018-09-20T14:34:03Z

This loss term is similar to the VQ-VAE paper

Well, that's because my experiments where based on the paper 😆. We're not going to train the conv layers, but it still adds a gradient flow that might be relevant for another VQ-layer further upstream.

FlorianPfisterer

Minor things, mostly questions. Really great experiments, I especially love the plots where one can see how the embeddings move!

experiments/vq-layer/001-vq-layer-function.ipynb

FlorianPfisterer · 2018-09-20T14:39:10Z

experiments/vq-layer/001-vq-layer-function.ipynb

+    "        access_count = tf.reduce_sum(one_hot_access, axis=[0, 1], name='access_count')\n",
+    "\n",
+    "        # closest embedding update loss (alpha-loss)\n",
+    "        nearest_loss = tf.reduce_mean(alpha * tf.norm(y - x, lookup_ord, axis=2), axis=[0, 1])\n",


As you noted yourself, add the tf.stop_gradient() here for y.

Rather for x, I'd say. Can you triple-check, please?

If by alpha-loss you're referring to the second term in the following loss function (from the paper), then yes:

experiments/vq-layer/001-vq-layer-function.ipynb

experiments/vq-layer/002-projection-evaluation.ipynb

experiments/vq-layer/003-embedding-space-training.ipynb

experiments/vq-layer/004-embedding-space-training-beta.ipynb

Simsso added 17 commits September 12, 2018 15:16

Draft initial vq layer function

cc17e23

Fix distance computation error

212fb7e

Evaluate projection functionality

2a9badb

Document experiments

c4a5fba

Evaluate embedding space training

68bec93

Make norm order parameterizable

e2f217e

Evaluate beta-loss

5383c17

Convert losses to scalars

9f1f5f4

Enumerate file names

0505278

Count vector hits

e0199c8

Move layer into project

f3526f3

Test projection

d3695f0

Test input validation

3bf9fc6

Test quantization split

c3c86fb

Test different norm orders

9e28914

Remove sub-tests

621ba0e

Remove production-ready vq-layer code

6438188

Simsso added code Software is relevant or involved research Scientific items labels Sep 18, 2018

Simsso added this to the 12. Working Group Meeting milestone Sep 18, 2018

Simsso self-assigned this Sep 18, 2018

Simsso changed the title ~~VQ Layer Prototype~~ Vector Quantization Layer Prototype Sep 18, 2018

Simsso requested review from a user and FlorianPfisterer September 19, 2018 16:09

Simsso mentioned this pull request Sep 19, 2018

Vector Quantization Layer #51

Closed

7 tasks

FlorianPfisterer added 2 commits September 20, 2018 11:27

Add basic-vq-layer-test notebook (experiments)

1e3b2c0

Clean up logging output (limit to every 1000 epochs)

1fa0a0d

FlorianPfisterer reviewed Sep 20, 2018

View reviewed changes

FlorianPfisterer and others added 2 commits September 20, 2018 16:52

Update README.md for vq-experiments regarding new notebook

326c931

Incorporate change requests

eb392f4

Simsso added the work-item Tasks label Sep 21, 2018

FlorianPfisterer approved these changes Sep 21, 2018

View reviewed changes

Simsso merged commit 0d1bbb2 into master Sep 21, 2018

Simsso deleted the vq-prototype branch September 21, 2018 08:49

Simsso removed the work-item Tasks label Sep 22, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector Quantization Layer Prototype #52

Vector Quantization Layer Prototype #52

Simsso commented Sep 18, 2018

Simsso commented Sep 20, 2018

FlorianPfisterer commented Sep 20, 2018 •

edited

Loading

FlorianPfisterer left a comment

FlorianPfisterer Sep 20, 2018

Simsso Sep 20, 2018

FlorianPfisterer Sep 21, 2018

Vector Quantization Layer Prototype #52

Vector Quantization Layer Prototype #52

Conversation

Simsso commented Sep 18, 2018

Simsso commented Sep 20, 2018

FlorianPfisterer commented Sep 20, 2018 • edited Loading

FlorianPfisterer left a comment

Choose a reason for hiding this comment

FlorianPfisterer Sep 20, 2018

Choose a reason for hiding this comment

Simsso Sep 20, 2018

Choose a reason for hiding this comment

FlorianPfisterer Sep 21, 2018

Choose a reason for hiding this comment

FlorianPfisterer commented Sep 20, 2018 •

edited

Loading