RQ-VAE: How can I get a list of all learned codebook vectors (as indexed in the "indices")? #28

christophschuhmann · 2022-10-03T13:38:48Z

Hi Lucid,
i am working on quantizing CLIP image embeddings with your RQ-VAE. It works pretty well.

Next I want to take all learned codebook vectors and add them to the vocab of a GPT (as frozen token embeddings).

The idea is to train a GPT with CLIP image embeddings in between texts, e.g. IMAGE-CAPTION or TEXT-IMAGE-TEXT-IMAGE- ... Flamingo-style).

If this works, then GPT could maybe also learn to generate quantized CLIP IM embeddings token by token --> and then e.g. show images through a.) retrieval or b.) a DALLE 2 decoder :)

... So my question is: Once the RQ-VAE is trained and i can get the quantized reconstructions and indices - How can I get a list or tensor of the actual codebook? (all possible vectors from the rq-vocab) :)

kradonneoh · 2022-10-07T04:42:06Z

+1 I can reverse engineer the forward function, but it'd be nice if there was an easy function call I'm missing

Edit: ended up reverse engineering it anyways :-) You can do codes from indices like:
quantizer.layers[i]._codebook.embed[0, tokens_ids[:, i]] for each layer i in the residual vector quantizer. As a bonus, you can reconstruct the input (image / audio / etc.) by doing:

decoded_vector = 0.0
for i, layer in enumerate(quantizer.layers):
    vector = vector + layer._codebook.embed[0, tokens[:, i]]

lucidrains · 2022-10-26T17:14:48Z

@christophschuhmann @kradonneoh oh hey! nice to hear that the library is working well for your use case

I've added the feature to return all the codes across quantization layers here ec24746

lucidrains closed this as completed Oct 28, 2022

liangcl0928 mentioned this issue Jun 13, 2023

Crash on Mac M1/M2 chip when using MPS support #55

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RQ-VAE: How can I get a list of all learned codebook vectors (as indexed in the "indices")? #28

RQ-VAE: How can I get a list of all learned codebook vectors (as indexed in the "indices")? #28

christophschuhmann commented Oct 3, 2022

kradonneoh commented Oct 7, 2022 •

edited

lucidrains commented Oct 26, 2022

RQ-VAE: How can I get a list of all learned codebook vectors (as indexed in the "indices")? #28

RQ-VAE: How can I get a list of all learned codebook vectors (as indexed in the "indices")? #28

Comments

christophschuhmann commented Oct 3, 2022

kradonneoh commented Oct 7, 2022 • edited

lucidrains commented Oct 26, 2022

kradonneoh commented Oct 7, 2022 •

edited