Access/train to use the embeddings #24

jn2clark · 2023-01-28T02:55:20Z

Hi @BlinkDL ! Really interested in your work here. I am looking to test out some of the models for embedding based tasks. What is the best way to access the embeddings? I would be looking to use these for training as well (i.e. contrastive loss using siamese training setup). Any information on this would be greatly appreciated.

BlinkDL · 2023-01-28T10:34:09Z

Thanks :) from the README here:
Read the inference code in src/model.py and try using the final hidden state（.xx .aa .bb) as a faithful sentence embedding for other tasks. Probably you shall begin with .xx and .aa/.bb (.aa divided by .bb).

tiendung · 2023-03-29T18:38:48Z

Can you explaim future? still don't get what they mean and which value should be use. xx or aa / bb?

BlinkDL · 2023-04-05T11:36:09Z

@tiendung the hidden state has 5 tensors per block (att+ffn): xx aa bb pp xx

ricardopinto · 2023-04-11T13:46:49Z

@BlinkDL how would the implementation look like for a def embed(text: str) -> List[float]: method in RWKV class? Such a method would be very useful. I asked gpt-4 about it and this is what it wrote:

class RWKV(pl.LightningModule):
    # (...)
    def embed(self, text: str) -> List[float]:
        args = self.args
        input_ids = args.tokenizer.encode(text)
        input_ids = torch.tensor(input_ids).unsqueeze(0).cuda()

        with torch.no_grad():
            x = self.emb(input_ids)
            x_emb = x

            if args.tiny_att_dim > 0:
                for block in self.blocks:
                    x = block(x, x_emb)
            else:
                for block in self.blocks:
                    x = block(x)

            x = self.ln_out(x)
        
        x = x[:, -1, :].detach().cpu()
        return x.squeeze().tolist()

sgaseretto · 2023-05-09T20:57:58Z

@ricardopinto any progress with this? I'm also interested in working with embeddings generated with RWKV but still don't have a clear understanding of how we can make it work to get embeddings like any of the models from sentence-transformers

KnutJaegersberg · 2023-07-04T05:21:49Z

I'm now doing this in HF transformers. 430m seems faithful on writing style, not content.

KnutJaegersberg · 2023-07-04T05:25:17Z

There is a function in gptcache that does this, too. I'm using that code in my HF transformers code, it's just a few lines.

https://gptcache.readthedocs.io/en/latest/_modules/gptcache/embedding/rwkv.html?highlight=rwkv#

…t-exp-A-Half-Half add half half

BlinkDL closed this as completed Feb 17, 2023

BlinkDL mentioned this issue Apr 5, 2023

4-bit quantization to reduce VRam requirement #70

Open

harrisonvanderbyl pushed a commit to harrisonvanderbyl/RWKV-LM that referenced this issue Jul 20, 2023

Merge pull request BlinkDL#24 from harrisonvanderbyl/rwkv5x-tokenshif…

47475f8

…t-exp-A-Half-Half add half half

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Access/train to use the embeddings #24

Access/train to use the embeddings #24

jn2clark commented Jan 28, 2023

BlinkDL commented Jan 28, 2023

tiendung commented Mar 29, 2023

BlinkDL commented Apr 5, 2023

ricardopinto commented Apr 11, 2023

sgaseretto commented May 9, 2023

KnutJaegersberg commented Jul 4, 2023

KnutJaegersberg commented Jul 4, 2023

Access/train to use the embeddings #24

Access/train to use the embeddings #24

Comments

jn2clark commented Jan 28, 2023

BlinkDL commented Jan 28, 2023

tiendung commented Mar 29, 2023

BlinkDL commented Apr 5, 2023

ricardopinto commented Apr 11, 2023

sgaseretto commented May 9, 2023

KnutJaegersberg commented Jul 4, 2023

KnutJaegersberg commented Jul 4, 2023