Ask For the problem of Transformer-XL to get text embedding #6

eddy60506 · 2022-05-22T07:18:03Z

Dear Author:

Very appreciate for the sample code of RGTN, I try to use Hugging Face transformer-XL to get semantic embedding of the node text.
Here is the official code in the Hugging Face transformer-XL doc:

===
`from transformers import TransfoXLTokenizer, TransfoXLModel
import torch

tokenizer = TransfoXLTokenizer.from_pretrained("transfo-xl-wt103")
model = TransfoXLModel.from_pretrained("transfo-xl-wt103")

inputs = tokenizer("Hello, my dog is cute", return_tensors="pt")
outputs = model(**inputs)

last_hidden_states = outputs.last_hidden_state`

===

But the last_hidden_states was the embedding vector of the word not the embedding of text, can you please give me the way to get the text embedding or release the part of sample code?

GRAPH-0 · 2022-05-23T07:21:53Z

An example code

tokenizer = XLNetTokenizer.from_pretrained('xlnet-base-cased')
model = XLNetModel.from_pretrained('xlnet-base-cased',
                                   output_hidden_states=True,
                                   output_attentions=True).to(device)

lang_features = dict()
with open(args.input_file) as fin:
    # read your text file as fin; node_id + text
    lines = fin.readlines()
    for line in tqdm(lines[1:]):
        l = line.split('\t')
        node_id = l[0]
        input_ids = torch.tensor([tokenizer.encode(l[1])]).to(device)
        input_ids = input_ids[:, :args.max_word]
        all_hidden_states, all_attentions = model(input_ids)[-2:]
        rep = (all_hidden_states[-2][0] * all_attentions[-2][0].mean(dim=0).mean(dim=0).view(-1, 1)).sum(dim=0)
        lang_features[node_id] = rep.detach().cpu().numpy()

eddy60506 · 2022-05-23T07:27:39Z

thx! I will try this code.

GRAPH-0 closed this as completed Jun 23, 2022

GRAPH-0 mentioned this issue Jun 4, 2023

About dataset #9

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ask For the problem of Transformer-XL to get text embedding #6

Ask For the problem of Transformer-XL to get text embedding #6

eddy60506 commented May 22, 2022

GRAPH-0 commented May 23, 2022

eddy60506 commented May 23, 2022

Ask For the problem of Transformer-XL to get text embedding #6

Ask For the problem of Transformer-XL to get text embedding #6

Comments

eddy60506 commented May 22, 2022

GRAPH-0 commented May 23, 2022

eddy60506 commented May 23, 2022