query node #11

mengdawn025 · 2023-03-16T11:49:48Z

Hello author, do you still have the archive of the query node code? I am more interested in this piece, so I want to study it.

Jasaxion · 2023-03-22T09:28:03Z

excuse me, @imelnyk , sorry to disturb you, I am also interestred in the code of query node section, could you please add it to code or sent to me, I really wander that how to implemented it and interested in it. thanks you a lot.

imelnyk · 2023-03-23T21:22:06Z

Hi, here is a rough sketch on how to implement this:

Create a trainable query_embed = nn.Embedding(max_nodes, hidden_dim)
When calling T5 transformer

Grapher/model/grapher.py

Line 58 in 97b1f7f

output = self.transformer(input_ids=text,

change it to remove decoder-related inputs and pass decoder_inputs_embeds=query_embed
The output node features coming from transformer should be processed by GRUDecoder (similar as for EdgesGen)

Grapher/model/grapher.py

Line 114 in 97b1f7f

class EdgesGen(nn.Module):
Finally, similarly as currently done, the node features should also be passed for edge generation.

Note: Optionally, you can also implement Matcher class which estimates permutation matrix to match logits_nodes with target_nodes (since in general their order may not match). The main idea here is to create all_pairwise_distance matrix where [i,j] element is cross_entropy(logits_nodes[i], target_nodes[j]). Then use linear_sum_assignment function to find the best match and construct a permutation matrix. Finally, apply this permutation matrix to logits_nodes.

Jasaxion · 2023-03-27T03:36:32Z

Thank you very much, I will give it a try.

Jasaxion · 2023-04-10T08:16:51Z

@imelnyk I am so sorry to disturb you. I add query_embed in litgrapher def __init__

self.query_embed = nn.Embedding(self.max_nodes, self.model.hidden_dim)  # query_node

and then pass parameters to grapher

logits_nodes, logits_edges = self.model(text_input_ids,
                                                text_input_attn_mask,
                                                target_nodes,
                                                target_nodes_mask,
                                                self.query_embed.weight,
                                                target_edges)

but it seems get failed inference (eg. failded_edge --> failded_node) after training, because the dimension of query_node is (max_node x hidden_dim), but T5 decoder_inputs_embeds is (batch_size x sequence_length x hidden_dim)
I tried to unqueeze query_node to (batch_size x max_node x hidden_dim), but still failed after fewer epoch training.
So I wonder that where I should add query_node in code.
Thank you for your answering!

imelnyk · 2023-04-10T15:23:45Z

Hi,
It is hard to say what went wrong. But here are some ideas: Make sure that the transformer only gets input_ids, attention_mask, and decoder_inputs_embeds (this is your query_embed ). Once you get the features out of the transformer, pass them through MLP (similar as for the edges), and then apply CE loss.

mengdawn025 · 2023-04-11T10:44:45Z

@imelnyk I am sorry to bother you again.
First, I wonder if the N (number of nodes) d-dimensional node features mentioned in the paper are the state of the last layer of the decoder.

joint_features = output.decoder_hidden_states[-1]

Second, are the node features decoded into node logits same as the node features used to generate edges?

Third, should the sample function in the grapher.py

 def sample(self, text, text_mask):

be modified after the query_embed is added?

imelnyk · 2023-04-11T15:10:22Z

Yes, you should get the last hidden state
joint_features = transformer(input_ids=text, attention_mask=text_mask, decoder_inputs_embeds=query_embed).last_hidden_state
No, node features and edge features are different. Otherwise, how can you convert the same features into different objects (nodes and edges)?
Yes, it needs to be modified since you need to pass learned query_embed to your transformer during inference.

mengdawn025 · 2023-04-12T02:09:33Z

@imelnyk
OK. Thank you for your answer. Two other questions I have are:

Are the node features decoded into node logits the state of the last layer of the decoder?

joint_features = transformer(input_ids=text, attention_mask=text_mask, decoder_inputs_embeds=query_embed).last_hidden_state

How can I obtain the node features used to generate edges?

imelnyk · 2023-04-12T14:11:20Z

Yes, the last hidden state from decoder is used to get node features which are then used to get node logits.
The edge pipeline can remain the same as is currently done in the code - each pair of node features is combined and passed through MLP to get edge logits.

mengdawn025 · 2023-04-13T09:20:12Z

@imelnyk
OK. Thank you for your answer. One other question I have is:
I need to modify the sample function in the grapher.py after the query_embed is added. Should the query_embed be added to the self.transformer.generate in the sample function?

 def sample (self, text, text_mask):
        output = self.transformer.generate(input_ids=text,
                                           max_length=150,
                                           attention_mask=text_mask,
                                           output_hidden_states=True,
                                           output_scores=True,
                                           return_dict_in_generate=True)

If yes, which parameter should the query_embed be passed to? decoder_inputs_embeds? But the parameters of the generate function do not seem to have the decoder_inputs_embeds parameter.

imelnyk · 2023-04-14T14:52:28Z

Yes, this part is a bit tricky. generate is not applicable here. One option would be to use the same setup as in training with learnedquery_embed.

mengdawn025 · 2023-04-15T02:20:28Z

OK. Thank you very much, I will give it a try.

Jasaxion · 2023-04-15T13:08:07Z

Hi @imelnyk I apologize for reaching out again. In section 2.2 "Node Generation: Query Nodes" of the paper, it is mentioned that the node features are encoded as Fn ∈ Rd×N. May I kindly ask you how to pass this to the GRUDecoder to generate logits nodes (Seq_len X voc_size X num_nodes) in detail? I have attempted numerous ways to modify it, but the issue still persists. If you still have the code archive, I would be grateful if you could share it with me. I am very interested in your implementation of this part. Thank you for tirelessly teaching me how to make modifications.

mengdawn025 · 2023-04-18T11:09:10Z

@imelnyk I am sorry to bother you again. One thing I wonder to figure out:
It is mentioned in the paper that the query node part is not performing well, but I want to know which kind of poor performance it refers to:

Normal triples can be generated, such as :

Aarhus | leader | Jacob Bundsgaard

but the accuracy is very low;
2. Unable to generate normal triples at all, such as :

&lt;extra_id_0&gt; | failed edge | failed node

Others;
If it is the third one, I hope you can explain it.

imelnyk · 2023-04-18T16:52:46Z

Yes, the query node training is not easy, you have to train longer, and play with learning rates, gradient clipping, etc. For us, the performance was not great, however it was still able to generate legible nodes and edges. It looks like in your case it might be the training problems or even some issues with implementation.

mengdawn025 · 2023-04-19T03:58:40Z

Okay. Thank you very much.

mengdawn025 · 2023-05-15T07:16:06Z

Hi, @imelnyk I am sorry to bother you again.
How to view the evaluation metrics, such as Precision, Recall, and F1 scores, after the model is trained. Is it using the command tensorboard --logdir output? But I didn't obtain the evaluation metrics after the command is executed.

imelnyk · 2023-05-15T21:47:17Z

Yes, as the model trains, it evaluates the model, and saves the results. You can see it here:

Grapher/model/litgrapher.py

Line 179 in 97b1f7f

    
           self.logger.experiment.add_scalar(f'{split}_score/{k}', v, global_step=iteration)

mengdawn025 · 2023-05-16T08:03:19Z

Okay. Thank you very much.

mengdawn025 · 2023-08-21T01:21:50Z

Hello, @imelnyk , I am sorry to bother you again. So far, we are still a little puzzled about the function of query node. Could you please explain it?

Jasaxion mentioned this issue Apr 16, 2023

I wonder that how query nodes is implemented in code? #10

Closed

mengdawn025 closed this as completed Aug 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

query node #11

query node #11

mengdawn025 commented Mar 16, 2023 •

edited

Loading

Jasaxion commented Mar 22, 2023

imelnyk commented Mar 23, 2023

Jasaxion commented Mar 27, 2023

Jasaxion commented Apr 10, 2023

imelnyk commented Apr 10, 2023

mengdawn025 commented Apr 11, 2023

imelnyk commented Apr 11, 2023

mengdawn025 commented Apr 12, 2023

imelnyk commented Apr 12, 2023

mengdawn025 commented Apr 13, 2023

imelnyk commented Apr 14, 2023

mengdawn025 commented Apr 15, 2023

Jasaxion commented Apr 15, 2023

mengdawn025 commented Apr 18, 2023

imelnyk commented Apr 18, 2023

mengdawn025 commented Apr 19, 2023

mengdawn025 commented May 15, 2023

imelnyk commented May 15, 2023

mengdawn025 commented May 16, 2023

mengdawn025 commented Aug 21, 2023

query node #11

query node #11

Comments

mengdawn025 commented Mar 16, 2023 • edited Loading

Jasaxion commented Mar 22, 2023

imelnyk commented Mar 23, 2023

Jasaxion commented Mar 27, 2023

Jasaxion commented Apr 10, 2023

imelnyk commented Apr 10, 2023

mengdawn025 commented Apr 11, 2023

imelnyk commented Apr 11, 2023

mengdawn025 commented Apr 12, 2023

imelnyk commented Apr 12, 2023

mengdawn025 commented Apr 13, 2023

imelnyk commented Apr 14, 2023

mengdawn025 commented Apr 15, 2023

Jasaxion commented Apr 15, 2023

mengdawn025 commented Apr 18, 2023

imelnyk commented Apr 18, 2023

mengdawn025 commented Apr 19, 2023

mengdawn025 commented May 15, 2023

imelnyk commented May 15, 2023

mengdawn025 commented May 16, 2023

mengdawn025 commented Aug 21, 2023

mengdawn025 commented Mar 16, 2023 •

edited

Loading