questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" #11

skye95git · 2022-05-27T03:43:31Z

The high variability would suggest a content-dependent head, while low variability would indicate a content-independent head.

Figure 7: Visualization of attention heads in CodeBERT, along with the value of attention analysis ( 𝑝 𝛼 ( 𝑓 )), and attention variability, given a Python code snippet.

What are high, low and attention variability?

What are the inputs and outputs of models in Syntax Tree Induction?
Why is it content dependent?

timetub · 2022-05-27T07:01:01Z

Thanks to propose these questions;
First, the variability is attention variability, and we think the high value is content-dependent and the low is content-independent.
Second, Our input is the pruned AST and code snippet, i.e. no symbols(for example: https://drive.google.com/file/d/1FMgABZMACAv8OjU7wcMliMqc9m3_APQ1/view?usp=sharing)
Third, it is high variability, and we consider that the attention distribution does not depend on the location at this head.
Hope to help you！

skye95git · 2022-05-27T07:52:32Z

Thanks to propose these questions; First, the variability is attention variability, and we think the high value is content-dependent and the low is content-independent. Second, Our input is the pruned AST and code snippet, i.e. no symbols(for example: https://drive.google.com/file/d/1FMgABZMACAv8OjU7wcMliMqc9m3_APQ1/view?usp=sharing) Third, it is high variability, and we consider that the attention distribution does not depend on the location at this head. Hope to help you！

Thanks for your reply!

So, the attention variability is calculated according to Formula 5, right? If the calculated value is high, it is considered content-dependent head, otherwise, it is considered content-dependent content-independent head.
Does it also reflect that different heads are paying attention to different information?

skye95git · 2022-05-27T08:23:36Z

If input is the pruned AST and code snippet, what are the outputs of models in Syntax Tree Induction?
The output is is there an edge between the two nodes, right?

timetub · 2022-05-27T09:36:06Z

Actually, the purned AST strcuture is the gold standard, and we use the method(in our paper) to induce a binary tree，and compute the similarity between the two trees.

skye95git · 2022-05-30T02:41:45Z

Actually, the purned AST strcuture is the gold standard, and we use the method(in our paper) to induce a binary tree，and compute the similarity between the two trees.

Thanks for your reply! I didn't understand what induce meant. Induce a binary tree mean generate an AST from zero, or predict edges only?

timetub · 2022-05-30T03:05:45Z

Actually, the purned AST strcuture is the gold standard, and we use the method(in our paper) to induce a binary tree，and compute the similarity between the two trees.

Thanks for your reply! I didn't understand what induce meant. Induce a binary tree mean generate an AST from zero, or predict edges only?

Yes, Induce a binary tree means generate a tree from zero.

skye95git · 2022-05-30T03:53:56Z

Thanks. There are two final questions：

So, the attention variability is calculated according to Formula 5, right? If the calculated value is high, it is considered content-dependent head, otherwise, it is considered content-dependent content-independent head.
Does it also reflect that different heads are paying attention to different information?

timetub · 2022-06-15T06:36:14Z

Yes, your understanding is right.

skye95git changed the title ~~questions about paper "What Do They Capture?"~~ questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" May 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" #11

questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" #11

skye95git commented May 27, 2022 •

edited

timetub commented May 27, 2022

skye95git commented May 27, 2022 •

edited

skye95git commented May 27, 2022

timetub commented May 27, 2022

skye95git commented May 30, 2022

timetub commented May 30, 2022

skye95git commented May 30, 2022

timetub commented Jun 15, 2022

questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" #11

questions about paper "A Structural Analysis of Pre-Trained Language Models for Source Code" #11

Comments

skye95git commented May 27, 2022 • edited

timetub commented May 27, 2022

skye95git commented May 27, 2022 • edited

skye95git commented May 27, 2022

timetub commented May 27, 2022

skye95git commented May 30, 2022

timetub commented May 30, 2022

skye95git commented May 30, 2022

timetub commented Jun 15, 2022

skye95git commented May 27, 2022 •

edited

skye95git commented May 27, 2022 •

edited