What is the difference between these three models? #110

wuyifan18 · 2021-09-17T03:55:49Z

code2seq
typed-code2seq
code2class

SpirinEgor · 2021-09-18T12:13:20Z

Hi!

Code2seq is a vanilla model that used LSTM to embed paths into vectors and then uses another LSTM to generate output sequence (e.g. method name)
Code2class uses the same encoding method, but as a decoder, it has MLP to the number of classes. It is useful for classification tasks or when you need to build embedding of the code.
Typed-code2seq is extended code2seq model. We describe it in paper about PSIMiner

wuyifan18 · 2021-09-18T12:24:47Z

Thank you!

Avv22 · 2021-11-29T05:58:04Z

@SpirinEgor. Thank you very much. Do you have documentation please of Code2class or published paper so that we can read more and cite it?

SpirinEgor · 2021-11-29T10:20:33Z

Currently, we don't have a paper that uses the code2class model. Hope to have one soon :)
To better understand how it works, I may suggest you study the difference between vanilla code2seq and code2vec models. Code2Class indeed uses code2seq encoder (path embedding algorithm) and code2vec decoder (path aggregation and processing output vector).

wuyifan18 closed this as completed Sep 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the difference between these three models? #110

What is the difference between these three models? #110

wuyifan18 commented Sep 17, 2021

SpirinEgor commented Sep 18, 2021

wuyifan18 commented Sep 18, 2021

Avv22 commented Nov 29, 2021

SpirinEgor commented Nov 29, 2021

What is the difference between these three models? #110

What is the difference between these three models? #110

Comments

wuyifan18 commented Sep 17, 2021

SpirinEgor commented Sep 18, 2021

wuyifan18 commented Sep 18, 2021

Avv22 commented Nov 29, 2021

SpirinEgor commented Nov 29, 2021