You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your work.
I tried to use CodeBERT, GraphCodeBERT and UnixCoder to extract Java code embeddings.
However, for inputs to the models, I only used the Java source code, something like [CLS][JavaCode][SEP].
Should I also add comments to the inputs?
For GraphCodeBERT and UnixCoder, should I also add dataflow and also the flattened AST as input? Since I care about the execution time of the approach, so would adding that information (Comments, Dataflow and AST) make the time for getting embeddings much longer?
I would appreciate your kind suggestions,
Thanks.
The text was updated successfully, but these errors were encountered:
You don't need to add dataflow or the flattened AST as input. The original code is enough. If you want to extract code embedding, I suggest you use UniXcoder which I test better on most datasets.
Hi,
Thanks for your work.
I tried to use CodeBERT, GraphCodeBERT and UnixCoder to extract Java code embeddings.
However, for inputs to the models, I only used the Java source code, something like [CLS][JavaCode][SEP].
I would appreciate your kind suggestions,
Thanks.
The text was updated successfully, but these errors were encountered: