CC2Vec: Combining Typed Tokens with Contrastive Learning for Effective Code Clone Detection
This is the open-source code repository for under-review paper "CC2Vec: Combining Typed Tokens with Contrastive Learning for Effective Code Clone Detection"
pytorch
cudatoolkit
datasets
transformers
gensim
CC2Vec/
|--scrpts/ # scripts for CC2Vec
|--bash.py
|--dot2sent.py
|--word2csv.py
|-- ...
|--train_att.py # pretrain for CC2Vec
|--evalutate.py # evaluate models
python train_att.py
python \scripts\*.py