-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ask help for the codeclone dataset #27
Comments
@LeeSureman I haven't been following the issues on this project, but I would love to help. Are you still facing these issues when running the code? I'm looking into the dataset we uploaded as |
I apologize for the delay in reply! |
Thank you for your response. We needed to compare your model on the codeclonedetection task when we submit our paper to ACL 2022. Finally we cancel the comparison. But we will still consider your model and your released dataset in future work. |
Best of luck with your submission! Closing this issue, but please email or reopen the PR if needed. |
great work! I need some help of your codeclone dataset. If you do not mind spend a little time and help me figure out it , I will be very appreciated to you. I download it by the "scripts/download_data.py" in your repo (codeclone/full_data.json.gz) , but I do not know wether it is the dataset used in "4.1 Evaluating Functionality and Robustness: Zero-shot Code Clone Detection" in your paper. I see the "split" function in "representjs/clone_detection.py", so I'm confused... And for the 2065 pairs you mention in your paper, ( also in 4.1) , is it from the same dataset? and how to get it? If you do not mind spend a little time and help me figure out it , I will be very appreciated to you.
The text was updated successfully, but these errors were encountered: