Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question About Raw Dataset #18

Closed
HyunaShin opened this issue Mar 24, 2019 · 2 comments
Closed

Question About Raw Dataset #18

HyunaShin opened this issue Mar 24, 2019 · 2 comments

Comments

@HyunaShin
Copy link

HyunaShin commented Mar 24, 2019

HI, Thanks for your contributions.
I'm concerned with shadowing code2vec, and wishing to do it from the scratch.(including preprocess raw data to preprocessed dataset)

So, I wanna ask you that offers raw dataset or not!
Is the only way to get raw dataset is crawling myself? 😢

I look forward to hearing from you soon.
Sincerely,
Hyuna Shin

@urialon
Copy link
Collaborator

urialon commented Mar 24, 2019

Hi Hyuna,
Thank you for your interest in code2vec.
Three raw datasets are available on my website:
http://urialon.cswp.cs.technion.ac.il/publications/

Called Java-small, Java-med and Java-large. In my website they are in raw *.java files, not preprocessed.
Best,
Uri

@HyunaShin
Copy link
Author

Thanks for replying this frivolous question..!
Wish I didn't bothered you a lot.
Thank you again.
Regards,
Hyuna Shin

@urialon urialon closed this as completed Mar 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants