Skip to content
This repository has been archived by the owner on Jul 30, 2024. It is now read-only.

Missing all.tok file when running data/github/preprocessing/preprocess.py #53

Closed
kdwedage opened this issue Jan 17, 2024 · 0 comments
Closed

Comments

@kdwedage
Copy link

I have downloaded the Java and Python Github data as outlined in the readme. I have attempted to run the preprocessing script following the command

python -m preprocessing.preprocess \ path_2_github_data \ --lang1 java \ --lang2 python \ --test_size 10000;

Upon completion I get the following error:
FileNotFoundError: [Errno 2] No such file or directory: '../../../path_2_github_data/java/all.tok'

While the directory '../../../path_2_github_data/java/' does exist, I can confirm that the all.tok does not.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants