Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intermittent memory errors while running preprocess.py #31

Open
EvanCrabtree opened this issue Apr 5, 2024 · 0 comments
Open

Intermittent memory errors while running preprocess.py #31

EvanCrabtree opened this issue Apr 5, 2024 · 0 comments

Comments

@EvanCrabtree
Copy link

While running preprocess.py, I have received intermittent memory access errors.

I have been using the ramen dataset from the link provided and running python preprocess.py --dataset_path data/ramen.

Randomly, it will give me memory errors like double free or corruption (!prev) Aborted (core dumped) , corrupted size vs. prev_size Aborted and additionally a pytorch error that has been patched in pytorch 2.0.1 when trying to run on multithreaded CPU.

Setting the threads to 1 with torch.set_num_threads(1) fixes the pytorch issue but makes the preprocessing speed very slow.

I was wondering if there was any advice on how to fix these issues happening during the preprocess script execution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant