Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

zarr temp store files filling up local drive #67

Open
eintzandt opened this issue Feb 20, 2024 · 6 comments
Open

zarr temp store files filling up local drive #67

eintzandt opened this issue Feb 20, 2024 · 6 comments

Comments

@eintzandt
Copy link

I have an external drive (D:) on my desktop PC that I am using to store my large datasets (typically a directory containing 15 1-hour long wavfiles) that I want to predict using a trained DAS model. However, I am often running into a storage error "No space left on device".
image
My D: drive has ~1TB of available space, but my local drive is filling up with temp files when DAS is running the predictions. Is there something I did during installation that is making the temp file path be on my local C: drive instead of my larger external D: drive? Is there a way to reroute these temp files to the D: drive so my local drive doesn't fill up? My temporary solution is to go into my temp directory (C:/.../AppData/Local/Temp/) and manually delete the directories created ('zarr...'), which are ~5GB per 1 hour file I am predicting on. I think the code that is creating these temp files is the TempStore class in das/lib/site-packages/zarr/storage.py, I just don't know if there is a way in DAS to redirect these files.
image
Thank you!

@postpop
Copy link
Contributor

postpop commented Feb 21, 2024

Hi, I can see that this is a bit of an issue. We create these temporary files to enable processing of large files that do not fit in memory - everything is processed in chunks taken from the temp file. However, the temp files should be deleted when you close the audio file in DAS or at the very least when you close DAS. But I guess you did that and the temp files persisted?

@eintzandt
Copy link
Author

I am using the GUI to predict on a whole folder, so DAS isn't closing between opening files. The zarr temp files do not delete after finishing predictions for one file in the folder and starting the next file.

@postpop postpop closed this as completed in f68ab67 Mar 8, 2024
@postpop postpop reopened this Mar 8, 2024
@postpop
Copy link
Contributor

postpop commented Mar 11, 2024

Hi, in the newest version 0.32.1, we now explicitly delete the temporary directory after each file was predicted. Can you give this a try and let me know whether it fixes your issue? Thanks.

@jynkjp
Copy link

jynkjp commented Apr 16, 2024

Hello,
I am experiencing the same issue of accumulating large zarr folders in Temp when predicting across a folder of wav files. I updated to 0.32.2 from 0.31.0 and the issue persists. The folders are deleted when I close DAS but I am hoping to process large amounts of data overnight if possible.
I am using Windows 11.

Thank you

@postpop
Copy link
Contributor

postpop commented Apr 16, 2024

We now manually delete the temporary folders after each file was predicted but maybe we've missed sth.
Are you processing a folder full of files? Are you using the command line or the graphical user interface to do that? Thanks!

@jynkjp
Copy link

jynkjp commented Apr 16, 2024

Thank you for the quick reply.
I am processing a folder full of files that is on a Network-Attached Storage server (not OS drive). I am using command line to process with increased batch size but I just tested it on the GUI and the issue is there also.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants