-
Notifications
You must be signed in to change notification settings - Fork 641
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ClearML-Data:Could not load dataset state #1123
Comments
Hi @alex-sage ! Before deleting a dataset, you need to delete/archive all dataset versions under it. from clearml.backend_api.session.client import APIClient
client = APIClient()
client.projects.delete(project="1cdc8407d0494adf822d282f7ad45739", force=True) I am not sure why the dataset you created didn't upload/write the state file properly. Could be a network/server error. If you have a consistent way to reproduce the issue, please let us know! |
Thank you for your help, now I was finally able to delete the invalid datasets. I found one way to reproduce the problem, but it seems to only happen when using our network storage as a target.
The second line is using an invalid wildcard on purpose, so that 0 files will be added. This seems to cause the state json file not to be written. Seems like this could be a bug, since I doubt our network keeps failing exactly at this moment 3 times in a row 😉 Edit: I just noticed that the same thing also happens if I add files and abort the hash calculation (By pressing CTRL-C once) half-way through. It prints the message "User aborted", but does not seem to write back the state file. |
This is still a bug that is affecting datasets stored locally. |
Hi @alex-sage ! We have acknowledged the issue. |
Describe the bug
I keep running into this issue, where I want to set up a dataset and it ends up no longer being able to read the datset state.
I created a new dataset using the basic CLI command, started adding data. Suddenly the CLI would only give me the following error:
This now happened 3 times already.
I can now no longer delete the datasets and start over, as the delete command gives me the same error. When trying to delete the dataset through the web UI, I get this error:
When trying to remove the dataset with the python API using
force=True
as suggested, I get this:Can anyone tell me how to get out of this state?
To reproduce
I could try again and give the exact commands I use, but since this now happened to me 3 times, I'm not sure if they matter all that much...
Perhaps this helps - This is the stack trace I get when trying to do a Dataset.get() with the ID of one of the affected datasets:
It really looks to me like the Json file was not written correctly (it seems to be empty).
Expected behaviour
I'd expect the state to be found and that the commands don't give me this error. At the very least it would be nice to be able to remove these datasets and try to start over.
Environment
The text was updated successfully, but these errors were encountered: