New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] pytorch dataloader index error #2089
Comments
Hey @lspinheiro Thank you for reporting this issue! Were you running this on one of our public datasets? If so, could you pls share the link. It will help us reproduce the issues. If you're running on a private dataset, don't worry about it. |
Hi @istranic. It is a private dataset, sadly I can't share any scripts about the ingestion. If it helps, I'm ingesting into deeplake by appending samples with the following format: dset_entry = {
"images": image,
"labels": classes.astype(np.int32),
"masks/label": seg_classes,
"masks/mask": masks.astype(np.bool8),
"metadata": metadata,
} This is my dataset specification: |
@farizrahman4u @istranic . I'm still debugging this. It looks like the data loader starts failing for all samples after some index. I'm guessing it is something to do with how the The |
Hey @lspinheiro This is out of my wheelhouse, but @farizrahman4u will get back to you tomorrow. Thank you for digging in further! |
Two updates.
Maybe you can try to reproduce it by generating a dataset with JSON tensors with attributes of varying types and lengths. |
@lspinheiro Thanks for the ticket and detailed break down of the issue. Would you be able to check if the problem persists with the branch |
Thanks @farizrahman4u , I'm travelling and with pretty crap internet atm, but I will give it a try as soon as possible. |
馃悰馃悰 Bug Report
I'm trying to understand an issue that is making the PyTorch data loader from deeplake throw an index error for some samples unexpectedly. When I try to fetch the data directly from the data set, the behaviour is not reproducible.
The error first appeared during model training. I was able to reproduce it with the following code:
The following error is thrown without much context.
But the following code produces no errors and exhausts the iterator.
I'm looking for help here since it may be related to the
chunk_engine
behaviour. It could help if the internal exception handler were more explicit about the error.鈿欙笍 Environment
Python
version(s): 3.8.10OS
: Ubuntu 18.04IDE
: VS-CodePackages
: [torch==1.13.1, deeplake==3.1.7]The text was updated successfully, but these errors were encountered: