Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cannot reproduce #38

Closed
robotNo201 opened this issue Feb 16, 2024 · 7 comments
Closed

cannot reproduce #38

robotNo201 opened this issue Feb 16, 2024 · 7 comments

Comments

@robotNo201
Copy link

I really dont think this can be reproduced,

RuntimeError: Error(s) in loading state_dict for CViViT:
Missing key(s) in state_dict: "discr.attn_blocks.3.null_kv", "discr.attn_blocks.3.q_scale", "discr.attn_blocks.3.k_scale", "discr.attn_blocks.3.norm.gamma", "discr.attn_blocks.3.norm.beta", "discr.attn_blocks.3.context_norm.gamma", "discr.attn_blocks.3.context_norm.beta", "discr.attn_blocks.3.to_q.weight", "discr.attn_blocks.3.to_kv.weight", "discr.attn_blocks.3.to_out.weight".
Unexpected key(s) in state_dict: "discr.blocks.6.conv_res.weight", "discr.blocks.6.conv_res.bias", "discr.blocks.6.net.0.weight", "discr.blocks.6.net.0.bias", "discr.blocks.6.net.2.weight", "discr.blocks.6.net.2.bias", "discr.blocks.5.downsample.1.weight", "discr.blocks.5.downsample.1.bias".
size mismatch for discr.to_logits.3.weight: copying a param with shape torch.Size([1, 8192]) from checkpoint, the shape in current model is torch.Size([1, 16384]).

@robotNo201
Copy link
Author

i use a simple video dataset, no problem with gpu and dataset, and i haven't changed anything on the source code.

@robotNo201
Copy link
Author

and this RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
have tried many gpu on google colab, all failed

@robotNo201
Copy link
Author

Alright, nevermine, its my problem, I should never waste time on a nonreproducible project, and a github less than 1k stars

@LuthandoMaqondo
Copy link

and this RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. have tried many gpu on google colab, all failed

I experienced the problem and posted it, as an issue.

@robotNo201
Copy link
Author

and this RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. have tried many gpu on google colab, all failed

I experienced the problem and posted it, as an issue.

I tried many all GPU on google colab but none work, what the error means is just that the training dataset and label are not correct, but i check the dataset and the label, you know it are the three lines:
texts = [
'a whale breaching from afar',
'young girl blowing out candles on her birthday cake',
'fireworks with blue and green sparkles'
]
also, the dataset is videos = torch.randn(3, 3, 17, 256, 128).cuda() # (batch, channels, frames, height, width)
mask = torch.ones((3, 17)).bool().cuda() # [optional] (batch, frames)
so i dont see any problem here because the sample size is 3

@robotNo201
Copy link
Author

and this RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. have tried many gpu on google colab, all failed

I experienced the problem and posted it, as an issue.

maybe we can work together to fix this error.

@LuthandoMaqondo
Copy link

LuthandoMaqondo commented Feb 17, 2024

Let's do it, can you share your fork or you or I can share mine (I forked and made my branch): luthando-contribution

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants