Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unstable numeric output for downstream task (moat 4 w/o pos) #151

Open
edwardyehuang opened this issue Nov 12, 2022 · 1 comment
Open

Unstable numeric output for downstream task (moat 4 w/o pos) #151

edwardyehuang opened this issue Nov 12, 2022 · 1 comment

Comments

@edwardyehuang
Copy link

ckpts : moat4 w/o pos

The output from moat4 can easily result in the following layers (e.g., 3x3 conv) having a NaN output.

The same issue, at least, does not show in moat0.

It is the first time I have met this issue in my career (I met NaN many times, but never like this), so I need to take some time to investigate this issue.

I will update this issue if I have a new finding. Please also check if the provided ckpts are working.

@Chenglin-Yang
Copy link

Thanks for your interest. For this issue, I have two suggestions:

  1. First of all, make sure the weights of all layers are correctly loaded, including the stem layers, etc.
  2. You can adjust the learning rate, drop path rate and batch size to solve the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants