Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crashes on 2xT4 GPUs #1433

Closed
9 tasks done
Philmod opened this issue Oct 18, 2022 · 3 comments
Closed
9 tasks done

Crashes on 2xT4 GPUs #1433

Philmod opened this issue Oct 18, 2022 · 3 comments
Assignees
Labels
bug Something isn't working help wanted Extra attention is needed wontfix This will not be worked on

Comments

@Philmod
Copy link

Philmod commented Oct 18, 2022

馃悰 Bug Report

Catalyst fails on 2xT4 GPUs.

We install Catalyst in the Kaggle base image. This week we wanted to release a new image with upgraded packages. It doesn't look like Catalyst was upgraded, but Accelerate was (from 0.12 to 0.13.1).

How To Reproduce

Steps to reproduce the behavior:
Run this unit test on a 2xT4 GPU.

Code sample

https://github.com/Kaggle/docker-python/blob/main/tests/test_catalyst.py

Screenshots

Screen Shot 2022-10-18 at 9 55 46 AM

Expected behavior

The test passes on a P100 GPU.

Environment

https://gist.github.com/Philmod/0349a2cf16d76e8d20e960d750962241

Checklist

  • bug description
  • steps to reproduce
  • expected behavior
  • environment
  • code sample / screenshots

FAQ

Please review the FAQ before submitting an issue:

@Philmod Philmod added bug Something isn't working help wanted Extra attention is needed labels Oct 18, 2022
@github-actions
Copy link

Hi! Thank you for your contribution! Please re-check all issue template checklists - unfilled issues would be closed automatically. And do not forget to join our slack for collaboration.

@Philmod
Copy link
Author

Philmod commented Oct 18, 2022

Update: pinning accelerate package to 0.12.0 fixes the problem.

@stale
Copy link

stale bot commented Dec 21, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the wontfix This will not be worked on label Dec 21, 2022
@stale stale bot closed this as completed Feb 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

3 participants