Skip to content

Conversation

@priyakasimbeg
Copy link
Contributor

@priyakasimbeg priyakasimbeg commented Nov 18, 2023

Should fix #582 and #572.
Without this fix the weight shape is bsz/num_devices because the targets get reshaped in the shard_and_pad function in data_utils.py.

@github-actions
Copy link

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@priyakasimbeg priyakasimbeg marked this pull request as ready for review November 18, 2023 08:05
@priyakasimbeg priyakasimbeg requested a review from a team as a code owner November 18, 2023 08:05
@priyakasimbeg priyakasimbeg changed the base branch from main to dev November 18, 2023 08:05
@priyakasimbeg priyakasimbeg merged commit 322014c into dev Nov 18, 2023
@github-actions github-actions bot locked and limited conversation to collaborators Nov 18, 2023
@pomonam pomonam deleted the mnist_pytorch_fix branch December 3, 2023 01:04
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Getting an error when running on a single host with 8xA100 GPU

2 participants