Fix grayscale channel dimension (1) lost in DataLoader by Chandraveersingh1717 · Pull Request #307 · mlverse/torchvision

Chandraveersingh1717 · 2026-03-24T19:12:13Z

Fix grayscale channel handling in MNIST-style datasets

Summary

This PR fixes an issue where grayscale images could lose their channel dimension during batching, resulting in shapes like [B, 28, 28] instead of [B, 1, 28, 28] (seen on Apple M2). This could cause errors in models expecting channel-first input.

Changes

Ensure .getitem() always preserves the channel dimension
Add a regression test for batch_size = 128
Make tensor conversion more robust for different input layouts
Fix a small MD5 check bug
Correct a typo in kmnist_dataset
Improve QMNIST download handling
Add support for .xz files
Improve EMNIST download warning message

Impact

Makes dataset behavior more consistent across environments and avoids shape-related errors during training.

cregouby

praise thanks for this improvements
todo see inline

cregouby · 2026-03-28T15:24:49Z

-  res <- torch::torch_tensor(img)$permute(c(3, 1, 2))
+  dims <- dim(img)
+  if (length(dims) != 3)
+    stop("Expected a 2D or 3D array.")


todo Please rely on functions from conditions.R for correct user message translation

cregouby · 2026-03-28T15:28:59Z

+    if (length(index) != 1) {
+      return(lapply(as.integer(index), function(i) self$.getitem(i)))
+    }


suggestion this is supposed to be the .getbatch() function. I would rather implement the .getbatch than trying to implement a get-batch-within-getitem.

cregouby · 2026-03-28T15:31:25Z


-    x <- self$data[index, , ]
-    y <- self$targets[index]
+    idx <- as.integer(index)


suggestion `.getitem() is mainly an internal function for dataloader to run. So index will always be integer. I'd prefer to avoid the perrformance impact and remove this line.

cregouby · 2026-03-28T15:31:45Z

+  x <- if (grepl("\\\\.xz$", path, ignore.case = TRUE)) {
+    xzfile(path, open = "rb")
+  } else {
+    gzfile(path, open = "rb")
+  }


praise nice addition

cregouby · 2026-03-28T15:33:43Z

+    if (length(index) != 1) {
+      return(lapply(as.integer(index), function(i) self$.getitem(i)))
+    }
+
+    idx <- as.integer(index)


todo Same as for L454-L458

cregouby · 2026-03-29T14:04:31Z

-  expect_named(raw_items[[1]], c("x", "y"))
-  expect_equal(dim(raw_items[[1]]$x), c(1, 28, 28))
+  expect_named(raw_items, c("x", "y"))
+  expect_equal(dim(raw_items$x), c(2, 28, 28))


question so there is no "fix of the grayscale channel dimension lost in dataloader", true ?

cregouby · 2026-03-29T14:45:58Z

-    if (!is.null(self$target_transform))
-      y <- self$target_transform(y)
+    if (!is.null(self$target_transform)) {
+      if (length(index) > 1) {


todo idem

…s[1] to be ndim=4. make emnist_collection inherit mnist to remove duplicated code fix tests accordingly

cregouby · 2026-03-29T21:54:17Z

This would fix mlverse/torch#1417 but would raise mlverse/luz#159

dfalbel · 2026-03-30T20:18:48Z

Have we looked at : #264 and how this is handled?

cregouby · 2026-03-31T11:41:29Z

Have we looked at : #264 and how this is handled?

Yes, this is where mlverse/luz#160 comes into play. I'll try to better document it in the {luz} pull request.

dfalbel · 2026-03-31T11:48:17Z

I believe this is tricky terrain. In theory torch vision transforms are not implemented to operate on batches of images, so while we can fix for some luz code, it will inevitably be problematic for other transforms.

Or otherwise we should error earlier if a trasnform takes a 4d tensor as input, which would clearly indicate it's a batch of images.

cregouby · 2026-03-31T11:57:14Z

Good point. I'll revert (again) on that part, and make that transform limitation explicit.

Fix grayscale channel dimension (1) lost in DataLoader (mlverse#306)

e14d80b

Chandraveersingh1717 force-pushed the fix-grayscale-dimension branch from b793f64 to e14d80b Compare March 24, 2026 19:16

Chandraveersingh1717 changed the title ~~Update deprecated parameters and improve documentation~~ Fix grayscale channel dimension (1) lost in DataLoader (#306) Mar 24, 2026

Merge branch 'main' into fix-grayscale-dimension

809c665

cregouby reviewed Mar 28, 2026

View reviewed changes

Chandraveersingh1717 added 2 commits March 28, 2026 23:40

fix requested changes

1e50c94

fix test failure

c6e98bd

Chandraveersingh1717 requested a review from cregouby March 29, 2026 04:36

cregouby reviewed Mar 29, 2026

View reviewed changes

align all .getitem() to be ndim = 3, .getbatch() to be ndim = 4 and d…

44e4c87

…s[1] to be ndim=4. make emnist_collection inherit mnist to remove duplicated code fix tests accordingly

add NEWS

ec658a0

cregouby merged commit 6c86e09 into mlverse:main Mar 30, 2026
3 checks passed

cregouby changed the title ~~Fix grayscale channel dimension (1) lost in DataLoader (#306)~~ Fix grayscale channel dimension (1) lost in DataLoader Mar 30, 2026

cregouby mentioned this pull request Mar 30, 2026

Add .getbatch to dataloaders that inherits mnist_dataset mlverse/luz#160

Draft

Conversation

Chandraveersingh1717 commented Mar 24, 2026

Fix grayscale channel handling in MNIST-style datasets

Summary

Changes

Impact

Uh oh!

cregouby left a comment

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

cregouby commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dfalbel commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cregouby commented Mar 31, 2026

Uh oh!

dfalbel commented Mar 31, 2026

Uh oh!

cregouby commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cregouby commented Mar 29, 2026 •

edited

Loading

dfalbel commented Mar 30, 2026 •

edited

Loading