support int16 grayscale images #105

bodokaiser · 2017-03-14T11:41:56Z

This is often the case with medical (MRI) data.

Required changes would be in ToTensor probably something like:

# PIL image mode: 1, L, P, I, F, RGB, YCbCr, RGBA, CMYK
if pic.mode == 'YCbCr':
  nchannel = 3
else:
  nchannel = len(pic.mode)
# handle PIL Image
buf = pic.tobytes()
if len(buf) > pic.width * pic.height * nchannel:
  img = torch.LongTensor(torch.LongStorage.from_buffer(buf))
else:
  img = torch.ByteTensor(torch.ByteStorage.from_buffer(pic.tobytes()))
img = img.view(pic.size[1], pic.size[0], nchannel)

as well as in ToPILImage (just remove normalization to [0, 255] here?).

However I can't assess possible side effects. int16 support may be not very good in pillow (e.g. plt.imshow(Image.fromarray(int16_np_array)) does not work) also there may be other transforms which depend on [0, 255] byte range.

The text was updated successfully, but these errors were encountered:

fmassa · 2017-03-14T12:02:48Z

I'd say as long as the returned tensor is properly converted to float and scaled to [0,1], things should be fine.
But we need to check if standard image transforms (like rotating, cropping, etc) work ok in PIL for int16 type.

Also, LongTensor is actually int64, you might be looking for a ShortTensor instead (which is signed).

bodokaiser · 2017-03-14T12:14:51Z

Thats the problem. I put a minimal example together where you can examine the problem.

from torchvision.transforms import Compose, ToPILImage, ToTensor
from matplotlib import pyplot as plt

import skimage.io
import numpy as np

img = skimage.io.imread('mr.tif')
print('img', img.shape, img.dtype)

plt.imshow(img)
plt.show()

transform = Compose([
    ToPILImage(),
    ToTensor(),
])

timg = transform(np.expand_dims(img, 2))

plt.imshow(timg[0].numpy())
plt.show()

here you find corresponding the tiff file.
mr.tif.zip

fmassa · 2017-03-14T12:29:30Z

The example that you mentioned shows that the current code is not adapted to int16 images, or did you try adding the modifications you mentioned?

bodokaiser · 2017-03-14T13:38:38Z

updated:

# PIL image mode: 1, L, P, I, F, RGB, YCbCr, RGBA, CMYK
if pic.mode == 'YCbCr':
  nchannel = 3
else:
  nchannel = len(pic.mode)
# handle PIL Image
buf = pic.tobytes()
if len(buf) > pic.width * pic.height * nchannel:
  img = torch.ShortTensor(torch.ShortStorage.from_buffer(buf, 'native'))
else:
  img = torch.ByteTensor(torch.ByteStorage.from_buffer(pic.tobytes()))
img = img.view(pic.size[1], pic.size[0], nchannel)

fails with RuntimeError: size '[466 x 394 x 1]' is invalid for input of with 367208 elements at /private/var/folders/y0/d4npmpd50971gpgqxtsvc25m0000gn/T/pip-_fraocf5-build/torch/lib/TH/THStorage.c:59 however changing to:

if len(buf) > pic.width * pic.height * nchannel:
  img = torch.ShortTensor(np.fromstring(buf, dtype=np.int16)[0::2])
else:
  img = torch.ByteTensor(torch.ByteStorage.from_buffer(pic.tobytes()))
img = img.contigouos().view(pic.size[1], pic.size[0], nchannel)

does the job. Furthermore we need to change:

class ToPILImage(object):
    """Converts a torch.*Tensor of range [0, 1] and shape C x H x W
    or numpy ndarray of dtype=uint8, range[0, 255] and shape H x W x C
    to a PIL.Image of range [0, 255]
    """

    def __call__(self, pic):
        npimg = pic
        mode = None
        if not isinstance(npimg, np.ndarray):
            npimg = pic.mul(255).byte().numpy()
            npimg = np.transpose(npimg, (1, 2, 0))

        if npimg.shape[2] == 1:
            npimg = npimg[:, :, 0]
            if npimg.dtype != np.int16:
                mode = "L"

        return Image.fromarray(npimg, mode=mode)

This works but is of course just a quick hack.

fmassa · 2017-03-14T14:20:48Z

Ok, cool.
But I was wondering, does PIL supports natively image operations in int16 image, such as rotate or crop? If it doesn't, then even if we adapt ToPILImage and ToTensor, we still won't be able to perform these operations. Also, as ToTensor converts the image to float, there would be no way of knowing if the original image was int16 or uint8, meaning that applying ToTensor() followed by ToPILImage would not return the identity.

bodokaiser · 2017-03-14T16:10:53Z

According to this issue and this PR it does only for grayscale images.

Regarding the behavior of ToTensor() one way to solve this would be that ToTensor() keeps the data type from PIL.Image but can take the target data type as argument.
Alternatively we could also ignore the fact that ToPILImage(ToTensor()) does not return the identity then we would have no API breaks and I also do do think we loose anything through this?

soumith · 2017-03-23T03:24:57Z

Bodo, it looks like you've been making a lot of progress already.

If you want to fire a few PRs to make torchvision work with int16 out of the box, I would love to have them. If not, I will eventually get to this for sure.

bodokaiser · 2017-03-23T04:24:29Z

Hey Sounith, yes I could do a PR! Before I would just like to clarify what pre processing to apply as in how pixel ranges are conserved. At the moment ToPILImage scales pixel range for every image to [0, 255] however this would not make sense for uint16/int16. I could then handle uint16/int16 as a special case and scale to [0, 65...] or just remove image scaling at all. What do you think? Also how would we address the concern of losing the identity between ToTensor and ToPILImage?

…

Am 23.03.2017 um 04:24 schrieb Soumith Chintala ***@***.***>: Bodo, it looks like you've been making a lot of progress already. If you want to fire a few PRs to make torchvision work with int16 out of the box, I would love to have them. If not, I will eventually get to this for sure. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

soumith · 2017-03-23T04:34:41Z

0 to 65 sounds fine for int16/uint16. You can remove image scaling if you want too, I dont have experience with this domain, so I'll let you make a call.

In the case of identity preservation, ToPILImage needs to take a kwarg of Int16=True or something for the identity loop to happen. I dont see a better way. Same for ToTensor, taking the target data type as a kwarg seems good.

bodokaiser · 2017-03-23T10:56:33Z

@soumith @fmassa PR #122 is up for discussion!

alykhantejani · 2017-09-06T09:51:34Z

@fmassa I think this can now be closed as #122 was merged.

fmassa · 2017-09-06T10:45:59Z

Thanks @alykhantejani !

bodokaiser mentioned this issue Mar 14, 2017

replace custom image transforms with transforms from torchvision bodokaiser/mrtous#11

Closed

bodokaiser added a commit to bodokaiser/vision that referenced this issue Mar 23, 2017

updated transforms.ToPILImage, see pytorch#105

b8e69d8

bodokaiser mentioned this issue Mar 23, 2017

updated transforms.ToPILImage, see #105 #122

Merged

soumith pushed a commit that referenced this issue Mar 23, 2017

updated transforms.ToPILImage, see #105

f954ea4

fmassa closed this as completed Sep 6, 2017

moritzschwyzer mentioned this issue Mar 15, 2020

Medical imaging tutorial fastai/fastai2#172

Merged

fmassa mentioned this issue Jul 7, 2020

Function to_pil_image expects wrong dtype for 'I;16' mode. #2322

Open

rajveerb pushed a commit to rajveerb/vision that referenced this issue Nov 30, 2023

Adding community submodule (pytorch#105)

ac90c2d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support int16 grayscale images #105

support int16 grayscale images #105

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017 •

edited

Loading

soumith commented Mar 23, 2017

bodokaiser commented Mar 23, 2017 via email

soumith commented Mar 23, 2017

bodokaiser commented Mar 23, 2017

alykhantejani commented Sep 6, 2017 •

edited

Loading

fmassa commented Sep 6, 2017

support int16 grayscale images #105

support int16 grayscale images #105

Comments

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017

fmassa commented Mar 14, 2017

bodokaiser commented Mar 14, 2017 • edited Loading

soumith commented Mar 23, 2017

bodokaiser commented Mar 23, 2017 via email

soumith commented Mar 23, 2017

bodokaiser commented Mar 23, 2017

alykhantejani commented Sep 6, 2017 • edited Loading

fmassa commented Sep 6, 2017

bodokaiser commented Mar 14, 2017 •

edited

Loading

alykhantejani commented Sep 6, 2017 •

edited

Loading