Non-RGB SPAN models #25

RunDevelopment · 2024-01-29T01:19:03Z

I just read through the SPAN code again, and wondered whether span even supports anything other than RGB images as input. If I understand PyTorch tensors correctly, then this line:

neosr/neosr/archs/span_arch.py

Line 239 in 6973906

x = (x - self.mean) * self.img_range

will fail for non-RGB inputs because self.mean is defined like this:

neosr/neosr/archs/span_arch.py

Line 222 in 6973906

self.mean = torch.Tensor(rgb_mean).view(1, 3, 1, 1)

and will always have 3 channels.

So the torch.Tensor(rgb_mean).view(1, 3, 1, 1) should probably be changed to torch.Tensor(rgb_mean).view(1, in_channels, 1, 1) . Alternatively, we could also use the same approach as SwinIR:

https://github.com/muslll/neosr/blob/master/neosr/archs/swinir_arch.py#L775-L779

        if in_chans == 3:
            rgb_mean = (0.4488, 0.4371, 0.4040)
            self.mean = torch.Tensor(rgb_mean).view(1, 3, 1, 1)
        else:
            self.mean = torch.zeros(1, 1, 1, 1)

Correction: The above suggested are not backwards compatible because of single-channel images are broadcasted. So we need to keep the current behavior for in_chans in (1, 3).

What do you think?

The text was updated successfully, but these errors were encountered:

umzi2 · 2024-01-29T04:19:36Z

It at least runs the training and gives the expected result, today I can run the training of a 1 channel model and show the result.

RunDevelopment · 2024-01-29T12:11:02Z

You're right. It will work for 1-channel models:

>>> m = torch.zeros(1, 3, 1, 1)
>>> i = torch.zeros(1, 1, 200, 100)
>>> (i - m).shape
torch.Size([1, 3, 200, 100])

I mean, the first thing a 1-channel model does is to convert the input image to RGB, but the rest of the model seemingly has no issue with that. ¯\(ツ)/¯

It doesn't work for RGBA models though:

>>> i = torch.zeros(1, 4, 200, 100)
>>> (i - m).shape
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 1

muslll · 2024-01-29T15:28:09Z

Hi @RunDevelopment, this has been discussed a lot these few days in the EE discord. Consensus was that the x = (x - self.mean) * self.img_range forward affects stability. I thought about creating a bool for disabling it, which would also solve the grayscale training issue you mentioned. What do you think?

    def __init__(self,
                 num_in_ch=3,
                 num_out_ch=3,
                 feature_channels=48,
                 upscale=upscale,
                 bias=True,
                 norm=False, # new bool
                 img_range=1.0,
                 rgb_mean=(0.4488, 0.4371, 0.4040)
                 ):
        super(span, self).__init__()

        in_channels = num_in_ch
        out_channels = num_out_ch
        self.img_range = img_range
        self.mean = torch.Tensor(rgb_mean).view(1, 3, 1, 1)
        self.norm = norm

        self.conv_x = [...]
        
    def forward(self, x):
        if self.norm:
            self.mean = self.mean.type_as(x)
            x = (x - self.mean) * self.img_range

        [...]

RunDevelopment · 2024-01-29T17:27:41Z

Sounds good. I don't know much about AI, so I trust you and the others on EE when you say that removing this won't cause problems.

As for spandrel parameter detection: since this parameter is a boolean, this makes detection (in a backwards compatible way) easy. We can use the same trick I used Real-CUGAN pro models. Basically, we optionally register a small tensor as a buffer and the presence of the tensor determines the parameter value.
In this case, we register the tensor only if norm=False. So the tensor will only be present on SPAN models without normalization, so everything is backwards compatible.

What do you think?

muslll · 2024-01-29T18:44:45Z

Looks good. Commited 👍

RunDevelopment · 2024-01-29T18:55:50Z

Sorry for not making this clear enough @muslll, but the Real-CUGAN trick I mentioned has to be implemented by neosr as well. If neosr doesn't register the tensor to signify the value of norm, there is no way for us to detect this.

RunDevelopment · 2024-01-29T18:59:59Z

On that note: your CUGAN implementation for pro models has the same issue. Since the pro parameter isn't stored in the .pth, there is no way to detect them as pro models. They would even be incompatible with the official Real-CUGAN code.

Should I make a separate issue for this?

muslll · 2024-01-29T19:10:19Z

I see, my bad. Commited here.

muslll · 2024-01-29T19:15:50Z

Should I make a separate issue for this?

No need, I just commited here. I also changed the tensor to isnorm instead, to avoid conflict with the main function param.

edit: fix

RunDevelopment · 2024-01-29T19:31:43Z

edit: fix

Still wrong. As I said here:

we register the tensor only if norm=False.

Right now, you are registering the tensor for the old behavior. But we need to register it for the new behavior and for it only.

muslll · 2024-01-29T19:47:06Z

I see. Is this right now?.

RunDevelopment · 2024-01-29T19:50:15Z

Yes, this works! Thank you @muslll!

Also, you don't have to make a property and stuff like I did with Real-CUGAN. I just did it there because both the hyper parameter and buffer had the same name...

muslll · 2024-01-29T19:51:13Z

Great. Thanks 👍

muslll added the enhancement New feature or request label Jan 29, 2024

muslll mentioned this issue Jan 29, 2024

SPAN implemented incorrectly chaiNNer-org/spandrel#145

Closed

muslll added the solved issue has been solved label Jan 29, 2024

RunDevelopment closed this as completed Jan 29, 2024

This was referenced Jan 29, 2024

Fix SPAN parameter detection tensor #27

Merged

Add SPAN norm parameter chaiNNer-org/spandrel#148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-RGB SPAN models #25

Non-RGB SPAN models #25

RunDevelopment commented Jan 29, 2024 •

edited

Loading

umzi2 commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024 •

edited

Loading

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024 •

edited

Loading

muslll commented Jan 29, 2024

muslll commented Jan 29, 2024 •

edited

Loading

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

Non-RGB SPAN models #25

Non-RGB SPAN models #25

Comments

RunDevelopment commented Jan 29, 2024 • edited Loading

umzi2 commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024 • edited Loading

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024 • edited Loading

muslll commented Jan 29, 2024

muslll commented Jan 29, 2024 • edited Loading

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024

muslll commented Jan 29, 2024

RunDevelopment commented Jan 29, 2024 •

edited

Loading

muslll commented Jan 29, 2024 •

edited

Loading

RunDevelopment commented Jan 29, 2024 •

edited

Loading

muslll commented Jan 29, 2024 •

edited

Loading