Add Apple's MobileOne encoder #693

kevinpl07 · 2022-12-08T10:05:37Z

Hello,

I added support for Apple's MobileOne encoder.

Paper: Link

There were very few changes I had to make to their official github repo: Link

It works with all decoders and has impressive inference time for images with 256x256:

Encoder-Decoder	Inference time in vanilla torch
mobileone_s1_pspnet_256	0.0313718318939209
mobileone_s0_pan_256	0.03421592712402344
mobileone_s2_pspnet_256	0.036206960678100586
mobileone_s3_pspnet_256	0.04711484909057617
mobileone_s1_pan_256	0.05329489707946777
mobileone_s0_linknet_256	0.05789995193481445
mobileone_s0_deeplabv3plus_256	0.058853864669799805
mobileone_s0_fpn_256	0.07664108276367188
mobileone_s4_pspnet_256	0.0768282413482666
mobileone_s1_deeplabv3plus_256	0.07886672019958496
mobileone_s2_pan_256	0.07946181297302246
mobileone_s3_pan_256	0.09101414680480957
mobileone_s1_fpn_256	0.09615683555603027
mobileone_s1_linknet_256	0.09956574440002441
mobileone_s2_fpn_256	0.11291790008544922
mobileone_s0_unet_256	0.11676502227783203
mobileone_s2_linknet_256	0.12518310546875
mobileone_s3_deeplabv3plus_256	0.12642478942871094
mobileone_s2_deeplabv3plus_256	0.1289658546447754
mobileone_s3_fpn_256	0.1370537281036377
mobileone_s4_pan_256	0.14015984535217285
mobileone_s1_unet_256	0.15249204635620117
mobileone_s3_linknet_256	0.15824413299560547
mobileone_s4_deeplabv3plus_256	0.16476082801818848
mobileone_s0_manet_256	0.17203474044799805
mobileone_s2_unet_256	0.17334604263305664
mobileone_s4_fpn_256	0.182358980178833
mobileone_s3_unet_256	0.20330286026000977
mobileone_s4_linknet_256	0.21462082862854004
mobileone_s0_deeplabv3_256	0.22992897033691406
mobileone_s4_unet_256	0.24337363243103027
mobileone_s0_unetplusplus_256	0.29451799392700195
mobileone_s1_deeplabv3_256	0.31217503547668457
mobileone_s1_manet_256	0.3140380382537842
mobileone_s1_unetplusplus_256	0.5090749263763428
mobileone_s2_deeplabv3_256	0.5372707843780518
mobileone_s3_deeplabv3_256	0.5489542484283447
mobileone_s2_unetplusplus_256	0.5728631019592285
mobileone_s4_deeplabv3_256	0.638185977935791
mobileone_s2_manet_256	0.6446411609649658
mobileone_s3_manet_256	0.6838269233703613
mobileone_s3_unetplusplus_256	0.6991360187530518
mobileone_s4_manet_256	0.748121976852417
mobileone_s4_unetplusplus_256	0.9898359775543213

qubvel · 2022-12-08T10:21:27Z

Hi, thanks for your work and contribution!
Could you please correct the code formatting and add information about the encoder to the docs?

kevinpl07 · 2022-12-08T11:42:10Z

Hi, thanks for your work and contribution! Could you please correct the code formatting and add information about the encoder to the docs?

Done :)

JulienMaille · 2022-12-09T09:14:59Z

Thanks for your contribution, I tried it on my side and could not make it work when the input images has only one channel (greyscale images). Is that a known limitation?

kevinpl07 · 2022-12-09T09:16:13Z

I honestly didn't check that. Let me investigate.

kevinpl07 · 2022-12-09T09:56:50Z

Thanks for your contribution, I tried it on my side and could not make it work when the input images has only one channel (greyscale images). Is that a known limitation?

I looked into the grayscale limitation and could not make it work without making more drastic changes to apple's code (other than passing "in_channel" through all the init functions).
I added the limitation to the readme file.
Me or someone else can revisit this, I don't think the architecture limits the channels. It just needs more thought put into it.

JulienMaille · 2022-12-09T11:53:33Z

I can try to give it a look, if you already have advices to share it might help.

JulienMaille · 2022-12-09T14:50:14Z

Model must be inited with 3 channels so that weights can be loaded.
Then utils.patch_first_conv is in charge of updating to the desired number of channels (1 in my case)
This loops through blocks and patches the first conv2D it finds, however in our case it seems we at least need to patch the stage0 rbr_conv

kevinpl07 · 2022-12-09T15:38:55Z

Mode must be inited with 3 channels so that weights can be loaded. Then utils.patch_first_conv is in charge of updating to the desired number of channels (1 in my case) This loops through blocks and patches the first conv2D it finds, however in our case it seems we at least need to patch the stage0 rbr_conv

Feel free to send me a snippet or create a PR if it works! 👍

JulienMaille · 2022-12-09T16:55:54Z

@kevinpl07 what's the purpose of reparameterize?

JulienMaille · 2022-12-09T17:07:35Z

Monkey patching like this seems to do the trick

    from . import _utils as utils

    def set_in_channels(self, in_channels, pretrained=True):
        """Change first convolution channels"""
        if in_channels == 3:
            return

        self._in_channels = in_channels
        self._out_channels = tuple([in_channels] + list(self._out_channels)[1:])
        utils.patch_first_conv(model=self.stage0.rbr_conv, new_in_channels=in_channels, pretrained=pretrained)
        utils.patch_first_conv(model=self.stage0.rbr_scale, new_in_channels=in_channels, pretrained=pretrained)

kevinpl07 · 2022-12-10T17:53:25Z

@kevinpl07 what's the purpose of reparameterize?

Essentially the multi-branch structure is benefitial for training but has drawbacks during inference. The reparameterize function takes the model after training and converts it to plain CNN-like structure for inference. This can be called on the complete segmentation model because it checks whether individual components have a reparameterize function.
See Apple's official repo fo more info.

JulienMaille · 2022-12-12T08:00:32Z

I'm surprised by the size of the model. I'm used to work with unet-resnet18 (depth 4) and unet-mobileone_s2 (depth 4) is still bigger in size (23Mo vs 14Mo)

kevinpl07 · 2022-12-12T08:16:48Z

I'm surprised by the size of the model. I'm used to work with unet-resnet18 (depth 4) and unet-mobileone_s2 (depth 4) is still bigger in size (23Mo vs 14Mo)

I know what you mean. The thing is that at end of the day, they only optimized for classification inference time on an iPhone 12.
From their paper:

Further they state:

[...]For example, MobileOne-S1 has 4.8M parameters and incurs a latency of
0.89ms, while MobileNet-V2 [2] has 3.4M (29.2% less than MobileOne-S1) parameters and incurs
a latency of 0.98ms. At this operating point, MobileOne attains 3.9% better top-1 accuracy than
MobileNet-V2.

So essentially:

It's optimized for inference speed over n_params
it might not even be a good backbone for segmentation (this PR would enable people to try it, there is no other Repo that has mobileone for segmentation)
we need experiments :)

Hope I could help a bit.

JulienMaille · 2022-12-12T11:01:45Z

I got you, but on paper resnet18 (11M) has more parameters than mobileone_s0/1/2/3
It seems to perform well on segmentation, but I only scratched the surface.

kevinpl07 · 2022-12-12T17:30:23Z

@JulienMaille Can you check if my last commit is according to your suggestion?

@qubvel can you trigger the workflow again, once Julien approves?

JulienMaille · 2022-12-12T18:32:27Z

Looks good to me

qubvel · 2022-12-14T08:39:12Z

segmentation_models_pytorch/encoders/mobileone.py

+        mod_list.add_module("bn", nn.BatchNorm2d(num_features=self.out_channels))
+        return mod_list
+
+    def set_in_channels(self, in_channels, pretrained=True):


probably we should move it to the MobileOne class?

Correct, my mistake -> Done.

qubvel · 2022-12-14T11:52:42Z

@kevinpl07 could you, please, also add information about new encoders to the docs here
https://github.com/qubvel/segmentation_models.pytorch/blob/master/docs/encoders.rst

kevinpl07 · 2022-12-14T15:04:00Z

@kevinpl07 could you, please, also add information about new encoders to the docs here https://github.com/qubvel/segmentation_models.pytorch/blob/master/docs/encoders.rst

Done as well :)

qubvel · 2022-12-15T10:59:00Z

Thanks a lot, merged!

JulienMaille · 2023-02-21T19:42:26Z

@kevinpl07 I gave it a try, IoU is great but inference time on Cuda is not optimized (tried with OpenCV with Cuda backend)

Kevin Bondzio and others added 5 commits December 8, 2022 10:39

add mobileone encoder

b456f92

Merge branch 'qubvel:master' into master

2e73744

remove testing code

12c5cbc

add custom state dict

fd5326e

remove old import

e48d4af

fix formatting, extend README

9e1202d

fix formatting, add limitation notice

050d396

add support for 1 channel

b0f0428

qubvel reviewed Dec 14, 2022

View reviewed changes

Kevin Bondzio added 2 commits December 14, 2022 15:38

add mobileone to encoder docs

2dc4170

fix encoder docs

21e8e95

qubvel merged commit c2fce7b into qubvel:master Dec 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Apple's MobileOne encoder #693

Add Apple's MobileOne encoder #693

kevinpl07 commented Dec 8, 2022

qubvel commented Dec 8, 2022

kevinpl07 commented Dec 8, 2022

JulienMaille commented Dec 9, 2022

kevinpl07 commented Dec 9, 2022

kevinpl07 commented Dec 9, 2022

JulienMaille commented Dec 9, 2022

JulienMaille commented Dec 9, 2022 •

edited

kevinpl07 commented Dec 9, 2022

JulienMaille commented Dec 9, 2022

JulienMaille commented Dec 9, 2022 •

edited

kevinpl07 commented Dec 10, 2022

JulienMaille commented Dec 12, 2022

kevinpl07 commented Dec 12, 2022 •

edited

JulienMaille commented Dec 12, 2022

kevinpl07 commented Dec 12, 2022

JulienMaille commented Dec 12, 2022

qubvel Dec 14, 2022

kevinpl07 Dec 14, 2022

qubvel commented Dec 14, 2022

kevinpl07 commented Dec 14, 2022

qubvel commented Dec 15, 2022

JulienMaille commented Feb 21, 2023

Add Apple's MobileOne encoder #693

Add Apple's MobileOne encoder #693

Conversation

kevinpl07 commented Dec 8, 2022

qubvel commented Dec 8, 2022

kevinpl07 commented Dec 8, 2022

JulienMaille commented Dec 9, 2022

kevinpl07 commented Dec 9, 2022

kevinpl07 commented Dec 9, 2022

JulienMaille commented Dec 9, 2022

JulienMaille commented Dec 9, 2022 • edited

kevinpl07 commented Dec 9, 2022

JulienMaille commented Dec 9, 2022

JulienMaille commented Dec 9, 2022 • edited

kevinpl07 commented Dec 10, 2022

JulienMaille commented Dec 12, 2022

kevinpl07 commented Dec 12, 2022 • edited

JulienMaille commented Dec 12, 2022

kevinpl07 commented Dec 12, 2022

JulienMaille commented Dec 12, 2022

qubvel Dec 14, 2022

Choose a reason for hiding this comment

kevinpl07 Dec 14, 2022

Choose a reason for hiding this comment

qubvel commented Dec 14, 2022

kevinpl07 commented Dec 14, 2022

qubvel commented Dec 15, 2022

JulienMaille commented Feb 21, 2023

JulienMaille commented Dec 9, 2022 •

edited

JulienMaille commented Dec 9, 2022 •

edited

kevinpl07 commented Dec 12, 2022 •

edited