intermediate/realtime_rpi: a tutorial on realtime CV inference on a raspberry pi #1821

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

brianjo merged 1 commit into pytorch:master from d4l3k:master

Feb 8, 2022

Member

d4l3k commented Feb 8, 2022 •

edited

Loading

This adds a tutorial on how to use PyTorch for real time inference on a Raspberry Pi 4. This requires a number of specialized steps to get it all setup correctly and running permanently so seems worthwhile to add to the tutorials.

This requires a 64-bit operating system and RPi OS 64-bit just came out of beta so seems like a good time to document this. https://www.raspberrypi.com/news/raspberry-pi-os-64-bit/

There's a number of existing tutorials but they're way more complicated than they need to be since pip install torch works out of the box on 64-bit images. Much better to just pip install than install from Google Drive which the two top Google searches recommend.

This also seems to be the fastest known way of running mobilenetv2 on a Raspberry Pi 4 (without accelerators) that I've seen by an order of magnitude. At ~30-32fps that's ~33ms per inference including camera capture and processing. A quick Google search shows:

The images are currently hosted on githubusercontent, let me know if I should place them in the repo instead.

Test plan:

I've tested all of these steps from scratch using the official Raspberry Pi OS (64-bit) images.

Screenshots:


          intermediate/realtime_rpi: a tutorial on realtime CV inference on a r…

8a57a77

…aspberry pi

facebook-github-bot added the cla signed label

netlify bot commented Feb 8, 2022

✔️ Deploy Preview for pytorch-tutorials-preview ready!

🔨 Explore the source changes: 8a57a77

🔍 Inspect the deploy log: https://app.netlify.com/sites/pytorch-tutorials-preview/deploys/6201ddf34b45690008843867

😎 Browse the preview: https://deploy-preview-1821--pytorch-tutorials-preview.netlify.app

msaroufim self-requested a review

February 8, 2022 03:20

msaroufim approved these changes

View reviewed changes

Member

msaroufim left a comment

Overall really enjoyed it. It's clear. I think the ideal audience smart kids so I'd explain a bit more detail in some of the trickier code snippets. Otherwise looks all good.

intermediate_source/realtime_rpi.rst

+              .. code:: shell
+                $ python3 -c "import torch; print(torch.__version__)"
+.10.0+cpu

Member

msaroufim Feb 8, 2022

I'd remove this line so you don't have to keep updating tutorial with new version

Member Author

d4l3k Feb 8, 2022

done

intermediate_source/realtime_rpi.rst

+                  # convert opencv output from BGR to RGB
+                  image = image[:, :, [2, 1, 0]]
+              NOTE: You can get even more performance by training the model directly with OpenCV's BGR data format to remove the conversion step.

Member

msaroufim Feb 8, 2022

you can link to a doc here

Member Author

d4l3k Feb 8, 2022

I don't really have any docs here, this would be on the training dataset.

intermediate_source/realtime_rpi.rst

+                  preprocess = transforms.Compose([
+                      transforms.ToTensor(),
+                      transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),

Member

msaroufim Feb 8, 2022

maybe explain briefly what this is. This tutorial would be great for someone even if they don't know what pytorch is. Think smart high school students

Member Author

d4l3k Feb 8, 2022

done

intermediate_source/realtime_rpi.rst

+                      transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
+                  ])
+                  input_tensor = preprocess(image)
+                  input_batch = input_tensor.unsqueeze(0) # create a mini-batch as expected by the model

Member

msaroufim Feb 8, 2022

can also briefly elaborate here - can add shape annotations on top of input_tensor and input_batch

Member Author

d4l3k Feb 8, 2022

done

intermediate_source/realtime_rpi.rst

+              MobileNetV2: Quantization and JIT
+              ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+              For optimal performance we want a model that's quantized and fused. Quantized

Member

msaroufim Feb 8, 2022

It's kinda cool that raspberry pi supports int8, definitely something I was surprised to learn

Member Author

d4l3k Feb 8, 2022

Thankfully it doesn't require specialized hardware support. Most CPUs support 8bit operations since it's cheap to add hardware wise

Member Author

d4l3k Feb 8, 2022

qnnpack is designed for mobile arm devices so it works surprisingly well on an Arm64 Raspberry Pi

intermediate_source/realtime_rpi.rst

+                  from torchvision import models
+                  net = models.quantization.mobilenet_v2(pretrained=True, quantize=True)
+              We then want to jit the model to reduce Python overhead and fuse any ops. Jit gives us ~30fps instead of ~20fps without it.

Member

msaroufim Feb 8, 2022

Kinda interesting that vanilla pytorch worked fine. curious if you tried using pytorch live

Member Author

d4l3k Feb 8, 2022

I think PyTorch Live is only for iOS/Android https://pytorch.org/live/docs/tutorials/get-started/ -- RPi is just aarch64 Linux so standard PyTorch works out of the box

intermediate_source/realtime_rpi.rst

+              most of the work to fuse and quantize has already been done for you so you can
+              directly deploy with good performance on a Raspberry Pi.
+              See more:

Member

msaroufim Feb 8, 2022

I'd like to see a gif of the model working, you can even showcase what it's seeing - will also be nice to share in tweet threads

Member Author

d4l3k Feb 8, 2022

yup, definitely planning on it. no more mysterious gdrive python packages

brianjo merged commit 5bc5b94 into pytorch:master

d4l3k mentioned this pull request

intermediate/realtime_rpi: updated with benchmarks + feedback #1823

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels