Support multi-arch image for base-notebook #1202

romainx · 2020-12-21T12:08:54Z

Hello,

I'm drafting a long PR to support several CPU architectures (multi-arch) for the base-notebook image: linux/amd64 (default), linux/arm64, linux/ppc64le. It sounds interesting to propose an official base Jupyter image for alternative CPU architectures like the raising ARM. Tools are now available to make it easy to do:

Docker buildx to build images for different platforms,
QEMU and qemu-user-static to emulate different architectures than the host,
Miniforge that provides installers for the different architectures,
OS and packages / libraries ready to use on different CPU architectures.

And it works 🎉

Todo list

Build images for: amd64, arm64 and ppc64le
Test them by using the same code
Documentation update
- Maintainer / collaborator
- User documentation
  - supported architectures
  - limitations (for example pandoc is available only for amd64)
Manage the impact of in the manifest / hook
Makefile finalisation → need some polish, move things around, comments, etc.
Remove legacy Dockerfile.ppc64le
Push target finalization and test
Merge (if any) will have to use squash since I have made a lot of commits (trials and errors) to benefit from GitHub CI.

Pros

Propose Jupyter images for other CPU architectures
Transparent for the user since the appropriate architecture will be pulled automatically from DockerHub (no increase of image size)
One Dockerfile to rule them all: the same Dockerfile is used so there is no duplication
Same tests are run against all images
Get rid of the legacy ppc64le support

Cons

Build time increased
Increase complexity (the build is more complex)
Limited (at this time) to the base-notebook image, but could be extended
Make the maintenance of the stack harder (more potential issues, updates more complex, etc.)

Your feedback is welcome as much on the usefulness of such a feature as on how to implement it.

Best

Done for pandoc on arm64

mathbunnyru · 2020-12-25T12:21:49Z

@romainx this is so cool 👍

I do like the idea of being able to build many architectures, I think that's a big improvement.

But I have one concern - I do not like a lot of code in Makefile.
I mean, it's not the language place to put some logic.
Maybe you have something in mind about how to change the build process to make our scripts easier?

bollwyvl · 2020-12-25T14:52:50Z

Maybe you have something in mind about how to change the build process to make our scripts easier?

I'm a big fan of doit. While it has some quirks, it can yield quite robust automation e.g. only do _this_ if _these_ files (or the output of _this_command) change, and, being python, is generally easier to follow than makefile syntax. Scons and pants are also useful, but have additional opinions and syntax that are somewhat less flexible... And have more dependencies than doit.

romainx · 2020-12-26T09:16:12Z

@mathbunnyru thanks for your feedback. It's only a first draft, my goal was to prototype it while keeping the existing build process. And yes the Makefile has become less maintainable with this addition.

@bollwyvl I did not know doit, thanks for the idea. I will definitely have a look at it 👀 .

You're right, if we put that in place the build process will need some improvement. One of the main reason is that it's a bit monolithic and will not scale well with this kind of evolutions. It becomes a drawback when it comes to add new images or features.

The advantage of having a Makefile vs. full GitHub actions is to keep the ability to run things locally without GitHub. I think we need to keep this capability. I will have a look at doit.

bollwyvl · 2020-12-26T14:12:28Z

Cool!

doit works pretty well, on GitHub actions, especially if the problems are trivially parallelized... or have a "do some up-front linting" task, e.g. doit lint, then fan-out to "build XX things," e.g. doit build:$SOME_THING then fan-in to "do a report," e.g. doit report where each is just performing a small part of the total. But from a local perspective, having doit all be a target that will do what the entire workflow would is really important, especially for reviewing PRs, etc. The doit CLI is kinda picky, so it can be easiest to handle complex behavior with matrix variables, e.g. BUILDING_IN_CI so that you don't run the linter again, or try to rebuild things when reporting/uploading.

One thing about GHA: it's rather hard to share data between multiple workflows, but between tasks is pretty decent.

Getting all this to place nicely with GHA cache can be a little annoying, but is usually worth it... docker's a bear, and rate limits are no fun, so if important base images/layers can be cached, without caching the built product, things should work better than naively pulling every time.

Something that helps a lot for this: a task's need/readiness to be run can be based on the partial contents of a parsed file: what I've ended up doing is parsing a matrix from a workflow YAML as the source of truth for what needs building. These leads to slightly more complex python (handling includes and excludes, for example) but the resulting machine hums pretty well, as adding a new excursion that is combination of existing features can be a one-liner, while changing some unrelated part of the file won't trigger a rebuild of everything.

Once outside of the task planning, it can be useful to just have dodo.py handle caching and looping over lists, and have each step be a small, independently-runnable python/bash script, so it's easy to test Just One Thing.

Happy to help in any way!

romainx · 2020-12-27T11:04:56Z

@bollwyvl thank you very interesting!
As we said, improving the build is a prerequisite to implement this kind of feature. I've created a dedicated issue for that #1203. Any help is welcome!

romainx · 2021-01-07T06:41:18Z

Hello, I'm closing this PR for now, since I think it's not the priority and since some prerequisites are not ready. We will see in the future if it's worth supporting multi-arch images.

Best

romainx added 12 commits December 20, 2020 21:57

First try: multi-arch base-image

911c7af

setup qemu ext, setup buildx

cab0715

Debug to check if the error comes from QEMU

f5d397d

Setup QEMU

1d78b73

Enabling tests on multi-arch

0349c2a

Fix pandoc on ARM

a9d243a

Fixing sudo tests

115aefa

ability to skip some tests according to the platform arch

60ae918

Done for pandoc on arm64

ppc64le

c28d5fd

Oops remove debug comments

34c5d89

Skip pandoc test for ppc64le

49f555e

Uncomment clone wiki

258ceb5

romainx marked this pull request as draft December 21, 2020 12:11

romainx added the type:Enhancement A proposed enhancement to the docker images label Dec 21, 2020

romainx mentioned this pull request Dec 27, 2020

Improve monolithic build #1203

Closed

romainx closed this Jan 7, 2021

romainx mentioned this pull request Feb 13, 2021

M1 Apple's MacBooks support #1238

Closed

romainx mentioned this pull request Apr 17, 2021

Support ARM architecture (multi-arch images) #1019

Closed

8 tasks

mathbunnyru mentioned this pull request Jun 16, 2021

Make the base & minimal notebook containers not amd specific (e.g. support building for arm64) #1368

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multi-arch image for base-notebook #1202

Support multi-arch image for base-notebook #1202

romainx commented Dec 21, 2020 •

edited

Loading

mathbunnyru commented Dec 25, 2020

bollwyvl commented Dec 25, 2020 via email

romainx commented Dec 26, 2020

bollwyvl commented Dec 26, 2020

romainx commented Dec 27, 2020

romainx commented Jan 7, 2021

Support multi-arch image for base-notebook #1202

Support multi-arch image for base-notebook #1202

Conversation

romainx commented Dec 21, 2020 • edited Loading

Todo list

Pros

Cons

mathbunnyru commented Dec 25, 2020

bollwyvl commented Dec 25, 2020 via email

romainx commented Dec 26, 2020

bollwyvl commented Dec 26, 2020

romainx commented Dec 27, 2020

romainx commented Jan 7, 2021

romainx commented Dec 21, 2020 •

edited

Loading