Add multi-stage build support #3383

emilevauge · 2017-08-29T20:12:50Z

Hi folks,
I'm opening this issue to track multi-stage builds support in official-images.
Any ETA on this ?

StefanScherer · 2017-09-08T06:25:51Z

Same demand to build optimal Node.js image in nodejs/docker-node#362 as Windows isn't as good in cleaning up temp files/packages/cached MSI.

Windows builds still use Docker 1.12.2-cs2-ws-beta (17.06.1-ee-2 is available)
ARM builds already use Docker 17.06.1-ce
AMD builds use Docker 17.03.2-ce
Is there anything the community can help with?

yosifkit · 2017-09-13T18:39:19Z

Here is a list we came up with so far, feel free to comment with more.

Positive points:

can generate nanoserver images where the installer exe would not normally run
upcoming nanoserver (version 1709) based images are mostly useless without multi-stage since they have no powershell
can download or generate a full rootfs (no more huge tarballs in git with force pushed branches)
large build container with small end result (hello-world, traefik)

Counter points:

can mostly be accomplished via long RUN lines
- this will be made easier with Proposal: add "heredoc" notation to the Dockerfile syntax moby/moby#34423
increases space required on build servers
can greatly increase build time (busybox building binaries: 37 min for just amd64, vs a Dockerfile with ADD busybox.tar.xz /)
updates to either base image will require a rebuild and push
is a single binary image useful?
- hard to debug, no shell or other utilities
- saves about 4 MB to be from scratch vs alpine

This is more complex than just the Docker version of the systems building the images.

how do we clean dangling images without invalidating cache of every mutli-stage image?
hardcoded assumptions of a single FROM in many places
- bashbrew itself
  - one example of many: in order to provide more robust build caching, bashbrew does a docker tag on an image with a hash of unique bits that includes the parent image ID
- many jenkins jobs, shell scripts, tooling, etc used in building, tagging, pushing
  - finding instances of this assumption is difficult, but is still only half the battle; for example, we use the Architectures of the parent image of a given Dockerfile to determine which Architectures this image supports -- in a multi-stage world, that needs to instead calculate the set intersection of all the parent images' Architectures
opinion: multi-stage build promotes messy Dockerfile since the first image isn't pushed
- can be slightly mitigated by requiring same good practices throughout Dockerfile

StefanScherer · 2017-09-14T06:51:22Z

Thanks @yosifkit for the comprehensive list of pros/cons to get a better understanding of what has to be considered.
For Windows images we still need an updated Docker engine as I doubt that the current installed docker version is able to run the upcoming 1709 images properly. And then we could use multi-stage builds for reasonable nanoserver based images.
I'm used to disposable build agents like Travis/Circle/AppVeyor, so speaking about caching first stages or cleaning up dangling images is done during one build agent's life cycle, so normally it gets cleaned up after a build.
I've seen some approaches with Jenkins, but it seems that this is not a common practice.

friism · 2017-09-20T04:25:39Z

can greatly increase build time (busybox building binaries: 37 min for just amd64, vs a Dockerfile with ADD busybox.tar.xz /)

If you have an expedient way to build busybox, can't you just still do that? You don't have to use multi-stage builds where it doesn't make sense.

updates to either base image will require a rebuild and push

Only to the second base image, no? If the first one is the same, you could presumably re-use the first artifact. If your build-boxes are ephemeral you can cache intermediary build artifacts in a registry: moby/moby#26839

is a single binary image useful?

I think deciding whether to further slim down the slim images is a separate discussion. Even if we don't use this as an opportunity to make more images single-binary, multi-stage builds are useful.

mickare · 2018-02-27T18:01:36Z

tl;dr: multi-stage build helps to clear removed artifacts and reduces the image size

@yosifkit
increases space required on build servers

I think it is better to use more space on build servers than the traffic that is wasted by removed artifacts left in layers.

A prime example is the Ubuntu image, where apt artifacts are deleted but still remain in the first ADD layer. And I fully agree that "repacking the tarballs is out of the question". Unfortunately the --squash option is still experimental. Multi-stage builds do offer a nice solution and more.

So a simple multi-stage build in the Ubuntu example can decrease the image size by 26MB. The compressed image on Docker Hub would be decreased by 5 MB. That is 5MB lower traffic for each pull. You can imagine what that means for a base image that has already 10M+ pulls. 😄

mickare · 2018-03-20T01:28:46Z

A new point was made by @jouve on my multistage copy proposal in the ubuntu image repo, so I just drop it here as an update.

😞 It has serious issues... and therefore is not usable.

@jouve tianon/docker-brew-ubuntu-core#119 (comment)
Hi,

this will not work for binaries with setuid bit because COPY does not keep it:

docker build --no-cache . -t jouve/test
Sending build context to Docker daemon  39.31MB
Step 1/6 : FROM scratch as base
 ---> 
Step 2/6 : ADD ubuntu-artful-core-cloudimg-amd64-root.tar.gz /
 ---> 3cbb692107ee
Step 3/6 : RUN ls -l /usr/bin/passwd
 ---> Running in c309e026a953
-rwsr-xr-x 1 root root 54224 Aug 20  2017 /usr/bin/passwd
Removing intermediate container c309e026a953
 ---> 6975740163a1
Step 4/6 : FROM scratch
 ---> 
Step 5/6 : COPY --from=base / /
 ---> fb9f22eb3121
Step 6/6 : RUN ls -l /usr/bin/passwd
 ---> Running in 9ce611e2a914
-rwxr-xr-x 1 root root 54224 Aug 20  2017 /usr/bin/passwd
Removing intermediate container 9ce611e2a914
 ---> 60931d907a8e
Successfully built 60931d907a8e
Successfully tagged jouve/test:latest

LaurentGoderre · 2018-04-26T15:19:14Z

Here is my opinion on multi-stage build based on my experience with the node image. At the moment, node needs to be compiled for Alpine linux and this process is LOOOONNG, around 40 minutes per version. In order to reduce this build time we are implementing ccache. Basically our Travis build remembers parts of the build and can reuse those. It bring down the build time from 40-50 minutes to less than 5-10.

The catch however is two fold, first we need to copy the cache files and the then build adds the new cache files. In order for caching to work we need to extract the new cache from the image, which is not possible at build time. And this is where multi-stage comes to the rescue.

The first stage builds node, the second one copies the build result to the final image. We then can create a container with the first stage image (which is entirely independent of the final image), extract the cache files (and any other build related files for debugging) and then delete that image.

LaurentGoderre · 2018-04-30T17:36:39Z

@tianon @yosifkit is there anything I can do to move this along?

arthurdm · 2018-05-07T21:06:13Z

Hi all. The use of multi-stage builds allows for a clean WebSphere Liberty image that has a different linux OS while still using the official Ubuntu-based WebSphere Liberty images to build the IBM JRE and WebSphere Liberty - this way I am not duplicating code that builds the IBM JRE / Liberty content.

The dockerfile is here: https://github.com/WASdev/ci.docker/blob/master/ga/developer/centos/Dockerfile

The PR to get it integrated is: PR #4326

Are there issues with this approach?

tianon · 2018-05-07T21:08:55Z

Are there issues with this approach?

Yes, similar to what @yosifkit has outlined above regarding multi-stage builds in general -- this creates an implicit dependency between two otherwise disparate images that's difficult for our tooling to extract.

arthurdm · 2018-05-08T00:27:35Z

hi @tianon - In this case the --from is referencing an official image, instead of a named build-stage, so shouldn't be easier to determine the dependency?

Would this be a better fit for the Docker Store instead of Docker Hub?

LaurentGoderre · 2019-02-11T15:32:58Z

Here is a scenario where multi-stage build support would be very helpful. I'm building a docker image for the Pachyderm CLI client which hopefully can become and official image. The client is written in GO. Using multi-stage, I was able to create really small images that don't need the entire golang image. However, if I couldn't use multi-stage I would have to either create an image that copies the golang image and delete the content at the end or use the golang image itself and have a huge image.

tianon · 2019-05-17T23:42:12Z

First step towards being able to actually support this is up at #5929.

tianon · 2019-08-14T21:25:06Z

#5929 and https://github.com/docker-library/faq#multi-stage-builds implement the basics of this -- there are still a few dangling scripts that have edge cases (as noted above) but we're finding/fixing them as we go

This was referenced Aug 29, 2017

Use --squash for images #3334

Closed

[WIP] Multi-stage build containous/traefik-library-image#14

Closed

tianon mentioned this issue Sep 20, 2017

[WIP] add windows dockerfile docker-library/docker#75

Merged

tianon mentioned this issue Sep 26, 2017

amend Composer documentation docker-library/docs#1017

Merged

2 tasks

tianon mentioned this issue Oct 11, 2017

Bumbed version to 7.7.1. Also SWI-Prolog/docker-swipl#8

Closed

yosifkit mentioned this issue Oct 13, 2017

Dockerfile using multi-stage build docker-library/ghost#95

Closed

yosifkit mentioned this issue Dec 12, 2017

Use multi stages builds docker-library/php#538

Closed

tianon mentioned this issue Jan 3, 2018

Add a multi-stage dockerfile that builds just the required executable distribution/distribution#2254

Closed

yosifkit mentioned this issue Jan 9, 2018

Packaging from scratch rethinkdb/rethinkdb-dockerfiles#43

Open

tianon mentioned this issue Jan 26, 2018

Add teamspeak #3919

Merged

9 tasks

tianon mentioned this issue Feb 27, 2018

Use multi stage build to clear artifacts and reduce image size #90 tianon/docker-brew-ubuntu-core#119

Closed

tianon mentioned this issue Mar 21, 2018

Multistage build docker-library/haproxy#62

Closed

yosifkit mentioned this issue Apr 11, 2018

Added support for nanoserver based windows containers docker-library/python#276

Closed

tt mentioned this issue Apr 19, 2018

Squash newly built layers tianon/docker-brew-ubuntu-core#123

Closed

This was referenced Apr 20, 2018

Speeding up alpine build nodejs/docker-node#693

Closed

Add Windows Server Core variants docker-library/ruby#130

Open

yosifkit mentioned this issue Apr 23, 2018

Build mongo:3.4.0-nanoserver docker-library/mongo#124

Closed

LaurentGoderre mentioned this issue Apr 26, 2018

Speed up alpine build nodejs/docker-node#703

Closed

tianon mentioned this issue May 7, 2018

Adding centOS tag #4326

Closed

tianon mentioned this issue Jul 26, 2018

Invoice Ninja #4260

Closed

9 tasks

TimWolla mentioned this issue Aug 2, 2018

Add Express-Gateway to the official images #3980

Merged

9 tasks

zakame mentioned this issue Aug 3, 2018

Reduces image size of resulting Docker Perl image Perl/docker-perl#53

Closed

tianon mentioned this issue Aug 3, 2018

Any scope for basing container not on debian but on "zero"? Perl/docker-perl#52

Closed

tianon mentioned this issue Aug 13, 2018

Update notary server and signer images to 0.6.1 #4718

Closed

J0WI mentioned this issue Sep 17, 2018

[Docker][RFC] Use upstream go image gogs/gogs#5003

Closed

tianon mentioned this issue Sep 19, 2018

Multistage builds docker-library/tomcat#132

Closed

J0WI mentioned this issue Sep 28, 2018

Use official golang as base container mailhog/MailHog#191

Merged

tianon mentioned this issue Oct 2, 2018

Initial commit of Windows containers docker-library/postgres#506

Open

yosifkit mentioned this issue Nov 1, 2018

Rethinkdb 2.3.6 Alpine version rethinkdb/rethinkdb-dockerfiles#45

Open

ralight mentioned this issue Nov 3, 2018

Cannot install additional software after upgrade to 1.5 (docker) eclipse/mosquitto#1011

Closed

wglambert mentioned this issue Nov 29, 2018

Multi-stage build? docker-library/gcc#52

Closed

tianon mentioned this issue Jan 4, 2019

Update Solr #5256

Closed

SimenB mentioned this issue Jan 10, 2019

[Discussion] Any thought on simpily yarn install process ? nodejs/docker-node#970

Closed

tianon mentioned this issue Jan 10, 2019

Compile OpenSSL & Erlang/OTP from source, use Ubuntu as the base image docker-library/rabbitmq#297

Merged

J0WI mentioned this issue Feb 3, 2019

[Docker] Join the 'official' docker library mastodon/mastodon#9967

Open

yosifkit mentioned this issue Feb 20, 2019

"FROM scratch" image nginxinc/docker-nginx#310

Closed

tianon mentioned this issue Mar 29, 2019

Add java 8 en java 11 images for nanoserver 1809 docker-library/openjdk#291

Closed

J0WI mentioned this issue Apr 12, 2019

Ensure Docker isn't relying on GitHub Services certbot/certbot#6630

Closed

030 mentioned this issue May 2, 2019

Another 20MB could be removed tianon/docker-brew-ubuntu-core#146

Closed

daveisfera mentioned this issue May 3, 2019

New image planning rethinkdb/rethinkdb-dockerfiles#44

Open

tianon mentioned this issue May 17, 2019

Add initial "multi-stage" support in bashbrew #5929

Merged

tianon mentioned this issue Jun 10, 2019

[ADD] Zabbix components Official docker images #6065

Open

9 tasks

tianon mentioned this issue Aug 13, 2019

Update to golang 1.11.13 and 1.12.8 docker-library/golang#295

Merged

tianon closed this as completed Aug 14, 2019

pascalandy mentioned this issue Feb 1, 2020

Dockerfile using multi-stage build (take #2) docker-library/ghost#209

Closed

james-crowley mentioned this issue Apr 14, 2020

Multi-arch builds via Multi-Stage Dockerfile distribution/distribution-library-image#101

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi-stage build support #3383

Add multi-stage build support #3383

emilevauge commented Aug 29, 2017

StefanScherer commented Sep 8, 2017 •

edited

Loading

yosifkit commented Sep 13, 2017

StefanScherer commented Sep 14, 2017

friism commented Sep 20, 2017 •

edited

Loading

mickare commented Feb 27, 2018

mickare commented Mar 20, 2018

LaurentGoderre commented Apr 26, 2018

LaurentGoderre commented Apr 30, 2018

arthurdm commented May 7, 2018 •

edited

Loading

tianon commented May 7, 2018

arthurdm commented May 8, 2018

LaurentGoderre commented Feb 11, 2019

tianon commented May 17, 2019

tianon commented Aug 14, 2019

Add multi-stage build support #3383

Add multi-stage build support #3383

Comments

emilevauge commented Aug 29, 2017

StefanScherer commented Sep 8, 2017 • edited Loading

yosifkit commented Sep 13, 2017

StefanScherer commented Sep 14, 2017

friism commented Sep 20, 2017 • edited Loading

mickare commented Feb 27, 2018

mickare commented Mar 20, 2018

LaurentGoderre commented Apr 26, 2018

LaurentGoderre commented Apr 30, 2018

arthurdm commented May 7, 2018 • edited Loading

tianon commented May 7, 2018

arthurdm commented May 8, 2018

LaurentGoderre commented Feb 11, 2019

tianon commented May 17, 2019

tianon commented Aug 14, 2019

StefanScherer commented Sep 8, 2017 •

edited

Loading

friism commented Sep 20, 2017 •

edited

Loading

arthurdm commented May 7, 2018 •

edited

Loading