Proposal: add support for pull/create/run by immutable identifier #10740

ncdc · 2015-02-12T18:04:51Z

Summary

We'd like to add support for using immutable image identifiers when pulling images from a v2 registry, creating containers, and running containers.

Background

Use case

When I create a container, I may specify an image such as mysql:latest. When the image is pulled, latest is resolved to a particular image at that point in time. If I later want to add more containers (e.g. possible read slaves in the MySQL case), ideally all the new containers would use the exact same image as my first container. Using a tag isn't sufficient as the tag is mutable.

V2 registry support

As part of distribution/distribution#46, the v2 registry will be adding support for retrieving an image manifest for a particular digest. This feature gives us what we need, as long as the Docker CLI and Engine support it too.

Proposed CLI/Engine changes

We'll need to provide a means to reference an image by its digest. One possible example might be

namespace/repository@digest

We'll need to make sure the following commands continue to work as they currently do, as well as with an optional digest:

docker pull
docker create
docker run

When listing images via docker images, we could default to displaying only the "current" values for each image and tag. An optional flag could enable displaying all values for each image and tag; namely, this would show 1 entry for each image/tag/digest combination.

Questions

What about v1 registry support?
It's not likely we'll be able to support this

If I create an image locally via docker tag or docker commit, can I refer to it by tag + digest?
As proposed in distribution/distribution#46, the registry is responsible for determining an image's digest and assigning it to the image. For an image that has not yet been pushed to a v2 registry, it may not be possible to refer to it by tag + digest. This is unlikely to be a significant issue, as the use case for tag + digest is consistent deployments using images pulled from registries. Or, if the community thinks this should be supported, we can revisit what component(s) are responsible for calculating digests.

The text was updated successfully, but these errors were encountered:

miminar · 2015-02-16T14:15:31Z

I'd suggest adding docker build (the FROM statement).

As for the image specification, if we already request a particular digest, is there a need to specify tag as well? IMHO tag part could be optional the same way as a digest part (namespace/repository[:tag][@digest]). If a particular image ID has multiple tags assigned, they would be pulled all. Supplying both tag and digest would pull just one tag and would imply additional existence check (particular tag exists for given digest).

ncdc · 2015-02-16T20:48:11Z

@miminar while this isn't coded yet to my knowledge in the v2 registry, the proposal re pulling by digest requires specifying repository, tag, and digest, which is why I wrote this proposal the way I did. I'd be fine with repository & digest without tag, but I'll defer to @stevvooe on this.

stevvooe · 2015-02-16T21:27:33Z

TL; DR Let's support <name>:<tag>@<digest> but make it easy to start supporting <name>@<digest>.

@miminar @ncdc The original compromise of distribution/distribution#46 required that immutable references includes a "tag" and "digest", hence why these proposals have this requirement. This was due to the fact that the new manifests have a "tag" field. I doubt we'll ever drop the requirement for specifying the namespace.

With distribution/distribution#62 and distribution/distribution#173, we intend to remove the requirement for a "tag" in the manifests. It would no longer be required when pulling manifests by digest.

For upcoming proposals in immutable manifest references, we should consider the following:

All proposals should support the following syntax:
```
<name>:<tag>@<digest>
```
This refers to a specific manifest, with a specified tag and revision.
We should optionally consider the tag-less syntax for referring to manifests:
```
<name>@<digest>
```
This is dependent on at least doc/spec: generic distribution content manifests distribution/distribution#62 and doc/spec: tags as a first class object distribution/distribution#173. This can be supported without API changes on the server-side with the blob API.

This can be implemented by defining a "image object reference" (working on the nomenclature) to always have three components:

name: Identifies the collection of image objects (repository) under which the object exists
tag (optional): Tag optionally specifies a named reference to a specific object.
digest (optional): Identifies the specific object by digest to be referenced.

The goal of the parser would be to identify whatever is required for the level of support specified at implementation time. If initial implementation (proposed above) requires all three, then it will error out when a tag is missing. If we decide we want to support item 2, then it not longer errors out when tag is missing and we proceed.

We may also want to define the minimum level of specification for a reference. Under certain cases, only name is required but for other cases, a digest-qualified reference is necessary.

ncdc · 2015-02-17T14:32:03Z

@stevvooe I'm definitely in favor of <name>@<digest>, as I assume that makes housekeeping in the registry's storage easier (no need to preserve tag history). We do not need name+tag+digest for our use case.

ncdc · 2015-02-18T15:39:47Z

@stevvooe what do you think we should be targeting for 1.6? Only name + tag + digest?

stevvooe · 2015-02-18T17:52:10Z

@ncdc I think we should target name + digest. We may need to adjust the proposed routes in distribution/distribution#46 to overload the tag routes to support manifest digest, but that is more than reasonable.

ncdc · 2015-02-18T17:59:10Z

@stevvooe that makes sense to me. It should be easier to implement than name + tag + digest, I would assume.

miminar · 2015-02-20T08:44:32Z

@ncdc @stevvooe Will we support shortened digests (7 characters and more) - similar to git? If we can get all available manifest digests from registry, it should be possible. Currently I don't see a way to obtain it with recent API specification though. IMHO this would greatly benefit to usability. 64 mandatory characters on command line is way too much:

docker pull registry.access.redhat.com/rhel7@e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855

If, by a chance, two or more digests matched given short version, user could be asked to specify full digest.

ncdc · 2015-02-20T11:58:14Z

This feature is most likely to be used as part of an automated system I would think, like Kubernetes, where a user says "I want to deploy foo/bar:latest" and the system resolves that to the digest for that tag at that point in time. Subsequent deployments would use the resolved digest.

Having said that, I don't have any problem supporting shortened digests.

Also, just to note, the id you have in your example below is a v1 image id; v2 digests include the algorithm as a prefix.

Sent from my iPhone

On Feb 20, 2015, at 3:45 AM, Michal Minar notifications@github.com wrote:

@ncdc @stevvooe Will we support shortened digests (7 characters and more) - similar to git? If we can get all available manifest digests from registry, it should be possible. Currently I don't see a way to obtain it with recent API specification though. IMHO this would greatly benefit to usability. 64 mandatory characters on command line is way too much:

docker pull registry.access.redhat.com/rhel7@e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
If, by a chance, two or more digests matched given short version, user could be asked to specify full digest.

—
Reply to this email directly or view it on GitHub.

miminar · 2015-02-20T12:12:17Z

Also, just to note, the id you have in your example below is a v1 image id; v2 digests include the algorithm as a prefix.

Oh, I see, thanks for the correction. So the command would actually look like this:

docker pull registry.access.redhat.com/rhel7@sha256:e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855

even more scary...

ncdc · 2015-02-20T12:40:51Z

Right, but again, this probably isn't something a human would regularly invoke.

stevvooe · 2015-02-20T18:36:27Z

@miminar The goal of this proposal is to provide a secure, reproducible way of fetching an image by its manifest digest. Syntactic sugar is outside the scope of this proposal.

That said, I agree long id strings are indeed unwieldy. I've filed distribution/distribution#194 in response so we can find secure, consistent and simple way of accomplishing this.

aluzzardi · 2015-02-24T02:11:09Z

+1

This would be awesome for Swarm.

/cc @vieux

stevvooe · 2015-02-25T21:44:43Z

@icecrime @jfrazelle @crosbymichael Could you take a peak at this?

icecrime · 2015-02-26T00:41:04Z

SGTM overall, with a few questions.

I agree that supporting <name>@<digest> makes perfect sense, I'm just worried about the way things will show up in docker images output. The same goes for that sentence:

When listing images via docker images, we could default to displaying only the "current" values for each image and tag. An optional flag could enable displaying all values for each image and tag; namely, this would show 1 entry for each image/tag/digest combination.

Does it mean that I can have multiple entries for REPOSITORY=ubuntu, TAG=latest, and different IMAGE ID? I'd rather not display the tag at all when we did a "digest pull".

ncdc · 2015-02-26T00:47:13Z

@icecrime I'm not sure what the best UX for this will be. However, since I originally wrote this proposal, after talking about it with @stevvooe, we think it doesn't make sense to try to support name:tag@digest. I have updated the issue text above to say just name@digest.

icecrime · 2015-02-26T03:47:26Z

So I guess this adds even more weight to my "don't even try to show a tag in docker images output" remark ;-)

ncdc · 2015-02-26T13:56:35Z

@icecrime here's what I was thinking with docker images, tags, and digests... Let's say you do this:

docker pull foo/bar:latest

And that gives you back image id i1 and digest d1. Time passes... foo/bar:latest gets updated. You do another pull, returning image id i2 and digest d2. So you've done 2 pulls of foo/bar:latest, neither pull was by digest - what should we show in docker images?

And I guess more generally: if we have images that aren't currently assigned to a tag, either from the above case, or just from pulling by digest, how/where should we display them?

sghosh151 · 2015-02-26T22:01:50Z

With v2 seeming to have 256 char limit - will that apply to "name" or "name@digest"? The SHA sum would take up a number of chars.

stevvooe · 2015-02-26T22:04:52Z

@sghosh151 The limit applies only to the name.

ncdc · 2015-03-02T14:47:25Z

@stevvooe wrote in #10740 (comment) about having a parser that can return an "image object reference". Right now, parsers.ParseRepositoryTag takes a string, parses it, and returns a repository string and a tag string. In my prototype for this feature, I modified this method to return either the digest or the tag in the 2nd returned string. Doing it this way means that the remaining changes to support referring to images either by tag or by digest relatively minimal; however, it does muddy the waters a bit, since ParseRepositoryTag is now returning something that is either a digest or a tag. I've thought about a few possibilities for making this cleaner:

Rename ParseRepositoryTag to ParseRepositoryReference, so it's clearer that it's not always a tag that comes back
Do the rename from above, but also modify the signature to return (repository, tag, digest), where only 1 of tag and digest is ever set at a time
Return a type, perhaps called ImageReference, that looks like this:

type ImageReference struct {
  repository string // or possibly registry.RepositoryInfo instead of string
  tag string
  digest string
}

This ImageReference option would be a more invasive change, as anywhere ParseRepositoryTag is called will have to be modified to work with a struct instead of 2 strings.

@icecrime @jfrazelle @crosbymichael what are your thoughts?

miminar · 2015-03-02T15:15:24Z

I'm in favor of a new type - parsing will be done just once for every request. Loads of checks for presence of ':' in a tag don't look nice. Passing along more than two values with different semantics encoded a single string is getting cumbersome.

stevvooe · 2015-03-03T01:37:45Z

@ncdc Option 3 above is the best approach, even if ImageReference is simply type ImageReference string with a few access methods. Stringly typed data is a no-no. ;)

ncdc · 2015-03-03T01:46:46Z

@stevvooe I'll do whatever you guys think makes the most sense. Do you want a type ImageReference string with

Repository() string
Tag() string
Digest() string

or does making it an actual struct make more sense? Should Repository() return a string or a RepositoryInfo?

ncdc · 2015-03-03T02:27:00Z

Moving discussion of image reference to the PR here #11109 (comment)

aluzzardi · 2015-04-10T22:20:02Z

Was this solved by #11109?

jessfraz · 2015-04-10T22:29:05Z

yep should be @ncdc let me know if you disagree

ncdc · 2015-04-10T22:42:26Z

All good, thanks!

Sent from my iPhone

On Apr 10, 2015, at 6:30 PM, Jessie Frazelle notifications@github.com wrote:

yep should be @ncdc let me know if you disagree

—
Reply to this email directly or view it on GitHub.

stevvooe mentioned this issue Feb 17, 2015

doc/spec: tags as a first class object distribution/distribution#173

Closed

stevvooe mentioned this issue Feb 20, 2015

propose approach and syntax for "sugary" content digests distribution/distribution#194

Closed

stevvooe mentioned this issue Feb 23, 2015

Add support for pull on all hosts (no change in docker run) docker-archive/classicswarm#349

Merged

estesp added the Proposal label Feb 25, 2015

stevvooe mentioned this issue Feb 25, 2015

Immutable image manifest references distribution/distribution#46

Closed

4 tasks

jessfraz added kind/feature Functionality or other elements that the project doesn't currently have. Features are new and shiny and removed kind/feature Functionality or other elements that the project doesn't currently have. Features are new and shiny labels Feb 26, 2015

stevvooe mentioned this issue Feb 26, 2015

doc/spec, registry: immutable manifest reference support distribution/distribution#211

Merged

ncdc mentioned this issue Mar 2, 2015

Add support for referring to images by digest #11109

Merged

2 tasks

jlhawn mentioned this issue Mar 12, 2015

Add support for referring to images by digest #11341

Closed

2 tasks

thaJeztah mentioned this issue Apr 8, 2015

Is there some way to build the unique image? #12173

Closed

jessfraz closed this as completed Apr 10, 2015

thaJeztah mentioned this issue Apr 15, 2015

Docker Daemon need to check if the image is update-to-date version #12408

Closed

thaJeztah mentioned this issue May 22, 2015

docker run doesn't pull down latest image if the image exists locally #13331

Closed

thaJeztah mentioned this issue Mar 14, 2016

Fix Docker pull examples #20947

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: add support for pull/create/run by immutable identifier #10740

Proposal: add support for pull/create/run by immutable identifier #10740

ncdc commented Feb 12, 2015

miminar commented Feb 16, 2015

ncdc commented Feb 16, 2015

stevvooe commented Feb 16, 2015

ncdc commented Feb 17, 2015

ncdc commented Feb 18, 2015

stevvooe commented Feb 18, 2015

ncdc commented Feb 18, 2015

miminar commented Feb 20, 2015

ncdc commented Feb 20, 2015

miminar commented Feb 20, 2015

ncdc commented Feb 20, 2015

stevvooe commented Feb 20, 2015

aluzzardi commented Feb 24, 2015

stevvooe commented Feb 25, 2015

icecrime commented Feb 26, 2015

ncdc commented Feb 26, 2015

icecrime commented Feb 26, 2015

ncdc commented Feb 26, 2015

sghosh151 commented Feb 26, 2015

stevvooe commented Feb 26, 2015

ncdc commented Mar 2, 2015

miminar commented Mar 2, 2015

stevvooe commented Mar 3, 2015

ncdc commented Mar 3, 2015

ncdc commented Mar 3, 2015

aluzzardi commented Apr 10, 2015

jessfraz commented Apr 10, 2015

ncdc commented Apr 10, 2015

Proposal: add support for pull/create/run by immutable identifier #10740

Proposal: add support for pull/create/run by immutable identifier #10740

Comments

ncdc commented Feb 12, 2015

Summary

Background

Use case

V2 registry support

Proposed CLI/Engine changes

Questions

miminar commented Feb 16, 2015

ncdc commented Feb 16, 2015

stevvooe commented Feb 16, 2015

ncdc commented Feb 17, 2015

ncdc commented Feb 18, 2015

stevvooe commented Feb 18, 2015

ncdc commented Feb 18, 2015

miminar commented Feb 20, 2015

ncdc commented Feb 20, 2015

miminar commented Feb 20, 2015

ncdc commented Feb 20, 2015

stevvooe commented Feb 20, 2015

aluzzardi commented Feb 24, 2015

stevvooe commented Feb 25, 2015

icecrime commented Feb 26, 2015

ncdc commented Feb 26, 2015

icecrime commented Feb 26, 2015

ncdc commented Feb 26, 2015

sghosh151 commented Feb 26, 2015

stevvooe commented Feb 26, 2015

ncdc commented Mar 2, 2015

miminar commented Mar 2, 2015

stevvooe commented Mar 3, 2015

ncdc commented Mar 3, 2015

ncdc commented Mar 3, 2015

aluzzardi commented Apr 10, 2015

jessfraz commented Apr 10, 2015

ncdc commented Apr 10, 2015