Add platform.Feature #761

ChristianKniep · 2019-02-19T20:38:30Z

As discussed on the mailing list a PR to bring platform.features back into the image spec.
Motivation stated in moby/moby#38715
Blog post providing context:

Not sure about the version bump tho - please advise how to handle that one.

stevvooe · 2019-02-19T20:40:32Z

You need to update the spec and the schema. Also, I think we need to be explicit about how this field is matched.

ChristianKniep · 2019-02-19T20:43:22Z

Ah - ok missed that. Will update. Thx

jonjohnsonjr · 2019-03-27T17:57:44Z

image-index.md

+         - CPU type (*haswell*, *broadwell*, *skylake*, *ryzen*)
+         - GPU [NVIDIA Compute Capability](https://developer.nvidia.com/cuda-gpus) (3.7, 6.1)
+         - Host name, to separate different types in a data center without flushing out all the specifics (*ComputeNode-20190213*, *storage-2019*)
+        By using this feature flag, a manifest list is able to provide specific images for certain hosts.


nit: s/a manifest list/an image index/

@jonjohnsonjr changed it - thx for the correction

ChristianKniep · 2019-03-27T21:05:40Z

@stevvooe sorry for the delay - a lot going on. re: your comment...
Do you think the matching should be discussed within the spec? Might be something TBD within the runtime? Even though I am not sure - just a gut feeling.

cyphar

I understand the need for something like this (and the previous method we used for CPU features had ... issues), but there are some pretty big things that should be resolved first (see my comments).

I would like to point out that some of these examples definitely feel like something you'd want to use internally within a particular company, but in their current form are quite questionable to be baked into a spec which requires interoperability between implementations and users. I would like to mention that you are allowed to add your own fields to the spec and use those if you really need them -- especially if you have a feature that is uber-specific to your usecase.

cyphar · 2019-03-29T04:13:26Z

image-index.md

+         - CPU type (*haswell*, *broadwell*, *skylake*, *ryzen*)
+         - GPU [NVIDIA Compute Capability](https://developer.nvidia.com/cuda-gpus) (3.7, 6.1)
+         - Host name, to separate different types in a data center without flushing out all the specifics (*ComputeNode-20190213*, *storage-2019*)
+        By using this feature flag, an image index is able to provide specific images for certain hosts.


In order for interoperability, we need to have either a standard set of well-understood names, or some way parsing well-structured feature names (cpu-feature:avx512 and cpu-type:haswell for instance) in order to be able to handle unknown names. I would opt for well-structured feature names and an explanation of how to check them -- and I believe this is the concern @stevvooe had.

I am very un-nerved by the hostname example, and would like to know how do we handle an image-spec implementation which wants to extract an image but doesn't have that hostname? Should it ignore the field? If so, how is the field mandatory if you can ignore it?

cyphar · 2019-03-29T04:14:43Z

image-index.md

@@ -87,7 +87,11 @@ For the media type(s) that this document is compatible with, see the [matrix][ma

    - **`features`** *array of strings*

-        This property is RESERVED for future versions of the specification.
+        This OPTIONAL property specifies an array of strings, each specifying a mandatory hardware/host feature. Examples are:


"mandatory" doesn't really have meaning in this spec, you need to explain what the behaviour needs to be and how a runtime should check the features.

ChristianKniep · 2019-03-29T07:28:11Z

@cyphar Thank you for your feedback. The previous use of the feature flag tried (as I see it) to come up with an agreed on and objective description of hardware features; hence, the focus was on (discoverable) CPU flags.
My proposal is different in that it allows a site to define such feature flags as it fits their environment. That is what is meant be using flags like compute-20190213 - it provides a reference to a certain node and freezes his host installation in time, so that one can create site-specific or even only node type specific containers.
As discussed with @AkihiroSuda in Japan last month and as brought up by your comment; for public image repositories a naming scheme should be agreed on; similar to label best-practises or what you provided as an example.
Where I see this feature flag missing is when you want to create containers that rely on something that the host provides but are not controlled by the kernel. The prime example is GPUs, where the runtime (nvidia-docker) breaks the immutability of the container FS to allow a container workflow to take place.
Allowing runtimes to use the OCI image spec to the fullest would inform a discussion about what feature flags are used (/useful) for public repositories.
This features provides a solution for communities with a need for either host-specific dependencies or the need for hardware optimisation by allowing them to define their own flags and hand over the distribution decision of optimised images to the runtime.

ChristianKniep · 2019-04-03T16:47:48Z

@cyphar / @stevvooe Any additional input on that?
As said; IMHO predefining all possible features strings is hard to impossible. And in an environment where mixed workloads exists I reckon that using a hardware identifier like host-type:HPE-Model2000 is a good start as it does not define which components of that host are used and optimised for. Workloads one might want to optimise for CPU+GPU and another one only for the CPU.

ChristianKniep · 2019-06-13T07:21:43Z

I am addressing the issue in a different way - so the PR is obsolete.
http://www.qnib.org/2019/06/12/metahub/
https://github.com/qnib/metahub

add platform.Feature

49181e2

ChristianKniep added 2 commits March 27, 2019 11:08

adjust schema and markdown description

7b1f90d

Merge branch 'platform-features' into b1.1.0-feature

3e39e3e

jonjohnsonjr reviewed Mar 27, 2019

View reviewed changes

bugfix wording

f8eaf36

cyphar reviewed Mar 29, 2019

View reviewed changes

ChristianKniep added 2 commits March 29, 2019 08:16

refine description

cdeaa26

drop mandatory

3080b75

ChristianKniep closed this Jun 13, 2019

ChristianKniep deleted the b1.1.0-feature branch June 13, 2019 07:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add platform.Feature #761

Add platform.Feature #761

ChristianKniep commented Feb 19, 2019

stevvooe commented Feb 19, 2019

ChristianKniep commented Feb 19, 2019

jonjohnsonjr Mar 27, 2019 •

edited

Loading

ChristianKniep Mar 27, 2019

ChristianKniep commented Mar 27, 2019

cyphar left a comment

cyphar Mar 29, 2019 •

edited

Loading

cyphar Mar 29, 2019

ChristianKniep commented Mar 29, 2019

ChristianKniep commented Apr 3, 2019

ChristianKniep commented Jun 13, 2019

Add platform.Feature #761

Add platform.Feature #761

Conversation

ChristianKniep commented Feb 19, 2019

stevvooe commented Feb 19, 2019

ChristianKniep commented Feb 19, 2019

jonjohnsonjr Mar 27, 2019 • edited Loading

Choose a reason for hiding this comment

ChristianKniep Mar 27, 2019

Choose a reason for hiding this comment

ChristianKniep commented Mar 27, 2019

cyphar left a comment

Choose a reason for hiding this comment

cyphar Mar 29, 2019 • edited Loading

Choose a reason for hiding this comment

cyphar Mar 29, 2019

Choose a reason for hiding this comment

ChristianKniep commented Mar 29, 2019

ChristianKniep commented Apr 3, 2019

ChristianKniep commented Jun 13, 2019

jonjohnsonjr Mar 27, 2019 •

edited

Loading

cyphar Mar 29, 2019 •

edited

Loading