As a user I want to differentiate between different nature of images #1437

ipanova · 2023-12-01T16:13:17Z

Is your feature request related to a problem? Please describe.
As a user when I inspect a manifest through pulp API I can tell whether the image is :

bootable
application based
flatpak
etc

Describe the solution you'd like
Expose Image Annotations/Labels based on which a user would be able to say/search what kid of image it is.
For example flatpak image contains org.flatpak.ref label. This label identifies the image as Flatpak.

Additional context
We might need to inspect both Label and Annotations depending on the Docker/Oci image format.
In OCI spec labels are superseded by annotations but docker format still uses labels.

Note: Even if the image is derived, the labels/annotations are passed down to the child image.

The text was updated successfully, but these errors were encountered:

ipanova · 2024-01-24T17:05:24Z

Docker images can use annotations format but they will be present in the config.Labels directive, e.g.
They will not have `annotations on the manifest because it is not part of their spec.

$ skopeo inspect docker://docker.io/bitnami/redis:latest --tls-verify=false --raw  --config |jq .config.Labels
{
  "com.vmware.cp.artifact.flavor": "sha256:1e1b4657a77f0d47e9220f0c37b9bf7802581b93214fff7d1bd2364c8bf22e8e",
  "org.opencontainers.image.base.name": "docker.io/bitnami/minideb:bullseye",
  "org.opencontainers.image.created": "2024-01-22T10:33:28Z",
  "org.opencontainers.image.description": "Application packaged by VMware, Inc",
  "org.opencontainers.image.licenses": "Apache-2.0",
  "org.opencontainers.image.ref.name": "7.2.4-debian-11-r3",
  "org.opencontainers.image.title": "redis",
  "org.opencontainers.image.vendor": "VMware, Inc.",
  "org.opencontainers.image.version": "7.2.4"
}

OCI images can use annotations but not config.Labels

$ skopeo inspect docker://cgr.dev/chainguard/wolfi-base --tls-verify=false --raw  |jq .annotations
{
  "org.opencontainers.image.authors": "Chainguard Team https://www.chainguard.dev/",
  "org.opencontainers.image.source": "https://github.com/chainguard-images/images/tree/main/images/wolfi-base",
  "org.opencontainers.image.url": "https://edu.chainguard.dev/chainguard/chainguard-images/reference/wolfi-base/"
}

But some(most?) OCI images still can use config.Labels but not annotations skopeo inspect docker://registry.fedoraproject.org/0ad:latest --tls-verify=false --raw --config |jq

And probably some OCI images can have both populated :)

Annotations are a fairly new directive , not many images leverage them and we should be prepared to look in both places.
There is clear benefit to leverage annotations because one can just download/parse the metadata and not the image itself.

ipanova · 2024-01-24T17:07:44Z

Note: Both Labels/annotations are indeed passed down to the child image(if unspecified). If specified, they will be overridden.

lubosmj · 2024-02-16T18:40:25Z

Furthermore, we are required to identify whether a manifest list is bootable or not. This can be done by parsing the first available listed manifest or a manifest with the amd64 architecture.

The ultimate goal is to enable users to filter manifests by their nature from the REST API. With @ipanova, we have considered two design options:

Adding two boolean fields to the model/serializer: is_bootable and is_flatpak. Pros: easy to implement and maintain, good performance while filtering content. Cons: we may end up having a bunch of fields per every manifest's nature (e.g., is_future_type).
Adding a labels json field to the manifest/serializer model. On the viewset, we define a custom search filter that will enable users to filter content by specific values stored inside the object's json field. Pros: robust and resilient to newly emerged natures, everything accessible from one place, visible in the API. Cons: performance implications of exposing a complex search filter (https://stackoverflow.com/questions/58807182/how-to-apply-filters-on-a-posgres-jsonfield-in-django-rest-framework, https://www.django-rest-framework.org/api-guide/filtering/#searchfilter, https://github.com/pulp/pulp_ansible/blob/6335fe32e699ed7686483cffc58e988c8b9c040f/pulp_ansible/app/galaxy/v3/filters.py#L39).

Unfortunately, we will need to extract data from config blobs' labels in both cases and place it to the manifest model. We cannot ensure that manifests' annotations are the only source of truth.

It might be worth implementing both of the suggested options (they are complementary, synergistic), so that the users will be able to filter manifests by is_bootable while at the same time they can preview what other labels were used.

http :5001/pulp/api/v3/content/container/manifests?is_bootable=True

{
  "count": 123,
  "next": "http://api.example.org/accounts/?offset=400&limit=100",
  "previous": "http://api.example.org/accounts/?offset=200&limit=100",
  "results": [
    {
      "pulp_href": "http://example.com",
      "pulp_created": "2019-08-24T14:15:22Z",
      "artifact": "http://example.com",
      "digest": "string",
      "schema_version": 0,
      "media_type": "string",
      "listed_manifests": [
        "http://example.com"
      ],
      "is_bootable": True,     # NEW
      "config_blob": "http://example.com",
      "labels": {     # NEW
        "key": "value"
      },
      "blobs": [
        "http://example.com"
      ]
    }
  ]
}

ipanova · 2024-02-19T13:38:28Z

I think, I like the idea exposing both the boolean field and the labels and here is why:

labels is a directive of image manifest, extracting them from it and exposing onto a manifest list feels a bit more than hacky, especially when there can be different set of labels on each child manifest.

Besides, labels, we should also consider exposing annotations.

ianballou · 2024-02-26T16:18:32Z

From a Katello use case this sounds perfect. We can immediately use this to display pertinent information about the manifests in the UI.

I think from our standpoint we're most likely to use is_flatpak and is_bootable , so +1 to having the booleans.

closes pulp#1437

closes #1437

ianballou · 2024-03-12T19:47:32Z

I'm going to push for having Katello integrate with this as soon as possible. I'll need to give time for users to pre-migrate in Katello 4.13 and 4.14, but after that we can unleash this setting (plus we want to rework our UI at the same time).

I have a request: can we get information about how long the pre-migration will take on a well-loaded Pulp environment? Perhaps a good start would be an environment that has all of the OpenStack-related container repositories synced.
We'll also need to provide information about how the pre-migration might affect Pulp (and Katello) performance while the migration is running. Will sync tasks be blocked for long, for example?

ipanova · 2024-03-13T11:31:51Z

@ianballou
sync and any other api calls won't be blocked because this django-admin command works directly with the DB.
I think you already have an instance of calling such command in the post-install phase, don't you? It's this one https://github.com/pulp/pulp_container/blob/main/pulp_container/app/management/commands/container-repair-media-type.py

ianballou · 2024-03-13T11:54:59Z

I think you already have an instance of calling such command in the post-install phase, don't you?

We do, but we block the upgrade on it in that case, so it's a little different.

ianballou · 2024-03-13T11:57:37Z

Thanks for mentioning that though, I forgot that foreman-maintain is a good place to include the pre-migration. That's where we run the media repair script.

ipanova · 2024-03-13T12:06:03Z

so you would be calling this one in foreman-maintain as well and it won't be blocking anything, correct?

ipanova · 2024-03-13T14:43:56Z

Would you still want to setup a testing env to see how long the command will take? It will not block the upgrade and api calls/tasks. The only place where you will see the overhead is DB and IO

ipanova · 2024-03-14T13:52:34Z

As agreed per call would be good to provide some numbers so they can be shared in the user docs

ipanova added Feature Triage-Needed labels Dec 1, 2023

ipanova removed the Triage-Needed label Feb 14, 2024

lubosmj self-assigned this Feb 24, 2024

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Feb 29, 2024

Distinguish between the nature of images

e904b8e

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Feb 29, 2024

Distinguish between the nature of images

f06cc93

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Feb 29, 2024

Distinguish between the nature of images

aa36825

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

831fc0b

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

b1f4494

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

3ee24a6

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

9567859

closes pulp#1437

lubosmj mentioned this issue Mar 3, 2024

Distinguish between the nature of images #1532

Merged

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

8fbcc78

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

37b5879

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

0054be3

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

26bcd61

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

a73c806

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

60aac7c

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

10a336e

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

5ed32ec

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

3d92683

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 3, 2024

Distinguish between the nature of images

0a1d1ee

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 4, 2024

Distinguish between the nature of images

cad750c

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

944d88b

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

4655ab4

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

fc6142b

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

52fbc92

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

40bab33

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

8367699

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

5961b9d

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

850b12a

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 6, 2024

Distinguish between the nature of images

06cd5d2

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 7, 2024

Distinguish between the nature of images

781e62a

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 7, 2024

Distinguish between the nature of images

9c1820c

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 7, 2024

Distinguish between the nature of images

82a61b6

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 7, 2024

Distinguish between the nature of images

3cc2588

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 7, 2024

Distinguish between the nature of images

7fe828c

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 8, 2024

Distinguish between the nature of images

5b3c4f2

closes pulp#1437

lubosmj added a commit to lubosmj/pulp_container that referenced this issue Mar 12, 2024

Distinguish between the nature of images

fa1c4c4

closes pulp#1437

lubosmj closed this as completed in #1532 Mar 12, 2024

lubosmj added a commit that referenced this issue Mar 12, 2024

Distinguish between the nature of images

f27f3af

closes #1437

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As a user I want to differentiate between different nature of images #1437

As a user I want to differentiate between different nature of images #1437

ipanova commented Dec 1, 2023 •

edited

ipanova commented Jan 24, 2024 •

edited

ipanova commented Jan 24, 2024

lubosmj commented Feb 16, 2024 •

edited

ipanova commented Feb 19, 2024

ianballou commented Feb 26, 2024

ianballou commented Mar 12, 2024

ipanova commented Mar 13, 2024 •

edited

ianballou commented Mar 13, 2024

ianballou commented Mar 13, 2024

ipanova commented Mar 13, 2024

ipanova commented Mar 13, 2024

ipanova commented Mar 14, 2024

As a user I want to differentiate between different nature of images #1437

As a user I want to differentiate between different nature of images #1437

Comments

ipanova commented Dec 1, 2023 • edited

ipanova commented Jan 24, 2024 • edited

ipanova commented Jan 24, 2024

lubosmj commented Feb 16, 2024 • edited

ipanova commented Feb 19, 2024

ianballou commented Feb 26, 2024

ianballou commented Mar 12, 2024

ipanova commented Mar 13, 2024 • edited

ianballou commented Mar 13, 2024

ianballou commented Mar 13, 2024

ipanova commented Mar 13, 2024

ipanova commented Mar 13, 2024

ipanova commented Mar 14, 2024

ipanova commented Dec 1, 2023 •

edited

ipanova commented Jan 24, 2024 •

edited

lubosmj commented Feb 16, 2024 •

edited

ipanova commented Mar 13, 2024 •

edited