HIP for Identifying Images for Helm Chart #215

runyontr · 2021-10-08T00:20:10Z

This HIP identifies how a chart can describe the images needed by a chart for deployment for use in airgap deployments, SBOMs or general metadata.

Signed-off-by: Thomas Runyon <[email protected]>

scottrigby · 2021-10-08T01:51:24Z

hips/hip-0015.md

+sources:
+ - https://github.com/stefanprodan/podinfo
+kubeVersion: ">=1.19.0-0"
+annotations:


Would this be better a top level images key in Chart.yaml, since this is a proposal for a new (optional) standard?

This would have the added benefit of allowing the items to be a map, instead of cast as a string.
Example:

Suggested change

annotations:

images:

- name: podinfo

image: ghcr.io/stefanprodan/podinfo:6.0.0

- name: redis

image: dockerhub.io/_/redis:6.0.8

annotations:

I think providing it as the native spec, rather than a supported annotation would provide a few benefits:

Better documentation/discovery

Easier integration as we add the properties to the Chart Struct.

👍 for native/typed spec since one of the key desires is for downstream tools to consume and leverage this information, this would prevent each tool from implementing their own parser.

Also want to float the idea of using digests instead of tags for referencing images. On the one hand, digests allow for more repeatability and have a better immutability story considering this HIP is about pinpointing dependencies, but that might be undesirable considering the chart itself is versioned using a mutable tag. Thoughts?

I would lean to support both and allow the chart owner to decide if they want to use a SHA or a tag for their application. Would you suggest breaking that into its own field so tools know how to concat the fields together?

images: - name: podinfo image: ghcr.io/stefanprodan/podinfo tag: 6.0.0 - name: redis image: dockerhub.io/_/redis digest: sha256:3235326357dfb65f1781dbc4df3b834546d8bf914e82cce58e6e6b676e23ce8f

images as a new top level element in the Chart.yaml file cannot happen until the next major version of Helm is introduced. This is due to the use of strict handling when the Chart.yaml file is loaded. This is done for security purposes and was a finding in a security audit. You can refer to the audits for more details on the issue.

An annotation is backwards compatible and lets us do it now.

What's the right way to capture this top level image field as part of the 4.X release so that can be tracked?

Two possibilities: Open a new HIP describing the change or mark this HIP as Helm 4, and not implement it in Helm 3. In theory, Helm 4's feature list is supposed to be built entirely from HIPs so that we have a documented forward path. This is after receiving backlash from people who felt that Helm 3 did not have an adequate set of planning documents.

scottrigby · 2021-10-08T02:00:34Z

hips/hip-0015.md

+
+## Rationale
+
+The `helm.sh` prefix identifies the standard belonging to the Helm community, and the `/images` path clearly identifies the type of objects being identified. The annotation will define a string that can be parsed into a list that the chart maintainer will manage as they knowingly adjust the images needed for the chart's deployment. Each element of this image list will contain at least a `name` for the image as well as a location for the image in the `image` field. and an optional `dependency` field described in the specification. The elements _may_ have additional properties to meet additional use cases, such as the `whitelisted` field used by ArtifactHub, or providing a public key for signed images. The additional fields are not part of this standard, but how they get attached to each image _is_.


Is this map structure for easy portability from existing artifacthub.io/images annotations? Asking because I don't yet understand the value of giving the image a name that's different from the image repo/name:tag path itself, which seems like it should be the primary key?

I could see this switching to

images: redis: image: docker.io/redis:6.0.8

The non-URL use cases I'm seeing for adding additional fields would be to identify locations for signatures, sboms, etc. e.g.:

images: kafka: image: docker.io/bitnami/kafka:2.8.1-debian-10-r0 bom: docker.io/bitnami/kafka:sha2562342342342343422342.bom signature: docker.io/bitnami/kafka:sha2562342342342343422342.sig public_key: MFkwEwYHKoZIzj0CAQYIKoZIzj0DAQcDQgAEqyIFy6N9IlhsJgXwBhJ+DXemtzFCo3aXYifBX+VJfe9Q3fTkdARn4cdNjTDFocjJSObRD8PxN8HEj1Vy2ziySA==

Both ArtifactHub and Openshift use the name field, but as you point out, I haven't seen any names that differ from the last part of the image URL, so could be duplicate information.

How would the image reference need to change (if at all?) to accommodate non-default values for images. Or is the assumption that the images map (or whatever the ultimately agreed upon format is) is only valid with the chart authors default values.yaml?

Probably don't want to assume it is only valid with the defaults since that would eliminate the accuracy of a static analysis on a chart to extract the images. We need a way to identify images with all of the overlays in place. The community has asked about overriding Chart values in the past. Is there an alternative to providing overlays for Chart values if the use case is to identify image attributes without deployment?

Noticed in this HIP a request to override dependencies. Maybe a helm option to provide a Chart.yaml overlay is the best solution to cover both scenarios and appversion.

I would suggest focusing on images first instead of the other elements. The references to bom and signature point in a direction where security is in the loop. The references follow what's happening with sigstore. This should happen as part of a broader conversation (which I would like to have).

scottrigby · 2021-10-08T02:05:11Z

hips/hip-0015.md

+
+### Helm Templating Discovery
+
+The methodology of dynamically discovering images used by a chart _can_ work in certain situations but also has limitations with images that are only used for certain use cases, as well as operator charts that don't expose the images needed as part of the rendered manifests.


Note that images listed in CRs based on CRDs can't easily be discovered because we don't know how to navigate that schema. Determining the images is a bit of a guessing game. I am glad that Artifact Hub, at least, is trying to figure out some of those things.

Agreeing with the above, also any images referenced in binaries, like the velero operator, cannot be scraped so having a schema that the publisher can expose what images their chart uses seems like the best way.

If the publisher does expose a list, but those images can be overridden through values, do we need a way of merging the values into the published list?

@joejulian are you suggesting putting a field for each image that specifies how an end user of the chart would override the image?

So for a chart that has

image: registry: docker.io/bitnami/kafka tag: 2.8.1-debian-10-r0 ...

The chart metadata would specify how to change the image and tag via:

annotations: images: | - name: kafka image: docker.io/bitnami/kafka:2.8.1-debian-10-r0 registry: .image.registry tag: .image.tag

We do have a similar scenario but also to be able to pull images with digest, we could include digest as well.

image: registry: docker.io/bitnami/kafka digest: sha256:cba

...

annotations: images: | - name: kafka image: docker.io/bitnami/kafka:2.8.1-debian-10-r0 registry: .image.registry digest: .image.digest

Even here I think we also need a way to figure out registry and digest/tag used by the chart beforehand without rendering the template

technosophos

I am good with merging this as-is.

I think it would be good (but beyond the scope of this present HIP) to continue work on:

A separate (short) HIP describing a top-level image field for Helm 4
Continued discussion about security/verifiability and for supporting alternate registries

mattfarina · 2022-02-17T18:02:31Z

This has the 2 required approvals. Merging.

helm-bot added the size/L label Oct 8, 2021

first draft of images spec for comments

85efe44

Signed-off-by: Thomas Runyon <[email protected]>

runyontr force-pushed the hip-images-annotation branch from c6c9b58 to 85efe44 Compare October 8, 2021 00:20

scottrigby reviewed Oct 8, 2021

View reviewed changes

Boojapho mentioned this pull request Oct 29, 2021

HIP for dependency overrides #176

Closed

technosophos approved these changes Feb 17, 2022

View reviewed changes

mattfarina approved these changes Feb 17, 2022

View reviewed changes

mattfarina merged commit 50d2bd1 into helm:main Feb 17, 2022

runyontr mentioned this pull request Feb 17, 2022

Support Image Discovery in Helm Charts zarf-dev/zarf#337

Closed

runyontr deleted the hip-images-annotation branch February 17, 2022 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HIP for Identifying Images for Helm Chart #215

HIP for Identifying Images for Helm Chart #215

runyontr commented Oct 8, 2021

scottrigby Oct 8, 2021

runyontr Oct 8, 2021

joshrwolf Oct 19, 2021

runyontr Oct 20, 2021

mattfarina Jan 6, 2022

runyontr Feb 3, 2022

technosophos Feb 17, 2022

scottrigby Oct 8, 2021

runyontr Oct 8, 2021 •

edited

Loading

joshrwolf Oct 19, 2021

Boojapho Oct 28, 2021

Boojapho Oct 29, 2021

mattfarina Jan 6, 2022

scottrigby Oct 8, 2021

mattfarina Jan 6, 2022

joejulian Feb 3, 2022

runyontr Feb 3, 2022

jhmadhav Feb 8, 2022 •

edited

Loading

technosophos left a comment

mattfarina commented Feb 17, 2022

-annotations:
+images:
+ - name: podinfo
+ image: ghcr.io/stefanprodan/podinfo:6.0.0
+ - name: redis
+ image: dockerhub.io/_/redis:6.0.8
+annotations:


		## Rationale

		The `helm.sh` prefix identifies the standard belonging to the Helm community, and the `/images` path clearly identifies the type of objects being identified. The annotation will define a string that can be parsed into a list that the chart maintainer will manage as they knowingly adjust the images needed for the chart's deployment. Each element of this image list will contain at least a `name` for the image as well as a location for the image in the `image` field. and an optional `dependency` field described in the specification. The elements _may_ have additional properties to meet additional use cases, such as the `whitelisted` field used by ArtifactHub, or providing a public key for signed images. The additional fields are not part of this standard, but how they get attached to each image _is_.


		### Helm Templating Discovery

		The methodology of dynamically discovering images used by a chart _can_ work in certain situations but also has limitations with images that are only used for certain use cases, as well as operator charts that don't expose the images needed as part of the rendered manifests.

HIP for Identifying Images for Helm Chart #215

HIP for Identifying Images for Helm Chart #215

Conversation

runyontr commented Oct 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

runyontr Oct 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhmadhav Feb 8, 2022 • edited Loading

Choose a reason for hiding this comment

technosophos left a comment

Choose a reason for hiding this comment

mattfarina commented Feb 17, 2022

runyontr Oct 8, 2021 •

edited

Loading

jhmadhav Feb 8, 2022 •

edited

Loading