[BUG] resource can not be passes as parameter #703

dmonakhov · 2018-01-26T17:06:07Z

I try to pass container resources (cpu in my case) as workflow parameter( see workflow below)
But argo cli validation failed like follows:

$ argo  submit resource-param.yaml 
2018/01/26 19:00:54 resource-param.yaml failed to parse: error unmarshaling JSON: quantities must match the regular expression '^([+-]?[0-9.]+)([eEinumkKMGTP]*[-+]?[0-9]*)$'

AFAIU this is happens because expression was not actually substituted.

WORKFLOW

# Try to pass cpu resource as an argument                                                                                                   
#                                                                                                                                           
apiVersion: argoproj.io/v1alpha1
kind: Workflow
metadata:
  generateName: resource-param-
spec:
  arguments:
    parameters:
    - name: nrcpu
      value: 4
  entrypoint: stress-ng
  templates:
  - name: stress-ng
    inputs:
      parameters:
      - name: nrcpu
    container:
      image: lorel/docker-stress-ng
      args: [ "--cpu", "{{workflow.parameters.nrcpu}}", "--timeout", "15s", "--metrics-brief" ]
      resources:
        requests:
          memory: 1Gi
          #cpu: 4                                                                                                                           
          cpu: "{{workflow.parameters.nrcpu}}"

The text was updated successfully, but these errors were encountered:

jessesuen · 2018-01-26T23:52:59Z

K8s has custom unmarshal/marshal functions for the container.resources field. Need to find a way to workaround this.

jessesuen · 2018-01-27T01:57:45Z

This is going to be a tricky one to fix. Since we reuse the Container data structure from the k8s API types (among others), we inherit all of logic that comes with marshaling and unmarshaling of the various types. This includes the custom unmarshalling functions deeply nested any of their sub fields (in this case resource/quantity.go Quantity). We definitely do not want to move away from reusing Container data type since we need to keep in-line with k8s. Our only option it appears is to implement custom unmarshalling at the template level. I'm also very weary of doing that since that is always tricky to get right.

dmonakhov · 2018-01-28T10:18:15Z

@jessesuen Thank you for explanation.
May be there is workaround for this I'm not an exert in k8s API.
My scenario:
a CI job which want to compile. So I want complete it ASAP, for that reason I use
'make -j $NRCPU', where NRCPU is $(numcpu-1) on kube-nodes. It is obvious that hardcoding NRCPU is bad for portability because it NRCPU varies from cluster to cluster

Is it possible to define resource via generic kubernetes API? Via configMap or other config?

dmonakhov · 2018-01-28T20:14:27Z

Yeah. It seems that 'kind: LimitRange' resource does exactly what I need. https://kubernetes.io/docs/tasks/administer-cluster/cpu-constraint-namespace .
Since workaround exists this issue can be moved to enhancement section.

jessesuen · 2018-01-28T20:54:04Z

Thanks for the legwork on the workaround. Yes, container.resources obtained from parameters is a use case we should support, so eventually this is something I'd like fixed. The right way to go about doing it, escapes me at the moment.

discordianfish · 2018-10-25T11:02:55Z

Is there a workaround for this? This is almost a show-stopper for the usecase here. We have several tasks that have vastely different resource requirements, which forces us to create many templates.

epa095 · 2019-05-20T12:18:07Z

A bit of brainstorming on this.
Would it maybe make sense to be able to define a "patch" to a workflow step, which could override parts of the yaml definition of the step. I am thinking something similar to overlays in kustomize.
This could take in exactly the same parameters as the step itself, and be applied to the workflow step right before execution. This would allow you to still use the k8s API types for the workflow step, but people would be "on their own" in regard to what they write in the patch/overlay step.

igor47 · 2019-05-20T23:21:10Z

also got bitten by this issue. as @discordianfish alludes to, the only known workaround so far is to have multiple copies of the same template but with different resource requests. for example:

spec:
  entrypoint: test
  templates:
    - name: test
      dag:
        tasks:
          - name: task1
             template: pod-lowmem
          - name: task2
             template: pod-highmem
      - name: pod-lowmem
         container:
           image: myimage:latest
           resources:
             requests:
               memory: 500Mi
       - name: pod-highmem
          container:
            image: myimage:latest
            resources:
               requests:
                 memory: 2Gi

some way to DRY this manifest would be greatly appreciated. i guess it's possible to use YAML anchors but the syntax there is quite confusing

elsonrodriguez · 2019-08-14T00:15:21Z

As a workaround for this I ended up using a resource to create a Job. Messy but workable.

Food for thought: A killer use case for this would be in combination with GKE's node auto-provisioner, especially for compute-heavy jobs. This way memory/cpu/gpu could be defined as a parameter, and eventually it will just run without needing to define/provision nodes.

Snapple49 · 2019-09-04T09:19:08Z

@elsonrodriguez Hi, would you mind explaining your workaround please? Not sure if I follow, but I'm interested in solutions to this problem as well

elsonrodriguez · 2019-09-05T18:57:17Z

@Snapple49 Basically I just use this pattern:

https://github.com/argoproj/argo/blob/baf37052976458401a6c0e44d06f30dc8d819680/examples/k8s-jobs.yaml

The manifest portion is fully template compatible, so you can swap out anything for a variable, including resource allocation.

For example:

                resources:
                  limits:
                    memory: "{{inputs.parameters.memory}}"
                    cpu: "{{inputs.parameters.cpu}}"

The downside is that logs won't show up directly associated with the workflow, it's less readable, and more complex.

Snapple49 · 2019-09-06T07:18:20Z

I see, thanks a lot! That's neat, I'll have a play with that, messy or not it's a workaround 😁

oskoss · 2019-10-02T17:55:01Z

+1 Just running into this issue for us as well.

jessesuen · 2019-10-04T20:25:22Z

Would it maybe make sense to be able to define a "patch" to a workflow step, which could override parts of the yaml definition of the step. I am thinking something similar to overlays in kustomize. This could take in exactly the same parameters as the step itself, and be applied to the workflow step right before execution. This would allow you to still use the k8s API types for the workflow step, but people would be "on their own" in regard to what they write in the patch/overlay step.

So far, this is the the most promising proposal I've seen to address this problem. I think a first class patch spec, which supports parameter substitution, and then applied on top of the container before submission, might be able to handle this in a generic way (not just for resources, but for other non-string fields like bools and ints).

jgbaum · 2019-10-25T21:41:08Z

Hi @sarabala1979.

Is this in release 2.4.2? If so, can you please provide an example on how to pass the cpu or memory resource requirements?

Thanks!

-J

sarabala1979 · 2019-10-25T21:52:14Z

https://github.com/argoproj/argo/blob/master/examples/pod-spec-patch-wf-tmpl.yaml
https://github.com/argoproj/argo/blob/master/examples/pod-spec-patch.yaml
https://github.com/argoproj/argo/blob/master/examples/pod-spec-yaml-patch.yaml

jgbaum · 2019-10-26T00:15:38Z

Perfect!

…

-J Sent from mobile device

On Oct 25, 2019, 2:52 PM -0700, Saravanan Balasubramanian ***@***.***>, wrote: https://github.com/argoproj/argo/blob/master/examples/pod-spec-patch-wf-tmpl.yaml https://github.com/argoproj/argo/blob/master/examples/pod-spec-patch.yaml https://github.com/argoproj/argo/blob/master/examples/pod-spec-yaml-patch.yaml — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

jessesuen added the type/bug label Jan 26, 2018

jessesuen added this to the v2.0.0-beta2 milestone Jan 27, 2018

jessesuen removed this from the v2.0.0-beta2 milestone Jan 27, 2018

jessesuen mentioned this issue Sep 24, 2018

Cannot use input parameters in kube yaml resources section #1015

Closed

kevinbache mentioned this issue Jul 2, 2019

Cannot set the (set_gpu_limit) gpu num from user params kubeflow/pipelines#1252

Closed

elikatsis mentioned this issue Sep 12, 2019

Cannot use resource request based on PipelineParam kubeflow/pipelines#1956

Closed

sarabala1979 self-assigned this Oct 15, 2019

sarabala1979 mentioned this issue Oct 17, 2019

PodSpecPatch functionality #1687

Merged

sarabala1979 closed this as completed in #1687 Oct 21, 2019

danxmoran mentioned this issue Jan 10, 2020

Can't set size of volumeClaimTemplate using a workflow variable #1932

Open

4 tasks

jananzhu mentioned this issue Mar 30, 2020

Error when using PodSpecPatch to parameterize ActiveDeadlineSeconds #2545

Closed

4 tasks

oliverwm1 mentioned this issue Aug 12, 2020

Parametrize cpu and memory requests for nudge-to-obs workflow ai2cm/fv3net#560

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] resource can not be passes as parameter #703

[BUG] resource can not be passes as parameter #703

dmonakhov commented Jan 26, 2018

jessesuen commented Jan 26, 2018

jessesuen commented Jan 27, 2018

dmonakhov commented Jan 28, 2018 •

edited

Loading

dmonakhov commented Jan 28, 2018

jessesuen commented Jan 28, 2018 •

edited

Loading

discordianfish commented Oct 25, 2018

epa095 commented May 20, 2019

igor47 commented May 20, 2019 •

edited

Loading

elsonrodriguez commented Aug 14, 2019 •

edited

Loading

Snapple49 commented Sep 4, 2019

elsonrodriguez commented Sep 5, 2019 •

edited

Loading

Snapple49 commented Sep 6, 2019

oskoss commented Oct 2, 2019

jessesuen commented Oct 4, 2019

jgbaum commented Oct 25, 2019

sarabala1979 commented Oct 25, 2019

jgbaum commented Oct 26, 2019 via email

[BUG] resource can not be passes as parameter #703

[BUG] resource can not be passes as parameter #703

Comments

dmonakhov commented Jan 26, 2018

jessesuen commented Jan 26, 2018

jessesuen commented Jan 27, 2018

dmonakhov commented Jan 28, 2018 • edited Loading

dmonakhov commented Jan 28, 2018

jessesuen commented Jan 28, 2018 • edited Loading

discordianfish commented Oct 25, 2018

epa095 commented May 20, 2019

igor47 commented May 20, 2019 • edited Loading

elsonrodriguez commented Aug 14, 2019 • edited Loading

Snapple49 commented Sep 4, 2019

elsonrodriguez commented Sep 5, 2019 • edited Loading

Snapple49 commented Sep 6, 2019

oskoss commented Oct 2, 2019

jessesuen commented Oct 4, 2019

jgbaum commented Oct 25, 2019

sarabala1979 commented Oct 25, 2019

jgbaum commented Oct 26, 2019 via email

dmonakhov commented Jan 28, 2018 •

edited

Loading

jessesuen commented Jan 28, 2018 •

edited

Loading

igor47 commented May 20, 2019 •

edited

Loading

elsonrodriguez commented Aug 14, 2019 •

edited

Loading

elsonrodriguez commented Sep 5, 2019 •

edited

Loading