-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v2.11: Steps (not DAG) always fails (with conflict error) and no helpful diagnostics (AWS only?) #4107
Comments
Workflow does not have appeared to have finished. Pod node is still pending. Maybe attached logs from failed pod. |
@alexec Attached pod status from k8s dashboard
Logs from k8s pod logs-from-wait-in-output-parameters-dummy-k28ls-4217895726 - wait.txt logs from argo workflow
|
@alexec |
It is a bit unclear what happened here. Because of the |
@alexec Sure. Please let me know the zoom link |
Note to self - we've had instances of errored pods going back into pending on certain providers. |
@saranyaeu2987 ping me on Slack when you're ready. |
@alexec |
`
` |
Maybe related: #4058 |
Notes:
|
Created a dev image |
Note: while this dev image fixed this specific issue, I still don't know why this only affected this steps workflow. Why not others? Why at all? |
Based on [
{"apiVersion":"argoproj.io/v1alpha1","kind":"Workflow","metadata":{"creationTimestamp":"2020-09-24T00:40:28Z","generateName":"steps-","generation":4,"labels":{"workflows.argoproj.io/phase":"Running"},"name":"steps-9khx2","namespace":"argo","resourceVersion":"46785889","selfLink":"/apis/argoproj.io/v1alpha1/namespaces/argo/workflows/steps-9khx2","uid":"b687e772-f844-4268-9fcf-04ae43c400cb"},"spec":{"arguments":{},"entrypoint":"hello-hello-hello","templates":[{"arguments":{},"inputs":{},"metadata":{},"name":"hello-hello-hello","outputs":{},"steps":[[{"arguments":{"parameters":[{"name":"message","value":"hello1"}]},"name":"hello1","template":"whalesay"}],[{"arguments":{"parameters":[{"name":"message","value":"hello2a"}]},"name":"hello2a","template":"whalesay"},{"arguments":{"parameters":[{"name":"message","value":"hello2b"}]},"name":"hello2b","template":"whalesay"}]]},{"arguments":{},"container":{"args":["{{inputs.parameters.message}}"],"command":["cowsay"],"image":"docker/whalesay","name":"","resources":{}},"inputs":{"parameters":[{"name":"message"}]},"metadata":{},"name":"whalesay","outputs":{}}]},"status":{"finishedAt":null,"nodes":{"steps-9khx2":{"children":["steps-9khx2-2475991523"],"displayName":"steps-9khx2","finishedAt":null,"id":"steps-9khx2","name":"steps-9khx2","phase":"Running","startedAt":"2020-09-24T00:40:28Z","templateName":"hello-hello-hello","type":"Steps"},"steps-9khx2-2475991523":{"boundaryID":"steps-9khx2","children":["steps-9khx2-3689613236"],"displayName":"[0]","finishedAt":null,"id":"steps-9khx2-2475991523","name":"steps-9khx2[0]","phase":"Running","startedAt":"2020-09-24T00:40:28Z","templateName":"hello-hello-hello","type":"StepGroup"},"steps-9khx2-3689613236":{"boundaryID":"steps-9khx2","displayName":"hello1","finishedAt":null,"hostNodeName":"ip-10-207-196-131.ec2.internal","id":"steps-9khx2-3689613236","inputs":{"parameters":[{"name":"message","value":"hello1"}]},"message":"ContainerCreating","name":"steps-9khx2[0].hello1","phase":"Pending","startedAt":"2020-09-24T00:40:28Z","templateName":"whalesay","type":"Pod"}},"phase":"Running","startedAt":"2020-09-24T00:40:28Z"}},
{"metadata":{"generation":5,"resourceVersion":"46785918"},"status":{"nodes":{"steps-9khx2-3689613236":{"hostNodeName":null,"message":null,"phase":"Running"}}}},
{"metadata":{"generation":6,"resourceVersion":"46785925"},"status":{"nodes":{"steps-9khx2-3689613236":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":7,"resourceVersion":"46785941"},"status":{"nodes":{"steps-9khx2-2475991523":{"finishedAt":"2020-09-24T00:40:34Z","phase":"Succeeded"},"steps-9khx2-2543249094":{"boundaryID":"steps-9khx2","children":["steps-9khx2-3046931061","steps-9khx2-2996598204"],"displayName":"[1]","finishedAt":null,"id":"steps-9khx2-2543249094","name":"steps-9khx2[1]","phase":"Running","startedAt":"2020-09-24T00:40:34Z","templateName":"hello-hello-hello","type":"StepGroup"},"steps-9khx2-2996598204":{"boundaryID":"steps-9khx2","displayName":"hello2b","finishedAt":null,"id":"steps-9khx2-2996598204","inputs":{"parameters":[{"name":"message","value":"hello2b"}]},"name":"steps-9khx2[1].hello2b","phase":"Pending","startedAt":"2020-09-24T00:40:34Z","templateName":"whalesay","type":"Pod"},"steps-9khx2-3046931061":{"boundaryID":"steps-9khx2","displayName":"hello2a","finishedAt":null,"id":"steps-9khx2-3046931061","inputs":{"parameters":[{"name":"message","value":"hello2a"}]},"name":"steps-9khx2[1].hello2a","phase":"Pending","startedAt":"2020-09-24T00:40:34Z","templateName":"whalesay","type":"Pod"},"steps-9khx2-3689613236":{"children":["steps-9khx2-2543249094"],"finishedAt":"2020-09-24T00:40:32Z","hostNodeName":null,"phase":"Succeeded"}}}},
{"metadata":{"generation":8,"resourceVersion":"46785950"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":"ip-10-207-196-131.ec2.internal"},"steps-9khx2-3046931061":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":9,"resourceVersion":"46785954"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":null,"message":"ContainerCreating"},"steps-9khx2-3046931061":{"hostNodeName":null,"message":"ContainerCreating"}}}},
{"metadata":{"generation":10,"resourceVersion":"46785964"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":"ip-10-207-196-131.ec2.internal"},"steps-9khx2-3046931061":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":11,"resourceVersion":"46785965"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":null},"steps-9khx2-3046931061":{"hostNodeName":null}}}},
{"metadata":{"generation":12,"resourceVersion":"46786007"},"status":{"nodes":{"steps-9khx2-3046931061":{"message":null,"phase":"Running"}}}},
{"metadata":{"generation":13,"resourceVersion":"46786013"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":"ip-10-207-196-131.ec2.internal"},"steps-9khx2-3046931061":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":14,"resourceVersion":"46786014"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":null},"steps-9khx2-3046931061":{"hostNodeName":null}}}},
{"metadata":{"generation":15,"resourceVersion":"46786026"},"status":{"nodes":{"steps-9khx2-2996598204":{"message":null,"phase":"Running"}}}},
{"metadata":{"generation":16,"resourceVersion":"46786029"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":"ip-10-207-196-131.ec2.internal"},"steps-9khx2-3046931061":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":17,"resourceVersion":"46786034"},"status":{"nodes":{"steps-9khx2-2996598204":{"hostNodeName":null},"steps-9khx2-3046931061":{"hostNodeName":null}}}},
{"metadata":{"generation":18,"labels":{"workflows.argoproj.io/completed":"true","workflows.argoproj.io/phase":"Succeeded"},"resourceVersion":"46786047"},"status":{"conditions":[{"status":"True","type":"Completed"}],"finishedAt":"2020-09-24T00:40:44Z","nodes":{"steps-9khx2":{"finishedAt":"2020-09-24T00:40:44Z","outboundNodes":["steps-9khx2-3046931061","steps-9khx2-2996598204"],"phase":"Succeeded"},"steps-9khx2-2543249094":{"finishedAt":"2020-09-24T00:40:44Z","phase":"Succeeded"},"steps-9khx2-2996598204":{"finishedAt":"2020-09-24T00:40:41Z","phase":"Succeeded"},"steps-9khx2-3046931061":{"finishedAt":"2020-09-24T00:40:40Z","phase":"Succeeded"}},"phase":"Succeeded"}},
{"metadata":{"generation":19,"resourceVersion":"46786052"},"status":{"nodes":{"steps-9khx2-2996598204":{"resourcesDuration":{"cpu":4,"memory":4}},"steps-9khx2-3046931061":{"resourcesDuration":{"cpu":4,"memory":4}}},"resourcesDuration":{"cpu":8,"memory":8}}},
] |
From busted version, why does generation 9 erase everything? [
{"apiVersion":"argoproj.io/v1alpha1","kind":"Workflow","metadata":{"creationTimestamp":"2020-09-24T21:10:27Z","generateName":"steps-","generation":3,"labels":{"workflows.argoproj.io/phase":"Running"},"name":"steps-vqqhb","namespace":"argo","resourceVersion":"47362772","selfLink":"/apis/argoproj.io/v1alpha1/namespaces/argo/workflows/steps-vqqhb","uid":"686f3ff3-6374-4f2f-96d1-5aaabe86f4a1"},"spec":{"arguments":{},"entrypoint":"hello-hello-hello","templates":[{"arguments":{},"inputs":{},"metadata":{},"name":"hello-hello-hello","outputs":{},"steps":[[{"arguments":{"parameters":[{"name":"message","value":"hello1"}]},"name":"hello1","template":"whalesay"}],[{"arguments":{"parameters":[{"name":"message","value":"hello2a"}]},"name":"hello2a","template":"whalesay"},{"arguments":{"parameters":[{"name":"message","value":"hello2b"}]},"name":"hello2b","template":"whalesay"}]]},{"arguments":{},"container":{"args":["{{inputs.parameters.message}}"],"command":["cowsay"],"image":"docker/whalesay","name":"","resources":{}},"inputs":{"parameters":[{"name":"message"}]},"metadata":{},"name":"whalesay","outputs":{}}]},"status":{"finishedAt":null,"nodes":{"steps-vqqhb":{"children":["steps-vqqhb-1336869503"],"displayName":"steps-vqqhb","finishedAt":null,"id":"steps-vqqhb","name":"steps-vqqhb","phase":"Running","startedAt":"2020-09-24T21:10:27Z","templateName":"hello-hello-hello","templateScope":"local/steps-vqqhb","type":"Steps"},"steps-vqqhb-1278686064":{"boundaryID":"steps-vqqhb","displayName":"hello1","finishedAt":null,"hostNodeName":"ip-10-207-196-131.ec2.internal","id":"steps-vqqhb-1278686064","inputs":{"parameters":[{"name":"message","value":"hello1"}]},"message":"ContainerCreating","name":"steps-vqqhb[0].hello1","phase":"Pending","startedAt":"2020-09-24T21:10:27Z","templateName":"whalesay","templateScope":"local/steps-vqqhb","type":"Pod"},"steps-vqqhb-1336869503":{"boundaryID":"steps-vqqhb","children":["steps-vqqhb-1278686064"],"displayName":"[0]","finishedAt":null,"id":"steps-vqqhb-1336869503","name":"steps-vqqhb[0]","phase":"Running","startedAt":"2020-09-24T21:10:27Z","templateName":"hello-hello-hello","templateScope":"local/steps-vqqhb","type":"StepGroup"}},"phase":"Running","startedAt":"2020-09-24T21:10:27Z"}},
{"metadata":{"generation":4,"resourceVersion":"47362809"},"status":{"nodes":{"steps-vqqhb-1278686064":{"hostNodeName":null,"message":null,"phase":"Running"}}}},
{"metadata":{"generation":5,"resourceVersion":"47362810"},"status":{"nodes":{"steps-vqqhb-1278686064":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":6,"resourceVersion":"47362818"},"status":{"nodes":{"steps-vqqhb-1278686064":{"hostNodeName":null,"outputs":{"artifacts":[{"archiveLogs":true,"name":"main-logs","s3":{"accessKeySecret":{"key":"accesskey","name":"my-minio-cred"},"bucket":"my-bucket","endpoint":"minio:9000","insecure":true,"key":"steps-vqqhb/steps-vqqhb-1278686064/main.log","secretKeySecret":{"key":"secretkey","name":"my-minio-cred"}}}]}}}}},
{"metadata":{"generation":7,"resourceVersion":"47362819"},"status":{"nodes":{"steps-vqqhb-1278686064":{"hostNodeName":"ip-10-207-196-131.ec2.internal"}}}},
{"metadata":{"generation":8,"labels":{"workflows.argoproj.io/completed":"true","workflows.argoproj.io/phase":"Failed"},"resourceVersion":"47362836"},"status":{"conditions":[{"status":"True","type":"Completed"}],"finishedAt":"2020-09-24T21:10:34Z","message":"child 'steps-vqqhb[1].hello2a' errored","nodes":{"steps-vqqhb":{"finishedAt":"2020-09-24T21:10:34Z","message":"child 'steps-vqqhb[1].hello2a' errored","phase":"Failed"},"steps-vqqhb-1278686064":{"children":["steps-vqqhb-330359458"],"finishedAt":"2020-09-24T21:10:32Z","hostNodeName":null,"phase":"Succeeded"},"steps-vqqhb-1336869503":{"finishedAt":"2020-09-24T21:10:34Z","phase":"Succeeded"},"steps-vqqhb-330359458":{"boundaryID":"steps-vqqhb","children":["steps-vqqhb-3815304425"],"displayName":"[1]","finishedAt":"2020-09-24T21:10:34Z","id":"steps-vqqhb-330359458","message":"child 'steps-vqqhb[1].hello2a' errored","name":"steps-vqqhb[1]","phase":"Error","startedAt":"2020-09-24T21:10:34Z","templateName":"hello-hello-hello","type":"StepGroup"}},"phase":"Failed"}},
{"metadata":{"generation":9,"labels":{"workflows.argoproj.io/completed":null,"workflows.argoproj.io/phase":"Running"},"resourceVersion":"47363007"},"status":{"conditions":[{"status":"False","type":"Completed"}],"finishedAt":null,"message":null,"nodes":{"steps-vqqhb":{"finishedAt":null,"message":null,"phase":"Running"},"steps-vqqhb-330359458":{"finishedAt":null,"message":null,"phase":"Running"}},"phase":"Running"}},
{"metadata":{"generation":10,"labels":{"workflows.argoproj.io/completed":"true","workflows.argoproj.io/phase":"Failed"},"resourceVersion":"47363010"},"status":{"conditions":[{"status":"True","type":"Completed"}],"finishedAt":"2020-09-24T21:10:57Z","message":"child 'steps-vqqhb[1].hello2a' errored","nodes":{"steps-vqqhb":{"finishedAt":"2020-09-24T21:10:57Z","message":"child 'steps-vqqhb[1].hello2a' errored","phase":"Failed"},"steps-vqqhb-330359458":{"finishedAt":"2020-09-24T21:10:57Z","message":"child 'steps-vqqhb[1].hello2a' errored","phase":"Error"}},"phase":"Failed"}},
{"metadata":{"generation":11,"resourceVersion":"47363017"},"status":{"nodes":{"steps-vqqhb-330359458":{"children":["steps-vqqhb-3815304425","steps-vqqhb-3764971568"]},"steps-vqqhb-3764971568":{"boundaryID":"steps-vqqhb","displayName":"hello2b","finishedAt":null,"id":"steps-vqqhb-3764971568","inputs":{"parameters":[{"name":"message","value":"hello2b"}]},"name":"steps-vqqhb[1].hello2b","phase":"Pending","startedAt":"2020-09-24T21:10:57Z","templateName":"whalesay","templateScope":"local/steps-vqqhb","type":"Pod"},"steps-vqqhb-3815304425":{"boundaryID":"steps-vqqhb","displayName":"hello2a","finishedAt":null,"id":"steps-vqqhb-3815304425","inputs":{"parameters":[{"name":"message","value":"hello2a"}]},"name":"steps-vqqhb[1].hello2a","phase":"Pending","startedAt":"2020-09-24T21:10:57Z","templateName":"whalesay","templateScope":"local/steps-vqqhb","type":"Pod"}}}},
] |
…ion configurable. Fixes argoproj#4107 (argoproj#4224) Signed-off-by: Alex Capras <[email protected]>
Summary
Trying Example https://argoproj.github.io/argo/examples/#output-parameters
Diagnostics
Argo version 2.11.0
2nd step(consume-parameter) failed because it was unable to get logs, while step 1(generate-parameter) was successful in pulling logs
Below are 2 API to made to get logs
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.
The text was updated successfully, but these errors were encountered: