Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

All nomad containers receive SIGTERM after ~10 minutes #346

Open
ColtonHerrod opened this issue May 30, 2024 · 0 comments
Open

All nomad containers receive SIGTERM after ~10 minutes #346

ColtonHerrod opened this issue May 30, 2024 · 0 comments

Comments

@ColtonHerrod
Copy link

ColtonHerrod commented May 30, 2024

I've been troubleshooting a strange issue off and on for a while. All nomad containers I run receive a SIGTERM after about 10 minutes of runtime and reboot as appropriate for their configuration. This appears to be the case regardless of the image being run or if health checks are enabled. None of the containers approach their memory or CPU limits and the single node I'm running is nowhere near its resource limits. This pattern seems to persist as long as the job runs.

There is nothing seemingly obvious in the logs that points to a specific issue. Is this something that could be an issue with the podman driver or with my local configuration? Happy to provide additional information as needed.

System Information

  • OS: RHEL 9.4
  • Nomad Version: 1.8.0
  • Podman Driver: 4.9.4-rhel
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant