-
Notifications
You must be signed in to change notification settings - Fork 539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Callback not getting invoked for exec call #2133
Comments
I don't think It sounds like you are trying to create a Kubernetes https://kubernetes.io/docs/concepts/workloads/controllers/job/ When you create a Job you can monitor it for completion and get its logs/status code. If you really do want to run a command within an existing container, I would suggest that implementing some sort of REST or RPC call within your existing container to create the command is a better approach.
|
Thank you @brendandburns. I'm using a third party software from GitHub (GitHub actions) and unfortunately that's what they designed it for so I am looking for potential easy workaround if possible. I guess I will try the Job approach instead. Thanks for the suggestion! |
fwiw, I would expect an error in the case of a true network disconnect (e.g. socket closed), if the underlying TCP connection stays open, I would expect this to continue to work correctly. If this is using the 1.0.0 branch, this may be related to #2127 |
Describe the bug
I am looking for a way to run a long running job in a Kubernetes container that can withstand temporary network disconnection (the disconnection normally lasts for less than 1 second). I attempt to do this using exec. However, the callback function for exec does not get called if there is a network disconnect.
To Reproduce
exec
with a long running method like sleep.konnectivity-agent
pods on the server).Expected behavior
An error or some indication of the disconnect would be helpful. Even better is if there is a way to establish a reconnection.
If not, is there a suggested way to run these long running jobs and not getting them disrupted when a network disconnect event happens? For context, I am using GKE and these disruptions happen when there is a maintenance event.
** Example Code**
The code that I am using (this is from https://github.com/actions/runner-container-hooks/blob/main/packages/k8s/src/k8s/index.ts#L223) where neither the
resolve
orreject
function is called in the callback:The text was updated successfully, but these errors were encountered: