-
Notifications
You must be signed in to change notification settings - Fork 759
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
no kind "Deployment" is registered for version "apps\x10v1" in scheme #4543
Comments
I should note that
|
Ok, so I managed to isolate the corrupted deployment configuration. Somehow there is a corrupted protocol buffer in the Isolate the corrupted serviceOn any of the nodes, run
Then in
I then copied the deployment names, dropped them in to Sublime text, and created a script with a bunch of lines that look like this:
This will error on the specific deployment that is causing the problem, and print View the configurationBack in The bad record's data starts with The good record's data starts with There doesn't appear to be any other corruption in here, but even if there is, it's this first part of this protocol buffer that I need to fix. Then I can just delete and recreate the deployment through the API as expected. Basically, I either need to patch that |
Explicitly deleting that record in the However, someone from the microk8s team should look into this, since it feels very wrong to me that a protocol buffer that has been corrupted should ever find its way into the dqlite database. Especially if this corruption results in completely knocking out basic reliability/recovery functionality. |
Basically, the root cause here seems to be the Feels like the solution here is two-fold
I don't know if microk8s has its own |
I am suddenly having a very strange issue with my 4 node microk8s cluster. I am unable to list all
Deployments
in only thedefault
namespace. Other namespaces are fine, as is getting the details of a specific deployment.This works
As does this
However, as soon as I try to list all deployments in
default
, I get this error fromkubectl
The
kubelite
logs show this failure internallyObviously
apps\x10v1
is wrong, as it should beapps/v1
(or maybeapps\v1
is acceptable as well). But I cannot figure out where this corrupted value is coming from. How can I figure out whichDeployment
in my configuration has this corrupted value and repair it?Worse, because the system can no longer list all deployments, all resources in the
default
namespace now seem to be frozen. Things likekubectl rollout restart deployment
don't finish their restart work, and even explicitly deleting a deployment doesn't remove its pods from the cluster.The text was updated successfully, but these errors were encountered: