Readiness probe failed

Siddharth_Oak · July 16, 2020, 3:57pm

I have created a block storage on IBM Cloud.
I have attached the storage to a worker node IP.
Installed Portworx from IBM CLoud.
2 worker nodes do not come in running state because of following error.

Events:
Type Reason Age From Message

Normal Scheduled default-scheduler Successfully assigned kube-system/portworx-vc6np to 10.221.167.174
Normal Pulling 46s kubelet, 10.221.167.174 Pulling image “portworx/oci-monitor:2.5.2”
Normal Pulled 45s kubelet, 10.221.167.174 Successfully pulled image “portworx/oci-monitor:2.5.2”
Normal Created 45s kubelet, 10.221.167.174 Created container portworx
Normal Started 45s kubelet, 10.221.167.174 Started container portworx
Warning Unhealthy 5s (x4 over 35s) kubelet, 10.221.167.174 Readiness probe failed: HTTP probe failed with statuscode: 503

koch · July 28, 2020, 10:41pm

I am facing the same issue with OpenShift 4.4 in AWS.
Were you able to get around this problem?

Vinayak_Shinde · July 29, 2020, 6:32am

Can you paste here portworx pod logs from kube-system namespace ?

Nitesh_Sharma · July 29, 2020, 8:03am

Vinayak,

Here are the logs, I guess from portworx log you mean portwox api

autopilot-d494f7f4f-tztct 1/1 Running 0 6h13m
portworx-api-m5r2q 1/1 Running 0 43s
portworx-api-pg9sw 1/1 Running 0 43s
portworx-api-rgm47 1/1 Running 0 34s
px-cluster-e3b0849c-d25d-4b26-9e54-013cd3ab0811-867gf 1/2 Running 0 8m3s
px-cluster-e3b0849c-d25d-4b26-9e54-013cd3ab0811-l722q 2/2 Running 0 8m3s
px-cluster-e3b0849c-d25d-4b26-9e54-013cd3ab0811-nc7mc 1/2 Running 0 8m3s
px-csi-ext-8467cd4bb6-2sg6w 3/3 Running 0 6h13m
px-csi-ext-8467cd4bb6-4xhmn 3/3 Running 0 6h13m
px-csi-ext-8467cd4bb6-r67nz 3/3 Running 3 6h13m
stork-9f8c45d44-fvwfc 1/1 Running 0 6h13m
stork-9f8c45d44-hg6r4 1/1 Running 0 6h13m
stork-9f8c45d44-ss5sj 1/1 Running 0 6h13m
stork-scheduler-8689987c6f-57qfw 1/1 Running 0 6h13m
stork-scheduler-8689987c6f-6chdw 1/1 Running 0 6h13m
stork-scheduler-8689987c6f-rvhnr 1/1 Running 0 6h13m

[root@upstreamcontroller portwork]# oc logs portworx-api-pg9sw
[root@upstreamcontroller portwork]# oc logs portworx-api-rgm47
[root@upstreamcontroller portwork]# oc logs portworx-api-m5r2q
time=“2020-07-29T07:56:53Z” level=warning msg=“Timed out while waiting for StartTransientUnit(crio-416d3b9eb933cd6086af74017d39fa24474fa09f502c75acd170277518d90c02.scope) completion signal from dbus. Continuing…”

I am working along koch as this is imp poc for one of our client.

thanks

sanjay.naikwadi · July 29, 2020, 8:13am

Logs from px-cluster-e3b0849c-d25d-4b26-9e54-013cd3ab0811-867gf not from api pods.

sanjay.naikwadi · July 29, 2020, 11:39am

Issue was related to the node labels for kvdb, only one node was labeled, we added the px/meteadata-node=true on other nodes and it formed the KVDB cluster. Also there we did the complete wipe as initially the ports were not opened on firewall.

You don’t need to label if your cluster size is 3, that is helpful if you have larger cluster size and want to dedicate the nodes for KVDB.

Nitesh_Sharma · July 29, 2020, 11:40am

Just to have summary.

17001-17020 port were opened on AWS
node.openshift.io/os_id=rhcos,px label was added to only one of the worker node due to which cluster was not creating a quorum.

Issue was solved by Portworx team quickly. Thanks Sanjay for quick help here.

Topic		Replies	Views
Portworx pod keeps showing Readiness probe failed Portworx on Kubernetes install , cloud	7	2699	February 10, 2021
Readiness probe error: Get "http://127.0.0.1:17001/status": dial tcp 127.0.0.1:17001: connect: connection refused Portworx Install install , operator	4	310	April 30, 2025
Installing Portworx on IBM Cloud OpenShift Bare Metal Portworx Install	3	710	February 27, 2021
PX cluster unable to ready - Readiness probe failed: HTTP probe failed with statuscode: 503 Portworx on Kubernetes install	4	2056	September 10, 2020
Readiness probe failed: HTTP probe failed with statuscode: 503 on GKE Portworx on Kubernetes kvdb	4	4964	December 6, 2019

Readiness probe failed

Related topics