Portworx Port 9002 consistently getting disconnected

lindaorny · December 16, 2021, 2:08am

Hi All,

I have set up the portworx cluster with the backend ceph on Openstack private cloud on my Kubernetes cluster and I see one strange phenomenon. The Portworx status consistently goes down whenever I attached Kafka pod to it. The error log said kvdb disconnected and then waiting to join the quorom with port 9002. The whole cluster goes down and I need to wait for 30 mins to let the portworx cluster reinitialize… which is not a very good experience.

The Portworx is set up with Portworx Operator with kvdb(/dev/vdb) size as 32GB and the storage disk(/dev/vdc) as 150GB.

I tried to wipe the cluster following the guide here(Uninstall Portworx from a Kubernetes cluster using the DaemonSet) and rebuilt but still encounter the same issue.

Error log:
PX is not running on host: Could not reach ‘HealthMonitor’

Error while calling home: KVDB connection failed, either node has networking issues or KVDB members are down or KVDB cluster is unhealthy. All operations (get/update/delete) are unavailable.

Failed to get node status warnings: couldn’t get: /nodestatuswarnings with error: Get “http://localhost:9001/nodestatuswarnings”: dial tcp 127.0.0.1:9001: connect: connection refused

kvdb error: context deadline exceeded, retry count 3

Hope anyone can give me some clue to solve this issue…

Thanks.

Topic		Replies	Views
Kvdb node addition when using portworx operator Portworx on Kubernetes kvdb , operator	0	656	February 9, 2022
On-prem kubernetes install failed Portworx Install	3	1484	August 1, 2022
Portworx Essential Operator 2.9 install on either OKD 4.7 or 4.8 - cluster pods not ready Portworx Install	1	811	January 31, 2022
Node status not OK (STATUS_STORAGE_DOWN)	2	697	May 21, 2022
Create PVC Fails with KVDB Connection Failed Message Portworx Essentials	0	687	November 9, 2021

Portworx Port 9002 consistently getting disconnected

Related topics