Internal KVDB considerations for cloud deployment

pb573 · March 18, 2021, 11:10pm

Hi,

We are considering using internal KVDB for Portworx on openshift cluster (cloud deployment).
I went through the doc here which has pretty good info - Internal KVDB
Would like to discuss few things in detail

Storage for KVDB
We currently attach block vols to nodes for PX storage. Can KVDB use these attached block vols for persistent storage ? Doc says it is not recommended to have KVDb consume storage from PX pool. Does this mean we shouldn’t have worker node be both a metadata node and storage node ?
HA for KVDB

Thru labelling, if we manage to have workers that have KVDB disks to be spread across 3 different Availability zones (AZs), can we expect KVDB to be resilient for zonal worker failures (volumes have more than 1 replication factor) ?

Worker upgrades/ unexpected failures

Docs indicate backups are stored on worker node. In our env, workers are replaced with new worker nodes during upgrades. Assuming the worst case scenario that all workers KVDB uses are down, is there a path to recovery ?

Worst case scenario: Where we are unable to recover PX cluster/KVDB
In this case, if we manage to bring a new installation of PX, would there be data loss ? Assuming we have scheduled snapshots to cloud object storage, can we recover the volumes from cloudsnaps ?

sensre · March 26, 2021, 4:22am

Storage for KVDB- We currently attach block vols to nodes for PX storage. Can KVDB use these attached block vols for persistent storage ? Doc says it is not recommended to have KVDb consume storage from PX pool. Does this mean we shouldn’t have worker node be both a metadata node and storage node ? - You can use single driver for both metadata and storage for non-prod env but its recommended to separate drives for both storage and kvdb
HA for KVDB-Thru labelling, if we manage to have workers that have KVDB disks to be spread across 3 different Availability zones (AZs), can we expect KVDB to be resilient for zonal worker failures (volumes have more than 1 replication factor) ? - Even if you dont label the nodes, portworx internal KVDB will be placed in different AZ automatically, if you have different AZ.
Worker upgrades/ unexpected failures- Docs indicate backups are stored on worker node. In our env, workers are replaced with new worker nodes during upgrades. Assuming the worst case scenario that all workers KVDB uses are down, is there a path to recovery ? - Yes, If at-least one kvdb backup able to retrieve.
Worst case scenario: Where we are unable to recover PX cluster/KVDB - In this case, if we manage to bring a new installation of PX, would there be data loss ? Assuming we have scheduled snapshots to cloud object storage, can we recover the volumes from cloudsnaps ? - if you have cloud snapshots, we can retrieve the px-volumes

Note: it also recommended to use the separate drive(min 64gb) for internal kvdb in prod env.

Topic		Replies	Views
Portworx installation fails with 'Failed to start Portworx: failed to setup internal kvdb Portworx Install install , kvdb	3	1124	February 22, 2022
Failed in internal kvdb setup: Kvdb took too long to start kvdb	8	3843	April 29, 2020
OnPrem Internal KVDB installation problem, connection refused on node's port 9019 Portworx Install kvdb	12	4132	June 10, 2020
Reinstall of Portworx after removing KVDB disks Portworx Install	1	849	February 6, 2021
Kvdb node addition when using portworx operator Portworx on Kubernetes kvdb , operator	0	681	February 9, 2022

Internal KVDB considerations for cloud deployment

Related topics