Replication

Container Storage Modules (CSM) for Replication is part of the open-source suite of Kubernetes storage enablers for Dell products.

CSM for Replication project aims to bring Replication & Disaster Recovery capabilities of Dell Storage Arrays to Kubernetes clusters. It helps you replicate groups of volumes using the native replication technology available on the storage array and can provide you a way to restart applications in case of both planned and unplanned migration.

CSM for Replication Capabilities

CSM for Replication provides the following capabilities:

Capability PowerMax PowerStore PowerScale PowerFlex Unity
Replicate data using native storage array based replication yes yes yes yes no
Asynchronous file volume replication no no yes no no
Asynchronous block volume replication yes yes n/a yes no
Synchronous file volume replication no no no no no
Synchronous block volume replication yes yes n/a no no
Active-Active (Metro) block volume replication yes yes n/a no no
Active-Active (Metro) file volume replication no no no no no
Create PersistentVolume objects in the cluster representing the replicated volume yes yes yes yes no
Create DellCSIReplicationGroup objects in the cluster yes yes yes yes no
Failover & Reprotect applications using the replicated volumes yes yes yes yes no
Online Volume Expansion for replicated volumes yes no no yes no
Provides a command line utility - repctl for configuring & managing replication related resources across multiple clusters yes yes yes yes no

NOTE: To add or delete PV s on an existing SYNC Replication Group in PowerStore, the user needs to pause, perform the operation and then resume the replication group. For more details, please refer to the troubleshooting section.

NOTE: To delete the last PV from a SYNC Replication Group in PowerStore, the user needs to first unassign the protection policy from the corresponding volume group on the PowerStore Manager UI. For more details, please refer to the troubleshooting section.

NOTE: Snapshot for SyncIQ replicated volumes is not supported by the CSI-PowerScale driver.

Details

As on the storage arrays, all replication related Kubernetes entities are required/created in pairs -

  1. Pair of Kubernetes Clusters
  2. Pair of replication enabled Storage classes
  3. Pair of PersistentVolumes representing the replicated pair on the storage array
  4. Pair of DellCSIReplicationGroup objects representing the replicated protection groups on the storage array

You can also use a single stretched Kubernetes cluster for protecting your applications. Even in this topology, rest of the objects still exist in pairs.

What it does not do

  • Replicate application manifests within/across clusters.
  • Stop applications before the planned/unplanned migration.
  • Start applications after the migration.
  • Replicate PersistentVolumeClaim objects within/across clusters.
  • Replication with METRO mode does not need replicator sidecar and common replication controller.
  • Different namespaces cannot share the same RDF group for creating volumes with ASYNC mode for PowerMax.
  • Same RDF group cannot be shared across different replication modes for PowerMax.
  • Replication support for multiple drivers installed on same Kubernetes cluster.

QuickStart

  1. Install all required components:
  • Enable replication during CSI driver installation
  • Install CSM Replication Controller & repctl
  1. Create replication enabled storage classes
  2. Create PersistentVolumeClaim using the replication enabled storage class

How it works

At a high level, the following happens when you create a PersistentVolumeClaim object using a replication enabled storage class -

  1. CSI driver creates protection group on the storage array (if required)
  2. CSI driver creates the volume and adds it to the protection group. There will be a corresponding group and pair on the remote storage array
  3. A DellCSIReplicationGroup object is created in the cluster representing the protection group on the storage array
  4. A replica of the PersistentVolume & DellCSIReplicationGroup is created

You can refer this page for more details about the architecture.

Once the DellCSIReplicationGroup & PersistentVolume objects have been replicated across clusters (or within the same cluster), you can exercise the general Disaster Recovery workflows -

  1. Planned Migration to the target cluster/array
  2. Unplanned Migration to the target cluster/array
  3. Reprotect volumes at the target cluster/array
  4. Maintenance activities like - Suspend, Resume, Establish replication

Architecture

High level architecture for CSM for Replication

Cluster Topologies

Supported Cluster Topologies with CSM Replication

Disaster Recovery

Disaster Recovery Workflows

High Availability

Support for High Availability of Volumes

Migration

Support for Array Migration of Volumes

Monitoring

DellCSIReplicationGroup Monitoring

Replication Actions

DellCSIReplicationGroup Actions

Volume Expansion

Online expansion of replicated volumes

Tools

repctl tool for Replication feature in detail

Troubleshooting

Troubleshooting guide

Release notes

Dell Container Storage Modules (CSM) release notes for replication