Manual Failover Procedure (Replication)

To manually failover from the master to a slave cluster, follow these steps:

  1. Stop all CDAP programs on the master cluster.

  2. Stop CDAP on the master cluster.

  3. Copy any HDFS files that have not yet been copied using either your distro's solution or distcp.

  4. Run the CDAP replication status tool to retrieve the cluster state:

    [master] $ cdap run io.cdap.cdap.data.tools.ReplicationStatusTool -m -o /tmp/master_state
  5. Copy the master state onto your slave cluster:

    [master] $ scp /tmp/master_state <slave>:/tmp/master_state
  6. Verify that replication has copied the required data onto the slave:

    [slave] $ cdap run io.cdap.cdap.data.tools.ReplicationStatusTool -i /tmp/master_state ... Master and Slave Checksums match. HDFS Replication is complete. HBase Replication is complete.
  7. Start CDAP on the slave:

Created in 2020 by Google Inc.