Deployment Architectures

This section illustrates both a minimal (single host) deployment and a high availability (multi-host) deployment that is highly scalable.

Our recommended installation is to use a multi-host deployment of two boxes for the CDAP components; the hardware requirements are relatively modest, as most of the work is done by the Spark or Hadoop cluster. These two boxes provide high availability; at any one time, one of them is the leader providing services while the other is a follower providing failover support.

CDAP Minimal Deployment

Note: Minimal deployment runs all services on a single host.

CDAP High Availability and Highly Scalable Deployment

Note: Each component in CDAP is horizontally scalable. The number of nodes for each component can be changed based on your particular requirements.

 

Created in 2020 by Google Inc.