...
As CDAP depends on HDFS, YARN, HBase, ZooKeeper, and (optionally) Hive and Spark, it must be installed on cluster host(s) with full client configurations for these dependent services.
The CDAP Master Service must be co-located on a cluster host with an HDFS client, a YARN client, an HBase client, and—optionally—Hive and, optionally, Hive or Spark clients.
Note that these clients are redundant if you are co-locating the CDAP Master on a cluster host (or hosts, in the case of a deployment with high availability) with actual services, such as the HDFS NameNode, the YARN resource manager, or the HBase Master.
You can download the Hadoop client and HBase client libraries, and then install them on the hosts running CDAP services. No Hadoop or HBase services need be running.
All services run as the
cdap
user installed by the package manager. See “Create the cdap User” below.If you are installing CDAP with the intention of using replication, see these instructions on CDAP Replication before installing or starting CDAP.
...
Sync the configuration files (such as
cdap-site.xml
andcdap-security.xml
) on all the nodes.While the default bind.address settings (
0.0.0.0
, used forapp.bind.address
,data.tx.bind.address
,router.bind.address
, and so on) can be synced across hosts, if you customize them to a particular IP address, they will—as a result—be as a result, they will be different on different hosts.Starting services is described in Starting CDAP Services.
...