Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • As CDAP depends on HDFS, YARN, HBase, ZooKeeper, and (optionally) Hive and Spark, it must be installed on cluster host(s) with full client configurations for these dependent services.

  • The CDAP Master Service must be co-located on a cluster host with an HDFS client, a YARN client, an HBase client, and—optionally—Hive and, optionally, Hive or Spark clients.

  • Note that these clients are redundant if you are co-locating the CDAP Master on a cluster host (or hosts, in the case of a deployment with high availability) with actual services, such as the HDFS NameNode, the YARN resource manager, or the HBase Master.

  • You can download the Hadoop client and HBase client libraries, and then install them on the hosts running CDAP services. No Hadoop or HBase services need be running.

  • All services run as the cdap user installed by the package manager. See “Create the cdap User” below.

  • If you are installing CDAP with the intention of using replication, see these instructions on CDAP Replication before installing or starting CDAP.

...

  • Sync the configuration files (such as cdap-site.xml and cdap-security.xml) on all the nodes.

  • While the default bind.address settings (0.0.0.0, used for app.bind.addressdata.tx.bind.addressrouter.bind.address, and so on) can be synced across hosts, if you customize them to a particular IP address, they will—as a result—be as a result, they will be different on different hosts.

  • Starting services is described in Starting CDAP Services.

...