Cloudera

Overview

Cloudera Certified Technology

CDAP 6.2.0 is certified on Cloudera 5

Today, Hadoop is frequently used as an offline analytics tool – generating insights that are then deployed operationally in other systems. Greater business value is generated when organizations turn their data analytics directly into action. Cask and Cloudera refer to this approach as operational analytics; applications that incorporate data analytics are referred to as data applications. The goal of the Cloudera-Cask partnership is to help customers overcome the challenges in building data applications and accelerate the value creation from operational analytics.

Utilizing CDAP on Cloudera Enterprise Data Hub (EDH) is a seamless experience. CDAP is integrated with Cloudera Manager, enabling customers to install, update, and monitor CDAP directly within the Cloudera Manager user interface. CDAP provides automation for ingestion and exploration of data in Cloudera Impala. Rather than writing a MapReduce program to transform data into the Impala file format and scheduling periodic transformation jobs in a separate system such as Oozie, developers can just issue a few simple commands and either batch or streaming data will automatically be ingested into Impala and available for high-performance SQL queries.

The Cask™ Data Application Platform (CDAP) integrates with the Cloudera Manager. Configurations that include Cloudera Manager can be easily configured to ingest data into a cluster, specify schema, or run interactive queries using Impala with CDAP for faster results.

CDAP 6.2.0 is certified on Cloudera 5.

CDAP and Cloudera Architecture Schematic

 

Created in 2020 by Google Inc.