Implement EMR Provisioner for Cloud Runtime

Description

Similar to how we have a Dataproc provisioner for Cloud Runtime, implement an Amazon EMR provisioner to allow running Data pipelines and other applications on an EMR cluster.
Note that this will only support EMR 5.0.0+, because 4.9.x and below runs with Java 7, which CDAP does not support.

Release Notes

Added an Amazon Elastic MapReduce provisioner that can run pipelines on AWS EMR.

Activity

Show:
Ali Anwar
July 20, 2018, 5:43 AM
Edited

Implemented here: https://github.com/caskdata/cdap/pull/10370

Fix to support both MR and Spark (MR support had a regression when I implemented Spark support in the previous PR): https://github.com/caskdata/cdap/pull/10421

 

Fixed

Assignee

Ali Anwar

Reporter

Ali Anwar

Labels

Docs Impact

None

UX Impact

None

Fix versions

Priority

Major
Configure