Intelligent workflow scheduling

Description

Intelligent workflow scheduling:
Implement limits on the number of active workflows, yarn containers, etc, to avoid resource starvation. For near term, a limit on number of active workflows will suffice.

The use case here is that a developer wants to ensure that workflows don't pile up and destroy a cluster.

Release Notes

Added a way to limit the number of concurrent schedule runs.

Activity

Show:

Albert Shau October 28, 2015 at 9:16 PM

For that scenario, we would not want to start the program. I was a bit careless with the terminology, what I meant is that the run record has RUNNING status, which occurs once the program is in the STARTING status. It's confusing that run record and program status are different, but that's another issue.

NitinM October 25, 2015 at 4:48 PM

Assume a scenario where a program in the workflow is started and it's not running yet, but waiting for containers to available to start running. Would this scenario break the set skip rule limit and start a new run of a program if getting containers for previous run takes longer than the schedule interval.

Albert Shau October 23, 2015 at 11:15 PM

Albert Shau October 23, 2015 at 8:48 PM

yep

Jeff Dix October 23, 2015 at 8:42 PM

If I did not set skipIfConcurrentProgramRunsExceed would it not limit the number of concurrent workflows like the current implementation?

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Assignee

Reporter

Labels

Sprint

Fix versions

Priority

Created October 7, 2015 at 6:47 PM
Updated January 14, 2016 at 2:34 AM
Resolved November 9, 2015 at 6:43 PM