Set ssh timeout in SSHSessionManager

Description

When we use ssh is used for monitoring, ssh connections can hang if the dataproc cluster's external IP address changes (this can happen because the IP address is ephemeral) resulting in pipelines getting stuck.
https://github.com/cdapio/cdap/blob/release/6.1/cdap-app-fabric/src/main/java/io/cdap/cdap/internal/app/runtime/distributed/remote/SSHSessionManager.java#L145

We should set a timeout on the ssh connection so that the pipeline runs fails when the cluster becomes unreachable.

Release Notes

None

Activity

Show:
Terence Yim
January 28, 2021, 1:23 AM

6.4 Fix

Prashant Jaikumar
January 28, 2021, 12:14 AM

Thanks, I’ll update the description.

Terence Yim
January 28, 2021, 12:12 AM

It is not only affecting 6.1. All later versions that uses SSH has the same issue.

Fixed

Assignee

Terence Yim

Reporter

Prashant Jaikumar

Labels

None

Docs Impact

None

UX Impact

None

Fix versions

Affects versions

Priority

Major