Runtime monitor thread hangs when setting up SSH proxy information on the remote cluster

Description

When programs are run in a remote cluster, the runtime monitor sets up SSH port forwarding information on the remote box by scp-ing a file. While doing the scp, the runtime monitor thread can hang waiting to read scp response. A single hanging runtime monitor thread will block all other runtime monitors due to a shared lock, leading to no new programs getting executed.

I have attached a stack trace showing this deadlock.

 

Release Notes

None

Attachments

1

Activity

Show:

Terence YimApril 5, 2019 at 9:37 PM

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Assignee

Reporter

Affects versions

Components

Fix versions

Priority

Created April 5, 2019 at 6:29 PM
Updated April 30, 2019 at 10:04 PM
Resolved April 5, 2019 at 9:37 PM