Created a HDP cluster and created a DBSource -> TPFS batch (MR) pipeline and it fails with the following issue:
Fixed a problem with Hydrator pipelines using a DBSource not working in an HDP cluster.
Can you take a look to see if you can repro?
i can repro. Happening on my test cluster with mysql jar.
Looking at https://github.com/hortonworks/hadoop-release/blob/HDP-126.96.36.199-tag/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/db/DBInputFormat.java, the code calls createConnection(), which is throwing an exception. Our plugin relies on the fact that DataDrivenETLDBInputFormat overrides the getConnection() method to intercept the call and correctly create the connection. Since the underlying DBInputFormat no longer calls it, everything breaks.
One short term hack would be to override the setConf() method, though there should be a more robust way to do this. Perhaps by copying the inputformat and related classes into the plugin jar?