Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The Hive Bulk Export action takes a SELECT query as input and runs that query on a Hive table. It stores the results under the provided HDFS directory. When the SELECT query is provided to the plugin, it converts that SELECT query to INSERT OVERWRITE DIRECTORY Hive statement. When this query is executed, Hive starts a MapReduce job which stores the results to provided directory location. So there can be multiple files in a given directory location.

Note

Important: Hive Export works with Hive 2.3.3.

If any query other than a valid SELECT query is provided, Hive Bulk Export will fail to publish the pipeline. This is because CDAP uses Apache Calcite to parse the SELECT query to verify that it's not any other SQL query.

...