Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Imports data from an HDFS directory/file into a Hive table. The Hive Import Action imports data from HDFS by executing provided Hive Load Statement. Local file storage is not allowed because a pipeline can run on any machine. If LOCAL file storage option is provided, pipeline deployment fails at publish time. The Hive Bulk Import action only accepts Hive LOAD statements. If any other Hive query is provided, pipeline publish will fail. If the Load command is executed successfully, all the files in the directory will be moved, not copied, to a Hive/warehouse directory.

Note

Important: The Hive Bulk Import action works with Hive 2.3.3.

Configuration

Property

Macro Enabled?

Description

Hive Metastore Username

Yes

User identity for connecting to the specified hive database. Required for databases that need authentication. Optional for databases that do not require authentication.

Hive Metastore Password

Yes

Password to use to connect to the specified database. Required for databases that need authentication. Optional for databases that do not require authentication.

JDBC Connection String

Yes

Required. JDBC connection string including database name. Use auth=delegationToken. The CDAP platform will provide appropriate delegation token while running the pipeline.

Statement to Load data into Hive

Yes

Required. Load command to load files data into a Hive table. LOCAL option in LOAD command is not available.

...