Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Property

Macro Enabled?

Description

Reference Name

No

RequiredOptional. Name used to uniquely identify this source for lineage, annotating metadata, etc.

Project ID

Yes

Optional. Google Cloud Project ID, which uniquely identifies a project. It can be found on the Dashboard in the Google Cloud Platform Console. This is the project that the Dataplex task will run in. If a temporary bucket needs to be created, the service account must have permission in this project to create buckets.

Default is auto-detect.

Service Account Type

Yes

Optional. Select one of the following options:

  • File Path. File path where the service account is located.

    • JSON. JSON content of the service account.

Service Account File Path

Yes

Optional. Path on the local file system of the service account key used for authorization. Can be set to 'auto-detect' when running on a Dataproc cluster. When running on other clusters, the file must be present on every node in the cluster.

Default is auto-detect.

Service Account JSON

Yes

Optional. Content of the service account.

Location ID

Yes

Required. ID of the location in which the Dataplex lake has been created, which can be found on the details page of the lake.

Lake ID

Yes

Required. ID of the Dataplex lake, which can be found on the details page of the lake.

Zone ID

Yes

Required. ID of the Dataplex zone, which can be found on the details page of the zone.

Entity ID

Yes

ID of the Dataplex entity, which can be found on the Discovery tab.

Partition Start Date

Yes

Optional. Inclusive partition start date, specified as ‘yyyy-MM-dd’. For example, ‘2019-01-01’. If no value is given, all partitions up to the partition end date will be read.

Note: Partition Start Date and Partition End Date are only applicable for BigQuery entities with time partitioning.

Partition End Date

Yes

Optional. Exclusive partition end date, specified as ‘yyyy-MM-dd’. For example, ‘2019-01-01’. If no value is given, all partitions up from the partition start date will be read.

Note: Partition Start Date and Partition End Date are only applicable for BigQuery entities with time partitioning.

Filter

Yes

Optional. Filters out rows that do not match the given condition. For example, if the filter is ‘age > 50 and name is not null’, all output rows will have an ‘age’ over 50 and a value for the ‘name’ field. This is the same as the WHERE clause in BigQuery. More information can be found at https://cloud.google.com/bigquery/docs/reference/standard-sql/query-syntax#where_clause

...