Plugin version: 0.22.0
This sources source reads from a Google Cloud Pub/Sub subscription in realtime. Cloud Pub/Sub brings the scalability, flexibility, and reliability of enterprise message-oriented middleware to the cloud. By providing many-to-many, asynchronous messaging that decouples senders and receivers, it allows for secure and highly available communication between independently written applications.
Credentials
If the plugin is run on a Google Cloud Dataproc cluster, the service account key does not doesn’t need to be provided and can be set to 'auto-detect'. Credentials will be are automatically read from the cluster environment.
If the plugin is not isn't run on a Dataproc cluster, the path to a service account key must be provided. The service account key can be found on the Dashboard in the Cloud Platform Console. Make sure the account key has permission to access Google Cloud To find the name of the key go to the Google Cloud console dashboard. The account key must have permission to access Pub/Sub. The service account key file needs to must be available on every node in your cluster and must be readable by all users running the job.
...
Property | Macro Enabled? | Version Introduced | Description |
---|---|---|---|
Reference Name | No | Required. Name used to uniquely identify this source for lineage, annotating metadata, etc. | |
Project ID | Yes | Optional. Google Cloud Project project ID, which uniquely identifies a project. It can be found on the Dashboard in the Google Cloud Platform Console. Default is auto-detect. | |
Subscription | Yes | Required. Name of the Google Cloud PubSub Pub/Sub subscription to subscribe. If the subscription needs to be created then the topic to which the subscription will belong must be provided. Naming Convention for Subscription:
| |
Topic | Yes | Optional. Name of the Google Cloud PubSub Pub/Sub topic to subscribe to. If a topic is provided and the given subscriber does not exists it will be created. If a subscriber does not exists and is created only the messages arrived after the creation of subscriber will be subscription doesn’t exist, the subscription gets created. Only the messages that arrive after the subscription is created are received. | |
Format | Yes | 6.4.0 / 0.17.0 | Optional. Format of the data to read. Supported formats are avro, blob, tsv, csv, delimited, json, parquet, and text. Default is text. |
Service Account Type | Yes | 6.3.0 / 0.16.0 | Optional. Select one of the following options:
|
Service Account File Path | Yes | Optional. Path File path on the local file system of the service account key used for authorization. Can be set to 'auto-detect' when running on a Dataproc cluster. When running on other clusters, the file must be present on every node in the cluster. Default is auto-detect. | |
Service Account JSON | Yes | 6.3.0 / 0.16.0 | Optional. Content of the service account. |
Number of Readers | Yes | 6.4.0 / 0.17.0 | Optional. Number of Pub/Sub reader workers to run in parallel for this source. Each reader requires a worker in the cluster. The default number of readers per Pub/Sub Streaming Source is 1. |
...