GitHub Batch Source

The GitHub Batch source plugin is available for Preview in the Hub.

This plugin is used to query GitHub API.

You can use this plugin to select the datasets associated with the specified repository and collect raw level data.

Configuration

Property

Macro Enabled?

Description

Property

Macro Enabled?

Description

Reference Name

No

Required. Name used to uniquely identify this source for lineage, annotating metadata, etc.

Repository owner name

Yes

Required. GitHub username who owns the repository from which the data is retrieved.

Repository name

Yes

Required. Repository name from which the data is retrieved.

Dataset name

Yes

Required. Dataset name that you would like to retrieve.

GitHub API hostname

Yes

Optional. GitHub API hostname from which the data is retrieved. Optional, for GitHub Enterprise only. By default, api.github.com.

Authorization token

Yes

Required. Authorization token to be used to authenticate to GitHub API.



Created in 2020 by Google Inc.