GitHub provides hosting for software development version control using Git.This plugin would allow users to select the data sets associated with the specified repository and collect raw level data.
User Expectations
Users would like to collect raw data sets associated with a specific repository so that they can perform monitoring and reporting on it
User would like to perform aggregations on GitHub datasets so that they can get better understanding of the repository usage
Valid values include all the objects listed in the above link.
* Dataset name can be one of the following: Branches, Collaborators, Comments, Commits, Contents, Deploy Keys, Deployments, Forks, Invitations, Pages, Releases, Traffic:Referrers, Webhooks)
** Retrieving GitHub data would always call list API for the associated object. For instance, if Collaborators dataset was selected, the plugin would get the list of all the collaborators on the specified repository (along with other associated fields returned by List Collaborators API)
Design / Implementation Tips
Plugin will be implemented using third-part The GitHub Java API library, It is part of the GitHub Eclipse Connector and aims to support the entire GitHub v3 API.
Authentication will be performed using access token.
Output schema must be automatically generated from selected data.