Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Create a connection to the data source.

  2. Double-click the dataset to start transforming the dataset.

  3. For CSV files connections, parse the dataset.
    Note: You don’t need to parse Database and BigQuery.

  4. Use the Insights tab to profile and discover data quality issues.

  5. Add transformations to the recipe.

  6. When the recipe is complete, click Create a Pipeline.
    CDAP creates a pipeline with the source plugin and the Wrangler transformation.

  7. In the Pipeline Studio, review the Source plugin and Wrangler transformation and edit as required.

  8. In the Pipeline Studio, continue adding plugins to the pipeline.

  9. When satisfied with the pipeline and transformations, deploy the pipeline and run it.

...