/
RDD Repartitioner Analytics
RDD Repartitioner Analytics
The RDD Repartitioner analytics plugin is available in the Hub.
This plugins re-partitions a Spark RDD.
Configuration
Property | Macro Enabled? | Description |
---|---|---|
Partitions | Yes | Required. Number of partitions to use when grouping data. If not specified, the execution framework will decide on the number to use. Default is 1. |
Shuffle Data | Yes | Required. Specifies whether the records have to be shuffled. Default is false. |
, multiple selections available,
Related content
ORC Dynamic Partitioned Dataset Sink (Deprecated)
ORC Dynamic Partitioned Dataset Sink (Deprecated)
More like this
Parquet Dynamic Partitioned Dataset Sink (Deprecated)
Parquet Dynamic Partitioned Dataset Sink (Deprecated)
More like this
NGramTransform Spark Compute Analytics (Deprecated)
NGramTransform Spark Compute Analytics (Deprecated)
More like this
Amazon Kinesis Spark Streaming Source (Deprecated)
Amazon Kinesis Spark Streaming Source (Deprecated)
More like this
Hive Bulk Import Action (Deprecated)
Hive Bulk Import Action (Deprecated)
More like this
Redshift to S3 Action (Deprecated)
Redshift to S3 Action (Deprecated)
More like this
Created in 2020 by Google Inc.