/
Disabling Spark Auto-caching
Disabling Spark Auto-caching
In some cases, you might want to disable Spark auto-caching at the Engine config level to improve pipeline performance.
For example, it is highly recommended that you disable Spark auto-caching at the Engine config level before using the Data Cacher plugin in pipelines.
To disable Spark auto-caching:
In Pipeline Studio, click Configure.
Click Engine config > Show Custom Config.
In the Name field, enter
spark.cdap.pipeline.autocache.enable
and enterfalse
as the Value.Click Save.
, multiple selections available,
Related content
Creating a reusable pipeline with the GCS Argument Setter
Creating a reusable pipeline with the GCS Argument Setter
Read with this
Reusing Dataproc clusters
Reusing Dataproc clusters
Read with this
Data Cacher Plugin
Data Cacher Plugin
Read with this
Google Dataproc
Google Dataproc
More like this
Amazon Kinesis Spark Streaming Source (Deprecated)
Amazon Kinesis Spark Streaming Source (Deprecated)
More like this
CDAP Release 6.11.0
CDAP Release 6.11.0
More like this
Created in 2020 by Google Inc.