/
Disabling Spark Auto-caching
Disabling Spark Auto-caching
In some cases, you might want to disable Spark auto-caching at the Engine config level to improve pipeline performance.
For example, it is highly recommended that you disable Spark auto-caching at the Engine config level before using the Data Cacher plugin in pipelines.
To disable Spark auto-caching:
In Pipeline Studio, click Configure.
Click Engine config > Show Custom Config.
In the Name field, enter
spark.cdap.pipeline.autocache.enable
and enterfalse
as the Value.Click Save.
, multiple selections available,
Related content
Data Cacher Plugin
Data Cacher Plugin
More like this
Creating a reusable pipeline with the GCS Argument Setter
Creating a reusable pipeline with the GCS Argument Setter
Read with this
Resources
Resources
More like this
Reusing Dataproc clusters
Reusing Dataproc clusters
Read with this
Parallel Processing
Parallel Processing
More like this
Optimizing Joiner Performance
Optimizing Joiner Performance
More like this
Created in 2020 by Google Inc.