Applications parameters (cdap-site.xml and cdap-default.xml)

Parameter Name

Default Value

Version Introduced

Description

Parameter Name

Default Value

Version Introduced

Description

app.artifact.compute.hash.time.bucket.days

15

6.8.0

If greater than 0, ensures that has values are time bucketed every x days. Therefore, generating hash of an identical value x days apart will result in different hash values.

app.artifact.dir

/opt/cdap/master/artifacts

 

Semicolon-separated list of local directories scanned for system artifacts to add to the artifact repository.

app.artifact.parallelism.max

50

6.6.0

Maximum parallelism of system artifact loading. This can be a resource-consuming process, e.g. with repository on GCS upload can use 64MB buffer, so we need to ensure app fabric is not overwhelmed during the process.

app.bind.port

0

 

App Fabric service bind port; if 0, binds to a random port.

app.output.dir

/programs

 

Directory where all archives are stored.

app.program.extra.classpath

 

 

Additional Java classpath for CDAP programs. These extra classpaths must be present on all nodes in the cluster. Supports wildcard suffix "*" to include all JAR files under a directory.

app.program.jvm.opts

${twill.jvm.gc.opts}

 

Java options for all Apache Twill containers.

app.max.concurrent.runs

-1

 

Maximum number of concurrent program runs allowed; set to -1 for unlimited. Program runs are those with an active run record.

app.max.concurrent.launching

-1

6.6.0

Maximum number of concurrent launching runs allowed; set to -1 for unlimited. Launching runs have an active run record with PENDING or STARTING status.

run.record.monitor.record.age.threshold.seconds

3600

 

Maximum amount of time (in seconds) in which a run record is retained in run record monitor. This is to safe guard launch requests flow-control such that if a request is somehow stuck in PENDING/STARTING state, it will be dropped from after the threshold. Note that run.record.monitor.cleanup.interval.seconds might be needed to changed if this config changes.

run.record.monitor.cleanup.interval.seconds

300

 

Cleanup interval (in seconds) in which run record monitor service cleanup logic runs to delete old entries. Note that run.record.monitor.record.age.threshold.seconds might be needed to changed if this config changes.

app.program.launch.threads

20

 

Size of the thread pool for launching programs

app.program.max.start.seconds

300

 

Maximum number of seconds to wait for a program to start before killing it.

app.program.max.stop.seconds

300

 

Maximum number of seconds to wait for a program to stop before killing it.

app.program.runid.corrector.interval

180

 

Interval in seconds of how often the run id corrector thread will run; this value should be greater than 0.

app.program.runid.corrector.tx.batch.size

1000

 

Number of run records being fetched per transaction for checking if needed for correction. This value is directly proportional to the ${data.tx.timeout} setting.

app.program.local.dataset.deleter.initial.delay

300

 

Interval in seconds for initial delay for the local dataset deletion thread; this value should be greater than 0.

app.program.local.dataset.deleter.interval

3600

 

Interval in seconds of how often the local dataset deletion thread will run; this value should be greater than 0.

app.program.runtime.extensions.dir

/opt/cdap/master/ext/runtimes

 

Semicolon-separated list of local directories that are scanned for program runtime extensions.

app.program.spark.event.logs.enabled

false

 

Enables Spark event logs collection.

app.program.spark.event.logs.dir

spark-histories

 

When ${app.program.spark.event.logs.enabled} is true, the directory under the ${root.namespace} for storing the logs.

app.program.spark.yarn.client.rewrite.enabled

true

 

Specify whether to rewrite the YARN 'Client.scala' class in Spark to work around issue SPARK-13441 in CDH clusters.

app.program.status.event.fetch.size

100

 

Maximum number of events to fetch from the messaging system in each processing cycle for program status update events.

app.program.status.event.poll.delay.millis

2000

 

The delay in milliseconds to check again for new program status events after it detects there was no event.

app.program.yarn.attempt.failures.validity.interval

60000

 

The interval in milliseconds for the time window used by YARN Resource Manager to check for application max failure attempts. By default, this is only used for long running program, but can be override through runtime argument system.yarn.attempt.failures.validity.interval.

app.program.transaction.control

implicit

 

Defines how transactions are controlled for program methods invocation; "implicit" means that the platform encloses method execution into a transaction, whereas "explicit" means that the method itself is in control of executing transactions.

app.temp.dir

/tmp

 

Temp directory.

apps.scheduler.queue

 

 

Scheduler queue for CDAP programs and CDAP Explore queries.

app.deploy.update.schedules

true

 

If true, redeploying an application will modify any schedules that currently exist for the application; if false, redeploying an application does not create any new schedules and existing schedules are neither deleted nor updated. This property only affects the redeployment of an application; all related actions or endpoints are unaffected.

master.services.bind.address

0.0.0.0

 

Bind address for app fabric service and dataset service.

program.container.dist.jars

 

 

Additional jars to be localized to every program container and to be added to classpaths of CDAP programs. They can be local file paths on the CDAP Master or URIs of remote files. Multiple JAR files are comma- separated.

scheduler.job.queue.num.partitions

16

 

Number of partitions in the scheduler's job queue. This is the same as the number of constraint checker threads started by the scheduler.

scheduler.max.thread.pool.size

100

 

Size of the scheduler thread pool.

scheduler.misfire.threshold.ms

60000

 

The number of milliseconds by which a schedule execution can miss its next-fire-time and still run.

scheduler.event.poll.delay.millis

2000

 

The delay in milliseconds that the scheduler checks again for new events after it detects there was no event.

scheduler.time.event.fetch.size

100

 

Maximum number of events to fetch from the messaging system in each processing cycle for time schedule events.

scheduler.stream.size.event.fetch.size

100

 

Maximum number of events to fetch from the messaging system in each processing cycle for stream size schedule events.

scheduler.data.event.fetch.size

100

 

Maximum number of events to fetch from the messaging system in each processing cycle for data schedule events.

scheduler.program.status.event.fetch.size

100

 

Maximum number of events to fetch from the messaging system in each processing cycle for program status schedule events.

time.event.topic

timeevent

 

Topic name for publishing time events from time scheduler to the messaging system.

program.status.event.topic

programstatusevent

 

Topic name for publishing status transitioning events of program runs to the messaging system.

program.status.record.event.topic

programstatusrecordevent

 

Topic name for publishing program status recording events to the messaging system.

workflow.token.max.size.mb

30

 

Maximum allowed size in megabytes of a workflow token; if the workflow token exceeds this size, no further updates are allowed.

requirements.datasetTypes.exclude.list

 

 

Comma separated list of plugin dataset type requirements that cannot be met in the instance. Plugins that require any of these will be treated as if they don't exist. For example, if 'table' is given, any plugin that requires 'table' will be treated as if they don't exist.

ui.theme.file

server/config/themes/default.json

 

File containing the theme to be used in UI.

Created in 2020 by Google Inc.