Note: Datasets are deprecated and will be removed in CDAP 7.0.0.
Datasets store and retrieve data. Datasets are your means of reading from and writing to the CDAP’s storage capabilities. Instead of forcing you to manipulate data with low-level APIs, datasets provide higher-level abstractions and generic, reusable implementations of common data patterns.
Impersonation
Impersonation allows users to run programs and access datasets, streams, and other resources as pre-configured users (a principal). CDAP supports configuring impersonation at a namespace and at an application level, with application level configuration having a higher precedence than namespace level.
If impersonation is enabled, and you don't specify a principal for an application or dataset, then the namespace owner's principal is used. If there is no namespace owner or you are using the default namespace, then the default principal is used (as set by the properties cdap.master.kerberos.keytab
and cdap.master.kerberos.principal
in the cdap-site.xml
).