Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »

The NGramTransform Spark Compute analytics plugin is available in the Hub.

Transforms the input features into n-grams, where n-gram is a sequence of n tokens (typically words) for some integer ā€˜nā€™.

For example, a bio data scientist wants to study the sequence of the nucleotides using the input stream of DNA sequencing to identify the bonds. The input Stream contains the DNA sequence eg AGCTTCGA. The output contains the bigram sequence AG, GC, CT, TT, TC, CG, GA.

  • No labels