Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The STEMMING directive applies the Porter stemmer algorithm for English words. The Porter stemmer has excellent trade-off between speed, readability, and accuracy. It stems using a set of rules (transformations) applied in a succession of steps. Generally, it applies approximately 60 rules in 6 steps.

Syntax

Code Block
stemming <column>:column

The <column>column contains a bag of words of type string array or type string list.

...

Applying this directive:

Code Block
stemming :word

The result would be this record:

...