The STEMMING directive applies the Porter stemmer algorithm for English words. The Porter stemmer has excellent trade-off between speed, readability, and accuracy. It stems using a set of rules (transformations) applied in a succession of steps. Generally, it applies approximately 60 rules in 6 steps.
Syntax
Code Block |
---|
stemming <column>:column |
The <column>
column
contains a bag of words of type string array or type string list.
...
Applying this directive:
Code Block |
---|
stemming :word |
The result would be this record:
...