Does the Rosette Base Linguistics SDK do term expansion? Ed Kenschaft August 24, 2022 15:22 Updated Follow Term expansion is the opposite of lemmatization, where you start with a lemma such as "get" and produce all the possible forms that it could derive from, such as "got", "getting", etc. We do not do this. The number of possible forms that any given lemma could expand into is large, especially when you consider obscure forms like, "The mail remained ungotten." Any attempt to produce an exhaustive list of possibilities leads to huge bloat, lots of red herrings, and inevitably some cases that you miss even so. We generally find that problems for which you hope to use term expansion can be readily adapted to use lemmatization instead, which is far more reliable and efficient. Write to firstname.lastname@example.org if you would like to work through the problem you are trying to address with an engineer at BasisTech. Related articles Getting Started Tokenizers I'm translating names in order to then fuzzy match them against a list - is there any way I can improve this matching probability with RNT? Integrating RBL into Solr How does tokenization of European languages work (RBL-JE)? Comments 0 comments Please sign in to leave a comment.