Does the Rosette Base Linguistics SDK do term expansion? Ed Kenschaft June 28, 2019 13:55 Updated Follow Term expansion is the opposite of lemmatization, where you start with a lemma such as "get" and produce all the possible forms that it could derive from, such as "got", "getting", etc. We do not do this. The number of possible forms that any given lemma could expand into is large, especially when you consider obscure forms like, "The mail remained ungotten." Any attempt to produce an exhaustive list of possibilities leads to huge bloat, lots of red herrings, and inevitably some cases that you miss even so. We generally find that problems for which you hope to use term expansion can be readily adapted to use lemmatization instead, which is far more reliable and efficient. Write to email@example.com if you would like to work through the problem you are trying to address with an engineer at Basis. Related articles Rosette Cloud and Enterprise Release Notes How does tokenization of European languages work (RBL-JE)? What are alternative lemmas and what do we do with them? What are the Rosette SDKs? Where do I find the Java client binding? Comments 0 comments Please sign in to leave a comment.