The Rosette Morphology endpoint provides language-specific morphological tools for returning parts-of-speech, lemmas, and compound components, and han-readings for each token in the input.
Lemmas are the canonical dictionary form for each token.
For languages with compound words, Rosette divides the words into components, which improves recall for search engines.
For Chinese tokens in Han script, Rosette returns pinyin transcriptions as the Han reading. For Japanese tokens in Han script (kanji), Rosette returns hiragana transcriptions as the Han reading.
You can append a morphoFeature or a comma-delimited list of features to the /morphology/
endpoint to specify which feature or features you want Rosette to return. /morphology/complete
returns all results for all features available for the language of the input text.
If you'd like to try out Rosette's morphological analysis capabilities, sign up for a free trial of Rosette here. See the documentation for explanations of the different morphological features available in Rosette.
The morphology REST endpoint is:
https://api.rosette.com/rest/v1/morphology/{morphoFeature}
More Info: