How does Wordbee deal with morphology?
Question
How does Wordbee deal with morphology?
Answer
Terminology and translation memory lookups leverage a morphological engine. This is implemented with language specific stemming in currently 32 languages. For all other languages a default suffix-only stemming is applied, in conjunction with the regular non-stemmed analysis of text. Decomposition is implemented to some limited extent at a higher layer for the calculation of the hit similarity percentage. It shall be noted that CJK languages are tokenized with CJK specific mechanisms (and not using stemming but decomposition).
Language specific stemming is supported for:
arabic
armenian
basque
bulgarian
catalan
cjk
czech
danish
dutch
english
finnish
french
galician
german
greek
hindi
hungarian
indonesian
irish
italian
latvian
lithuanian
norwegian
persian
portuguese
romanian
russian
sorani
spanish
swedish
thai
turkish