Rosette Base Linguistics for European Languages
Comprehensive morphological analysis of European language text
Text mining and information retrieval of European language text produces more accurate results when documents have gone though a thorough linguistic analysis. Our Rosette Base Linguistics for European languages helps applications overcome many of the challenges that can lead to inaccurate processing, such as the presence of compound nouns (common in German) and contractions such as “l’” in French (as in “l’eau”).
Features:
-
Lemmatization
| Input | Output |
|---|
| lu | lire |
| gezogen | ziehen |
-
Part of Speech Analysis
| Input | Output |
|---|
| éditeurs | Plural Noun |
| Heiße | Adjective |
-
Noun Decompounding
| Input | Output |
|---|
| Kontrollsysteme | [Kontroll] [systeme] |
| Jugendarbeit | [Jugend] [arbeit] |
- Context Based Analysis