Learn about the world’s most widely used component library for multilingual text retrieval and analysis. Rosette provides automatic language identification, text normalization, entity extraction, name matching, and name translation from unstructured text, all in a single, unified framework.
The first step in analyzing or searching multilingual text is identifying the language and encoding of each document, or detecting all languages present in a multilingual document.
A widely deployed solution for implementing Unicode compliance in applications, or for converting files from legacy encodings into Unicode on a wide range of operating systems.
Provides essential linguistic analysis to enable full text search of multilingual documents—including tokenization, lemmatization, and decompounding—in languages covering Europe, Asia, and the Middle East.
Enable classification, management, and analysis of large volumes of unstructured text using advanced linguistics to locate such entities as names, places, organizations, dates, and other significant words or phrases.
Searching for names in a foreign language document or watch list? Rosette Name Indexer matches names of people, places, and organizations against a single, universal index. Match names in English, Arabic, Chinese, French, Korean, Persian, and Spanish regardless of misspellings, missing name components, and language variations.
Names of people, places, or other entities are of crucial importance in almost every field from finance to law enforcement, yet conventional translation systems are ill-suited to deal with the intricacies of name translation. Rosette Name Translator supports this essential requirement by translating names from foreign languages into accurate, standardized English translations. Supported languages include English, Arabic, Chinese, Korean, and Persian.
Political revolutions in the Middle East are being fueled by text messages, chat rooms, and other forms of social media. Much of this content is in Arabic which is spelled phonetically using Latin characters with wide variations in vocabulary, grammar, and spelling. The Rosette Chat Translator for Arabic accurately translates Arabic chat alphabet into standard Arabic script regardless of regional or dialectical variations.