-
Basis Technology
Update: Rosette 7 Basis Technology develops and supports
software products which address the foreign language needs of the defense,
intelligence, and law enforcement communities. Our software has been applied
to such missions as document and media exploitation; document triage; watch
list management; and geospatial fusion.
Presentation by Steve Kearns at Basis Technology’s
Government Users Conference in Chantilly, VA on June 8-9, 2010
-
Guided Tour
of the Rosette Linguistics Platform A presentation
outlining the full suite of the Rosette linguistics platform and its
components. Learn its capabilities as well as how it’s being used, how it
interacts, and how it can be tuned and customized to fit your
needs.
Presentation by Ken Glidden at Basis Technology’s
Government Users Conference on June 8, 2009.
-
Building
Applications with the Rosette Linguistics Platform The
presentation reviews the capabilities of Rosette, the applications for which it
can be used, and the techniques it employs. It also focuses on how Rosette can
be integrated and used in existing systems, and how it can be tuned for each
system’s requirements.
Presentation by Steve Cohen at Basis Technology’s
Government Users Conference in Washington, D.C. on June 14, 2006.
-
Multilingual Deep Web Search This presentation introduces
BrightPlanet’s product, technology, and unique placement in the ‘deep web’
space, and presents the close relationship with Basis Technology and its
Rosette linguistics platform.
Presentation by Duncan Witte and Dirk Koechner at Basis
Technology’s Government Users Conference in Washington, D.C. on June 7,
2007.
-
Language Identification: The First Step in Processing
Intelligence A prerequisite to processing multilingual
documents for search, intelligence analysis, e-discovery or digital
forensics is knowing the language and encoding system of each file.
Rosette Language Identifier (RLI) is one of the most widely used language
identifiers used in the commercial or government space today. Based on a
statistical model, RLI has performance advantages over dictionary-based
language identifiers and does not require updating as new vocabulary enters
a language. Unlike language identification that relies on character code
range, RLI easily distinguishes languages which share the same writing
system, such as Russian vs. Ukrainian, or Arabic vs. Persian.
This talk presents an overview of RLI and discuss the
techniques it uses to automatically identify document language and encoding
of documents. We will also discuss how to use all the data points from RLI
to measure confidence levels of an identification result or to achieve more
fine-grained results.
Presentation by Nobuo Otsuka at Basis Technology’s
Government Users Conference in Chantilly, VA on June 8-9, 2010
-
What
Language is That? Using Rosette Language Identifier
This presentation gives an overview of Rosette Language Identifier
(RLI) and the techniques RLI uses to automatically identify the language and
encoding of a block of text. It also explains how language and encoding
identification is an essential stage in the process of working with
unstructured multilingual text.
Presentation by Nobuo Otsuka at Basis Technology’s
Government Users Conference in Washington, D.C. on June 14, 2006.