Basis Technology Blog
You are currently viewing all posts tagged with Text Analytics.
Customer Hackathons: In the Trenches with Rosette Engineers
Basis Technology and Kyper Data Technologies engineers collaborate on code. Image by Alyssa Watson of Kyper. It’s 9am, the coffee is freshly brewed, and fingers are hovering over keyboards, poised to start. As with most hackathons, there’s a palpable buzz in the room, muted discussions of engineers eager to put their skills and expertise to […]Read more
What’s New in Highlight 7.2
This latest version of Highlight has significant enhancements for government linguists and translators that use this Microsoft Office plug-in to translate and standardize names between English and non-Latin languages—Arabic, Dari, Farsi, Korean, Mandarin Chinese, Pashto, and Russian.Read more
Elasticsearch and Fuzzy Name Matching Meetup, World Tour
Normalization is crucial to high quality search results — who wants irrelevant variations between queries and documents leading to missed hits (e.g., “celebrity” v. “celebrities”)? Normalizing dictionary words works, but what if your application focuses on names? Whether you’re tackling log analysis, e-commerce, watch list screening or other applications, names are often the key. Can […]Read more
Multilingual Search With Solr? No Problem!
Whitepaper – Optimizing Multilingual Search With Solr: Recall, Precision and Accuracy INTRODUCTION Today’s search application users expect search engines to just work seamlessly across multiple languages. They should be able to issue queries in any language against a document corpus that spans multiple languages, and get quality results back. This whitepaper offers the search application engineer […]Read more
Predictive Analytics Case Study
EMBERS Successfully Forecasts Future Events How Virginia Tech’s EMBERS project “beat the news” by predicting civil unrest in Latin America Is a fascinating case study in the power of Big Data, advanced text analytics, and human/computer collaboration. Case Study Summary Since November 2012, the EMBERS project has been accurately forecasting civil unrest events in Latin […]Read more
Accurate Language Detection for Queries & Tweets
Doubles the Accuracy of Existing Language Identification Software Basis Technology’s Rosette Language Identifier (RLI) has been improved to solve the problem of language detection for short texts. Existing language detectors require many words to confidently identify the language of a string of text, and are therefore unreliable when trying to detect the language of queries, […]Read more