Solex : Lexum’s Latest Search Engine

In the movie The Man with the Golden Gun, the Solex is a revolutionary device that is meant to solve the 1973 energy crisis. After killing its British inventor, an elite assassin steals the Solex to sell it to foreign powers. James Bond is dispatched to find the assassin and recover the precious device. Because this is a James Bond movie, there’s also a laser.

The Solex Agitator

Solex also stands for SolrCloud Lexum plugins, the latest iteration of the search engine Lexum deploys in all of its products.

Lexum has used a wide variety of search engines throughout its history. It all started at the dawn of the Web, in 1994, with the Wide Area Information Server (WAIS). Then came the NQL search engine from a local Montreal firm. Then, for a year or so, AustLII’s SINO search engine. In 2003, we elected to build a search engine of our own: Eliisa, a faster, more capable Apache Lucene based search engine library. Finally, in 2009, we integrated Apache Solr elements into Eliisa and turned it into a standalone server application.

Today, we are happy to announce the release of our 3rd generation search engine: Solex.

Over the years, we’ve added a number of functionalities to the stock Apache Lucene/Solr search platforms: faster result list “snippet” generation, phrase query performance improvements, whole document highlighting, phrase and sentence proximity operators, a smart auto-complete mechanism for document identifiers, a lenient query parser with a custom syntax, HTML aware indexation, citation indexation and highlighting, noteup counting functionalities and more.

So, what new features does Solex bring? The answer is scalability, performance improvements and flexibility.

For the last fifteen years, Lexum has proudly provided CanLII with the fastest legal search engine in Canada. Content and traffic growth have however made our exacting performance standards harder to maintain. Nowadays, CanLII indexes several billion words of content and handles on average fifteen queries per second, with frequent spikes of 50 or more queries per second. Quite simply, our previous search engine had reached the limit of what could be done in a single server process. Solex, our new search engine, is based on Apache SolrCloud, a technology Netflix, Instagram, Reddit, and other Internet giants rely on for their own search platforms. Solex scales horizontally by distributing content and queries to as many servers as necessary. As a result, response time is better and more consistent, with up to 500% speedups for certain queries, ensuring that users of Lexum’s products enjoy the fastest response time available for many more years.

Although Solex modestly improves top results ranking precision over our previous generation engine, the best is yet to come. A distributed paradigm will give us the flexibility to experiment with new, more processor hungry machine-learning based relevance algorithms that we hope will further improve relevance measures across all our products and provide an even better experience to every user.

Solex was deployed on CanLII on Monday Feb 20th 2018 and will be deployed on our other products in the coming weeks.

Our Solex might not solve the next energy crisis but we hope it will provide a solution to your next legal research.