Monday, February 13, 2012

8 Best Open Source Searchengines built on top of Lucene

Lucene is most powerful and widely used Search engine. Here is the list of 7 search engines which is built on top of Lucene. You could imagine how powerful they are.

Apache Solr
Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.http://lucene.apache.org/solr/

Elastic Search
ElasticSearch could be opted if, We want our search solution to be fast, we want a painless setup and a completely free search schema, we want to be able to index data simply using JSON over HTTP, we want our search server to be always available, we want to be able to start with one machine and scale to hundreds, we want real-time search, we want simple multi-tenancy, and we want a solution that is built for the cloud.http://www.elasticsearch.com

Index Tank
IndexTank search engine powers search in Reddit, Social bookmarking site. IndexTank is acquired by LinkedIn and released the project as open source. It includes features like Variables boosts, Facets, Faceted search, Snippeting, Custom scoring functions, Suggest, and Autocomplete.https://github.com/linkedin/indextank-engine

Katta
Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles. http://katta.sourceforge.net/

Bobo Search Bobo Browse is an information retrieval technology that provides navigational browsing into a semi-structured dataset. Beyond the result set from queries and selections, Bobo Browse also provides the facets from this point of browsing. It provides support to sort documents on fields that have multiple values. It is stable and used by LinkedIn. https://github.com/javasoze/bobo

Compass
Compass provides real time searchengine built on top of Lucene. It is distributed, transcational, supports Spring MVC, integrates with ORM. Compass provides google-style search, index updates as well as more advanced concepts such as caching and index sharding (sub indexes). http://www.compass-project.org/

Summa
Summa is a fast modular and scalable search engine written in Java. It can simultaneously access a number of different data and data sources and expose it in a unified interface. It supports distributed architecture and fault tolerant. It can be scaled up or down to handle any amount of data.http://wiki.statsbiblioteket.dk/summa/

Constellio
Constellio Open Source Enterprise Search is based on Apache Solr and using Google Search Appliances connectors architecture, it allows, with a single click, to find all relevant content in your organization (Web, email, ECM, CRM etc.).http://constellio.com/

No comments:

Post a Comment

ShareThis

Bookmark and Share