Apache Lucene Ultra Search Utility

 Apache lucene sets the standard for Search and Indexing performance

It is high performance and full-feature text search engine fully written using java. It is used to any application that requires full-text search, especially cross platform.

It is open source project we can download it freely. Using this link to download the apache lucene from server download .

Features:

1. Scalable and High performance  Indexing
  • Small Ram  require i.e 1 MB Heap
  • Incremental indexing is faster than Batch indexing 
2. Powerful, Accurate and Efficient Searching algorithm
  •  It is support Ranked Searching i.e best result return first
  •  It is allow Fielded searching
  •  It is allow Sorting by any order
  •  It support Multiple-index searching 
  •  Allow concurrency access i.e at the same time indexing and updating it
  •  It is easy to fetching, join, and grouping data
  •  It is fast retrieve 

3. Stop-word processing
        
      Common words, Such as "a", "and" and "the" add little value to a search index. But since these words are so common, cataloging them will contribute considerably to the indexing time and index size. Most search engines not indexing certain words, called as stop words. 
       Lucene  handles stop words with the more general ANALYZER mechanism. the StopAnalyzer call which eliminates stop words from the input stream.

4. Stemming
       
        User desires a query for one word to match other similar words. For example , a query for "jump" should probably also match the words "jumped", "jumper" or  "jumps". Reducing a word to its root form is called STEMMING.
        Lucene does not yet implemented stemming. But it can be achived by using ANALYZER class.

5. Non-English Support

        Most search engines implicitly support english words. Use that words as query words. But lucene preprocess the input stream via ANALYZER class. It is provide the language specific filtering.

6. Cross-Platform Solution
  • It is purely Java
  • It is available in open source apache         
For more details visit Apache Lucene5.1

Comments

Popular posts from this blog

Pyhton auto post to blogger using Google blogger API

Connect VPN via Python

Website crawl or scraping with selenium and python