Apache Lucene Ultra Search Utility
Apache lucene sets the standard for Search and Indexing performance
It is high performance and full-feature text search engine fully written using java. It is used to any application that requires full-text search, especially cross platform.
It is open source project we can download it freely. Using this link to download the apache lucene from server download .
It is high performance and full-feature text search engine fully written using java. It is used to any application that requires full-text search, especially cross platform.
It is open source project we can download it freely. Using this link to download the apache lucene from server download .
Features:
1. Scalable and High performance Indexing
- Small Ram require i.e 1 MB Heap
- Incremental indexing is faster than Batch indexing
2. Powerful, Accurate and Efficient Searching algorithm
- It is support Ranked Searching i.e best result return first
- It is allow Fielded searching
- It is allow Sorting by any order
- It support Multiple-index searching
- Allow concurrency access i.e at the same time indexing and updating it
- It is easy to fetching, join, and grouping data
- It is fast retrieve
3. Stop-word processing
Common words, Such as "a", "and" and "the" add little value to a search index. But since these words are so common, cataloging them will contribute considerably to the indexing time and index size. Most search engines not indexing certain words, called as stop words.
Lucene handles stop words with the more general ANALYZER mechanism. the StopAnalyzer call which eliminates stop words from the input stream.
4. Stemming
User desires a query for one word to match other similar words. For example , a query for "jump" should probably also match the words "jumped", "jumper" or "jumps". Reducing a word to its root form is called STEMMING.
Lucene does not yet implemented stemming. But it can be achived by using ANALYZER class.
5. Non-English Support
Most search engines implicitly support english words. Use that words as query words. But lucene preprocess the input stream via ANALYZER class. It is provide the language specific filtering.
6. Cross-Platform Solution
- It is purely Java
- It is available in open source apache
For more details visit Apache Lucene5.1
Comments
Post a Comment