发布于 2017-03-28 00:25:59 | 164 次阅读 | 评论: 0 | 来源: 网友投递
Apache Lucene全文检索引擎工具包
Lucene是apache软件基金会4 jakarta项目组的一个子项目,是一个开放源代码的全文检索引擎工具包,即它不是一个完整的全文检索引擎,而是一个全文检索引擎的架构,提供了完整的查询引擎和索引引擎,部分文本分析引擎(英文与德文两种西方语言)。Lucene的目的是为软件开发人员提供一个简单易用的工具包,以方便的在目标系统中实现全文检索的功能,或者是以此为基础建立起完整的全文检索引擎。
Apache Lucene 6.5.0 发布了。
该版本包含许多错误修复,优化和改进,其中一些值得关注的如下:
It is now possible filter out duplicates in the NRT suggester
SimpleQueryString now supports default fuziness
IndexWriter can return the list of visible field names
DisjunctionScorer now supports returning the matching children clauses
A new FunctionScoreQuery that modifies the internal query's score using the per-document values
A new FunctionMatchQuery that returns any documents with a value that matches a predicate
A new WordDelimiterGraphFilter that outputs a correct graph structure for multi-token expansion at query time
A new PatternTokenizer that uses Lucene's RegExp implementation
RangeFieldQuery now supports CROSSES relation
A new IndexOrDocValuesQuery that uses either an index (points or terms) or doc values in order to run a (range, geo box and distance) query,depending which one is more efficient