lucene+paoding亲密接触

发布于 2015-05-23 07:55:43 | 144 次阅读 | 评论: 0 | 来源: 网友投递

Apache Lucene全文检索引擎工具包

Lucene是apache软件基金会4 jakarta项目组的一个子项目，是一个开放源代码的全文检索引擎工具包，即它不是一个完整的全文检索引擎，而是一个全文检索引擎的架构，提供了完整的查询引擎和索引引擎，部分文本分析引擎（英文与德文两种西方语言）。Lucene的目的是为软件开发人员提供一个简单易用的工具包，以方便的在目标系统中实现全文检索的功能，或者是以此为基础建立起完整的全文检索引擎。

lucene是什么？lucene是一个开源的，广泛应用的，对数据进行索引、查询的一个框架，更详细的介绍请查看www.lucene.com.

下面简单的描述一下索引和查询过程。

1. 做索引简单过程：

//获取索引存储路径

String strindexDir = “”;

File indexDir = new File(strindexDir);

// 分词器
Analyzer analyzer = new PaodingAnalyzer(); // 中文分词庖丁解牛

//获取写索引对象，准备开始做索引，索引对象相当于数据库中的表
indexWriter = new IndexWriter(indexDir, analyzer, true);

//构造Document对象，Document相当于数据库中的一条记录
Document doc = new Document();

// 给记录中的字段赋值
doc.add(new Field(“name”, "value", Field.Store.NO, Field.Index.ANALYZED));

doc.add(new Field(“name”, "value", Field.Store.NO, Field.Index.ANALYZED));

//表记录加入表中
indexWriter.addDocument(doc);

// 做完索引之后，对索引进行优化
indexWriter.optimize();

//关闭资源
indexWriter.close();

2.查询过程

//存放查询结果
List<HashMap<String, String>> rs = new ArrayList<HashMap<String, String>>();

//获取分析器，做索引的时候需要分析，查询的时候也需要分析
Analyzer analyzer = new PaodingAnalyzer();

//构造查询对象，queryField是查询那个Field，相当于数据记录的哪个字段，q是查询关键字
query = new QueryParser(queryField, analyzer).parse(q);

//下面这句是可以看到对查询关键字的分析
System.out.println("query key analyze result: " + query.toString());

//获取索引对象，进行查询
   IndexReader reader = IndexReader.open(indexDir);
   Searcher searcher = new IndexSearcher(reader);
   Hits hits = searcher.search(query);
   searcher.close();

//从查询结果中获取信息
Document doc = null;
HashMap<String, String> item = new HashMap<String, String>();

item.put("name", "value");

item.put("name", "value");
rs.add(item);

//释放资源
reader.close();

做索引，查询就被这么短短几行搞定，如此看来，lucene好简单，甚强大。其实不然，要游刃有余，需深入磨练。

最新网友评论 共有(0)条评论发布评论返回顶部

Apache Lucene全文检索引擎工具包

后端技术

前端技术

数据库

热门框架

常用IDE

其他