发布于 2016-09-28 23:59:41 | 187 次阅读 | 评论: 0 | 来源: 网友投递
Apache UIMA 非结构化信息管理应用
UIMA (Unstructured Information Management applications) 是一个软件系统,用来分析大量的非结构化信息从而发掘中对最终用户有用的知识点,一个最典型的 UIM 应用就是从文本文件中提取有用信息,例如人员、地址和组织等相关信息。
Apache UIMA Ruta 2.5.0 发布了,Apache UIMA Ruta 是一个基于角色的脚本语言。
改进内容:
UIMA Ruta Language and Analysis Engine:
Depends on UIMA 2.9.0 and uimaFIT 2.2.0
New block FOREACH: iterate over pattern and reuse the matched
annotation in contained rules
Label expressions across inlined rules are supported, behave like
local variables now
Feature expressions on label expressions are supported
Setting values of variables by configuration parameter
Macros support variables
New type function: typeFromString
Wordlist initialization with string expressions: simplified
specification of language-dependent dictionary usage
New maven archetype for typical UIMA Ruta projects
Improved and faster maven plugins
Parameters additionalScripts and additionalEngines are now optional
Removed engine loader concept and parameter completely
Removed redundant Document type
Improved datapath support
Improved resource manager support
Many bug fixes and improvements
UIMA Ruta Workbench:
Improved support of classpath-lookup in maven-based projects
Launch configuration supports various serialization formats (with
preference)
Bug fixes
更多内容,请点击此处查看。
下载地址: