发布于 2016-02-16 00:20:12 | 246 次阅读 | 评论: 0 | 来源: 网友投递
Apache UIMA 非结构化信息管理应用
UIMA (Unstructured Information Management applications) 是一个软件系统,用来分析大量的非结构化信息从而发掘中对最终用户有用的知识点,一个最典型的 UIM 应用就是从文本文件中提取有用信息,例如人员、地址和组织等相关信息。
Apache UIMA Ruta 2.4.0 发布,Apache UIMA Ruta 是一个基于角色的脚本语言。
该版本主要改进内容包括:
UIMA Ruta Language and Analysis Engine: - - Explicit referencing of annotations with variables, labels and addresses - - Helper methods for applying rules directly in Java code - - New action for splitting annotations - - New block for resetting match context - - Import of uimaFIT analysis engines with manditory parameters - - Macros for conditions and actions (prototypical) - - Limited support of UIMA arrays (prototypical) - - Many, many bug fixes and improvements UIMA Ruta Workbench: - - More support of external files - - Bug fixes