发布于 2015-03-29 01:02:24 | 241 次阅读 | 评论: 0 | 来源: 网友投递
Apache PDFBox Java类库
PDFBox(一个BSD许可下的源码开放项目)是一个为开发人员读取和创建PDF文档而准备的纯Java类库。
Apache PDFBox 1.8.9 发布,此版本现已提供下载:http://pdfbox.apache.org/downloads.cgi。
发行说明 -- Apache PDFBox -- Version 1.8.9
此版本主要是 bug 修复版本,基于上一版本 1.8.8,包括大量的 bug 修复和改进。
issues 列表:https://issues.apache.org/jira/browse/PDFBOX
Bug 修复
[PDFBOX-159] - Field renaming character set problem
[PDFBOX-785] - Spliting a PDF creates unnecessarily large files
[PDFBOX-1002] - Form field not rendered after being processed by pdfbox-1.1.0,
wrong position of same field in pdfbox-1.5.0
[PDFBOX-1618] - Split PDF file to single page files, some files are inflated in size
[PDFBOX-1651] - PDFBox doesn't read the permission bits correct.
PDDocument.getCurrentAccessPermission().canPrint() is allways returning true
irrespective of the document print permissions
[PDFBOX-1750] - PDTextbox and PDAnnotationWidget are not correct initialized
from it's own constructor .
[PDFBOX-1859] - ClassCastException for unknown destination type
[PDFBOX-1874] - PDFTextStripper.isParagraphSeparation(...)
[PDFBOX-1965] - NPE in NonSequentialPDFParser when parseMinimal property is set
to true
[PDFBOX-2525] - Overlay: data black & white after import
[PDFBOX-2549] - TIFF-Predictor with 16 bits per component not supported
[PDFBOX-2550] - ClassCastException in PDAnnotation.getColour
[PDFBOX-2552] - Blank rendering when negative page rotation
[PDFBOX-2559] - TTF font cannot be loaded
[PDFBOX-2579] - Exception in thread "main" java.io.IOException: Error: Expected
a long type at offset 1029, instead got '12688(Deleted'
[PDFBOX-2582] - Form fields missing entirely or incorrect in PDField list
[PDFBOX-2583] - Error when rendering a PDF with annotations
[PDFBOX-2595] - Pdfbox always sets the second part of documentID to the same value
[PDFBOX-2615] - IllegalArgumentException in PDPageTree constructor: root cannot
be null
[PDFBOX-2617] - Group of Button fields treated as a Radio Button group
[PDFBOX-2620] - Support named actions
[PDFBOX-2622] - PDAnnotationLink::getBorderStyle() don't understand external
border style
[PDFBOX-2644] - Load FDF document creates Temp file when called with file parameter
[PDFBOX-2653] - Image extraction fails with attached PDF
[PDFBOX-2654] - NullPointerException when reading a GIF file with a transparent
color
[PDFBOX-2676] - PDPageLabelRange.setLabelItem() should not allow negative startPage
[PDFBOX-2678] - possible NPE in ExtractText tool of trunk
[PDFBOX-2687] - ClassCastException when trying to get OutputIntents or add to it
[PDFBOX-2698] - PDFToImage IndexOutOfBoundsException
[PDFBOX-2702] - Merging PDFs created using "Nuance PDF Create" not possible
[PDFBOX-2710] - Form fields missing in 1.8.9 version
[PDFBOX-2726] - org.apache.pdfbox.cos.COSArray cannot be cast to
org.apache.pdfbox.cos.COSDictionary
改进
[PDFBOX-1523] - Manifest should support Specification entries
[PDFBOX-2594] - Set default params in JBIG2Filter
[PDFBOX-2619] - XMP dates contain time zone, while document info dates do not,
and this isn't detected by preflight
[PDFBOX-2673] - Add output path prefix param in PDFSplit/Splitter
更多内容请看发行说明。
PDFBox是Java实现的PDF文档协作类库,提供PDF文档的创建、处理以及文档内容提取功能,也包含了一些命令行实用工具。
主要特性包括:
从PDF提取文本
合并PDF文档
PDF 文档加密与解密
与Lucene搜索引擎的集成
填充PDF/XFDF表单数据
从文本文件创建PDF文档
从PDF页面创 建图片
打印PDF文档