发布于 2015-07-25 00:52:56 | 390 次阅读 | 评论: 1 | 来源: 网友投递
Apache PDFBox Java类库
PDFBox(一个BSD许可下的源码开放项目)是一个为开发人员读取和创建PDF文档而准备的纯Java类库。
Apache PDFBox 1.8.10 发布,此版本现已提供下载:
http://pdfbox.apache.org/download.cgi
此版本是上一版本的 bug 修复版本,主要是 bug 修复和小改进:
Bug 修复
[PDFBOX-1453] - NPE saving modified encrypted document
[PDFBOX-1642] - NPE when parsing XMP schema definition with "closed Choice"
value type
[PDFBOX-2501] - Page render without barcode
[PDFBOX-2730] - PDFSplit slow and keeps unused pages
[PDFBOX-2733] - Nullpointer exception in PDFXrefStreamParser.parse
[PDFBOX-2734] - Can't create PDF with DeviceN colorspace
[PDFBOX-2737] - PDF not converted to Image
[PDFBOX-2741] - IndexOutOfBoundsException when calling
PDSeparation.setAlternateColorSpace
[PDFBOX-2742] - PDFSplit ignores global resources
[PDFBOX-2743] - Can't create PDF with PDIndexed colorspace
[PDFBOX-2745] - PDPageXYZDestination zoom property can't be set lower than 100%
[PDFBOX-2767] - ClassCastException in PDDocument.addSignature
[PDFBOX-2769] - NPE when saving encrypted file
[PDFBOX-2772] - EI token lost for rewrite
[PDFBOX-2773] - ClassCastException in PDDocumentCatalog.java:339
[PDFBOX-2778] - PDF to Image conversion fails with "Invalid code word
encountered"
[PDFBOX-2781] - Opening pdf document after encrypting it with PDFBox throws
IllegalBlockSizeException
[PDFBOX-2786] - PDPageDestination page index off by one
[PDFBOX-2788] - Seemingly good document gets semi-corrupted
[PDFBOX-2792] - Text extraction ignores bookmarks
[PDFBOX-2793] - /Dests dictionary isn't supported
[PDFBOX-2797] - PDJavascriptNameTreeNode does not support dictionaries
[PDFBOX-2799] - PDOptionalContentProperties.setGroupEnabled not working
[PDFBOX-2801] - SecurityHandler does not tolerate plain-text COSString
[PDFBOX-2811] - Infinite loop within RandomAccessBuffer
[PDFBOX-2812] - NPE in PDColorSpaceFactory.createColorSpace with PDICCBased
[PDFBOX-2819] - invalid ICC Profile when reading from a byte array
[PDFBOX-2823] - StringIndexOutOfBoundsException when doing
DateConverter.parseDate()
[PDFBOX-2837] - PDFBox creates files with EBCDIC code on z/OS
[PDFBOX-2847] - mergeDocumentsNonSeq does not utilize scratchFile
[PDFBOX-2853] - CCITT: Background is rendered as transparent color
[PDFBOX-2867] - Correct use of Float.NaN
改进
[PDFBOX-2744] - Add validation check for setNonStrokingColor and
setStrokingColor
详细改进内容请看发行说明。
PDFBox(一个BSD许可下的源码开放项目)是一个为开发人员读取和创建PDF文档而准备的纯Java类库。
PDFBox是Java实现的PDF文档协作类库,提供PDF文档的创建、处理以及文档内容提取功能,也包含了一些命令行实用工具。
主要特性包括:
从PDF提取文本
合并PDF文档
PDF 文档加密与解密
与Lucene搜索引擎的集成
填充PDF/XFDF表单数据
从文本文件创建PDF文档
从PDF页面创 建图片
打印PDF文档