发布于 2014-06-23 07:57:26 | 394 次阅读 | 评论: 0 | 来源: 网友投递
Apache PDFBox Java类库
PDFBox(一个BSD许可下的源码开放项目)是一个为开发人员读取和创建PDF文档而准备的纯Java类库。
Apache PDFBox 1.8.6 发布,此版本现已提供下载。此版本主要是 bug 修复和系统改进,bug 修复列表请看这里。
改进记录: Bug 修复 [PDFBOX-54] - please correct the SetField example [PDFBOX-62] - Incorrect (zero) character widths returned in some docs [PDFBOX-239] - PDFToImage prints every word at the start of the line [PDFBOX-934] - ImageToPDF.createPDFFromImage causes problems for certain TIFF inputs [PDFBOX-1474] - PDDocument.decrypt does not throws InvalidPasswordException [PDFBOX-1689] - Partial failure to render PDF [PDFBOX-1713] - [PATCH] Bullet character not rendered [PDFBOX-1756] - ClassCastException CosString cannot be cast to COSName [PDFBOX-1845] - PDDocument.load() give Error: Expected a long type at offset 1633 [PDFBOX-1895] - Type0 settings /Registry and /Ordering are not decrypted when writing document [PDFBOX-1922] - NonSequentialParser not reading version in header and trailer [PDFBOX-2047] - read operations alter PDLab object [PDFBOX-2050] - Add predictor to LZW filter [PDFBOX-2054] - Remove System.out.println() [PDFBOX-2056] - incomplete build tests [PDFBOX-2057] - Importing BufferedImage into PDPixelMap is broken in 1.8.5 [PDFBOX-2058] - The text of pdfs using Type1C can't be extracted correct [PDFBOX-2063] - Incomplete EOF detection in ASCIIHexFilter [PDFBOX-2064] - ArrayIndexOutOfBoundsException in CompositeImage.createMaskedImage [PDFBOX-2072] - Wrong calculation of space char width in PDFStreamEngine [PDFBOX-2073] - PDF files with unusual Japanese font can not be rewrite correctly [PDFBOX-2074] - 4-bytes CMap entry causes exception [PDFBOX-2079] - Extra new line characters extracted in 1.8.5 for embedded files leading to ZipFile exception in Java 1.6 [PDFBOX-2080] - Barcode getting color inverted in pdf to image conversion [PDFBOX-2082] - signing corrupts PDF when signature exactly fits allocated space [PDFBOX-2095] - Useless memory allocation in GlyfDescript [PDFBOX-2096] - ICC profile ignored if number of components is 1 [PDFBOX-2100] - Gouraud shading doesn't work with function [PDFBOX-2101] - Surprising memory consumption when extracting images [PDFBOX-2102] - Characters swallowed on COSString.getString() [PDFBOX-2109] - CFFParser uses String constructor without encoding [PDFBOX-2110] - Font not found: CourierNew [PDFBOX-2111] - Cast error in Gouraud shadings [PDFBOX-2114] - ObjStm is being processed to late [PDFBOX-2115] - Use unfiltered stream in gouraud shadings [PDFBOX-2120] - Regression: Type 1 font corrupted [PDFBOX-2122] - FontBox's TTFDataStream doesn't set timezone in readInternationalDate 改进: [PDFBOX-712] - SecurityHandlersManager May stop the application Server when running PDFParser in a Servlet. [PDFBOX-1596] - OverlayPDF logic should be moved into a library class [PDFBOX-1739] - Load document error for two RegisSTAR documents [PDFBOX-2034] - TestFilters is non-deterministic [PDFBOX-2052] - PDFCloneUtility does not handle COSStreamArray [PDFBOX-2066] - RubberStampWithImage should support more image types [PDFBOX-2084] - Make TestImageIOUtils optional in 1.8 for Fedora packaging [PDFBOX-2105] - Support for multipage TIFFs in CCITTFactory, makes PDFBox capable of doing tiff2pdf [PDFBOX-2129] - Add PDFBox version to the title [PDFBOX-1600] - COSDocument and PDDocument declare throws IOException when they don't [PDFBOX-1584] - Add unit test for RandomAccessFileOutputStream
PDFBox(一个BSD许可下的源码开放项目)是一个为开发人员读取和创建PDF文档而准备的纯Java类库。它提供如下特性:
主要特性包括:
从PDF提取文本
合并PDF文档
PDF 文档加密与解密
与Lucene搜索引擎的集成
填充PDF/XFDF表单数据
从文本文件创建PDF文档
从PDF页面创 建图片
打印PDF文档