发布于 2016-10-24 02:17:33 | 156 次阅读 | 评论: 0 | 来源: 网友投递

这里有新鲜出炉的精品教程,程序狗速度看过来!

jsoup HTML解析器

jsoup 是一款 Java 的HTML 解析器,可直接解析某个URL地址、HTML文本内容。它提供了一套非常省力的API,可通过DOM,CSS以及类似于JQuery的操作方法来取出和操作数据。


Jsoup 1.10.1 发布了,更新内容如下:

改进

  • Improved support for extended HTML entities, including supplemental characters and multiple character references. Also reduced memory consumption of the entity tables.

  • Added support for *|E wildcard namespace selectors.

  • Added support for setting multiple connection headers in Jsoup.connect at once with Connection.headers(Map)

  • Added support for setting/overriding the response character set in Connection.Response, for cases where the charset is not defined by the server, or is defined incorrectly.

  • Improved the performance of class selectors by reducing memory allocation and garbage collection.

  • Improved performance of HTML output by reducing the creation of temporary attribute list iterators.

修复

  • Fixed an issue when converting to the W3CDom XML, where valid (but ugly) HTML attribute names containing characters like " could not be converted into valid XML attribute names. These attribute names are now normalized if possible, or not added to the XML DOM.

  • Fixed an OOB exception when loading an empty-body URL and parsing with the XML parser.

  • Fixed an issue where attribute names starting with a slash would be parsed incorrectly.

  • Don't reuse charset encoders from OutputSettings, to make threadsafe.

  • Fixed an issue in connections with a requestBody where a custom content-type header could be ignored.

点此查看完整更新内容发行说明

下载地址:



历史版本 :
jsoup 1.11.2 发布,Java 的 HTML 解析器
jsoup 1.10.3 发布,Java 的 HTML 解析器
jsoup 1.10.2 发布,Java 的 HTML 解析器
Jsoup 1.10.1 发布,Java 的 HTML 解析器
jsoup 1.9.2 发布,Java 的 HTML 解析器
jsoup 1.9.1 发布,HTML 解析器
最新网友评论  共有(0)条评论 发布评论 返回顶部

Copyright © 2007-2017 PHPERZ.COM All Rights Reserved   冀ICP备14009818号  版权声明  广告服务