-
Notifications
You must be signed in to change notification settings - Fork 0
/
CHANGES
173 lines (113 loc) · 6.91 KB
/
CHANGES
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
jsoup changelog
*** Release 1.3.3 [2010-Sep-19]
* Implement Elements.empty() and Elements.remove(). This allows easy element removal, like:
doc.select("iframe").remove();
* Fixed issue in Entities when unescaping $ ("$")
<http://github.com/jhy/jsoup/issues/issue/34>
* Added restricted XHTML output entity option
<http://github.com/jhy/jsoup/issues/issue/35>
*** Release 1.3.2 [2010-Aug-30]
* Treat HTTP headers as case insensitive in Jsoup.Connection. Improves compatibility for HTTP responses.
* Improved malformed table parsing by implementing ignorable end tags.
*** Release 1.3.1 [2010-Aug-23]
* Removed dependency on Apache Commons-lang. Jsoup now has no external dependencies.
* Added new Connection implementation, to enable easier and richer HTTP requests that parse to Documents. This includes
support for gzip responses, cookies, headers, data parameters, user-agent, referrer, etc.
* Added Element.ownText() method, to get only the direct text of an element, not including the text of its children.
* Added support for selectors :containsOwn(text) and :matchesOwn(regex), to supplement Element.ownText().
* Added support for non-pretty-printed HTML output, to more closely mirror the input HTML.
* Further speed optimisations for parsing and output generation.
* Fixed support for case-sensitive HTML escape entities.
<http://github.com/jhy/jsoup/issues/issue/31>
* Fixed issue when parsing tags with keyless attributes.
<http://github.com/jhy/jsoup/issues/issue/32>
*** Release 1.2.3 [2010-Aug-04]
* Added support for automatic input character set detection and decoding. Jsoup now automatically detects the encoding
character set when parsing HTML from a File or URL. The parser checks the content-type header, then the
<meta http-equiv> or <meta charset> tag, and finally falls back to UTF-8.
* Added ability to configure the document's output charset, to control which characters are HTML escaped, and which
are kept intact. The output charset defaults to the document's input charset. This simplifies non-ascii output.
* Added full support for all new HTML5 tags.
* Added support for HTML5 dataset custom data attributes, with the Element.dataset() map.
* Added support for the [^attributePrefix] selector query, to find elements with attributes starting with a prefix.
Useful for finding elements with datasets: [^data-] matches <p data-name="jsoup">
* Added support for namespaced elements (<fb:name>) and selectors to find them (fb|name)
* Implemented Node.ownerDocument DOM method
* Improved implicit table element handling (particularly around thead, tbody, and tfoot).
* Improved HTML output format for empty elements and auto-detected self closing tags
* Changed DT & DD tags to block-mode tags, to follow practice over spec
* Added support for tag names with - and _ (<abc_foo>, <abc-foo>)
* Handle tags with internal trailing space (<foo >)
* Fixed support for character class regular expressions in [attr=~regex] selector
*** Release 1.2.2 [2010-Jul-11]
* Performance optimisation:
- core HTML parser engine now 3.5 times faster
- HTML generator now 2.5 times faster
- much lower memory use and garbage collection time
* Added support for :matches(regex) selector, to find elements containing text matching regular expression
* Added support for [key~=regex] attribute selector, to find elements with attribute values matching regular expression
* Upgraded the selector query parser to allow nested selectors like 'div:has(p:matches(regex))'
*** Release 1.2.1 [2010-Jun-21]
* Added .before(html) and .after(html) methods to Element and Elements, to insert sibling HTML
* Added :contains(text) selector, to search for elements containing the specified text
* Added :has(selector) pseudo-selector
<http://github.com/jhy/jsoup/issues/issue/20>
* Added Element#parents and Elements#parents to retrieve an element's ancestor chain
<http://github.com/jhy/jsoup/issues/issue/20>
* Fixes an issue where appending / prepending rows to a table (or to similar implicit
element structures) would create a redundant wrapping elements
<http://github.com/jhy/jsoup/issues/issue/21>
* Improved implicit close tag heuristic detection when parsing malformed HTML
* Fixes an issue where text content after a script (or other data-node) was
incorrectly added to the data node.
<http://github.com/jhy/jsoup/issues/issue/22>
* Fixes an issue where text order was incorrect when parsing pre-document
HTML.
<http://github.com/jhy/jsoup/issues/issue/23>
*** Release 1.1.1 [2010-Jun-08]
* Added selector support for :eq, :lt, and :gt
<http://github.com/jhy/jsoup/issues/issue/16>
* Added TextNode#text and TextNode#text(String)
<http://github.com/jhy/jsoup/issues/issue/18>
* Throw exception if trying to parse non-text content
<http://github.com/jhy/jsoup/issues/issue/17>
* Added Node#remove and Node#replaceWith
<http://github.com/jhy/jsoup/issues/issue/19>
* Allow _ and - in CSS ID selectors (per CSS spec).
<http://github.com/jhy/jsoup/issues/issue/10>
* Relative links are resolved to absolute when cleaning, to normalize
output and to verify safe protocol. (Were previously discarded.)
<http://github.com/jhy/jsoup/issues/issue/12>
* Allow combinators at start of selector query, for query refinements
<http://github.com/jhy/jsoup/issues/issue/13>
* Added Element#val() and #val(String) methods, for form values
<http://github.com/jhy/jsoup/issues/issue/14>
* Changed textarea contents to parse as TextNodes, not DataNodes,
so contents visible to text() (and val(), as treated as form input)
* Fixed support for Java 1.5
*** Release 0.3.1 (2010-Feb-20)
* New features: supports Elements#html(), html(String),
prepend(String), append(String); bulk methods for corresponding
methods in Element.
* New feature: Jsoup.isValid(html, whitelist) method for user input
form validation.
* Improved Elements.attr(String) to find first matching element
with attribute.
* Fixed assertion error when cleaning HTML with empty attribute
<http://github.com/jhy/jsoup/issues/issue/7>
*** Release 0.2.2 (2010-Feb-07)
* jsoup packages are now available in the Maven central repository.
* New feature: supports Element#addClass, removeClass, toggleClass;
also collection class methods on Elements.
* New feature: supports Element#wrap(html) and Elements#wrap(html).
* New selector syntax: supports E + F adjacent sibling selector
* New selector systax: supports E ~ F preceding sibling selector
* New: supports Element#elementSiblingIndex()
* Improved document normalisation.
* Improved HTML string output format (pretty-print)
* Fixed absolute URL resolution issue when a base tag has no href.
*** Release 0.1.2 (2010-Feb-02)
* Fixed unrecognised tag handler to be more permissive
<http://github.com/jhy/jsoup/issues/issue/1>
*** Release 0.1.1 (2010-Jan-31)
* Initial beta release of jsoup