On Thu, Nov 20, 2003 at 09:40:23PM +0000, James Bursa wrote:
> Below is a minor patch for HTMLparser.c:
>
> 1. Handle hex character entities like ģ, ie. a capital X.
Hum, I didnt know that form was possible, indeed
http://www.w3.org/TR/REC-html40/charset.html#h-5.3.1
I learnt something :-)
> 2. Skip to the end of misplaced <body> start tags. Currently any attributes
> of a misplaced <body> are parsed as text and included as a <p> element in
> the tree.
eek :-), sounds good too.
Applied and commited, thanks a lot !
Daniel
--
Daniel Veillard | Red Hat Network https://rhn.redhat.com/
veillard@xxxxxxxxxx | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
|