logo       

Re: Parsing HTML: msg#00080

java.enhydra.xmlc

Subject: Re: Parsing HTML


You might be able to try something like the code at this link..
http://www-106.ibm.com/developerworks/xml/library/x-tipboot2.html

set the DOMImplementation class to that of the html DOM implementation class.  Now things should work for you loading up plain HTML.

Jake

At 02:02 PM 3/24/2003 -0600, you wrote:
I often work with XML, so I know how to load an XML file. Unfortunately, other people will be creating these headers and footers using normal HTML editors, and I don't want to force them to use XHTML if they don't want to or know how.

Maybe I could convert the HTML to XHTML on the fly when I need to.

At 01:43 PM 3/24/2003, you wrote:

Here is an example of creating a Document from a file....of course this example assumes the file is valid XML....

DocumentBuilderFactory dbfactory = DocumentBuilderFactory.newInstance(); ...

I'm not sure how to modify this example to load a valid HTML4 file?  If you use this example, your file will have to be XHTML valid...meaning that all tags which are opened must then be closed.  If you come up with a way to do this with a non-XML-compliant HTML file, I'd appreciate it if you posted the example back to the list.

--
Ben Sinclair
ben@xxxxxxxxxxxxxxx
_______________________________________________
XMLC mailing list
XMLC@xxxxxxxxxxx
http://www.enhydra.org/mailman/listinfo.cgi/xmlc
<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise