If there is a doctype in your xml and it references an external dtd or
xsd, the parse will attempt to access this. If it fails, you will get
an error message. This is probably NOT due to the file size, but I
could be wrong! ;)
-----Original Message-----
From: peter_billen@xxxxxxxxxxxxxxxxx
[mailto:peter_billen@xxxxxxxxxxxxxxxxx]
Sent: Thursday, December 16, 2004 9:42 AM
To: xerces-j-user@xxxxxxxxxxxxxx
Subject: Socket timeout when parsing a big XML file (DOM/SAX)
Hello,
I wanted to parse a XML file of 30 megs, but I get the following error:
Exception in thread "main" java.net.ConnectException: Connection timed
out:
connect
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(Unknown Source)
...
at sun.net.www.http.HttpClient.openServer(Unknown Source)
...
at
org.apache.xerces.impl.XMLEntityManager.setupCurrentEntity(Unknown
Source)
at org.apache.xerces.impl.XMLEntityManager.startEntity(Unknown
Source)
...
It both happens with SAX and DOM:
SAXParser parser = new SAXParser();
parser.setContentHandler(this);
parser.parse(file);
documentBuilder =
DocumentBuilderFactory.newInstance().newDocumentBuilder();
Document document = documentBuilder.parse(file);
I also tried to increase the allowed memory usage of the virtual
machine, but
that didn't help. It has something to do with the filesize of the input
XML
file, since everything works perfect with small files.
Someone who has an idea what is wrong and how it can be fixed?
And why are sockets and the HTTP protocol used anyway, to load a XML
file on my
hard disk?
thanks a lot for your time,
--
Peter
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-user-unsubscribe@xxxxxxxxxxxxxx
For additional commands, e-mail: xerces-j-user-help@xxxxxxxxxxxxxx
|