logo       

RE: Socket timeout when parsing a big XML file (DOM/SAX): msg#00059

Subject: RE: Socket timeout when parsing a big XML file (DOM/SAX)
If there is a doctype in your xml and it references an external dtd or
xsd, the parse will attempt to access this.  If it fails, you will get
an error message.  This is probably NOT due to the file size, but I
could be wrong! ;)

-----Original Message-----
From: peter_billen@xxxxxxxxxxxxxxxxx
[mailto:peter_billen@xxxxxxxxxxxxxxxxx] 
Sent: Thursday, December 16, 2004 9:42 AM
To: xerces-j-user@xxxxxxxxxxxxxx
Subject: Socket timeout when parsing a big XML file (DOM/SAX)

Hello,

I wanted to parse a XML file of 30 megs, but I get the following error:

Exception in thread "main" java.net.ConnectException: Connection timed
out:
connect
        at java.net.PlainSocketImpl.socketConnect(Native Method)
        at java.net.PlainSocketImpl.doConnect(Unknown Source)
  ...
  at sun.net.www.http.HttpClient.openServer(Unknown Source)
        ...
        at
org.apache.xerces.impl.XMLEntityManager.setupCurrentEntity(Unknown
Source)
        at org.apache.xerces.impl.XMLEntityManager.startEntity(Unknown
Source)
        ...

It both happens with SAX and DOM:

SAXParser parser = new SAXParser();
parser.setContentHandler(this);
parser.parse(file);

documentBuilder =
DocumentBuilderFactory.newInstance().newDocumentBuilder();
Document document = documentBuilder.parse(file);

I also tried to increase the allowed memory usage of the virtual
machine, but
that didn't help. It has something to do with the filesize of the input
XML
file, since everything works perfect with small files.

Someone who has an idea what is wrong and how it can be fixed?

And why are sockets and the HTTP protocol used anyway, to load a XML
file on my
hard disk?

thanks a lot for your time,

-- 
Peter

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-user-unsubscribe@xxxxxxxxxxxxxx
For additional commands, e-mail: xerces-j-user-help@xxxxxxxxxxxxxx


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
boot-loaders.gr...    php.pear.genera...    debugging.valgr...    kde.redhat.user...    text.xml.xsl.ge...    culture.languag...    hardware.microc...    java.servicemix...    redhat.release....    web.zope.plone....    user-groups.lin...    opendarwin.webk...    video.mjpeg.use...    sysutils.bcfg2....    encryption.gpg....    lx-office.devel...    xfree86.forum/2...    mail.mutt.devel...    acpi.devel/2003...    qnx.openqnx.dev...    network.irc.irs...    freebsd.devel.m...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe