logo       

Re: Xerces vs Crimson performance: msg#00049

Subject: Re: Xerces vs Crimson performance
This is late reply, but fwiw, have you tried turning on or off the feature
       
    http://apache.org/xml/features/dom/defer-node-expansion

The intention of the feature is to make parsing faster, but with certain
situations / document sizes it actually slows it down considerably. Or
at least it used to--I haven't played with it for a while so my data may
be old, but it's easy enough to try. Note that the default value of this
seems to depend on what implemention you use, so I would make sure to
try it explicitly set both ways.

Eric

Ritu Raj Tiwari wrote:

>Folks,
>I am migrating an application that makes use of
>validating XML parsing. It made use of the crimson
>parser and we are moving to xerces.
>
>The XML documents I encounter have a huge DTD they
>need to be validated against. On many occasions the
>documnents are actually much smaller than the DTD!
>Crimson had no way to cache grammars so the xerces
>grammar pool looked really exciting for performance
>gains.
>
>However, after enabling grammar caching, and running
>comparison with my codebase on JDK 1.4.2 + Crimson vs
>JDK 5 + Xerces,  I see negligible, if any, performance
>gain. I am looking at the total CPU time of the Java
>process as it runs through a suite of about 400 XML
>files. There is a lot going on apart from XML parsing,
>but Xerces vs Crimson (and the JDK) are the only major
>differences between the codebases.
>
>My questions are:
>- Are thre any obvious ways of boosting Xerces
>performance? In my application, all the documents make
>use of the same DTD.
>- I am currently on the xerces version that ships with
>JDK 5. Will moving to xerces 2.6.2 have any gains?
>
>Thanks.
>-Raj
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: xerces-j-user-unsubscribe@xxxxxxxxxxxxxx
>For additional commands, e-mail: xerces-j-user-help@xxxxxxxxxxxxxx
>
>
>  
>


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
boot-loaders.gr...    php.pear.genera...    debugging.valgr...    kde.redhat.user...    text.xml.xsl.ge...    culture.languag...    hardware.microc...    java.servicemix...    redhat.release....    web.zope.plone....    user-groups.lin...    opendarwin.webk...    video.mjpeg.use...    sysutils.bcfg2....    encryption.gpg....    lx-office.devel...    xfree86.forum/2...    mail.mutt.devel...    acpi.devel/2003...    qnx.openqnx.dev...    network.irc.irs...    freebsd.devel.m...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe