logo       

Re: [Patch] Whitespace Handling in Batik vs ASV: msg#00027

Subject: Re: [Patch] Whitespace Handling in Batik vs ASV
Hi Tonny,

Tonny Kohar wrote:

I would like to submit patch for the initial parsing white space
handling. The patch is attached to this email.

    Thanks for the patch but I think there are two issues with this.
First I don't think you can do whitespace handling in the parser.
It's a little hard to know exactly where Batik fits in the XML scheme
of things (it's not 100% clear to me if it is an XML processor or an
Application) but my reading of the XML spec indicates that Batik can't
strip spaces they must be made available through the DOM.

This patch is normalize the document on the parsing stage and it is also
handle XML space attribute based on return value of
XMLSupport.getXMLSpace(Element)

   I also don't think this is done quite right in some cases you
will introduce new spaces and in other cases you will remove
spaces that should be kept.

eg:
    <text>foo&amp;bar</text>
        will likely become: "foo & bar" instead of "foo&bar".
    <text>foo <baz:blah/>bar</text>
        Will become: "foo<baz:blah>bar" instead of staying the same.

Do I need to put this into bugzilla?

   Feel free to but I think there are fundamental issues (like
violation of the XML spec) with trying to handle this in this manner.
It may be fine for your application because you know those spaces are
never important but as a general processing tool Batik can't do it.

   Finally, did the new space handling I checked in a few weeks ago
fix text for you?  Or are you worried about something else?


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
boot-loaders.gr...    php.pear.genera...    debugging.valgr...    kde.redhat.user...    text.xml.xsl.ge...    culture.languag...    hardware.microc...    java.servicemix...    redhat.release....    web.zope.plone....    user-groups.lin...    opendarwin.webk...    video.mjpeg.use...    sysutils.bcfg2....    encryption.gpg....    lx-office.devel...    xfree86.forum/2...    mail.mutt.devel...    acpi.devel/2003...    qnx.openqnx.dev...    network.irc.irs...    freebsd.devel.m...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe