logo       
Google Custom Search
    AddThis Social Bookmark Button

truncating last character (or bytes?) when using 2 byte UTF characters in f: msg#00023

Subject: truncating last character (or bytes?) when using 2 byte UTF characters in fields
Hello All,

I am running into a strange possible bug, and I could really use someone else to test this out and see if it is my current setup. I have tested in 0.9.7, 0.9.9, and 1.0.0 and seen it in each environment. Using the attached version of the hello.xhtml form (modify the xml:base attribute in the form to reflect your chiba URL), input "test1" in the name field and "TEST1: aéromexico" in the content field, and try either the debug or the instance submission. In all my tests, it seems that because of the higher byte UTF character "é" (Spanish e with an accent, UTF char 233 if it doesn't show in the email) in aéromexico, somewhere the serialization of the XML is getting corrupted and not outputting the final character of the XML Document when submitted. This is visible by viewing the resulting XML and seeing that the final ">" is not on the last closing tag. I presume it is because this UTF character takes two bytes because if I enter plain ASCII characters there is no problem. So, two questions. Is anyone else seeing this, and if so are there any ideas where to start digging around to fix it? Thanks,

Zach Roberts


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click



Try Searching:
servers, voip, java, networking, microsoft ...
<Prev in Thread] Current Thread [Next in Thread>