logo       

[ tidy-Patches-1252651 ] Add support for for simplified Chinese encoding (G: msg#00053

web.html-tidy.tracker

Subject: [ tidy-Patches-1252651 ] Add support for for simplified Chinese encoding (GB2312)

Patches item #1252651, was opened at 2005-08-05 17:12
Message generated for change (Comment added) made by hoehrmann
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=390965&aid=1252651&group_id=27659

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: All
Group: Current - all platforms
Status: Open
Resolution: None
Priority: 5
Submitted By: Benfeng CHEN (bfchen)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add support for for simplified Chinese encoding (GB2312)

Initial Comment:
I add the support for simplified Chinese encoding
(GB2312) to tidy source code.

My work is based on the lastest version of source code,
which is released on 3 August, 2005. So it can be
directly applied to current release. The compiling
environment is RH linux + gcc 3.4.3. But since I didn't
use any os dependent code, it should be cross-
platform.

The test case is under the "test" folder in the tar.gz file.
This page is copied from
http://www.gnu.org/home.cn.html. The updated tidy
works well by using command " tidy -gb2312 -o
output_gnucn.html test_gnucn.html". In fact, I've also
test many cases, and no problem is found.

I also fixed a bug of processing traditional Chinese
(BIG5) in current tidy: the text wrapping is not correct
when using "punctuation-wrap" option.

Thank you very much.

----------------------------------------------------------------------

>Comment By: Björn Höhrmann (hoehrmann)
Date: 2005-08-17 20:35

Message:
Logged In: YES
user_id=188003

Indeed, I don't think we should add more of these
transcoders and rely on iconv, etc. instead.

----------------------------------------------------------------------

Comment By: Arnaud Desitter (arnaud02)
Date: 2005-08-17 11:53

Message:
Logged In: YES
user_id=566665

The encoding to Unicode should be done properly in the first
place.
See http://tidy.sf.net/bug/910450.


----------------------------------------------------------------------

You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=390965&aid=1252651&group_id=27659


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise