logo       

Re: Shift-JIS/Unicode mapping in JAVA: msg#00742

text.unicode.general

Subject: Re: Shift-JIS/Unicode mapping in JAVA

From: Jane Liu <xjliu_ca@xxxxxxxxx>
Subject: Shift-JIS/Unicode mapping in JAVA
Date: Wed, 28 May 2003 12:36:39 -0700 (PDT)
Message-ID: <20030528193639.92471.qmail@xxxxxxxxxxxxxxxxxxxxxxx>
> I am running a JAVA program on Japanese Windows 2000 system, looking
> at the Unicode conversion of the following four characters from
> Shift-JIS encoding (MS-CP932) in both JRE 1.3.1 and JRE 1.4.1, and
> noticed some interesting changes:

I guess that you used the charset name "Shift_JIS". Would you try to
use "Windows-31J"?

Two Shift-JIS variations are registed in the IANA registry:
"Shift_JIS" and "Windows-31J". The former is for JIS X 0208 and the
latter is for Microsoft's CP932. "Windows-31J" was proposed by one of
Microsoft's Japanese engeneers.

"Shift_JIS" is aliased to JIS X 0208 in JDK 1.1-1.1.7. But it is
re-aliased to CP932 in JDK 1.1.8-J2SE 1.4 ("Windows-31J" is also
aliased to CP932) and we found problems that we can't select the right
character encoding in J2EE platforms or there is a mapping
mis-matching between JDK and Xerces (Xerces has an original alias
table to alias "Shift_JIS" to JIS X 0208).

So we requested the following alias change and it was accepted in J2SE
1.4.1:

Shift_JIS -> JIS X 0208's shift-jis encoding.
Windows-31J -> Microsoft's CP932

See changes of J2SE 1.4.1.

http://java.sun.com/j2se/1.4.1/changes.html#Shift-JIS

Kazuhiro Kazama (kazama@xxxxxxxxxx) NTT Network Innovation Laboratories


To Unsubscribe, send a blank message to: unicode-unsubscribe@xxxxxxxxxxxxxxx

This mailing list is just an archive. The instructions to join the true Unicode
List are on http://www.unicode.org/unicode/consortium/distlist.html


Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/





<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise