|
Korean input using Xkb (was Re: switching to UTF-8): msg#00063internationalization.linux
On Thu, 2 May 2002, Pablo Saratxaga wrote: > On Thu, May 02, 2002 at 02:14:08AM -0400, Jungshik Shin wrote: > > > BTW, Xkb may work for Korean Hangul, too and we don't need > > XIM if we use 'three-set keyboard' instead of 'two-set keyboard' > If it is indeed doable, it wouldn't be very practical due to the high > amount of possible combinations (the Xkb based solution consist to As you correctly noticed, I was talking about using U+1100 Jamos. Some Koreans believe that it's the greatest mistake of Korean nat'l standard body to insist that 11,172 syllables are encoded in Unicode/ISO 10646 and to prevail in ISO/IEC JTC1/SC2/WG2 because 11,172 precomposed syllables are not sufficient for _even modern_ Korean, we need U+1100 Jamo support anyway for Middle (and future) Korean and the inclusion of 11,172 syllables only delayed support of U+1100 Jamos by giving (or rather strengthening) a **false** impression to developers that Hangul doesn't need to be treated as a complex script (as Indic and Thai scripts do). For instance, Sun's complex text/script support plan _did_ not mention Korean Hangul while listing various South and Southeast Asian scripts and Hebrew and Arabic as the targets of complex text processing. It's frustrating to have to debunk the myth (that Korean Hangul can be treated just like Japanese and Chinese writing systems and that it doesn't have anything in common with South and Southeast Asian scripts) time and again. At the moment, it's only Microsoft that fully understands the issue and offers the full range of Hangul support. Fortunately, Pango is moving forward in the right direction and hopefully ST and ATSUI(of MacOS) would help, too. > But that's true that hangul-only typing doesn't require any user > interactivity at all, an on-the-spot method that just analyzes the input > and convert to preformed hangul syllabes on the fly is enoguh. Yup. It's basically a not-so-complex automata. For three-set keyboard, it's very simple while for two-set keyboard, it's a bit more complicated. An automata for two-set Middle Korean KBD would be much more complicated than three-set Middle Korean KBD. Of course, Hanja input does require dictionary look-up and user interaction. > Now, if by "Xkb is enough to type Korean" you meant typing directly > the single jamos without composing, yes, that's perfectly possbile; You have to note that I had two conditions under which that might be possible. One of them is that 'three-set' keyboard is used. 'Three-set' keyboard distinguishes between leading consonants and trailing consonants while 'two-set' keyboard doesn't. The other is that we use U+1100 Jamos to represent Hangul. > but the produced output won't be in the standardized precomposed form > for the common korena syllabes, that could be a compatibility problem > if you exchange files written that way. Well, I have to quote 'standard' in standard precomposed form :-). It's certainly true that precomposed form is widely used for Korean. However, many people including me want to go all the way to using exclusively U+1100 Hangul Jamos for both modern and Middle Korean when a large enough number of programs and fonts support that. To achieve backward compatibility, post and pre-processing (to convert modern syllables into and out of NFC -precomposed forms) can be done if necessary. Jungshik Shin P.S. Attached is an example of Xkb definition for a 3-set Korean keyboard. It's made by PARK Won Kyu <wkpark@xxxxxxxxxxxxxxx>.
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: Switching to UTF-8: 00063, Jungshik Shin |
|---|---|
| Next by Date: | Re: Switching to UTF-8: 00063, Tomohiro KUBOTA |
| Previous by Thread: | Re: Switching to UTF-8i: 00063, Pablo Saratxaga |
| Next by Thread: | Re: readline (was: Switching to UTF-8): 00063, Bruno Haible |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |