|
Re: Private Use Area: msg#00335text.unicode.devel
But it misses English, such as the name "Zoë" Märk Dāvĭs ________ mark.davis@xxxxxxxxx IBM, MS 50-2/B11, 5600 Cottle Rd, SJ CA 95193 (408) 256-3148 fax: (408) 256-0799 ----- Original Message ----- From: "Eric Muller" <emuller@xxxxxxxxx> To: "Kevin Brown" <graphity@xxxxxxxxxxxxxxx> Cc: "Public email list" <unicode@xxxxxxxxxxx> Sent: Saturday, April 26, 2003 10:46 Subject: Re: Private Use Area > > > Kevin Brown wrote: > > >Is there a database anywhere that cross-references Latin-based glyphs (eg > >E WITH DIAERESIS) to all the Latin-based writing systems they're used in > >(eg Dutch, French, Albanian, Cyrillic and Welsh) and vice versa? It would > >be really useful. > > > www.eki.ee, which has already been mentioned a few times, reports the > following for U+00CB Ë LATIN CAPITAL LETTER E WITH DIAERESIS: > > name: LATIN CAPITAL LETTER E WITH DIAERESIS > old name: LATIN CAPITAL LETTER E DIAERESIS > Adobe glyph name: Edieresis > mnemonic name(s): <E:> > HTML 4 mnemonic name:Ë > category: Lu (Letter, Uppercase) > combining: 0 > decomposition info: 0045 0308 > comment: > found in charsets: 8859-1 (CB); 8859-10 (CB); 8859-14 (CB); 8859-15 > (CB); 8859-2 (CB); 8859-3 (CB); 8859-4 (CB); 8859-9 (CB); CP1250 (CB); > CP1252 (CB); CP1254 (CB); CP1258 (CB); CP850 (D3); CP852 (D3); CP857 > (D3); CP863 (94); ROMAN (E8); CP1116 (D3); CP1122 (73); SAMI_WIN (CB); > SAMI_MAC (E8); 8859-16 (CB); > found in languages: af [Afrikaans]; cy [Welsh]; fr [French]; fy > [Frisian]; lb [Luxembourgian]; nl [Dutch]; sq [Albanian]; sv [Swedish]; > wo [Wolof]; yap [Yapese]; dink [Dinka]; sla [Kashubian]; > used in romanization of: be_r [Belarusian (cyrillic)]; ru_r [Russian > (cyrillic)]; > lowercase: 00EB > > > This database also covers Cyrillic and the languages written in it. > > The underlying data can be accessed a number of ways: e.g. “what is > needed beyond basic Cyrillic to write Altai?” reports U+0458 ј CYRILLIC > SMALL LETTER JE, U+04A5 ҥ CYRILLIC SMALL LIGATURE EN GHE, U+04E7 ӧ > CYRILLIC SMALL LETTER O WITH DIAERESIS and U+04F1 ӱ CYRILLIC SMALL > LETTER U WITH DIAERESIS. > > As far as I can tell, the underlying data is of good quality; of course, > there the devil is in the details, and there are certainly decisions > that could be argued. A wonderful resource. > > Eric. > >
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: Country codes, Stefan Persson |
|---|---|
| Next by Date: | [OT] Re: Country codes, Doug Ewell |
| Previous by Thread: | RE: Private Use Area, Kent Karlsson |
| Next by Thread: | Re: Private Use Area, Peter_Constable |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |