logo       

Title Case (Was: [OT] multilingual support in MS products: msg#00362

text.unicode.devel

Subject: Title Case (Was: [OT] multilingual support in MS products

Doug,


> But SpecialCasing.txt, along with the rest of the UCD,
> defines how those algorithms should function and what entries the tables
> should have.
>
> Whatever technique Turkish-aware programs currently use to recognize (I
> and ı) and (İ and i) as case pairs, something similar should be done to
> Dutch-aware programs so they recognize IJ as an inseparable digraph with
> special casing behavior. This is *much* better than dragging the IJ and
> ij compatibility characters out of the closet.

Special casing deals with upper/lower case transformations not specifically a
title case routine. The Turkish and Dutch work because all letters are
converted.

I found title case to be a much more challenging problem. There are problems
with Macintosh vs. MacIntosh but the hardest problem I found was the problem of
French articles. This takes a syntactical analysis.

Carl






<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise