|
Re: Last Call comments on IRI - 3.1 Mapping of IRIs to URIs: msg#00036org.w3c.tag
There is also the following Note: Note: The difference between Variants B and C in Step 1 (Variant B using normalization with NFC while Variant C not using any normalization) is to account for the fact that in many non-Unicode character encodings, some text cannot be represented directly. For example, Vietnam is natively written "Việt Nam" (containing a LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOW in NFC, but a direct transcoding from the windows-1258 character encoding leads to "Việt Nam" (containing a LATIN SMALL LETTER E WITH CIRCUMFLEX followed by a COMBINING DOT BELOW), whereas direct transcoding of other 8-bit encodings of Vietnamese may lead to other representations. Would moving this closer to the A/B/C variants, and maybe adding some text, be a solution to your last call comment? Regards, Martin. At 14:50 04/08/18 +0900, Martin Duerst wrote: Hello Chris, I believe that I understand why this step says 'do not normalize' |
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: XML and Versioning: 00036, Rick Jelliffe |
|---|---|
| Next by Date: | TAG announces Last Call review of Architecture of the World Wide Web: 00036, Paul Cotton |
| Previous by Thread: | Re: Last Call comments on IRI - 3.1 Mapping of IRIs to URIsi: 00036, Martin Duerst |
| Next by Thread: | Athens Olympics site policy regarding links to the site: 00036, Ian B. Jacobs |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |