From: Peter Kirk (peterkirk@qaya.org)
Date: Wed Dec 17 2003 - 14:51:23 EST
On 17/12/2003 11:29, Philippe Verdy wrote:
>Peter Kirk wrote:
>
>
>>Conclusion: the right thing even for Turkish is to drop the dot on i
>>before a circumflex.
>>
>>
>
>I agree. The letter is rare enough to not create an exception here for
>the removal of dot on the soft-dotted i followed by circumflex (which
>is needed much more often in other languages that use 'î' and Î'.
>
>
>
I'm not sure that rarity is a good argument, but we agree on the conclusion.
>>But by the same argument we would also want to drop
>>the dot on dotless I.
>>
>>
>
>I think you meant "But by the same argument we would also want to drop
>the dot on DOTTED I". I would not recommand it, this would make things
>even worse and more complicated.
>
>
>
Indeed. Thank you for correcting my error.
>If Turkish wants to remove the dot on "pseudo-dotted" I if followed by
>a circumflex, the correct thing to do is then to use the ASCII dotless
>I and add a circumflex or use its canonical equivalent
><LATIN CAPITAL LETTER I WITH CIRCUMFLEX>.
>
>With the current specification, both of
> <LATIN CAPITAL LETTER I, COMBINING CIRCUMFLEX>, and
> <LATIN CAPITAL LETTER I WITH CIRCUMFLEX>
>are canonical equivalents and must render the same, without the dot.
>
>To display a dot, one can use one of the four canonical eqquivalents:
> <LATIN CAPITAL LETTER I WITH DOT ABOVE, COMBINING CIRCUMFLEX>
> <LATIN CAPITAL LETTER I WITH CIRCUMFLEX, COMBINING DOT ABOVE>
> <LATIN CAPITAL LETTER I, COMBINING DOT ABOVE, COMBINING CIRCUMFLEX>
> <LATIN CAPITAL LETTER I, COMBINING CIRCUMFLEX, COMBINING DOT ABOVE>
>(one is the NFC form, another is the NFD form, two others are also
>possible)
>
>
>
The problem which might arise is when someone applies Turkic casing
operations to a Turkish text including i with circumflex (at least in
NFD). The small version becomes <dotted I, circumflex>, which is wrong
and looks wrong. The capital version becomes <dotless i, circumflex>,
which is wrong but looks correct.
So the rules should be adjusted so that the normal casing rule, not the
special Turkic one, applies when there is a circumflex. But perhaps not
when there are other accents e.g. I would expect that an acute is
sometimes used as a stress marker but it would then need to appear in
addition to the dot on i and dotted I, cf. Lithuanian.
-- Peter Kirk peter@qaya.org (personal) peterkirk@qaya.org (work) http://www.qaya.org/
This archive was generated by hypermail 2.1.5 : Wed Dec 17 2003 - 15:47:19 EST