From: Peter Kirk (firstname.lastname@example.org)
Date: Thu Jul 10 2003 - 12:42:36 EDT
On 10/07/2003 09:34, Stefan Persson wrote:
> Peter Kirk wrote:
> > Maybe, but it is hardly realistic to expect all existing Turkish and
> Azeri text to be recoded to insert a character in the middle of each f
> - i sequence.
> Aren't most Turkish and Azeri text coded as ISO-8859-9 and similar
> code pages? I that case, it would be enough to add the proper
> disjoiners to the proper Unicode conversion tables.
There is no existing code page covering Azeri Latin, so everything is in
Unicode or in one of a huge variety of custom solutions. See
and the article "The Land of Azeri Fonts: It's a Jungle Out There" in
the same magazine issue, unfortunately not online, which summarises 20
or so custom encodings all in current use.
Anyway, I understood from the recent discussion of Hebrew that it is
Unicode policy not to do anything which could theoretically invalidate
existing text even if it could be proved that no such text existed.
-- Peter Kirk email@example.com http://web.onetel.net.uk/~peterkirk/
This archive was generated by hypermail 2.1.5 : Thu Jul 10 2003 - 13:22:05 EDT