Re: [long] Use of Unicode in AbiWord

From: A. Vine (
Date: Fri Mar 19 1999 - 14:46:43 EST wrote:
> > We also think that we should switch our representation to UTF-8. On
> > every platform we current plan to support, this would eliminate the
> > encoding conversion step (as well as a lot of memory usage) for any
> > run of text which includes only ASCII characters. For obvious
> > reasons, and with no offense intended to the majority of the world who
> > primarily use double-byte encoded characters, we believe this to be a
> > common case worth optimizing.
> this optimizes for one language while making it a lot less efficient to use
> just about every other language including all european ones, as well as the
> line and paragraph separator, the euro, the formatting characters,
> dingbats, etc.

Not true. Not only does ASCII represent significantly more languages than just
English (Hawaiian, Indonesian, Latin, some Native American languages, etc.), but
many Latin script languages are mostly ASCII (e.g. German, French, Spanish,
Italian, Portuguese, Dutch, Danish, Swedish, Norwegian, etc.).


Andrea Vine
Sun Internet Mail Server i18n architect
Remember: stressed is desserts spelled backwards.

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:44 EDT