Re: "A Programmer's Introduction to Unicode"

From: Richard Wordingham <richard.wordingham_at_ntlworld.com>
Date: Mon, 13 Mar 2017 21:47:04 +0000

On Mon, 13 Mar 2017 23:10:11 +0200
Khaled Hosny <khaledhosny_at_eglug.org> wrote:
 
> But there are many text operations that require access to Unicode code
> points. Take for example text layout, as mapping characters to glyphs
> and back has to operate on code points. The idea that you never need
> to work with code points is too simplistic.

There are advantages to interpreting and operating on text as though it
were in form NFD. However, there are still cases where one needs
fractions of a character, such as word boundaries in Sanskrit, though I
think the locations are liable to be specified in a language-specific
form. U+093E DEVANAGARI VOWEL SIGN AA can have a word boundary in it
in at least 4 ways.

Richard.
Received on Mon Mar 13 2017 - 16:48:07 CDT

This archive was generated by hypermail 2.2.0 : Mon Mar 13 2017 - 16:48:07 CDT