Re: Unicode plain-text file

From: Kenneth Whistler (kenw@sybase.com)
Date: Thu May 22 1997 - 18:59:06 EDT


Tim Partridge wrote:

> Someone else suggested that New Line should just be white space not a block
> separator. I don't agree - surely a paragraph is (usully) a new line with
> some extra white space added - this implies the semantics should be similar.

Please be extra careful here. The suggestion specifically was that
U+2028 LINE SEPARATOR (not NL nor LF functioning as newline) should be
considered WS (a technical category of the bidi algorithm, not white space
as processed, for example in a C preprocessor, or white space meaning
unprinted area on a text page) rather than BS (another technical category
of the bidi algorithm which is used to determine the boundaries of directional
blocks).

Cf. pages 3-15 and 3-17 of the Unicode Standard.

--Ken Whistler
 



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:34 EDT