UTR #13 comments (was: Plain text: Amendment 1)

From: John Cowan (cowan@locke.ccil.org)
Date: Tue Jul 06 1999 - 13:39:47 EDT


Mark Davis wrote:
 
> A lot of the discussion of line termination relates to technical report #13.
> Any suggestions for additional information for that report would be welcome.

My suggestions:

1) The NEL character in the C1 set (0x85) is the ISO equivalent of
EBCDIC NL (0x15) and this mapping is duly given in the EBCDIC code page
mappings on the Unicode FTP site. The text should therefore advise
applications to treat U+0085 (NL/NEL) as a newline, not U+0015 (NAK).

2) There should be a warning that some old documents use bare
CR (0x0D) to do underlining or other overstriking; an application
that converts such text should do a more complex conversion, though
treating bare CR as a NLF is marginally acceptable even for these
documents (which may then wind up containing occasional lines
with only spaces and underscores).
 

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
   Schlingt dreifach einen Kreis um dies! / Schliesst euer Aug vor heiliger Schau,
   Denn er genoss vom Honig-Tau / Und trank die Milch vom Paradies.
			-- Coleridge / Politzer



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:48 EDT