RE: Surrogate support in *ML?

From: Brendan Murray/DUB/Lotus (Brendan_Murray@Lotus.com)
Date: Thu Sep 07 2000 - 10:47:33 EDT

Next message: Gary P. Grosso: "Unicode on a non-Unicode web page"
Previous message: Michael \(michka\) Kaplan: "Re: Tamil glyphs"
Maybe in reply to: Brendan Murray/DUB/Lotus: "Surrogate support in *ML?"
Next in thread: Brendan Murray/DUB/Lotus: "Re: Surrogate support in *ML?"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Karlsson Kent - keka <keka@im.se> wrote:
> At the level of XML the number of bits is irrelevant.
> The "high and low surrogate" code points are excluded
> from being used as NCRs. A character (not UTF-16 code
> units) can be referenced by NCRs. See (XML) procuction 66
> (CharRef) and its well-formedness constraint (and
> production 2 (Char), though they missed to exclude a number
> of other non-character code points in that production).

I know that XML explicitly excludes surrogates. My question really refers
to what one can do to encode the non-BMP data in the new Han unification
data that will become part of 10646 and Unicode in the not too distant
future: is this huge block of characters regarded as irrelevant, or has
anyone proposed an encoding that can be used?

Next message: Gary P. Grosso: "Unicode on a non-Unicode web page"
Previous message: Michael \(michka\) Kaplan: "Re: Tamil glyphs"
Maybe in reply to: Brendan Murray/DUB/Lotus: "Surrogate support in *ML?"
Next in thread: Brendan Murray/DUB/Lotus: "Re: Surrogate support in *ML?"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT