RE: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

From: Doug Ewell <>
Date: Wed, 04 Jun 2014 11:26:01 -0700

Sorry, I left out an important detail.

I wrote:
> 3. U+FEFF at the beginning of a stream (note: not "packet" or
> arbitrary cutoff point)

I meant U+FEFF as a zero-width no-break space. Obviously it is very
common to see U+FEFF as a signature or BOM.

My underlying question here is, how common is it that the producer of a
stream actually intends this character *at the start of a stream* to be
a ZWNBSP, not to be stripped lest the actual text content be altered?

Doug Ewell | Thornton, CO, USA | @DougEwell
Unicode mailing list
Received on Wed Jun 04 2014 - 13:27:00 CDT

This archive was generated by hypermail 2.2.0 : Wed Jun 04 2014 - 13:27:00 CDT