Re: BOM ambiguity?

From: John W Kennedy <jwkenne_at_attglobal.net>
Date: Fri, 13 Jul 2012 19:57:59 -0400

On Jul 13, 2012, at 4:54 PM, Stephan Stiller wrote:
> As an aside to the BOM discussion - something I've always been meaning to ask.
>
> So there is a BOM-ambiguity when a file starts with
> FF FE
> and then a couple of U+0000 characters, yes? Because this could be either UTF-16 or UTF-32 under little-endianness. Has this been pointed out and discussed beforehand?
>
> Because the set of BOMs in different encodings don't constitute a prefix-free code.

Isn't this why UTF-32 is forbidden for HTML 5?

-- 
John W Kennedy
Having switched to a Mac in disgust at Microsoft's combination of incompetence and criminality.
Received on Fri Jul 13 2012 - 19:01:28 CDT

This archive was generated by hypermail 2.2.0 : Fri Jul 13 2012 - 19:01:29 CDT