RE: MS/Unix BOM FAQ again (small fix)

From: jarkko.hietaniemi@nokia.com
Date: Wed Apr 10 2002 - 15:53:03 EDT


> If you look for any Unicode signature, then you look for FF
> FE 00 00 (UTF-32LE) before you check for FF FE (UTF-16LE).

FF FE 00 00 could be the UTF-32LE BOM, but it could also be UTF-16LE BOM
followed by a UTF-16 U+0000. Yes, the NULL is usually not thought of as "text",
but there's no knowing what data people might be storing in UTF-16.
So it's back again to either out-of-band information or heuristics.



This archive was generated by hypermail 2.1.2 : Wed Apr 10 2002 - 14:09:20 EDT