Re: UTF-8 ill-formed question

From: Otto Stolz <>
Date: Wed, 12 Dec 2012 20:55:42 +0100


am 2012-12-11 20:16, schrieb James Lin:
> If i have a code point: U+4E8C or "二"
> In UTF-8, it's "E4 BA 8C" while in UTF-16, it's "4E8C".
> Where is this "BA" comes from?

Cf. <>.

Enclosed are the (almost original) version of “€œCima’s Magic
UTF-8 Pocket encoder”€ (2004), and its two followers for
more UTFs. Display or print with a fixed-pitch font,
such as Lucida Console or Courier New. Enjoy!

    Otto Stolz

