Re: Semi-annual reminder of CSets collection

From: Philippe Verdy <>
Date: Sat, 17 Nov 2012 01:48:01 +0100

A few additional ones (not in the ~leisher collection) are found also in an
annex directory the Unicode database.

IBM has maintained and published since long a *much* larger collection of
charsets (including many variants), and in my opinion the IBM collection is
the most complete one, which is even **much** larger than what is actually
registered in the IANA database, and where many **distinct** IBM charsets
are treated (almost incorrectly if improperly used) as "compatible
aliases", but only for the purpose of **plain-text** encoding and for their
use in IETF protocols like MIME.

The MIME charsets are also frequently referenced in many technical
specifications from various standard international bodies like the ITU or
the IEEE, and from CEN in Europe, or national standard bodies like the
former ANSI in USA).

2012/11/16 Mark Leisher <>

> The CSets collection is a set of mapping tables between various character
> sets and Unicode, and is intended to provide mappings not included in most
> character set conversion tools available today.
> The collection hasn't changed since 2008, but if you know of any encoding
> or table out there that might be a candidate, please let me know.
> --
> Mark Leisher
Received on Fri Nov 16 2012 - 18:49:07 CST

This archive was generated by hypermail 2.2.0 : Fri Nov 16 2012 - 18:49:07 CST