RE: PC code pages and national character sets

From: Mike Brown (mbrown@webb.net)
Date: Mon Apr 30 2001 - 20:31:35 EDT


> Is there a list of all PC code pages, national character sets,
> and/or mail encodings, that area in real use today somewhere
> in the web?

A list of names of character sets that may be used on the Internet:
http://www.isi.edu/in-notes/iana/assignments/character-sets

> If the list can also tell which one(s) for a given language is used
> most popluarly, if more than one code pages, character sets, or
> encodings exist.

Character "sets" map abstract characters to specific bit patterns (bytes or
byte sequences, usually). They do this for certain character repertoires
(subsets of the entire Unicode repertoire, one might say). These repertoires
are usually oriented toward the support of one or more writing systems
("scripts"). Different languages share certain scripts. Scripts sometimes
overlap. There are other complicating factors. The relationships between
character sets and languages, therefore, is quite intricate. It is hard to
find a centralized list.

That said, you will probably be able to glean the info you need from
http://www.eki.ee/itstandard/docs/draft-alvestrand-lang-char-03.txt
and
http://www.eki.ee/letter/

Have fun.



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:16 EDT