FYI : Korean standard naming scheme change

From: Jungshik Shin (jshin@pantheon.yale.edu)
Date: Sat Jul 11 1998 - 15:57:39 EDT


  Hello,

  Korean standard body recently changed the naming scheme of Korean
standards involving information exchange. What used to be refered to as
KS C 5[6-8]xx are assigned new names in X section of KS(Korean
Standard). This change reportedly became unavoidable because KS C
5[6-8]xx ran out of space(among 300 positions in the range, only 3 were
left untaken as of 1998).

 Old New
KS C 5601 KS X 1001 : 94x94 graphic coded character set for Korean
KS C 5657 KS X 1002 : additional characters(94x94) for information
                           exchange
                           (mentioned as a source of UniHan in Unicode 2.0)
KS C 5636 KS X 1003 : Korean version of ISO 646
KS C 5620 KS X 1004 : ISO/IEC 2022
KS C 5700 KS X 1005-1 : Unicode 2.0/ISO-10646-1
KS C 5697 KS X 1023 : ISO 2375
KS C 5861 KS X 2901 : Korean Unix environment

  In light of this change, I think the name of the directory for Korean
standard in ftp://ftp.unicode.org/Public/MAPPINGS/EASTASIA should be
changed to KSX(or just KS) from KSC. In order not to break old links
around the net, KSC can be made a link to KSX(or KS).

  Incidentally, I'd like KSC5601.TXT at

   ftp://ftp.unicode.org/Public/MAPPINGS/EASTASIA/KSC/KSC5601.TXT

to be replaced by
 
   KSX1101.TXT at

   http://pantheon.yale.edu/~jshin/faq/KSX1101.TXT.gz
   (after ungzipping)

   KSC5601.TXT at Unicode archive is NOT the mapping between KS X
1101(KS C 5601-1992) and Unicode 2.0(in UCS-2) BUT the mapping between
double byte part of UHC(Unified Hangul Code/MS code page 949) and
Unicode 2.0(in UCS-2). UHC/CP949 has NOTHING to do with KS X 1101(KS C
5601-1992) per se(it's just a proprieatary/vendor specific encoding used
only in Korean MS-Windows) while it's upward compatible(it uses C1 range
reserved for control characters in ISO-2022 to make rooms for additional
Hangul syllables not covered in EUC-KR) with EUC-KR(Korean EUC) encoding
of KS X 1101(KS C 5601-1992/1987) and KS X 1103(KS C
5636)/US-ASCII/ISO-646. Please, note that the mapping table between
CP949/UHC and Unicode 2.0 (in UCS-2) is available at

  ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP949.TXT

where it really belongs as a vendor specific encoding. Hence,
replacing KSC5601.TXT(which is actually CP949.TXT) with genuine
KSX1101.TXT results in no loss of information while helping
reducing confusion about Korean standard.

 In addition, I'd like to request that JOHAB.TXT at

   http://pantheon.yale.edu/~jshin/faq/JOHAB.TXT.gz

be made available at

   ftp://ftp.unicode.org/Public/MAPPINGS/EASTASIA/KSC/JOHAB.TXT

JOHAB is an encoding specified in KS X 1101(KS C 5601-1992) annex 3
as a *supplementary* encoding. It encodes all of
11,172 Hangul syllables in UAC00-UD7A3 while KS X 1101(as defined
the main body of the document) based encoding like EUC-KR and
ISO-2022-KR only encode 2350 Hangul syllables.

   It'd be great if those in charge of Unicode archive could
make these changes soon.
    
    Thank you,

     Jungshik Shin



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:40 EDT