Re: JIS X 0208 mappings in Unihan.txt

From: Erik van der Poel (erik@vanderpoel.org)
Date: Sat Jun 04 2005 - 09:56:57 CDT

  • Next message: Michael Everson: "Woleai (Caroline Islands script)"

    Theo Veenker wrote:
    > If there are so many subtle changes between different
    > versions of the same coded character set wouldn't make sense to have
    > tags for all of these versions available in Unihan.txt?

    It is trivial to come up with the Unicode mappings for the pairs of
    simplified and traditional characters involved in the changes between 78
    and 83. It is not as trivial to determine the Unicode code points for
    the characters that changed in subtle ways between 78, 83 and 90. Even
    if you decided that a particular change in JIS X 0208 was not reflected
    in Unicode by assigning distinct codepoints to the "before" and "after"
    versions, who can assure you that that distinction will never be made in
    a future version of Unicode, given that e.g. U+2090E appeared in Unicode
    3.1 after U+5315 had been encoded in Unicode 1.1?

    http://www.unicode.org/Public/UNIDATA/DerivedAge.txt

    Erik



    This archive was generated by hypermail 2.1.5 : Sat Jun 04 2005 - 09:57:41 CDT