RE: Roundtripping in Unicode

From: Mike Ayers (
Date: Tue Dec 14 2004 - 18:22:08 CST

  • Next message: Mike Ayers: "RE: Roundtripping in Unicode"

    > From: Peter Kirk []
    > Sent: Tuesday, December 14, 2004 3:37 PM

    > Thanks for the clarification. Perhaps the bifurcation could
    > be better expressed as into "strings of characters as defined
    > by the locale" and "strings of non-null octets". Then I could
    > re-express this as "the only safe way out of this mess is
    > never to process filenames as strings of characters as
    > defined by the locale".

            That would not be correct for ISO 8859 locales, though (amongst
    others). That's why I specified UTF-8. Although other locales may have the
    problem of invalid sequences, we're only interested in UTF-8 here.

    > Well, I was assuming that when John Cowan implied that 0x08
    > was permitted, and Jill wrote "Unix filenames consist of an
    > arbitrary sequence of octets, excluding 0x00 and 0x2F", they
    > were speaking from the appropriate orifices.

            Correct, and my bad. I got thrown off by John's:

    >>(A private correspondent has come up with an ingenious trick which
    >>depends on being able to create files named 0x08 and 0x7F, but it truly
    >>is a trick, and in any case depends only on an ASCII interpretation.)

            which I misinterpreted to mean that 0x08 was a forbidden character.
    It isn't - just real hard to type!


    "Tumbleweed E-mail Firewall <>" made the following
     annotations on 12/14/04 16:24:51
    This e-mail, including attachments, may include confidential and/or proprietary information, and may be used only by the person or entity to which it is addressed. If the reader of this e-mail is not the intended recipient or his or her authorized agent, the reader is hereby notified that any dissemination, distribution or copying of this e-mail is prohibited. If you have received this e-mail in error, please notify the sender by replying to this message and delete this e-mail immediately.

    This archive was generated by hypermail 2.1.5 : Tue Dec 14 2004 - 18:27:43 CST