L2/05-364 From: Christopher Fynn Date: 2005-11-03 Subject: Re: New Tibetan Proposals Andrew West wrote: > Chris, > Thanks for providing your comments so swiftly. I appreciate what you > say about the head marks, and think that you have made some valid > observations. > Just to clarify a couple of points: > I understand that there are many ornate forms of head mark in > existence, but this proposal explicitly restricts itself to characters > that are required by scholars of early Tibetan manuscripts. It > deliberately does not cover other forms of head mark, as this beyond > the scope of the proposal. This in no way implies that other proposals > for other forms of head marks may not be needed. I am interested in > early Tibetan texts so I think it is appropriate for me to make a > proposal relating to characters needed to represent these texts; I am > not so interested in the sort of texts which use other forms of head > marks, so I am not planning to write a proposal for them. In the > absence of a consensus amongst the members of TIBEX as to what > additional Tibetan characters are required for encoding, I think it is > probably best for people to make individual proposals relating to > characters that are of particular concern to them. > The term archaic is perhaps a little contentious as the single-line > form of head mark can be seen in more recent texts; but I think that > it is appropriate because in the early manuscripts and monumental > inscriptions the single line form is used exclusively. I went through > hundreds and hundreds of items in the International Dunhuang Project > (IDP) on-line database looking for examples of double tsheg usage, and > although I did not originally intend to propose any new head mark > characters, the ubiquity of this form of the head mark in early > manuscripts convinced me that you could not ignore it or treat it as a > simple glyph variant of 0F04. > BTW, I have notified the Chinese representative on WG2 of these > proposals, and have asked him to seek feedback on them from the > appropriate Tibetan experts. > Best Regards, > > Andrew Andrew It is this very ubiquity or fact that *all* examples of archaic Tibetan texts use single line head marks; that many forms of cursive Tibetan (which have undergone less modifications in form than modern dbu-can ) still use only single line head marks; and that modern dbu-can [almost] always uses double line head marks, only seems to add weight to the argument that the double line form is just the modern glyph variant of the original single line form. At some point there must either have been an orthographic "reform" or maybe some scribe simply started writing the more embellished double line form and (Tibetans loving elaborate embellished things) it became the preferred form - the simpler older form pretty well dropping out of use (at least in dbu-can). Now if the double line form is simply a glyph variant of a simpler older form and conveys no different meaning - do we need to make this distinction in *plain text*? We can already include as many variants as we want in an OpenType font and have these available for selection using a discretionary lookup - or we could create an "old style" Tibetan font which has single line glyphs for U+0F04 & U+0F05. There are many examples of texts where both single and double tsheg characters occur in the same line of text - so there is strong evidence for having both single and double tsheg characters encoded. OTOH we have yet to find an example of a text where both single and double line forms of U+0F04 & U+0F05 occur. IMO the strongest evidence for encoding thee two additional characters would be such an example - a text where *both* the single and the double line forms of head marks occur. - Chris --------------------------------------------------------- From: Christopher Fynn Date: 2005-11-03 Subject: Re: New Tibetan Proposals Andrew West wrote: > On 27/10/05, Robert R. Chilton wrote: >>With regard to the second proposal, I fully agree about the need for >>NYIS TSHEG and I don't see any problem with using this for both the >>archaic and modern (editorial) usages. Yes I don't think there should be any argument about the need for encoding this character as it it is very well attested. Incidentally Csoma de Koršs' Grammar illustrates another type of TSHEG - larger that the usual one. I don't have the book here but I think he calls it TSHEG CHEN or something like that. Have any of you seen other evidence for such a character? >> As for the archaic head marks, I >>do have some question as to whether the "traditional" marks (encoded at >>U+0F04 and U+0F05) are equivalent but somewhat embellished versions of >>the proposed archaic forms. (You may remember that I have raised >>similar questions in past discussions regarding proposals to encode >>variant forms of the terminal SHAD character.) > I agree that these are the most problematic of the proposed > characters, and I thought long and hard about whether to include them > in the proposal or not; but it seemed to me that if you want to encode > the double tsheg then you have to encode them as well. Why? The double tsheg often occurs together with single tsheg in the same document - frequently both are used at different places in the same line of text - so different characters are needed. OTOH, as you point out, in these archaic documents we find *only* the single line form of the head marks - so if someone wants these they could probably just use a font with the single line glyph form for the existing head mark characters. > As you say, the > traditional forms of head mark are just ornate versions of the > proposed "archaic" forms, but scholars dealing with early Tibetan > manuscripts like to be able to represent the exact forms of letters > and marks, as seen by the typeset edition of the Tibetan mss at the > BNP (this was the killer argument in favour of separate encoding for > me). If they want to represent exact forms of letters & marks then they should probably use an archaic *style* Tibetan font. There are after all differences in a number of the letter forms between early dbu-can and contemporary forms of the script. >>On the whole, I do not see much problem with encoding these archaic head >>marks -- aside from the problem of the existing precomposed character >>U+0F07, which you address in your proposal. Something that perhaps needs to be addressed is that in many cursive and semi-cursive forms of Tibetan script - including formal styles used for writing pecha - the head marks are pretty well always written with a single not a double stroke. When we make fonts for these script styles are we to create artificial double stroke head marks to map to the existing U+0F04 & U+0F05 and map the usual (for these styles) single stroke forms to the new characters? - or should we double map the existing head mark characters *and* the proposed new characters to single glyph forms? Or do we have to create new, & artificial for these forms of Tibetan script, double line head marks? > I think that there are probably less problems with encoding these than > with the myriad shad variants that have been discussed previously. I have a copies of a few Tibetan texts back in the UK where variant (less & more elaborate) forms of RGYA GRAM SHAD & SBRUL SHAD occur - and are used in a clearly hierarchical manner to indicate section, sub-section, and sub-sub-section & topic sub-topic and sub-sub-topic. In much the same way as different levels of outline heading are used in modern word-processed documents. Here these different forms convey meaning as they clearly indicate different levels (and nested levels) in the structure of the text. IMO these texts provide a strong case for encoding additional RGYA GRAM SHAD & SBRUL SHAD characters. If we could find a text where both single and double line forms of YIG MGO are similarly used then I believe there would also be a strong argument for encoding the additional YIG MGO characters you propose. > They will be useful to scholars of early Tibetan texts, and can be > safely ignored by most other people. If a distinction between single and double stroke head marks is "useful to scholars of early Tibetan texts" and the characters "can safely be ignored by most other people" surely being able to distinguish between different forms of terminal shad marks would be just as useful (and just as ignorable)? One way of encoding a distinction, and at the same time making it easily ignorable, might be to use specified variation selector characters. Scholars and applications which need to make such distinctions in plain text can use the variation selector characters to do this while the rest can treat the variation selector characters as ignorables. - Chris