From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Mon Aug 25 2003 - 04:13:24 EDT
Peter Kirk wrote:
> Well, the situation with Hebrew sof pasuq is almost identical to that
> for Greek and Arabic question marks, except that it is functionally a
> full stop not a question mark, so I can't see any reason other than
> prejudice for omitting it from the list.
Well, I had a much better reason than prejudice: ignorance. :-)
That's why I told that my two lists were tentative and asked for comments.
> [...] - maqaf is very much the cultural equivalent of hyphen, and I
> have seen recent discussion about whether the hyphen key on a
> Hebrew keyboard ought actually to generate a maqaf. [...]
> I'm not talking about biblical Hebrew here, I'm talking about
> a living modern language. [...]
And these are exactly the kinds of reasons that I was looking for: any
punctuation used in modern text and/or available on keyboards are what good
candidates for inclusion in Pattern_Syntax (i.e., exclusion from usage in
identifiers).
> [...] extend the list to include all punctuation and to allow as
> yet undefined characters to be added to it.
Well, the requirement for an invariable set seems to be part of the "rules
of the game" with this UTR, so I'll stick to it.
I guess that this requirement is due to backward compatibility issues. If
version X of a certain programming language accepts identifier "foo:bar"
(where ":" is a certain mark), it is not acceptable that version X+1 of the
same language treats the same sequence as a syntax error: that would make
existing source code in that language potentially unusable.
Obviously, it must be possible to customize the default sets: most
*existing* computer languages already allow in identifiers characters that
would be excluded by UTR#31 (e.g.: <_> <-> <$> <'>).
_ Marco
This archive was generated by hypermail 2.1.5 : Mon Aug 25 2003 - 05:11:16 EDT