L2/05-294 Subject: Proposed revision to UAX 29 Date: Wed, 12 Oct 2005 Source: Deborah Goldsmith It looks like when the Khmer and Myanmar scripts were added, the word boundary rules in UAX 29 were not updated accordingly. I propose the following revision: Current ALetter definition ALetter Alphabetic = true, or U+00A0 ( ) NO-BREAK SPACE (NBSP) or U+05F3 (׳) HEBREW PUNCTUATION GERESH and Ideographic = false and Katakana = false and Script ≠ Thai and Script ≠ Lao and Script ≠ Hiragana and Grapheme_Extend = false Proposed new ALetter definition: ALetter Alphabetic = true, or U+00A0 ( ) NO-BREAK SPACE (NBSP) or U+05F3 (׳) HEBREW PUNCTUATION GERESH and Ideographic = false and Katakana = false and Line_Break ≠ SA and Script ≠ Hiragana and Grapheme_Extend = false By tying the definition of ALetter to the Line_Break property, we can avoid having to update multiple places when we add new characters with the SA Line_Break property. I think this is a better approach than adding Myanmar and Khmer to the explicit enumeration. Deborah Goldsmith Internationalization, Unicode liaison Apple Computer, Inc. goldsmit@apple.com