L2/07-266 Source: Mark Davis Date: August 8, 2007 Subject: Extended Grapheme Cluster Rules The default grapheme cluster rules do not include all combining character sequences, and also break Thai clusters. I propose we add a new extended grapheme cluster segment (XGraphemeClusterBreak) that is extended to handle that. It is the same as GraphemeClusterBreak except adding: Variables: Extend = Grapheme_Extend = true OR gc=Mc OR 0E30 ( ะ ) THAI CHARACTER SARA A 0E32 ( า ) THAI CHARACTER SARA AA 0E33 ( ำ ) THAI CHARACTER SARA AM 0E45 ( ๅ ) THAI CHARACTER LAKKHANGYAO Prepend = [\u0E40-\u0E44] PrependBase = [\u0E01-\\u0E2E] Rules GB9a. Prepend × PrependBase [In the future we can add Lao characters ] -- Mark