Bill Shui wrote:
>
> Hi,
>
> I am writing an application in C/C++, and I am trying to support XML range of
> unicode in my lexer.
>
> the unicode support in my application is at a database level and the database
> is a quick and dirty implementation that I came up with myself.
>
> I need to test that my lexer and my programs ability to handle unicode,
> for example,
>
> if (a == b)
>
> and both a and b are strings in unicode.
>
> how should I go about testing it?
You have to make sure both strings are in the same normalization form.
Then you can test for equality. There is canonical and compatibility
equivalence depending on which normalization form you use. Please read:
http://www.unicode.org/unicode/reports/tr15/ for more info.
Regards,
Theo
This archive was generated by hypermail 2.1.2 : Mon Jul 29 2002 - 01:17:26 EDT