Re: Transforming BidiTest.txt to the format of BidiCharacterTest.txt

From: Markus Scherer <markus.icu_at_gmail.com>
Date: Wed, 12 Feb 2014 11:46:03 -0800

On Wed, Feb 12, 2014 at 11:09 AM, Whistler, Ken <ken.whistler_at_sap.com>wrote:

> Eric,
>
> The C version of the bidiref code does that, in part.
>
> See the function br_ParseFileFormatB in brinput.c.
>
> http://www.unicode.org/Public/PROGRAMS/BidiReferenceC/6.3.0/
>
> It doesn't actually *transform* the BidiTest.txt file to output the other
> format, but it
> parses the input and then constructs calls into the bidi testing API in
> the same format
> used when it parses BidiCharacterTest.txt. So you could adapt that code,
> if you
> want, to writing out lines in the format of BidiCharacterTest.txt. The
> main addition you would have to make would be to add a table of
> characters exemplifying each of the bidi classes, so you could map
> the bidi class values from BidiTest.txt back to actual code points to
> store in BidiCharacterTest.txt format.
>

ICU also has test code that parses both files, but it does not transform
either one into the format of the other. We have both C++ and Java, and I
can send you URLs if you are interested. There are also sample characters
per Bidi_Class.

markus

_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Wed Feb 12 2014 - 13:47:06 CST

This archive was generated by hypermail 2.2.0 : Wed Feb 12 2014 - 13:47:06 CST