Re: Transforming BidiTest.txt to the format of BidiCharacterTest.txt

From: Markus Scherer <>
Date: Wed, 12 Feb 2014 11:46:03 -0800

On Wed, Feb 12, 2014 at 11:09 AM, Whistler, Ken <>wrote:

> Eric,
> The C version of the bidiref code does that, in part.
> See the function br_ParseFileFormatB in brinput.c.
> It doesn't actually *transform* the BidiTest.txt file to output the other
> format, but it
> parses the input and then constructs calls into the bidi testing API in
> the same format
> used when it parses BidiCharacterTest.txt. So you could adapt that code,
> if you
> want, to writing out lines in the format of BidiCharacterTest.txt. The
> main addition you would have to make would be to add a table of
> characters exemplifying each of the bidi classes, so you could map
> the bidi class values from BidiTest.txt back to actual code points to
> store in BidiCharacterTest.txt format.

ICU also has test code that parses both files, but it does not transform
either one into the format of the other. We have both C++ and Java, and I
can send you URLs if you are interested. There are also sample characters
per Bidi_Class.


Unicode mailing list
Received on Wed Feb 12 2014 - 13:47:06 CST

This archive was generated by hypermail 2.2.0 : Wed Feb 12 2014 - 13:47:06 CST