    Unihan is designed, first and foremost, to be a _data_ file for
    consumption by software. It doesn't matter at all how many spaces are
    used for the tabs. The use of tabs make it trivial to write scfipts to
    parse the file with grep, awk, Perl, Python.

    With regards to the Pinyin orthography: tone numbers make it easier to
    process the readings into initial, final and tone. Replacing the
    numbers with diacritics makes it more difficult to do this.

