Displaying languages of the Indian subcontinent upon the DVB-MHP platform.

From: William Overington (WOverington@ngo.globalnet.co.uk)
Date: Wed Apr 02 2003 - 03:54:31 EST

  • Next message: Michael Everson: "Re: ogonek vs. retroflex hook"

    I have now completed and published my document on the topic of displaying
    languages of the Indian subcontinent upon the DVB-MHP platform. DVB-MHP
    stands for Digital Video Broadcasting - Multimedia Home Platform. Details
    of the DVB-MHP system are available from the http://www,mhp.org webspace.
    There is also the http://forum.mhp.org webspace which may be joined online.
    DVB-MHP is likely to become the common interactive television standard
    throughout much of the world. However, the standard provides many options,
    defining a minimum system and leaving open many options for implementation
    decisions.

    The document which I have recently completed is published in the DigitalTV
    forum in the http://www.cenelec.org webspace. This forum is a specialist
    forum regarding the implementation of interactive television, using the
    DVB-MHP system, within the European Union, involving such issues as
    interoperability. Readers interested in joining this forum may like to know
    that an email address for making application is helpdesk@cenelec.org and
    that there is also at present a notification about the purpose of the forum
    available using a link in the http://www.cenelec.org webspace. The fact of
    membership is visible to other participants. Although not mandatory, it is
    quite likely that what is decided for use within the European Union will be
    used in many countries which are not within the European Union.

    The file has the following name, in accordance with the file naming
    conventions of the forum.

    DigitalTV_WJGO0005_Languages_of_the_Indian_subcontinent.txt

    A transcript of the text of the document is below.

    William Overington

    2 April 2003

    ----
    Displaying languages of the Indian subcontinent upon the DVB-MHP platform.
    I wonder if I may please draw your attention to a potential problem with the
    displaying of the languages of the Indian subcontinent upon the screens of
    DVB-MHP interactive televisions.
    The DVB-MHP system uses Unicode.  The DVB-MHP system also uses a Portable
    Font Resource PFR0 font.
    I am not a linguist so I am simply mentioning the following document.
    http://www.unicode.org/book/ch09.pdf
    It is Chapter 9 of The Online Edition of The Unicode Standard, Version 3.0,
    the chapter being entitled "South and Southeast Asian Scripts".
    It appears, as far as I can tell, that a PFR0 font cannot display the
    languages of the Indian subcontinent directly from a sequence of Unicode
    characters.
    The Online Edition of The Unicode Standard can be downloaded from the
    following web page, chapter by chapter.
    http://www.unicode.org/book/u2.html
    The main index page of the Unicode web site is as follows.
    http://www.unicode.org
    I have thought out what I consider to be a way to solve the problem of
    displaying the languages of the Indian subcontinent using software within a
    Java program running upon the DVB-MHP platform.
    The method is described in the following document.
    http://www.users.globalnet.co.uk/~ngo/ast03300.htm
    The method uses what I have called a eutocode typography file.
    However, in order for the method to be highly effective for the DVB-MHP
    platform at an interoperability level, what is really needed is a
    standardized list of glyphs for displaying the languages of the Indian
    subcontinent so that those glyphs may be mapped to U+EC00 onwards of the
    Private Use Area of Unicode.  This would not be essential, yet is, I feel,
    highly desirable, because if such a list can be produced and the same list
    used by all content authors who produce content for broadcasting upon the
    DVB-MHP platform using languages of the Indian subcontinent, then lots of
    repeated work can be avoided in the future and there will be advantages for
    interoperability of font generation.  For the avoidance of doubt, please
    know that I am not suggesting that those Private Use Area code points be
    used for broadcasting text.  Text would be broadcast using regular Unicode
    code points.  The reason for assigning the glyphs to code points is so that
    the incoming text stream can be converted into a local, within the
    television set, text stream which can be used to access the PFR0 font so as
    to enable the correct glyphs to be displayed upon the screen.  Study of the
    document above will show that that particular choice of Private Use Area
    code points could also protect against any broadcasting of the languages of
    the Indian subcontinent using those Private Use Area codes to access the
    glyphs directly as those code points when broadcast could be regarded as
    data for a vector graphics system which could be used for drawing
    illustrations within a document.  The vector graphics data does not need to
    access the font, so the locations in the font can be used for this purpose
    on a local, within the television set basis.
    If such a list can be produced within the context of the setting of content
    authoring guidelines which are to be produced for the European Union, with
    appropriate liaison with the government of India, then the task can be
    carried out once within a standardization context, then the list can be used
    permanently by any content author who wishes to use it.
    I do not have the linguistic knowledge to prepare such a list myself, yet I
    do feel that such a list is important for the future for the application of
    the DVB-MHP platform and ask that this matter be added to the agenda of
    formal meetings please.
    William Overington
    29 March 2003
    


    This archive was generated by hypermail 2.1.5 : Wed Apr 02 2003 - 05:10:16 EST