L2/00-412

DATE: 2000-11-08

 

DOC TYPE:

Expert contribution

TITLE:

Special mathematical operators used in plain-text mathematical expressions

SOURCE:

Murray Sargent III

PROJECT:

 

STATUS:

Proposal

ACTION ID:

FYI

DUE DATE:

--

DISTRIBUTION:

Worldwide

MEDIUM:

Paper and html

NO. OF PAGES:

4


A. Administrative

1. Title

Special mathematical operators used in computational algebra

2. Requester's name

Murray Sargent III

3. Requester type

Expert request.

4. Submission date

07-11-2000

5. Requester’s reference

 

6a. Completion

Complete proposal

6b. More information to be provided?

If requested

 

B. Technical -- General

1a. New script? Name?

No.

1b. Addition of characters to existing block? Name?

Miscellaneous technical

2. Number of characters

5

3. Proposed category

 

4. Proposed level of implementation and rationale

Level 1

5a. Character names included in proposal?

Yes.

5b. Character names in accordance with guidelines?

Yes.

5c. Character shapes reviewable?

 

6a. Who will provide computerized font?

Microsoft Symbol font

6b. Font currently available?

Microsoft Symbol font

6c. Font format?

TrueType

7a. Are references (to other character sets, dictionaries, descriptive texts, etc.) provided?

Yes.

7b. Are published examples (such as samples from newspapers, magazines, or other sources) of use of proposed characters attached?

No

8. Does the proposal address other aspects of character data processing?

No

 

C. Technical -- Justification

1. Contact with the user community?

Yes. TeX and PS Technical Word Processor

2. Information on the user community?

Mathematical software

3a. The context of use for the proposed characters?

Used in plain-text encoding of mathematics

3b. Reference

See references at end of this proposal

4a. Proposed characters in current use?

Yes.

4b. Where?

PS Technical Word Processor.

5a. Characters should be encoded entirely in BMP?

Yes.

5b. Rationale

Small number of characters

6. Should characters be kept in a continuous range?

Preferably (easier to parse)

7a. Can the characters be considered a presentation form of an existing character or character sequence?

No, although existing characters like ^ and _ are overloaded to have the same semantics

7b. Where?

TeX

7c. Reference

 

8a. Can any of the characters be considered to be similar (in appearance or function) to an existing character?

Yes

8b. Where?

 

8c. Reference

 

9a. Combining characters or use of composite sequences included?

No

9b. List of composite sequences and their corresponding glyph images provided?

na

10. Characters with any special properties such as control function, etc. included?

No

 

D. SC2/WG2 Administrative

To be completed by SC2/WG2

1. Relevant SC 2/WG 2 document numbers:

 

2. Status (list of meeting number and corresponding action or disposition)

 

3. Additional contact to user communities, liaison organizations etc.

 

4. Assigned category and assigned priority/time frame

 

Other Comments

 

 

Three additional operators allow most mathematical expressions to be encoded in a simple plain-text format.  The first two of these characters are similar in spirit to the Unicode FRACTION SLASH (2044), in that they are used to request built-up display.  Hence I recommend that they follow 2044.

These characters are:

 

                ­              SUPERSCRIPT OPERATOR

                        × (up arrow ® 2191)

 

                ¯              SUBSCRIPT OPERATOR

                        × (down arrow ® 2193)

 

            L          LITERAL OPERATOR

                        × (circled Latin capital letter L ® 24C1)

 

            {          MATHEMATICS ON (symbol in dashed box)

 

            }          MATHEMATICS OFF (symbol in dashed box)

 

The simplest example of subscript/superscript operator usage is to replace TEX’s math use of ^ and _, respectively, which currently have to be “escaped” to be displayed.  The glyphs are created by a superscripted bold up arrow and a subscripted bold down arrow, respectively.  As in  TEX, the subscript and superscript operators allow nested subscripts and superscripts.  In combination with other Unicode characters, these “operator” characters allow most common mathematical expressions to be represented without overloading the use of display characters like ^.  An appropriate rendering algorithm can use them to produce built-up mathematical text.

 

The literal operator is used to reduce an operator character like the FRACTION SLASH and the subscript/superscript operators to be displayed as an ordinary character, instead of in built-up form.

 

The math-on/math-off symbols are analogous to the left-to-right U+200E and right-to-left U+200F bidirectional marks. 

 

For further discussion, please see my articles “Unicode Plain-Text Encoding of Mathematics” in the Proceedings of the Unicode Implementers’ Workshop 6, Santa Clara, CA (Sept. 8-9, 1994) and “Unicode, Rich Text, and Mathematics”, in the Proceedings of the Seventh International Unicode Conference, San Jose, CA (Sept. 14-15, 1995)