Re: Proposal for German capital letter "ß"

From: Hans Meiser <brille1_at_hotmail.com>
Date: Wed, 9 Dec 2015 19:55:24 +0000

I see.

Yet, the u+1E9E doesn't quite look like two capital "S". So any program implementing a conversion conforming to Unicode will currently display/print in a wrong result: "MAßE" instead of the correctly converted result "MASSE". Both would be correctly encoded as u+004D u+0041 u+1E9E u+0045. Yet, AFAIK, the current glyph would currently be considered an error.

Proposal: Shouldn't the glyph be amended to match the natural language?

Cheers,
Axel

________________________________
From: Dreiheller, Albrecht <albrecht.dreiheller_at_siemens.com>
Sent: Wednesday, December 9, 2015 4:59 PM
To: Hans Meiser; unicode_at_unicode.org
Subject: AW: Proposal for German capital letter "ß"

Just have a look at

U+1E9E LATIN CAPITAL LETTER SHARP S

in the block Latin Extended Additional

http://www.unicode.org/charts/PDF/U1E00.pdf

Latin Extended Additional
Latin Extended Additional Range: 1E00 1EFF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 8.0
Read more...<http://www.unicode.org/charts/PDF/U1E00.pdf>

Kind regards

Von: Unicode [mailto:unicode-bounces_at_unicode.org] Im Auftrag von Hans Meiser
Gesendet: Mittwoch, 9. Dezember 2015 13:26
An: unicode_at_unicode.org
Betreff: Proposal for German capital letter "ß"

Currently there is a vast problem trying to determine the lower case equivalent of a capitalized German word like "MASSE".

This is due to the fact that an orthographic rule exists to convert lower case letter "ß" to upper case letters "SS". So after converting a word from lower case to upper case one cannot unequivocally determine the original lower case word because the conversion is only surjective.

This issue exists because the letter "ß" originally was but a ligature of the small letter "sz" (using a legacy German font) which over time became a ligature of "ss".

After the German spelling reform in 1996, "ß" then became a letter of its own, and words containing the letter "ß" are no longer equivalent to words containing an "ss" combination instead of the "ß". So, for instance, "Maße" and "Masse" are not equal. In fact, "Maße" translates to "measurements" while "Masse" translates to "weight".

This is a particular problem in electronic data processing - like, for instance, SQL data queries. Given above rule, "Maße" will become "MASSE", just like "Masse" becomes "MASSE" when converting a word to uppercase. But there is no way back to distinguish one from the other.

I read that the UNICODE group is already striving for a solution to this problem and that they are searching for a capital letter equivalent of "ß".

My proposal is to introduce a capital letter equivalent of "ß" that's resembling two capital "S" letters: "SS".

So the capital letter equivalent of "ß" would look like "SS" but was in fact a separate code point. Converting words from lower case to upper case and back will then become bijective, auto correction will become easier and the (false) ANSI SQL stopgap of declaring "ß" and "ss" to be equal can be dropped.

Your feedback is appreciated.

Axel Dahmen - Germany
Received on Wed Dec 09 2015 - 13:56:23 CST

This archive was generated by hypermail 2.2.0 : Wed Dec 09 2015 - 13:56:23 CST