From rick@unicode.org Mon Oct 1 14:01:04 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Mon, 01 Oct 2007 14:05:43 -0600 (CST) Received: from izanami (c-71-202-247-55.hsd1.ca.comcast.net [71.202.247.55]) by unicode.org (8.13.4/8.12.11) with SMTP id l91K0pKs021753; Mon, 1 Oct 2007 14:00:51 -0600 Message-Id: <200710012000.l91K0pKs021753@unicode.org> To: unicode@unicode.org Subject: Unicode server upgrade this week Date: Mon, 1 Oct 2007 13:00:53 -0700 From: Rick McGowan received: by Apple.Mailer (2.95.2) X-archive-position: 261 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: rick@unicode.org Precedence: bulk X-list: cldr-users This is to notify you that the Unicode.org server will be upgraded this week. The upgrade involves moving all of our web data and mail lists, and updating various pieces of software. The web server will probably go off-line on Thursday morning (California time) and probably come back up sometime on Friday. The Unicode mail lists, and other lists on this server, will be shut down probably on Wednesday morning, and probably brought back up again on Friday, if there are not too many problems. I will send a final warning note to the Unicode mail list shortly before the mail lists are actually taken out of service. Once you see that note, please don't expect any mail to get through until you receive the notification that the lists are once again functional. Regards, Rick McGowan Unicode, Inc. From rick@unicode.org Tue Oct 2 15:04:20 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Tue, 02 Oct 2007 15:05:11 -0600 (CST) Received: from izanami (c-71-202-247-55.hsd1.ca.comcast.net [71.202.247.55]) by unicode.org (8.13.4/8.12.11) with SMTP id l92L4FCm015069; Tue, 2 Oct 2007 15:04:19 -0600 Message-Id: <200710022104.l92L4FCm015069@unicode.org> To: rick@unicode.org Subject: Unicode.org - mail list shutdown for system upgrade Date: Tue, 2 Oct 2007 14:04:17 -0700 From: Rick McGowan received: by Apple.Mailer (2.95.2) X-archive-position: 262 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: rick@unicode.org Precedence: bulk X-list: cldr-users The mail lists on Unicode.org will be shut down soon today for system upgrade. They will probably be functioning again on Thursday this week. Regards, Rick McGowan Unicode, Inc. From rick@unicode.org Tue Oct 9 19:14:21 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Tue, 09 Oct 2007 19:18:26 -0500 (CDT) Received: from izanami (c-71-202-247-55.hsd1.ca.comcast.net [71.202.247.55]) by unicode.org (8.12.11/8.12.11) with SMTP id l9A0EGfH027905; Tue, 9 Oct 2007 19:14:17 -0500 Message-Id: <200710100014.l9A0EGfH027905@unicode.org> To: unicode@unicode.org Subject: Public Review Issue Update - UTS #10: Unicode Collation Algorithm Date: Tue, 9 Oct 2007 17:14:17 -0700 From: Rick McGowan received: by Apple.Mailer (2.95.2) X-archive-position: 263 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: rick@unicode.org Precedence: bulk X-list: cldr-users The document for Public Review Issue #113, UTS #10: Unicode Collation Algorithm has been updated to second draft. The report is available here: http://www.unicode.org/reports/tr10/tr10-17.html The second draft of the proposed update for 5.1 includes the following changes: * Clarified use of contractions in Section 3.2 Default Unicode Collation Element Table and Section 3.1.1.2 Contractions * Added information about the use of parameterization (Section 5.1 Parametric Tailoring), * Added Section 8.1 Collation Folding * In Section 8 Searching and Matching, added new introduction and explained special cases. Due date for comments to the current draft is: 2007/10/16. Regards, Rick McGowan Unicode, Inc. From srl@icu-project.org Tue Oct 9 21:43:17 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Tue, 09 Oct 2007 21:43:17 -0500 (CDT) Received: from v.icu-project.org (v.icu-project.org [161.58.210.87]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9A2hHZn019283 for ; Tue, 9 Oct 2007 21:43:17 -0500 Received: from monkey.sbay.org ([216.27.178.44] helo=[10.0.0.119]) by v.icu-project.org with esmtpa (Exim 4.63 (FreeBSD)) (envelope-from ) id 1IfRXZ-0008bk-56 for cldr-users@unicode.org; Wed, 10 Oct 2007 02:43:17 +0000 Mime-Version: 1.0 (Apple Message framework v752.3) Content-Transfer-Encoding: 7bit Message-Id: Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed To: cldr-users@unicode.org From: "Steven R. Loomis" Subject: FYI: survey tool (closed) and cldr utilities are back up Date: Tue, 9 Oct 2007 19:43:12 -0700 X-Mailer: Apple Mail (2.752.3) X-archive-position: 264 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: srl@icu-project.org Precedence: bulk X-list: cldr-users CLDR survey tool ( still closed ) is back up and running, as are the cldr java utilities http://unicode.org/cldr/utility/ From rick@unicode.org Wed Oct 10 13:44:42 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Wed, 10 Oct 2007 13:48:06 -0500 (CDT) Received: from izanami (c-71-202-247-55.hsd1.ca.comcast.net [71.202.247.55]) by unicode.org (8.12.11/8.12.11) with SMTP id l9AIiTMs009073; Wed, 10 Oct 2007 13:44:33 -0500 Message-Id: <200710101844.l9AIiTMs009073@unicode.org> To: unicode@unicode.org Subject: New FAQ page "Display of Unsupported Characters" Date: Wed, 10 Oct 2007 11:44:29 -0700 From: Rick McGowan received: by Apple.Mailer (2.95.2) X-archive-position: 265 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: rick@unicode.org Precedence: bulk X-list: cldr-users There is a new FAQ page on "Display of Unsupported Characters" available on the Unicode web site. Please see: http://www.unicode.org/faq/unsup_char.html Regards, Rick From dzo@bisharat.net Sat Oct 13 22:08:50 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sat, 13 Oct 2007 22:08:50 -0500 (CDT) Received: from kabissa.org (113166.kabissa.org [72.32.199.201]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9E38ook010044 for ; Sat, 13 Oct 2007 22:08:50 -0500 Received: (qmail 20276 invoked from network); 13 Oct 2007 22:07:09 -0500 Received: from pool-71-252-88-25.washdc.east.verizon.net (HELO IBM92AA25595C4) (71.252.88.25) by 72.32.229.137 with SMTP; 13 Oct 2007 22:07:08 -0500 From: "Don Osborn" To: "'CLDR list'" , Subject: Locales with language only Date: Sat, 13 Oct 2007 23:07:04 -0400 Message-ID: <006301c80e0f$4ce84d50$e6b8e7f0$@net> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0064_01C80DED.C5D6AD50" X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcgOD0u5BOdZXcJuSh+6EGlpaS4urw== Content-Language: en-us X-archive-position: 266 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: dzo@bisharat.net Precedence: bulk X-list: cldr-users This is a multipart message in MIME format. ------=_NextPart_000_0064_01C80DED.C5D6AD50 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Under what circumstances can a locale be filed with language only (no country)? I noticed in an article by Bill Hall ("New Internationalization Features of Microsoft Vista" Internationalization: Gatting Started (insert), MultiLingual #87 April/May 2007) mention of a language-only locale for Persian in Vista. Thanks for any info. Don ------=_NextPart_000_0064_01C80DED.C5D6AD50 Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Under what circumstances can a locale be filed with = language only (no country)?

 

I noticed in an article by Bill Hall ("New Internationalization Features of Microsoft Vista" = Internationalization: Gatting Started (insert), MultiLingual #87 April/May 2007) mention of a = language-only locale for Persian in Vista.

 

Thanks for any info.

 

Don

------=_NextPart_000_0064_01C80DED.C5D6AD50-- From srl@icu-project.org Sat Oct 13 22:39:14 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sat, 13 Oct 2007 22:39:14 -0500 (CDT) Received: from v.icu-project.org (v.icu-project.org [161.58.210.87]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9E3dDIf032418 for ; Sat, 13 Oct 2007 22:39:14 -0500 Received: from monkey.sbay.org ([216.27.178.44] helo=[10.0.0.119]) by v.icu-project.org with esmtpa (Exim 4.63 (FreeBSD)) (envelope-from ) id 1IguJi-0002BM-DS; Sun, 14 Oct 2007 03:39:02 +0000 In-Reply-To: <006301c80e0f$4ce84d50$e6b8e7f0$@net> References: <006301c80e0f$4ce84d50$e6b8e7f0$@net> Mime-Version: 1.0 (Apple Message framework v752.3) Content-Type: text/plain; charset=UTF-8; delsp=yes; format=flowed Message-Id: Cc: cldr-users@unicode.org From: "Steven R. Loomis" Subject: Re: Locales with language only Date: Sat, 13 Oct 2007 20:38:59 -0700 To: "Don Osborn" X-Mailer: Apple Mail (2.752.3) Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by unicode.org id l9E3dDIf032418 X-archive-position: 267 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: srl@icu-project.org Precedence: bulk X-list: cldr-users The LDML format used by CLDR uses a hierarchical data model - http:// www.unicode.org/reports/tr35/#Locale_Inheritance This is distinguished from POSIX and other systems where the language and territory are always required. Therefore, 'root' is the parent of 'fa' (Persian) which is the parent of 'fa_IR' (Persian, Iran) and 'fa_AF' (Persian, Afghanistan). In practice, the language locale 'fa' is decided to be the default content for a certain sublocale, ( more at http://www.unicode.org/reports/tr35/ #Appendix_Supplemental_Metadata under P.3 default content ) in this case fa_IR is default content for 'fa'. So, fa_IR itself is empty, all of the contents are in 'fa'. fa_AF has the Afghanistan-specific overrides (if you will) relative to fa. This is mainly for ease of maintenance. Also, if you were to request fa_US, say, it would start out already having valid data for at least one real sublocale (fa_IR in this case), and you would only have to add the US specific overrides. If you scroll down to Number Patterns in http://demo.icu- project.org/icu-bin/locexp?_=fa_US ( based on CLDR data ) you can see it's really trying: "−۱٬۲۳۴٫۵۷‬ US$" Farsi digits, and the US default currency (USD) was picked up. If there was a specific format for USD in Persian that would have been used: http://unicode.org/cldr/apps/survey?_=fa&forum=fa&xpath=84202 But to more directly answer your question, if an application only sets the language part, you will get some behavior, but it will not have all necessary information. Using the currency example again, with just "fa" you don't really know what currency to use, for example. Is it the Iranian Rial? Also you won't be able to determine time zone information as accurately without knowing the territory. Also, some languages like eo (Esperanto) don't have a territory assigned to them currently. So the answer is, yes, you can have a locale with the language only and without the territory. But your application is severely limited in providing correct locale data if it does not know the user's territory as well. I hope I have answered your question. -s On 13 Ott 2007, at 20:07, Don Osborn wrote: > Under what circumstances can a locale be filed with language only > (no country)? > > I noticed in an article by Bill Hall ("New Internationalization > Features of Microsoft Vista" Internationalization: Gatting Started > (insert), MultiLingual #87 April/May 2007) mention of a language- > only locale for Persian in Vista. > > Thanks for any info. > > Don From cfynn@gmx.net Sun Oct 14 06:37:26 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sun, 14 Oct 2007 06:37:26 -0500 (CDT) Received: from mail.gmx.net (mail.gmx.net [213.165.64.20]) by unicode.org (8.12.11/8.12.11) with SMTP id l9EBbPeV007104 for ; Sun, 14 Oct 2007 06:37:25 -0500 Received: (qmail invoked by alias); 14 Oct 2007 11:37:18 -0000 Received: from cust71.fastlink.bt (EHLO [127.0.0.1]) [202.89.26.71] by mail.gmx.net (mp036) with SMTP; 14 Oct 2007 13:37:18 +0200 X-Authenticated: #9568751 X-Provags-ID: V01U2FsdGVkX18ICxY5pYgI1190e4F+bT4pbKv7WxOFNTwny4lNpL Jiuqw28EBVEP8E Message-ID: <4711FF62.40206@gmx.net> Date: Sun, 14 Oct 2007 17:37:06 +0600 From: Christopher Fynn User-Agent: Thunderbird 2.0.0.6 (Windows/20070728) MIME-Version: 1.0 To: "Steven R. Loomis" , cldr-users@unicode.org Subject: Re: Locales with language only References: <006301c80e0f$4ce84d50$e6b8e7f0$@net> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 000781-1, 14/10/2007), Outbound message X-Antivirus-Status: Clean X-Y-GMX-Trusted: 0 X-archive-position: 268 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: cfynn@gmx.net Precedence: bulk X-list: cldr-users Steven R. Loomis wrote: > The LDML format used by CLDR uses a hierarchical data model - > http://www.unicode.org/reports/tr35/#Locale_Inheritance > This is distinguished from POSIX and other systems where the language > and territory are always required. > Therefore, 'root' is the parent of 'fa' (Persian) which is the parent > of 'fa_IR' (Persian, Iran) and 'fa_AF' (Persian, Afghanistan). > > In practice, the language locale 'fa' is decided to be the default > content for a certain sublocale, > ( more at > http://www.unicode.org/reports/tr35/#Appendix_Supplemental_Metadata > under P.3 default content ) ... Stephen This leads me to wonder how does one make locales for languages with multiple scripts? For example Sanskrit, although most commonly written in Devanagari is sometimes written in almost all the scripts of India (and several other Indic scripts as well). Similarly Pali is written in almost all the scripts of countries where Theravada Buddhism predominates including Sri Lanka, Thailand, Burma, Cambodia and Laos - as well as in Devanagari. In fact it would be very hard to decide which should be the default script for that language. Balti may be written in Tibetan or Arabic script, Mongolian in Mongolian or Cyrillic - and there must be quite a number of other examples. - Chris From eik@iki.fi Sun Oct 14 07:23:04 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sun, 14 Oct 2007 07:23:04 -0500 (CDT) Received: from smtp5.pp.htv.fi (smtp5.pp.htv.fi [213.243.153.39]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9ECN40A029407 for ; Sun, 14 Oct 2007 07:23:04 -0500 Received: from Raahattava (cs181253188.pp.htv.fi [82.181.253.188]) by smtp5.pp.htv.fi (Postfix) with ESMTP id B154A5BC175; Sun, 14 Oct 2007 15:23:02 +0300 (EEST) From: "Erkki I. Kolehmainen" To: "'Christopher Fynn'" , "'Steven R. Loomis'" , Subject: VS: Locales with language only Date: Sun, 14 Oct 2007 15:22:59 +0300 Message-ID: <000001c80e5c$f58fc800$0300a8c0@Raahattava> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.6626 Importance: Normal In-Reply-To: <4711FF62.40206@gmx.net> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198 Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by unicode.org id l9ECN40A029407 X-archive-position: 269 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: eik@iki.fi Precedence: bulk X-list: cldr-users Chris, There are several language locales in CLDR 1.5 with multiple scripts, although the current maximum is three (no set limit) for Uzbek. Technically, the choice of the default does not affect the use of the locales. Erkki I. Kolehmainen Tilkankatu 12 A 3, FI-00300 Helsinki, Finland Puh. (09) 4368 2643, 0400 825 943; Tel. +358 9 4368 2643, +358 400 825 943 -----Alkuperäinen viesti----- Lähettäjä: cldr-users-bounce@unicode.org [mailto:cldr-users-bounce@unicode.org] Puolesta Christopher Fynn Lähetetty: 14. lokakuuta 2007 14:37 Vastaanottaja: Steven R. Loomis; cldr-users@unicode.org Aihe: Re: Locales with language only Steven R. Loomis wrote: > The LDML format used by CLDR uses a hierarchical data model - > http://www.unicode.org/reports/tr35/#Locale_Inheritance > This is distinguished from POSIX and other systems where the language > and territory are always required. > Therefore, 'root' is the parent of 'fa' (Persian) which is the parent > of 'fa_IR' (Persian, Iran) and 'fa_AF' (Persian, Afghanistan). > > In practice, the language locale 'fa' is decided to be the default > content for a certain sublocale, > ( more at > http://www.unicode.org/reports/tr35/#Appendix_Supplemental_Metadata > under P.3 default content ) ... Stephen This leads me to wonder how does one make locales for languages with multiple scripts? For example Sanskrit, although most commonly written in Devanagari is sometimes written in almost all the scripts of India (and several other Indic scripts as well). Similarly Pali is written in almost all the scripts of countries where Theravada Buddhism predominates including Sri Lanka, Thailand, Burma, Cambodia and Laos - as well as in Devanagari. In fact it would be very hard to decide which should be the default script for that language. Balti may be written in Tibetan or Arabic script, Mongolian in Mongolian or Cyrillic - and there must be quite a number of other examples. - Chris From verdy_p@wanadoo.fr Sun Oct 14 13:06:24 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sun, 14 Oct 2007 13:06:24 -0500 (CDT) Received: from smtp25.orange.fr (smtp25.orange.fr [193.252.22.23]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9EI6NgA016610 for ; Sun, 14 Oct 2007 13:06:24 -0500 Received: from me-wanadoo.net (localhost [127.0.0.1]) by mwinf2508.orange.fr (SMTP Server) with ESMTP id 1B41A1C00094 for ; Sun, 14 Oct 2007 20:06:18 +0200 (CEST) Received: from HARNON (unknown [90.50.233.208]) by mwinf2508.orange.fr (SMTP Server) with ESMTP id AE31E1C00092; Sun, 14 Oct 2007 20:06:17 +0200 (CEST) X-ME-UUID: 20071014180617713.AE31E1C00092@mwinf2508.orange.fr Reply-To: From: "Philippe Verdy" To: "'Erkki I. Kolehmainen'" , "'Christopher Fynn'" , "'Steven R. Loomis'" , References: <4711FF62.40206@gmx.net> <000001c80e5c$f58fc800$0300a8c0@Raahattava> Subject: RE: Locales with language only Date: Sun, 14 Oct 2007 20:06:06 +0200 Organization: Ordinateur Personnel Message-ID: <00fb01c80e8c$e4959e00$0a01a8c0@rodage.dyndns.org> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 11 In-Reply-To: <000001c80e5c$f58fc800$0300a8c0@Raahattava> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198 Thread-Index: AcgOXrqTE3nboZGPQxq458xS/VfRLgALdlvQ X-archive-position: 270 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: verdy_p@wanadoo.fr Precedence: bulk X-list: cldr-users Erkki I. Kolehmainen wrote: > There are several language locales in CLDR 1.5 with multiple scripts, > although the current maximum is three (no set limit) for Uzbek. Isn't Aramaic written natively with five scripts (at least)? Also Kashmiri is also found with three scripts. From eik@iki.fi Sun Oct 14 16:05:35 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Sun, 14 Oct 2007 16:05:35 -0500 (CDT) Received: from smtp5.pp.htv.fi (smtp5.pp.htv.fi [213.243.153.39]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9EL5YsC023773 for ; Sun, 14 Oct 2007 16:05:35 -0500 Received: from Raahattava (cs181253188.pp.htv.fi [82.181.253.188]) by smtp5.pp.htv.fi (Postfix) with ESMTP id 9B4875BC08D; Mon, 15 Oct 2007 00:05:33 +0300 (EEST) From: "Erkki I. Kolehmainen" To: , "'Christopher Fynn'" , "'Steven R. Loomis'" , Subject: Re: Locales with language only Date: Mon, 15 Oct 2007 00:05:33 +0300 Message-ID: <001a01c80ea5$f5c9b990$0300a8c0@Raahattava> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.6626 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198 In-Reply-To: <00fb01c80e8c$e4959e00$0a01a8c0@rodage.dyndns.org> Importance: Normal X-archive-position: 271 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: eik@iki.fi Precedence: bulk X-list: cldr-users New data for additional locales can and should be added in the forthcoming CLDR 1.6 and later releases. This new data, however, needs to be submitted and vetted, which does require quite some expertise and work. Erkki I. Kolehmainen -----Alkuperainen viesti----- Lahettaja: Philippe Verdy [mailto:verdy_p@wanadoo.fr] Lahetetty: 14. lokakuuta 2007 21:06 Vastaanottaja: 'Erkki I. Kolehmainen'; 'Christopher Fynn'; 'Steven R. Loomis'; cldr-users@unicode.org Aihe: RE: Locales with language only Erkki I. Kolehmainen wrote: > There are several language locales in CLDR 1.5 with multiple scripts, > although the current maximum is three (no set limit) for Uzbek. Isn't Aramaic written natively with five scripts (at least)? Also Kashmiri is also found with three scripts. From srl@icu-project.org Mon Oct 15 12:08:14 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Mon, 15 Oct 2007 12:08:14 -0500 (CDT) Received: from v.icu-project.org (v.icu-project.org [161.58.210.87]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9FH8D6i023603 for ; Mon, 15 Oct 2007 12:08:14 -0500 Received: from monkey.sbay.org ([216.27.178.44] helo=[10.0.0.119]) by v.icu-project.org with esmtpa (Exim 4.63 (FreeBSD)) (envelope-from ) id 1IhTQ2-000AL7-2h; Mon, 15 Oct 2007 17:07:54 +0000 In-Reply-To: <00fb01c80e8c$e4959e00$0a01a8c0@rodage.dyndns.org> References: <4711FF62.40206@gmx.net> <000001c80e5c$f58fc800$0300a8c0@Raahattava> <00fb01c80e8c$e4959e00$0a01a8c0@rodage.dyndns.org> Mime-Version: 1.0 (Apple Message framework v752.3) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: <68FCA443-F504-464F-A942-DEB37A9EDA69@icu-project.org> Cc: "Erkki I. Kolehmainen" , Christopher Fynn , cldr-users@unicode.org, Don Osborn Content-Transfer-Encoding: 7bit From: "Steven R. Loomis" Subject: Re: Locales with language only Date: Mon, 15 Oct 2007 10:07:54 -0700 To: X-Mailer: Apple Mail (2.752.3) X-archive-position: 272 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: srl@icu-project.org Precedence: bulk X-list: cldr-users Just to clarify, Erkki is saying, and says, that there is no limitation or maximum limit in CLDR of the number of scripts. He is simply saying that the observed current maximum found in the data is three. If Aramaic needs five scripts then it can have five scripts. -s On 14 Ott 2007, at 11:06, Philippe Verdy wrote: > Erkki I. Kolehmainen wrote: >> There are several language locales in CLDR 1.5 with multiple scripts, >> although the current maximum is three (no set limit) for Uzbek. > > Isn't Aramaic written natively with five scripts (at least)? > Also Kashmiri is also found with three scripts. From rick@unicode.org Tue Oct 30 13:45:25 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Tue, 30 Oct 2007 13:50:09 -0600 (CST) Received: from izanami (c-71-202-247-55.hsd1.ca.comcast.net [71.202.247.55]) by unicode.org (8.12.11/8.12.11) with SMTP id l9UJjKZe006864; Tue, 30 Oct 2007 13:45:20 -0600 Message-Id: <200710301945.l9UJjKZe006864@unicode.org> To: unicode@unicode.org Subject: New Public Review Issue: Proposed Update UTR #36 Unicode Security Considerations Date: Tue, 30 Oct 2007 11:45:23 -0800 From: Rick McGowan received: by Apple.Mailer (2.95.2) X-archive-position: 273 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: rick@unicode.org Precedence: bulk X-list: cldr-users The Unicode Technical Committee has posted a new issue for public review and comment. Details are on the following web page: http://www.unicode.org/review/ Review periods for the new item closes on January 28, 2008. Please see the page for links to discussion and relevant documents. Briefly, the new issue is: 115 Proposed Update to UTR #36 Unicode Security Considerations http://www.unicode.org/reports/tr36/tr36-6.html Changes in this proposed update include: * Added explanation of UTF-8 over consumption attack in 3.1 UTF-8 Exploits * Added subsection of 2.8.2 Mapping and Prohibition describing the Unicode 5.1 changes in identifiers * Added section 3.4 Property and Character Stability If you have comments for official UTC consideration, please post them by submitting your comments through our feedback & reporting page: http://www.unicode.org/reporting.html If you wish to discuss issues on the Unicode mail list, then please use the following link to subscribe (if necessary). Please be aware that discussion comments on the Unicode mail list are not automatically recorded as input to the UTC. You must use the reporting link above to generate comments for UTC consideration. http://www.unicode.org/consortium/distlist.html Regards, Rick McGowan Unicode, Inc. From v-magdad@microsoft.com Tue Oct 30 15:15:42 2007 Received: with ECARTIS (v1.0.0; list cldr-users); Tue, 30 Oct 2007 15:15:43 -0600 (CST) Received: from smtp.microsoft.com (maila.microsoft.com [131.107.115.212]) by unicode.org (8.12.11/8.12.11) with ESMTP id l9ULFgLB024837; Tue, 30 Oct 2007 15:15:42 -0600 Received: from TK5-EXHUB-C102.redmond.corp.microsoft.com (157.54.70.72) by TK5-EXGWY-E801.partners.extranet.microsoft.com (10.251.56.50) with Microsoft SMTP Server (TLS) id 8.1.222.3; Tue, 30 Oct 2007 14:15:31 -0700 Received: from NA-EXMSG-C125.redmond.corp.microsoft.com ([157.54.61.83]) by TK5-EXHUB-C102.redmond.corp.microsoft.com ([157.54.70.72]) with mapi; Tue, 30 Oct 2007 14:15:36 -0700 From: "Magda Danish (Unicode)" To: "unicode@unicode.org" Date: Tue, 30 Oct 2007 14:15:35 -0700 Subject: New FAQ page posted Thread-Topic: New FAQ page posted Thread-Index: AcgbOgLTzv43skRzTluYIo3TC6llIw== Message-ID: <871A62EA91884849A3BE952CA63832D01550D0AE3B@NA-EXMSG-C125.redmond.corp.microsoft.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_871A62EA91884849A3BE952CA63832D01550D0AE3BNAEXMSGC125re_" MIME-Version: 1.0 X-archive-position: 274 X-ecartis-version: Ecartis v1.0.0 Sender: cldr-users-bounce@unicode.org Errors-to: cldr-users-bounce@unicode.org X-original-sender: v-magdad@microsoft.com Precedence: bulk X-list: cldr-users --_000_871A62EA91884849A3BE952CA63832D01550D0AE3BNAEXMSGC125re_ Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 QSBuZXcgRkFRIHBhZ2UgaGFzIGJlZW4gcG9zdGVkIGF0IGh0dHA6Ly91bmljb2RlLm9yZy9jbGRy L2RhdGEvZG9jcy93ZWIvbG9jYWxlX2ZhcS5odG1sLiBJdCBpcyBhbHNvIGFjY2Vzc2libGUgdGhy b3VnaCB0aGUgbWFpbiBGQVEgcGFnZSBhdCBodHRwOi8vd3d3LnVuaWNvZGUub3JnL2ZhcS8uDQoN ClRoaXMgcGFnZSBhbnN3ZXJzIHF1ZXN0aW9ucyBhYm91dCBVbmljb2RlIExvY2FsZXMsIENMRFIs IGFuZCBMRE1MLg0KDQotLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0NCg0KTWFnZGEgRGFuaXNo DQpTci4gQWRtaW5pc3RyYXRpdmUgRGlyZWN0b3INClRoZSBVbmljb2RlIENvbnNvcnRpdW0NCjY1 MC02OTMtMzkyMQ0KbWFnZGFAdW5pY29kZS5vcmcNCg0KDQoNCg0K --_000_871A62EA91884849A3BE952CA63832D01550D0AE3BNAEXMSGC125re_ Content-Type: text/html; charset="utf-8" Content-Transfer-Encoding: base64 77u/PCFET0NUWVBFIEhUTUwgUFVCTElDICItLy9XM0MvL0RURCBIVE1MIDQuMCBUcmFuc2l0aW9u YWwvL0VOIj4NCjxIVE1MPjxIRUFEPg0KPE1FVEEgaHR0cC1lcXVpdj1Db250ZW50LVR5cGUgY29u dGVudD0idGV4dC9odG1sOyBjaGFyc2V0PXV0Zi04Ij4NCjxNRVRBIGNvbnRlbnQ9Ik1TSFRNTCA2 LjAwLjYwMDAuMTY1NDQiIG5hbWU9R0VORVJBVE9SPjwvSEVBRD4NCjxCT0RZPg0KPERJVj48U1BB Tj48Rk9OVCBjb2xvcj0jMDAwMGZmPjxGT05UIGZhY2U9VmVyZGFuYSBzaXplPTI+QSBuZXcgRkFR IHBhZ2UgaGFzIGJlZW4gDQpwb3N0ZWQgYXQgPC9GT05UPjxBIHRpdGxlPWh0dHA6Ly91bmljb2Rl Lm9yZy9jbGRyL2RhdGEvZG9jcy93ZWIvbG9jYWxlX2ZhcS5odG1sIA0Kb25jbGljaz0icmV0dXJu IHRvcC5qcy5PcGVuRXh0TGluayh3aW5kb3csZXZlbnQsdGhpcykiIA0KaHJlZj0iaHR0cDovL3Vu aWNvZGUub3JnL2NsZHIvZGF0YS9kb2NzL3dlYi9sb2NhbGVfZmFxLmh0bWwiIHRhcmdldD1fYmxh bms+PEZPTlQgDQpmYWNlPVZlcmRhbmEgDQpzaXplPTI+aHR0cDovL3VuaWNvZGUub3JnL2NsZHIv ZGF0YS9kb2NzL3dlYi9sb2NhbGVfZmFxLmh0bWw8L0ZPTlQ+PC9BPjxGT05UIA0KZmFjZT1WZXJk YW5hIHNpemU9Mj4uIEl0IGlzIGFsc28gYWNjZXNzaWJsZSB0aHJvdWdoIHRoZSBtYWluIEZBUSBw YWdlIGF0IA0KPC9GT05UPjxBIHRpdGxlPWh0dHA6Ly93d3cudW5pY29kZS5vcmcvZmFxLyANCm9u Y2xpY2s9InJldHVybiB0b3AuanMuT3BlbkV4dExpbmsod2luZG93LGV2ZW50LHRoaXMpIiANCmhy ZWY9Imh0dHA6Ly93d3cudW5pY29kZS5vcmcvZmFxLyIgdGFyZ2V0PV9ibGFuaz48Rk9OVCBmYWNl PVZlcmRhbmEgDQpzaXplPTI+aHR0cDovL3d3dy51bmljb2RlLm9yZy9mYXEvPC9GT05UPjwvQT48 Rk9OVCBmYWNlPVZlcmRhbmE+PEZPTlQgDQpzaXplPTI+LjwvRk9OVD48L0ZPTlQ+PC9GT05UPjwv U1BBTj48L0RJVj4NCjxESVY+PFNQQU4+PEZPTlQgY29sb3I9IzAwMDBmZj48Rk9OVCBmYWNlPVZl cmRhbmE+PEZPTlQgDQpzaXplPTI+PC9GT05UPjwvRk9OVD48L0ZPTlQ+PC9TUEFOPiZuYnNwOzwv RElWPg0KPERJVj48U1BBTj48Rk9OVD48Rk9OVD48Rk9OVCBmYWNlPVZlcmRhbmE+PEZPTlQgY29s b3I9IzAwMDBmZj48Rk9OVCBzaXplPTI+PFNQQU4gDQpjbGFzcz04MzUwMzQ5MjAtMzAxMDIwMDc+ VGhpcyBwYWdlIGFuc3dlcnMmbmJzcDs8L1NQQU4+PFNQQU4gDQpjbGFzcz04MzUwMzQ5MjAtMzAx MDIwMDc+cXVlc3Rpb25zIGFib3V0IFVuaWNvZGUgTG9jYWxlcywgQ0xEUiwgYW5kIA0KTERNTC48 L1NQQU4+PC9GT05UPjwvRk9OVD48L0ZPTlQ+PC9GT05UPjwvRk9OVD48L1NQQU4+PC9ESVY+DQo8 RElWPjxTUEFOPjxGT05UPjxGT05UPjxGT05UIGZhY2U9VmVyZGFuYT48Rk9OVCBjb2xvcj0jMDAw MGZmPjxGT05UIHNpemU9Mj48U1BBTiANCmNsYXNzPTgzNTAzNDkyMC0zMDEwMjAwNz48L1NQQU4+ PC9GT05UPjwvRk9OVD48L0ZPTlQ+PC9GT05UPjwvRk9OVD48L1NQQU4+Jm5ic3A7PC9ESVY+DQo8 RElWPjxCIHN0eWxlPSJtc28tYmlkaS1mb250LXdlaWdodDogbm9ybWFsIj48U1BBTiANCnN0eWxl PSJGT05ULVNJWkU6IDEwcHQ7IENPTE9SOiBibHVlOyBGT05ULUZBTUlMWTogVGFob21hIj4tLS0t LS0tLS0tLS0tLS0tLS0tLS0tLS0tLS08P3htbDpuYW1lc3BhY2UgDQpwcmVmaXggPSBvIG5zID0g InVybjpzY2hlbWFzLW1pY3Jvc29mdC1jb206b2ZmaWNlOm9mZmljZSIgDQovPjxvOnA+PC9vOnA+ PC9TUEFOPjwvQj48L0RJVj4NCjxESVYgY2xhc3M9U2VjdGlvbjE+DQo8UCBjbGFzcz1Nc29BdXRv U2lnPjxCIHN0eWxlPSJtc28tYmlkaS1mb250LXdlaWdodDogbm9ybWFsIj48U1BBTiANCnN0eWxl PSJGT05ULVNJWkU6IDEwcHQ7IENPTE9SOiBibHVlOyBGT05ULUZBTUlMWTogVGFob21hIj5NYWdk YSANCkRhbmlzaDxCUj48L1NQQU4+PC9CPjxCIHN0eWxlPSJtc28tYmlkaS1mb250LXdlaWdodDog bm9ybWFsIj48U1BBTiANCnN0eWxlPSJGT05ULVNJWkU6IDEwcHQ7IENPTE9SOiBibHVlOyBGT05U LUZBTUlMWTogVGFob21hIj5Tci4gQWRtaW5pc3RyYXRpdmUgDQpEaXJlY3RvcjxCUj48L1NQQU4+ PC9CPjxCIHN0eWxlPSJtc28tYmlkaS1mb250LXdlaWdodDogbm9ybWFsIj48U1BBTiANCnN0eWxl PSJGT05ULVNJWkU6IDEwcHQ7IENPTE9SOiBibHVlOyBGT05ULUZBTUlMWTogVGFob21hIj5UaGUg VW5pY29kZSANCkNvbnNvcnRpdW08QlI+PC9TUEFOPjwvQj48QiBzdHlsZT0ibXNvLWJpZGktZm9u dC13ZWlnaHQ6IG5vcm1hbCI+PFNQQU4gDQpzdHlsZT0iRk9OVC1TSVpFOiAxMHB0OyBDT0xPUjog Ymx1ZTsgRk9OVC1GQU1JTFk6IFRhaG9tYSI+NjUwLTY5My0zOTIxPEJSPjwvU1BBTj48L0I+PEIg DQpzdHlsZT0ibXNvLWJpZGktZm9udC13ZWlnaHQ6IG5vcm1hbCI+PFNQQU4gDQpzdHlsZT0iRk9O VC1TSVpFOiAxMHB0OyBDT0xPUjogYmx1ZTsgRk9OVC1GQU1JTFk6IFRhaG9tYSI+bWFnZGFAdW5p Y29kZS5vcmc8bzpwPjwvbzpwPjwvU1BBTj48L0I+PC9QPg0KPFAgY2xhc3M9TXNvQXV0b1NpZz48 QiBzdHlsZT0ibXNvLWJpZGktZm9udC13ZWlnaHQ6IG5vcm1hbCI+PFNQQU4gDQpzdHlsZT0iRk9O VC1TSVpFOiAxMHB0OyBDT0xPUjogYmx1ZTsgRk9OVC1GQU1JTFk6IFRhaG9tYSI+PG86cD4mbmJz cDs8L286cD48L1NQQU4+PC9CPjwvUD48L0RJVj4NCjxESVY+Jm5ic3A7PC9ESVY+PC9CT0RZPjwv SFRNTD4NCg== --_000_871A62EA91884849A3BE952CA63832D01550D0AE3BNAEXMSGC125re_--