Jump to content

Language names not in Unicode CLDR

I'd say not to bother about those non-standard codes. Sure, we can use sitenotice once those language names are added to the English source for CLDR, but first we could send direct messages to Language support team members for languages with existing CLDR locales. In the meanwhile I'll email Amir, Santhosh and the CLDR survey tool admin to figure out account creation for our translators.

Nemo (talk)20:28, 27 April 2014

Language names were added to English source yesterday! http://unicode.org/cldr/trac/changeset/10166 In May we'll translate them. :)

Nemo (talk)18:11, 29 April 2014

I see some of the needed translations into Japanese (and a couple of other languages) were added at https://git.wikimedia.org/tree/mediawiki%2Fextensions%2Fcldr/HEAD/LocalNames . Can someone merge them?

whym16:17, 2 May 2014

whym, yes, you can. :) Send me an email and I'll add you to the CLDR survey tool as soon as possible. Let me know if you want to add translations in all those languages or only Japanese.

Nemo (talk)16:33, 2 May 2014

I have added a news item; the banner can wait till 8 May, if I remember.

The survey tool on CLDR will be open for contributions from 8 May to 19 June, for those keen to contribute as soon as possible. If you already have an account at CLDR you can log in here.

Lloffiwr (talk)14:19, 3 May 2014
 

I have reviewed the list of aliases at CLDR. Apart from the macrolanguages als, bcc, bcl, bxr, diq, mhr, pnb and rmy, there are 3 codes on this list, which are used at translatewiki.net:

  • mo - Moldovan, deprecated in CLDR - CLDR use ro_MD
  • sh - Serbo-Croatian in translatewiki.net, Serbian (Latin) in CLDR. CLDR use sr_Latn for Serbian(Latin)
  • tl - Tagalog in twn, Filipino in CLDR. CLDR use fil for Filipino.

These 3 codes are already in CLDR so I assume there must be a way of mapping the CLDR code to the twn code.

Lloffiwr (talk)14:21, 3 May 2014

Thanks, that's useful. The other day I was stupidly wondering how could CLDR not have Tagalog as locale... I'm not sure about aliasing but surely one bug should be filed for each of those languages to be renamed to its proper language code, can you do that? At least tl sounds uncontroversial.

Nemo (talk)14:30, 3 May 2014

I think that Siebrand is already aware of these, and will know better than I whether they should be changed.

Lloffiwr (talk)19:56, 3 May 2014
 

What we commonly call "Tagalog" in Wikimedia is the "Filipino" (or Pilipino) language in standards. But the language code "tl" is ambiguous, it can be considered as a macrolanguage encompassing the traditional Tagalog and the modern Filipino. Filipino has its CLDR data under its standard code as an individual language. Note that the traditional Tagalog was not written with the Latin script, and was not so much creolized with lots of borrows and important simplifications of the phonology. "tl" is not recommended, but as a macrolanguage, can be considered like "zh" for Chinese (even if most of the time it just means modern Mandarin, and most of the time in the simplified version of the Han script). "tl-Tglg" on the opposite only qualifies the traditional language (the modern Filipino is almost never written in the traditional script, and that's probably why "tl" is not standardized as including Filipino). Wikimedia makes an exception to that view on its localized sites (but not in Wiktionary which preferably uses more precise language codes).

Verdy p (talk)20:55, 3 May 2014