Jump to content

Talk:Plural

From translatewiki.net

Contents

Thread titleRepliesLast modified
Difference between CLDR plural rules and MediaWiki plural rules1115:38, 3 April 2023
Plural not supported ... WikimediaMobile012:21, 22 February 2018
Plural rule changes for many languages317:53, 9 January 2015
Changes in plural for Scots Gaelic (gd) in MediaWiki013:22, 17 December 2013
Request clarification of rules for writing plural120:12, 8 December 2013
PLURAL keyword for languages without grammatical plural forms412:00, 16 September 2012
On different plural systems701:21, 12 July 2012
Documenting plural on language portals117:53, 28 April 2012
SVN or Git?1020:30, 15 April 2012
Table of Plural/Mediawiki_plural_rules ready013:06, 9 April 2012
Russian plural314:56, 1 April 2012
Only for your information000:15, 13 March 2012

Difference between CLDR plural rules and MediaWiki plural rules

Edited by another user.
Last edit: 15:38, 3 April 2023

CLDR plural rules define different sets of decimal numbers according to the differences in grammar which occur in a sentence which includes those numbers - see Language Plural Rules on CLDR. These rules do not appear to cover the situation where a decimal number does not appear in a sentence.

In MediaWiki there are a handful of sentences where the PLURAL magic word is used but no number appears in the sentence. In these sentences there is a subject or object which can be either singular or plural and PLURAL is used to enable correct grammatical sentences where verbs, pronouns, etc differ according to the (unstated) number of items discussed. Some examples are:

  • Hidden-categories (“{{PLURAL:$1|Hidden category|Hidden categories}}”)
  • Cascadeprotected (“This page has been protected from editing because it is transcluded in the following {{PLURAL:$1|page, which is|pages, which are}} protected with the "cascading" option turned on: $2”)
  • Cascadeprotectedwarning (“<strong>Warning:</strong> This page has been protected so that only users with [[Special:ListGroupRights|specific rights]] can edit it because it is transcluded in the following cascade-protected {{PLURAL:$1|page|pages}}:”)
  • Pagecategories (“{{PLURAL:$1|Category|Categories}}”)
  • Badaccess-groups (“The action you have requested is limited to users in {{PLURAL:$2|the group|one of the groups}}: $1.”)
  • Checkuser-block-success (“'''The {{PLURAL:$2|user $1 is|users $1 are}} now blocked.'''”)
  • Achievements-badge-hover-desc-edit (“Awarded for making $1 {{PLURAL:$1|edit|edits}}<br /> on {{PLURAL:$1|an article|article}}!”)

Potentially there are grammar changes triggered here which are different from the grammar rules triggered by changes in decimal numbers. For example:

  • the form of a collective noun (referring to 'more than one') may be different to the form used with a specific number, for example in "the 11 following things" and "the following things" the form for "things" may differ.
  • verb or pronoun forms used may be different for singular and collective plurals, but not different when the number is included in the sentence.

I know these cases should be rare, and it may be that in practice there are no languages where the CLDR rules do not cover all the uses of PLURAL in MediaWiki as well, but potentially there could be languages affected by this difference in how PLURAL is used on MediaWiki and how it is defined on CLDR. Does anyone know of any languages affected by this?

I think that this is not really an issue that necessarily needs to be raised at CLDR, although I am sure they would be interested to learn about how MediaWiki uses PLURAL in sentences that do not include a decimal number. However, what it may mean is that there may be rare cases where translatewiki.net will need to define more number sets for a language than there are in CLDR.

Lloffiwr16:24, 1 August 2010
Edited by another user.
Last edit: 18:43, 29 January 2012

In other words we want to know if there is a language where any given number-included plural form is not a subset of number-less plural form. If this is the case, then we cannot achieve number-less forms by just combining existing (number-included) plural forms (in practice, giving them the same translation) in suitable combination.

Nike17:25, 1 August 2010

And in far fewer words than I used:)

Lloffiwr17:46, 1 August 2010
 

I'm not aware of a language not having the wanted subset relation. Since there may be ones, I think, CLDR should be made aware, with the suggestion to make a note on their explanatory page. Unless someone reports that (s)he did it already, I shall do that.

Purodha Blissenbach08:40, 18 September 2010
 

Well, I think, it would be a superset, but anyways. When going through the above examples again, I found that in Colognian, a sentence followed or preceded by a list (the items of which you can count) is to be treated and built exactly as if the number was included, even if the sentence itself is numberless.

Purodha Blissenbach18:18, 14 May 2011

The page on plural rule syntax at CLDR says: "There are two extra values that can be used with count attributes: 0 and 1. These are used for the explicit values, and may or may not be the same as the forms for "zero" and "one"." It seems that CLDR have got around the problem of defining additional categories for use in particular circumstances by introducing these 2 additional values. Would it be possible to write code for Mediawiki plural which does the same, enabling the use of '0' and '1' only when needed?

Lloffiwr (talk)11:05, 10 March 2012

How would you specify those parameters?

Nike (talk)09:26, 6 April 2012

I wrote the previous comment before I had understood how Mediawiki uses more than one defined plural ruleset to handle numberless sentences (and potentially sentences with zero?). Mediawiki's solution appears to be elegant, with simpler syntax for translators for numberless sentences.

However, you also say in another thread that it is 'hard to unify Mediawiki rules with other systems'. Would it be easier to unify with other systems if instead of making the second ruleset shorter than the normal ruleset, instead we made it longer, typically by adding an additional rule for 1 (or for 1 and 2 for Scottish Gaelic for example) and an additional rule for 0 where needed (Swahili would benefit from an additional rule for 0, for example). Making the second ruleset longer is not as elegant as the current system. But does it help with compatibility with other systems?

Lloffiwr (talk)10:52, 7 April 2012
 
 
 
 
 

Plural not supported ... WikimediaMobile

See this message. Maybe needs to update table, if PLURAL is supported now?

Kaganer (talk)12:21, 22 February 2018

Plural rule changes for many languages

Edited by author.
Last edit: 20:50, 27 October 2014

MediaWiki is upgrading its plural rules to match CLDR version 26. The updates include incompatible changes for plural forms in Russian, Prussian, Tagalog, Manx and several languages that fall back to Russian. In addition there are minor changes for other languages.

In January 2014, CLDR 24 had introduced several changes in the plural forms for some of these languages, including Russian, and we had updated MediaWiki's plural rules to comply with the CLDR standard. Some of these changes are now being reverted. Below is a detailed explanation of the changes.

For the migration period, from Monday, 27th October 2014 to Thursday 6th November 2014, LocalisationUpdate is disabled at Wikimedia wikis to reduce the chance of ungrammatical translations being displayed in the interface. Translators are requested to update translations.

Russian and language using Russian as fallback

[edit source]

Languages affected: Russian (ru), Abkhaz (ab), Avaric (av), Bashkir (ba), Buryat (bxr), Chechen (ce), Crimean Tatar (crh-cyrl), Chuvash (cv), Inguish (inh), Komi-Permyak (koi), Karachay-Balkar (krc), Komi (kv), Lak (lbe), Lezghian (lez), Eastern Mari (mhr), Western Mari (mrj), Yakut (sah), Tatar (tt), Tatar-Cyrillic (tt-cyrl), Tuvinian (tyv), Udmurt (udm), Kalmyk (xal).

CLDR 24 plural forms for Russian were:

  • Form 1: @integer 1, 21, 31, 41, 51, 61, 71, 81, 101, 1001, …
  • Form 2: @integer 0, 5~19, 100, 1000, 10000, 100000, 1000000, …
  • Form 3: @integer 2~4, 22~24, 32~34, 42~44, 52~54, 62, 102, 1002, … @decimal 0.0~1.5, 10.0, 100.0, 1000.0, 10000.0, 100000.0, 1000000.0, …

This has been changed to :

  • Form 1: @integer 1, 21, 31, 41, 51, 61, 71, 81, 101, 1001, …
  • Form 2: @integer 2~4, 22~24, 32~34, 42~44, 52~54, 62, 102, 1002, …
  • Form 3: @integer 0, 5~19, 100, 1000, 10000, 100000, 1000000, …
  • Form 4: @decimal 0.0~1.5, 10.0, 100.0, 1000.0, 10000.0, 100000.0, 1000000.0, …

Plurals in translations for affected languages have been updated automatically where possible. Translators are requested to check all messages containing plurals, starting from those which have been marked as outdated.

Prussian

[edit source]

Prussian (prg) now follows the same rules as Latvian (lv):

  • Form 1: @integer 0, 10~20, 30, 40, 50, 60, 100, 1000, 10000, 100000, 1000000, … @decimal 0.0, 10.0, 11.0, 12.0, 13.0, 14.0, 15.0, 16.0, 100.0, 1000.0, 10000.0, 100000.0, 1000000.0, …
  • Form 2: @integer 1, 21, 31, 41, 51, 61, 71, 81, 101, 1001, … @decimal 0.1, 1.0, 1.1, 2.1, 3.1, 4.1, 5.1, 6.1, 7.1, 10.1, 100.1, 1000.1, …
  • Form 3: @integer 2~9, 22~29, 102, 1002, … @decimal 0.2~0.9, 1.2~1.9, 10.2, 100.2, 1000.2, …

Translators are requested to update all translations containing plural rules. Those translations have been marked as outdated.

Tagalog

[edit source]

Tagalog (tl) has new rules as follows:

  • Form 1: @integer 0~3, 5, 7, 8, 10~13, 15, 17, 18, 20, 21, 100, 1000, 10000, 100000, 1000000, … @decimal 0.0~0.3, 0.5, 0.7, 0.8, 1.0~1.3, 1.5, 1.7, 1.8, 2.0, 2.1, 10.0, 100.0, 1000.0, 10000.0, 100000.0, 1000000.0, …
  • Form 2: @integer 4, 6, 9, 14, 16, 19, 24, 26, 104, 1004, … @decimal 0.4, 0.6, 0.9, 1.4, 1.6, 1.9, 2.4, 2.6, 10.4, 100.4, 1000.4, …

Translators are requested to update all translations containing plural rules. Those translations have been marked as outdated.

Manx (gv) has a new (fourth) form for decimals. New rules are as follows:

  • Form 1: @integer 1, 11, 21, 31, 41, 51, 61, 71, 101, 1001, …
  • Form 2: @integer 2, 12, 22, 32, 42, 52, 62, 72, 102, 1002, …
  • Form 3: @integer 0, 20, 40, 60, 80, 100, 120, 140, 1000, 10000, 100000, 1000000, …
  • Form 4: @decimal 0.0~1.5, 10.0, 100.0, 1000.0, 10000.0, 100000.0, 1000000.0, …
  • Form 5: @integer 3~10, 13~19, 23, 103, 1003, …

Translators are requested to update all translations containing plural rules. Those translations have been marked as outdated.

Other languages

[edit source]
  • In Mirandese (mwl), Portuguese (pt) and Brazilian Portuguese (pt-br), the first form now also includes zero.
  • In Uyghur (ug), Lower Sorbian (dsb) and Upper Sorbian (hsb), support for decimals was added.
  • In Asturian (ast) and Western Frisian (fy), the first form is no longer used for decimals.

Translators are encouraged to review translations with plural forms and update them where necessary. Because the changes have been minor, we have not marked those translations as outdated.

Nike (talk)18:35, 27 October 2014

Here is the output from the migration script.

Statistics (gv)

[edit source]
Total translations: 1528
-- with {{        : 76 (5%)
-- with {{plural}}: 44 (2.9%)
---- instances: 1 : 41
---- instances: 2 : 3
---- updated      : 0 (0% of 44)
---- fuzzied      : 44 (100% of 44)

Statistics (tl)

[edit source]
Total translations: 14750
-- with {{        : 1322 (9%)
-- with {{plural}}: 520 (3.5%)
---- instances: 1 : 440
---- instances: 2 : 66
---- instances: 3 : 13
---- instances: 4 : 1
---- updated      : 0 (0% of 520)
---- fuzzied      : 520 (100% of 520)

Statistics (prg)

[edit source]
Total translations: 2013
-- with {{        : 150 (7.5%)
-- with {{plural}}: 100 (5%)
---- instances: 1 : 91
---- instances: 2 : 9
---- updated      : 0 (0% of 100)
---- fuzzied      : 100 (100% of 100)

Statistics (ru)

[edit source]
Total translations: 20607
-- with {{        : 2291 (11.1%)
-- with {{plural}}: 847 (4.1%)
---- instances: 1 : 701
---- instances: 2 : 123
---- instances: 3 : 20
---- instances: 4 : 3
---- forms:   1+0 : 19
---- forms:   1+1 : 270
---- forms:   2+0 : 155
---- forms:   2+1 : 26
---- forms:   2+2 : 1
---- forms:   3+0 : 397
---- forms:   3+1 : 49
---- forms:   3+2 : 3
---- updated      : 376 (44.4% of 847)
---- fuzzied      : 0 (0% of 847)

Statistics (ab)

[edit source]
Total translations: 165
-- with {{        : 4 (2.4%)
-- with {{plural}}: 1 (0.6%)
---- instances: 1 : 1
---- forms:   1+1 : 1
---- updated      : 0 (0% of 1)
---- fuzzied      : 0 (0% of 1)

Statistics (av)

[edit source]
Total translations: 402
-- with {{        : 12 (3%)
-- with {{plural}}: 7 (1.7%)
---- instances: 1 : 7
---- forms:   1+1 : 5
---- forms:   3+0 : 2
---- updated      : 2 (28.6% of 7)
---- fuzzied      : 0 (0% of 7)

Statistics (ba)

[edit source]
Total translations: 7889
-- with {{        : 784 (9.9%)
-- with {{plural}}: 274 (3.5%)
---- instances: 1 : 248
---- instances: 2 : 19
---- instances: 3 : 6
---- instances: 4 : 1
---- forms:   0+0 : 1
---- forms:   0+1 : 6
---- forms:   1+0 : 121
---- forms:   1+1 : 138
---- forms:   2+0 : 5
---- forms:   3+0 : 3
---- updated      : 3 (1.1% of 274)
---- fuzzied      : 0 (0% of 274)

Statistics (bxr)

[edit source]
Total translations: 318
-- with {{        : 28 (8.8%)
-- with {{plural}}: 24 (7.5%)
---- instances: 1 : 24
---- forms:   1+0 : 22
---- forms:   2+0 : 1
---- forms:   2+1 : 1
---- updated      : 0 (0% of 24)
---- fuzzied      : 0 (0% of 24)

Statistics (ce)

[edit source]
Total translations: 9369
-- with {{        : 1156 (12.3%)
-- with {{plural}}: 473 (5%)
---- instances: 1 : 395
---- instances: 2 : 65
---- instances: 3 : 12
---- instances: 4 : 1
---- forms:   1+0 : 158
---- forms:   1+1 : 145
---- forms:   2+0 : 158
---- forms:   2+1 : 6
---- forms:   3+0 : 8
---- updated      : 6 (1.3% of 473)
---- fuzzied      : 0 (0% of 473)

Statistics (crh-cyrl)

[edit source]
Total translations: 1468
-- with {{        : 147 (10%)
-- with {{plural}}: 83 (5.7%)
---- instances: 1 : 80
---- instances: 2 : 3
---- forms:   1+0 : 3
---- forms:   1+1 : 80
---- updated      : 0 (0% of 83)
---- fuzzied      : 0 (0% of 83)

Statistics (cv)

[edit source]
Total translations: 987
-- with {{        : 36 (3.6%)
-- with {{plural}}: 20 (2%)
---- instances: 1 : 19
---- instances: 2 : 1
---- forms:   1+0 : 2
---- forms:   1+1 : 16
---- forms:   2+0 : 1
---- forms:   3+0 : 2
---- updated      : 1 (5% of 20)
---- fuzzied      : 0 (0% of 20)

Statistics (inh)

[edit source]
MediaWiki:Prevn/inh has too many forms
!!FUZZY!!!!FUZZY!!{{PLURAL:$1|хьалхйоаг|ар $1|хьалхйоаг|араш $1|хьалхйоаг|араш $1}}
Total translations: 924
-- with {{        : 55 (6%)
-- with {{plural}}: 37 (4%)
---- instances: 1 : 33
---- instances: 2 : 3
---- instances: 3 : 1
---- forms:   1+1 : 33
---- forms:   3+0 : 3
---- forms:   6+0 : 1
---- updated      : 3 (8.1% of 37)
---- fuzzied      : 1 (2.7% of 37)

Statistics (koi)

[edit source]
Total translations: 584
-- with {{        : 34 (5.8%)
-- with {{plural}}: 25 (4.3%)
---- instances: 1 : 23
---- instances: 2 : 2
---- forms:   1+0 : 4
---- forms:   1+1 : 13
---- forms:   3+0 : 8
---- updated      : 8 (32% of 25)
---- fuzzied      : 0 (0% of 25)

Statistics (krc)

[edit source]
Total translations: 4042
-- with {{        : 407 (10.1%)
-- with {{plural}}: 217 (5.4%)
---- instances: 1 : 197
---- instances: 2 : 15
---- instances: 3 : 4
---- instances: 4 : 1
---- forms:   1+0 : 85
---- forms:   1+1 : 64
---- forms:   2+0 : 66
---- forms:   2+1 : 2
---- updated      : 0 (0% of 217)
---- fuzzied      : 0 (0% of 217)

Statistics (kv)

[edit source]
Total translations: 119
-- with {{        : 3 (2.5%)
-- with {{plural}}: 2 (1.7%)
---- instances: 1 : 2
---- forms:   1+0 : 1
---- forms:   1+1 : 1
---- updated      : 0 (0% of 2)
---- fuzzied      : 0 (0% of 2)

Statistics (lbe)

[edit source]
Total translations: 41
-- with {{        : 0 (0%)
-- with {{plural}}: 0 (0%)
---- updated      : 0 (0% of 0)
---- fuzzied      : 0 (0% of 0)

Statistics (lez)

[edit source]
Total translations: 1335
-- with {{        : 55 (4.1%)
-- with {{plural}}: 38 (2.8%)
---- instances: 1 : 34
---- instances: 2 : 3
---- instances: 3 : 1
---- forms:   1+0 : 2
---- forms:   1+1 : 34
---- forms:   2+0 : 2
---- updated      : 0 (0% of 38)
---- fuzzied      : 0 (0% of 38)

Statistics (mhr)

[edit source]
Total translations: 892
-- with {{        : 49 (5.5%)
-- with {{plural}}: 33 (3.7%)
---- instances: 1 : 28
---- instances: 2 : 4
---- instances: 3 : 1
---- forms:   1+0 : 7
---- forms:   1+1 : 25
---- forms:   3+0 : 3
---- updated      : 1 (3% of 33)
---- fuzzied      : 0 (0% of 33)

Statistics (mrj)

[edit source]
Total translations: 508
-- with {{        : 31 (6.1%)
-- with {{plural}}: 22 (4.3%)
---- instances: 1 : 20
---- instances: 2 : 2
---- forms:   1+1 : 7
---- forms:   2+0 : 2
---- forms:   3+0 : 15
---- updated      : 13 (59.1% of 22)
---- fuzzied      : 0 (0% of 22)

Statistics (sah)

[edit source]
Total translations: 6932
-- with {{        : 659 (9.5%)
-- with {{plural}}: 217 (3.1%)
---- instances: 1 : 188
---- instances: 2 : 23
---- instances: 3 : 6
---- forms:   1+0 : 6
---- forms:   1+1 : 6
---- forms:   2+0 : 204
---- forms:   2+1 : 1
---- updated      : 0 (0% of 217)
---- fuzzied      : 0 (0% of 217)

Statistics (tt)

[edit source]
Total translations: 30
-- with {{        : 2 (6.7%)
-- with {{plural}}: 2 (6.7%)
---- instances: 1 : 2
---- forms:   1+0 : 1
---- forms:   2+0 : 1
---- updated      : 0 (0% of 2)
---- fuzzied      : 0 (0% of 2)

Statistics (tt-cyrl)

[edit source]
Total translations: 3085
-- with {{        : 271 (8.8%)
-- with {{plural}}: 98 (3.2%)
---- instances: 1 : 83
---- instances: 2 : 10
---- instances: 3 : 5
---- forms:   0+1 : 2
---- forms:   1+0 : 50
---- forms:   1+1 : 39
---- forms:   2+0 : 3
---- forms:   3+0 : 6
---- updated      : 4 (4.1% of 98)
---- fuzzied      : 0 (0% of 98)

Statistics (tyv)

[edit source]
Total translations: 1188
-- with {{        : 87 (7.3%)
-- with {{plural}}: 56 (4.7%)
---- instances: 1 : 55
---- instances: 3 : 1
---- forms:   1+0 : 24
---- forms:   1+1 : 25
---- forms:   2+0 : 7
---- updated      : 0 (0% of 56)
---- fuzzied      : 0 (0% of 56)

Statistics (udm)

[edit source]
Total translations: 210
-- with {{        : 9 (4.3%)
-- with {{plural}}: 6 (2.9%)
---- instances: 1 : 6
---- forms:   1+1 : 6
---- updated      : 0 (0% of 6)
---- fuzzied      : 0 (0% of 6)

Statistics (xal)

[edit source]
Total translations: 947
-- with {{        : 60 (6.3%)
-- with {{plural}}: 36 (3.8%)
---- instances: 1 : 33
---- instances: 2 : 3
---- forms:   1+0 : 2
---- forms:   1+1 : 20
---- forms:   2+0 : 2
---- forms:   3+0 : 14
---- updated      : 12 (33.3% of 36)
---- fuzzied      : 0 (0% of 36)
Nike (talk)20:48, 27 October 2014

520 fuzzied for Tagalog? :o Better copy this message there (done). Yay for 0 fuzzy in Russian. :)

Nemo (talk)07:02, 28 October 2014
 

I have fixed this:

MediaWiki:Prevn/inh has too many forms
Shirayuki (talk)21:30, 28 October 2014
 
 

Changes in plural for Scots Gaelic (gd) in MediaWiki

A thread, Thread:Talk:Plural/Changes in plural for Scots Gaelic (gd) in MediaWiki, was moved from here to Portal talk:Gd/LiquidThreads. This move was made by Siebrand (talk | contribs) on 17 December 2013 at 13:22.

Request clarification of rules for writing plural

  • Can we use 0= and 1= when we want to (as long as the version is 1.22 and above) or only when the English version use it?
  • Can we add the syntax 0= or 1= anywhere in the phrase or do they have to be written at the beginning or end?
Lloffiwr (talk)19:18, 9 November 2013

Anywhere, and whenever you want to.

Siebrand19:59, 11 November 2013
 

PLURAL keyword for languages without grammatical plural forms

At some recent point, there seemingly was a change in how the PLURAL keyword of MediaWiki works. Could anyone shed me the light what the change really is, and how we should now translate messages with PLURAL in the source language?

Here is a bit of what I understand (correct me if I'm wrong): It seems that when the site/user language is specified as Japanese, the MediaWiki function CLDRPluralRuleEvaluator.evaluateCompiled always returns 0. This means that a PLURAL keyword will always be replaced by its first argument. Gramattically, it's correct that nouns have the same form regardless of number in languages like Japanese. However, abandoning PLURAL at all causes a problem in messages such as MediaWiki:Category-article-count/ja.

PLURAL keyword seems to be used beyond just grammatical numbers. Here, the English message MediaWiki:Category-article-count/en uses PLURAL not only for switching between 'page' and 'pages', but also for deciding whether to add the part saying 'out of xx total' or not. Japanese translators here tried to literally translate it, and ended in having MediaWiki to always show the first part, corresponding to the singular case, with no information of the total number. Since most categories have multiple pages, most of the case the rendering is wrong. For instance, see how this message is rendered, on the English Wiktionary, correctly with the English interface [1] , but incorrectly with the Japanese interface [2].

What is the recommended way to translate them now? Do we want to remove all PLURAL when translating into Japanese, regardless of how it is used? If we are to throw out PLURAL for languages without grammatical plural forms, I think we'd have to review all the instances in the messages ...

whym06:20, 13 September 2012

I see the problem. It is pretty obvious from Plural/Comparison_of_plural_rules_in_various_databases where ja and ko have implicit default n != 1 rule in MW while explicit no forms rules in other sets.

Short term fix is to avoid plural in messages, in longer term I think this would be appropritately solved by supporting explicit number forms like {{PLURAL:$1|0=something special|usual text}}.

Nike (talk)16:02, 14 September 2012

Correct me if I'm wrong but this is only an issue for messages with special n=0 cases (for which we usually use different messages altogether).

Nike (talk)16:11, 14 September 2012

I can see the short term fix will work most of the case. Sometimes we'd have to choose translation a bit inconsistent to original, but the difference would be minor anyway.

I'm not sure how to check whether it's a 'n=0' case or not. If MediaWiki:badaccess-groups/ja is a non 'n=0' case, I confirm it's also affected.

whym19:53, 14 September 2012
 
 
 

On different plural systems

Edited by another user.
Last edit: 16:39, 10 March 2012

I made some research on the various plural systems used by projects in translatewiki.net. It compares plural rules between three main systems: MediaWiki, Gettext and CLDR (used by Ruby projects). The results are at page Plural/Comparison_of_plural_rules_in_various_databases. The most troubling part are the detailed reports after the overview. In the detailed part the plural rules are compared between all applicable systems. If there are differences, those are printed out. Can we reach out speakers of those languages to verify which ruleset (if any) is correct.

If we can sort those issues out, we can go further and try to get CLDR data as complete as possible, and have all other projects use CLDR definitions in way or another. We could for example provide extended list of plural rules (based on CLDR data with missing rules added) for all the supported projects.

Nike16:04, 10 September 2010

Looking for someone who wants to help getting this sorted out.

Siebrand09:25, 11 September 2010
 

I send out two notices that I hope will get some attention.

Siebrand09:50, 11 September 2010
 

If someone wonders about the Gettext, there isn't any agreed standard on those rules, and different projects may use different rules for the same language. The list of Gettext rules would be those used by translatewiki.net and is built on multiple sources. The raw data for Gettext is at [1] and preprocessed CLDR data is at [2]. There is one more peculiarity with CLDR data, that it also tries to take into account decimal numbers.

Nike10:19, 11 September 2010

I am glad to say that CLDR have now changed their plural rules for Welsh to agree with the rules on translatewiki.net. See the ticket and the rules.

I haven't attempted to do anything with Gettext yet, and won't be in a position to tackle this for a while. Having had some discussions with the Bedwyr Language Institute at Bangor University about the plural rules (they requested the changes at CLDR), it appears that most commercial websites and computer programs have in the past just not put any plural rules for Welsh onto their systems. Since the mutation rules are complex, it appears that some linguists have opted to not mutate at all, just using the singular form of a word in an unmutated state, analogous to websites in English which use 'him' to refer to 'him/her'. Given that that is the case, it might prove impossible to get Gettext to follow the CLDR and translatewiki.net pattern. How big a problem is it likely to be in future if Gettext continues to be out of step with CLDR for Welsh?

Lloffiwr18:05, 4 December 2010

As far as I know we can put any rule we want into Gettext headers.

Nike19:12, 4 December 2010

OK. If that is so, am I right in thinking that for the projects here which use Gettext, we can use the 6 plural rules for Welsh, same as currently used on MediaWiki and CLDR?

Lloffiwr19:18, 4 December 2010

Yes, that's how I think it works. The current set of Gettext rules in the comparison is just collection of rules from different sources.

Nike19:29, 4 December 2010
 
 
 
 
 

Documenting plural on language portals

I have noticed that only a few language portals have notes or links to sub-pages of notes on plural for their language. I think we need to recommend that each language which doesn't use the default plural rules make notes on plural on their portal, so that any new translators have easy access to this information. Ideally, it would be good to have the page on Plural/Mediawiki plural rules checked before requesting documentation on the language portals, to avoid confusion if there are mistakes on the page (which is more than likely since I am not familiar with software code). I am willing to do the work of putting a request for documentation, to be written in the language of the portal, on the talk pages of those languages which differ from English plural in either Mediawiki or CLDR. Any comments before I start?

Lloffiwr (talk)14:05, 9 April 2012

Requests to check Mediawiki plural rules and for documentation left on language portal talk pages.

Lloffiwr (talk)17:53, 28 April 2012
 

SVN or Git?

This page writes about Mediawiki with SVN in mind. As far as I know, Mediawiki just shifted to Git. And this page should reflect that.

The transitions is underway, but doesn't actually affect this topic in any way.

Nike (talk)12:58, 5 March 2012

A lot of links are going to SVN from the plural page and from plural/Mediawiki plural rules. When the migration is done, if the links need to be redone, can someone provide the new address of this to set me on the right path?

Lloffiwr (talk)17:56, 31 March 2012
 
 

The table of Plural/Mediawiki_plural_rules has been drafted. Could a staff member or someone who understands software code please check that I have interpreted the software rules correctly?

Lloffiwr (talk)13:06, 9 April 2012

Russian plural

I notice that Russian appears to have two different sets of rules defined - see the SVN file. One set is used in sentences with numbers, one in sentences without numbers. Can someone confirm that the software recognises which set of plural to use by whether the sentence contains the number or not? If this works for Russian, then presumably it could work for all languages, and would be of interest to some of the languages where the number 1 is in a group with other numbers. Please can a developer comment before I try to write this up on the plural page.

Lloffiwr (talk)17:36, 31 March 2012

Languages lt, sh, sr and uk also appear to have 2 sets of rules, as for Russian.

Lloffiwr (talk)18:34, 31 March 2012
 

Software doesn't recognize it. It just acts differently when less than usual number of forms are given (which is something that makes it hard to unify mediawiki rules with other systems).

Nike (talk)20:49, 31 March 2012

I have tried to write this up on Plural#Alternative_ruleset. As usual, I would be grateful if this could be checked for accuracy and usefulness. I have also added an extra column to the table of Mediawiki plural rules to carry a note on which languages have the additional ruleset.

Lloffiwr (talk)14:56, 1 April 2012
 
 

Only for your information

Hello,

for your information I would like to point out to you that there is still another plural system that is used by Mozilla programs (Firefox, SeaMonkey, Thunderbird ...), see here. It's for your information only because I think that is not relevant for current Translatewiki projects.

Kind regards

Michawiki (talk)00:15, 13 March 2012