Skip to content

rikkimongoose/pluralia4j

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pluralia4j is a lightweight library for Java 8+ that makes generating plural version of words easy.

In most langauges nouns have plural forms. When you display a message, you want it to look natural. A format string isn't enough for it, i.e.

String.format("You have %s new messages", newMessagesSize);
// if newMessageSize == 1, text became "You have 1 new messages".
// It's correct Java, but poor English.

If values are double, it brings even more trouble (0.5 inch, 1.5 inches, 1 inch, 0 inches, -1 inch, etc). Choosing the right variant is hard even to native speakers.

The English rules for plural nouns aren't easy. The languages with more complicated grammar has more complicated rules as well. A lot of languages (like Irish or Arabic) has not only Singular and Plural, but also Dual grammatical number. Some languages has Paukal number as well (Russian is most popular of them).

That makes i18n of messages with numbers a true pain. This rules are mostly aren't explained in schools.

But pluralia4j comes to the rescue. Based on standart CLDR Unicode Plural Rules, pluralia4j provides a small but powerful DSL to make your messages grammatically consistent.

Install

The pluralia4j package is currently avaible at Maven Central repository.

  1. Add to yours pom.xml:
<dependency>
    <groupId>io.github.rikkimongoose</groupId>
    <artifactId>pluralia4j</artifactId>
    <version>1.2</version>
</dependency>
  1. Run via command line:
$ mvn install

Usage

Translate using external data:

//stored data
final Map<String, Number> source = new HashMap<>();
source.put("catsCount", 21);
source.put("dogsCount", 10);

//Create plural
final Plural plural = new Plural(new PluralisationRussian());
//fill dictionary
        plural.dict("кот","котов","кота");

//Create a template
final MessageTemplate messageTemplate=MessageTemplate.builder()
        .text("У вас ")
        .data("catsCount")
        .text(" ")
        .plural("catsCount","кот")
        .text(", ")
        .data("dogsCount")
        .text(" ")
        //templates has their own dictionary
        .plural("dogsCount","собака","собак","собаки")
        .build();

        plural.plural(messageTemplate,source); //returns "У вас 21 кот, 10 собак"

Pluralia DSL

  • text(@NonNull String format, Object... args) — just put text. The format string-like attributes are allowed/
  • data(@NonNull String key) — extract data from provided map by key
  • data(@NonNull String key, @NotNull String format) — extract data from provided map by key and display it formatted
  • plural(@NonNull String mapKey, @NonNull String word, @NonNull String... wordforms) — Add plural by number key item. The word will be pluralised according to provided numberadd plural by map key item. The word will be pluralised according to number extracted from Map by key item
  • plural(@NonNull Number num, @NonNull String word, @NonNull String... wordforms) — Add plural by number key item. The word will be pluralised according to provided number

Wordforms order:

   .plural(ONE,OTHER,FEW,MANY,TWO,ZERO)

If language doesn't have ZERO, MANY, FEW, etc wordforms, just avoid this params.

Adding new languages

To add a new language, just implement plural rules for it deriving from com.github.pluralia4j.lang.Pluralisation class. Then just use it for Plural object:

final Plural plural=new Plural(new MyPluralisation());

You can add your own WordformsDictionary with predefined forms as well.

final Plural plural=new Plural(new MyPluralisation(),myWordfromsDictionary);

You can even add your predefined wordforms dictionary from a YAML file, using WordformsDictionaryLoader class:

final Plural plural = new Plural(new MyPluralisation(), WordformsDictionaryLoader.loadFromStream(myFileStream));

An example of YAML wordforms dictionary:

exceptions:
  formula: formulae
  mouse: mice
  goose: geese
  deer:
  sheep:

or, for a language with multiple forms:

exceptions:
  секунда: [секунды, секунд]
  минута: [минуты, минут]
  час: [часа, часов]
  день: [ дня, дней ]
  неделя: [ недели, недель ]
  месяц: [ месяца, месяцев ]
  квартал: [ квартала, кварталов ]
  год: [ года,лет ]
  десятилетие: [ десятилетия, десятилетий ]
  столетие: [ столетия, столетий ]
  век: [ века, веков ]
  тысячелетие: [ тысячелетия, тысячелетий ]

Languages

The following languages are supported. They are implemented according to CLDR v37.0 Language Plural Rules .

  • Afrikaans
  • Akan
  • Albanian
  • Amharic
  • Arabic
  • Aragonese
  • Armenian
  • Assamese
  • Asturian
  • Asu
  • Azerbaijani
  • Bambara
  • Bangla
  • Basque
  • Belorussian
  • Bemba
  • Bena
  • Bhojpuri
  • Bodo
  • Bosnian
  • Breton
  • Bulgarian
  • Burmese
  • Cantonese
  • Cebuano
  • CentralKurdish
  • Chechen
  • Cherokee
  • Chiga
  • Chinese
  • Colognian
  • Cornish
  • Croatian
  • Czech
  • Danish
  • Divehi
  • Dutch
  • Dzongkha
  • English
  • Esperanto
  • Estonian
  • EuropeanPortuguese
  • Ewe
  • Faroese
  • Filipino
  • Finnish
  • Friulian
  • Galician
  • Ganda
  • Georgian
  • German
  • Gun
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Ido
  • Igbo
  • InariSami
  • Indonesian
  • Interlingua
  • Inuktitut
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Jju
  • Kabuverdianu
  • Kako
  • Kalaallisut
  • Kannada
  • Kashmiri
  • Kazakh
  • Khmer
  • Korean
  • KoyraboroSenni
  • Kurdish
  • Kyrgyz
  • Lakota
  • Langi
  • Lao
  • Latvian
  • Lingala
  • Lithuanian
  • Lojban
  • LowerSorbian
  • LuleSami
  • Luxembourgish
  • Macedonian
  • Machame
  • Makonde
  • Malagasy
  • Malayalam
  • Malay
  • Maltese
  • Manx
  • Marathi
  • Masai
  • Meta
  • Mongolian
  • Nahuatl
  • NajdiArabic
  • Nama
  • Nepali
  • Ngiemboon
  • Ngomba
  • NKo
  • NorthernSami
  • NorthernSotho
  • NorthNdebele
  • Norwegian
  • Nyanja
  • Nyankole
  • Odia
  • Oromo
  • Osage
  • Ossetic
  • Papiamento
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Prussian
  • Punjabi
  • Romanian
  • Romansh
  • Rombo
  • Russian
  • Rwa
  • Saho
  • Sakha
  • Samburu
  • Sami
  • Sango
  • Santali
  • Sardinian
  • ScottishGaelic
  • Sena
  • Serbian
  • Shambala
  • Shona
  • SichuanYi
  • Sicilian
  • Sindhi
  • Sinhala
  • SkoltSami
  • Slovak
  • Slovenian
  • Soga
  • Somali
  • SouthernKurdish
  • SouthernSami
  • SouthernSotho
  • SouthNdebele
  • Spanish
  • Sundanese
  • Swahili
  • Swati
  • Swedish
  • SwissGerman
  • Syriac
  • Tachelhit
  • Tamazight
  • Tamil
  • Telugu
  • Teso
  • Thai
  • Tibetan
  • Tigre
  • Tigrinya
  • Tongan
  • Tsonga
  • Tswana
  • Turkish
  • Turkmen
  • Tyap
  • Ukrainian
  • UpperSorbian
  • Urdu
  • Uyghur
  • Uzbek
  • Venda
  • Vietnamese
  • Volapuk
  • Vunjo
  • Walloon
  • Walser
  • Welsh
  • WesternFrisian
  • Wolof
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Requirements

Java 1.8+

License

This project available under Apache 2.0 license