Skip to content

v0.1.4

Choose a tag to compare

@raeq raeq released this 25 Mar 19:02
· 284 commits to main since this release

What's New

9 new script transliterations, bringing built-in language support from 51 to 60:

Caucasian alphabets (pure data — no engine changes)

  • Georgian (lang="ka") — BGN/PCGN romanization, 114 mappings
  • Armenian (lang="hy") — BGN/PCGN romanization, 86 mappings + 5 ligatures

Brahmic abugidas (extended Indic engine)

  • Sinhala (lang="si") — 90 mappings, dedicated sinhala_char_role()
  • Tibetan (lang="bo") — 147 mappings, halanta + subjoined consonants
  • Myanmar (lang="my") — 89 mappings, virama + asat + medials
  • Khmer (lang="km") — 110 mappings, coeng virama, series-normalized

Tai scripts (new ScriptClass::Tai with tone stripping)

  • Thai (lang="th") — RTGS romanization, 87 mappings
  • Lao (lang="lo") — BGN/PCGN romanization, 67 mappings

Ethiopic alphasyllabary (pure data — no engine changes)

  • Ethiopic/Amharic (lang="am") — 307 mappings for Ge'ez syllabary

Engine Changes

  • is_indic() extended to cover Tibetan, Myanmar, Khmer ranges
  • New tibetan_char_role(), myanmar_char_role(), khmer_char_role() functions
  • Unicode range constants: TIBETAN, MYANMAR, KHMER
  • Comprehensive example + property-based tests for all 9 scripts

Full Changelog: v0.1.3...v0.1.4