Lao #71

scossu · 2023-11-08T13:49:47Z

Add support for Lao: https://www.loc.gov/catdir/cpso/romanization/lao.pdf

andjc · 2023-12-07T00:03:50Z

Discrepancies between interpretations of the 1997 and the 2012 Lao romanisations tables may need to be resolved. There is divergent practice that would affect mappings. See a draft note.

The Lao -> Latin mapping is a sieve, lots of data is lost, making a Latin -> Lao mapping problematic at the least. Lao, Thai and various other mappings would be better served by developing machine learning models for them. In the absence of an ML model, next best approach would be to base the assigned Latin -> Lao mapping on character frequencies based on analysis of a Lao corpus. It will also require Lao syllable boundary identification to distinguish syllable initial and final consonants.

Alternatively, it may be easier to map complete syllables form Lao -> Latin. Romanised syllables to Lao are a one-to-many mapping.

An example of many-to-one syllable mappings. But the situation becomes more complex with syllable final consonants.

scossu · 2023-12-07T13:52:52Z

We are exploring the use of Aksharamukha embedded in Scriptshifter for some East Asian and Southeast Asian languages. Currently there is experimental support for Bengali, Burmese, Devanagari, Gurmukhi, Japanese (Katakana + Hiragana - but slated for removal because not accurate enough), Tamil (+ Brahmi + extended), Thai, Tibetan via Aksharamukha.

Lao (in two versions) seems to be supported in Aksharamukha but we haven't tested it yet. If you were able to confirm the accuracy of Lao transliteration I could very easily add support for that in Scriptshifter.

Roman to Script transliteration support has a lower priority than Script to Roman at the moment. So if S2R transliteration is not reliable we can disable it on a script-by-script basis.

scossu added help wanted Extra attention is needed script labels Nov 8, 2023

scossu self-assigned this Nov 8, 2023

scossu added this to the Phase 3 milestone Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lao #71

Lao #71

scossu commented Nov 8, 2023

andjc commented Dec 7, 2023 •

edited

scossu commented Dec 7, 2023

Lao #71

Lao #71

Comments

scossu commented Nov 8, 2023

andjc commented Dec 7, 2023 • edited

scossu commented Dec 7, 2023

andjc commented Dec 7, 2023 •

edited