Skip to content

[indic] DIsallow matras after explicit half-form #556

@behdad

Description

@behdad

It was reported to me that we accept matras after a explicit half form (C,H,ZWJ) whereas Uniscribe doesn't. Indeed the Indic specs do not allow that in the grammar. I tracked this down to f0b8ed1:

commit f0b8ed1b6dd9f1d2b9084c101a6fc5dee0cc22a8
Author: Behdad Esfahbod <behdad@behdad.org>
Date:   Wed Sep 5 17:32:57 2012 -0400

    [Indic] Allow "H,ZWJ,M"
    
    Uniscribe accepts a Halant,ZWJ before matras.  Allow that.
    
    BENGALI down from 295 to 291
    DEVANAGARI down from 69 to 57
    GUJARATI down from 19 to 17
    KANNADA down from 871 to 867
    MALAYALAM down from 340 to 337
    TELUGU down from 20 to 16

Given how little the gains were (~4 per script), that does look quite suspicious to me now. We should re-evaluate.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions