Orthography profile: "nd" as a single segment, vs "n d" in transnewguineaorg #13

XachaB · 2021-02-24T16:42:15Z

In this dataset, the sequence "nd" is segmented as a single segment:

https://github.com/lexibank/joophonosemantic/blob/master/etc/orthography.tsv#L192

Example:

Line 2897 in 602f561

    
           Enga-33_one-1,,Enga,33_one,m.e.nd.ɑ.i,m.e.nd.ɑ.i,m e nd ɑ i,,,,,^ m . e . nd . ɑ . i $,default

However, the same sequence is segmented as "n d" in transnewguineaorg:

enga-wapi-one-1,164073,enga-wapi,one,mendai,mendai,m e n d a i,,davies_and_comrie1985,,,^ m e n d a i $,default

Is it possible to normalize to one or the other ?

The text was updated successfully, but these errors were encountered:

XachaB · 2021-03-02T11:41:28Z

pinging @LinguList

LinguList · 2021-03-02T11:57:37Z

Look, @XachaB, this is not my idea, but the source, right? The source is already segmented. So you should bring this up in transnewguineaorg, where on eshould then discuss to merge all nd instances to prenasalized n + d, and the same for mb, ng, etc.! But here again, it should also be discussed with @SimonGreenhill.

XachaB · 2021-03-02T12:05:45Z

Noted, thanks

MuffinLinwist · 2024-08-02T15:15:42Z

I'm closing this since in transnewguineaorg the issue is already resolved.

XachaB mentioned this issue Mar 2, 2021

Should it be "nd" or "n d" ? Inconsistency across datasets lexibank/transnewguineaorg#18

Closed

MuffinLinwist closed this as completed Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orthography profile: "nd" as a single segment, vs "n d" in transnewguineaorg #13

Orthography profile: "nd" as a single segment, vs "n d" in transnewguineaorg #13

XachaB commented Feb 24, 2021

XachaB commented Mar 2, 2021

LinguList commented Mar 2, 2021

XachaB commented Mar 2, 2021

MuffinLinwist commented Aug 2, 2024

Orthography profile: "nd" as a single segment, vs "n d" in transnewguineaorg #13

Orthography profile: "nd" as a single segment, vs "n d" in transnewguineaorg #13

Comments

XachaB commented Feb 24, 2021

XachaB commented Mar 2, 2021

LinguList commented Mar 2, 2021

XachaB commented Mar 2, 2021

MuffinLinwist commented Aug 2, 2024