-
Notifications
You must be signed in to change notification settings - Fork 829
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LOCALIZATION REQUEST: #3266
Comments
Hey @alvynabranches , thanks so much for the request. Is it okay to explain, the multiple language scripts, to help me understand your request ? |
I will add this to the sentence collector once it is added to Pontoon, as the ISO code is not yet clear to me. |
As far as I understand, Devanagari is the official script, but in Goa, Latin can be used too. Afaik ISO code is |
Most of the Bibles are written in Latin Script and traditional books. We
have a requirement for the elderly people, so they don't not understand
Devanagari script, hence we need Latin Script as of now.
…On Tue, 21 Sep 2021, 1:13 am Francis Tyers, ***@***.***> wrote:
As far as I understand, Devanagari is the official script, but in Goa,
Latin can be used too. Afaik ISO code is kok.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEQ3KLNSHFSERPWSCW3UC6FFBANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
If people can contribute towards Devanagari script will also be fine. But
we need Latin Script for now.
On Tue, 21 Sep 2021, 9:54 am Alvyn Abranches, ***@***.***>
wrote:
… Most of the Bibles are written in Latin Script and traditional books. We
have a requirement for the elderly people, so they don't not understand
Devanagari script, hence we need Latin Script as of now.
On Tue, 21 Sep 2021, 1:13 am Francis Tyers, ***@***.***>
wrote:
> As far as I understand, Devanagari is the official script, but in Goa,
> Latin can be used too. Afaik ISO code is kok.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#3266 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ADBJHEQ3KLNSHFSERPWSCW3UC6FFBANCNFSM5EMMPYKA>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
|
The Konkani Wikipedia seems to be in both scripts (with the language code In any case it does not matter exactly which script is chosen first with a multi-script language, but it should be possible to find at least 5000 public domain sentences in that script. For Serbian for example we went with Cyrillic over Latin because it was easy to find public domain texts and that was what the requester asked for. But the final choice is with the community and contributors. |
So which script will be posted first?
…On Tue, 21 Sep 2021, 10:10 am Francis Tyers, ***@***.***> wrote:
The Konkani Wikipedia
<https://gom.wikipedia.org/wiki/%E0%A4%AE%E0%A5%81%E0%A4%96%E0%A5%87%E0%A4%B2_%E0%A4%AA%E0%A4%BE%E0%A4%A8>
seems to be in both scripts (with the language code gom).
In any case it does not matter exactly which script is chosen first with a
multi-script language, but it should be possible to find at least 5000
public domain sentences in that script. For Serbian for example we went
with Cyrillic over Latin because it was easy to find public domain texts
and that was what the requester asked for. But the final choice is with the
community and contributors.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEQDEKRFSDGVFGM722TUDAEDVANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Whichever one is more convenient. I know some people working on NLP for Konkani and they have ~10k sentences in Konkani in Devanagari script that could be potentially used to bootstrap the sentence collector. Is it possible to convert sentences to and from Latin and Devanagari automatically? |
We have no converter for that. It would be great if you could put sentence
collector for both. Whoever is interested in either script can post their
sentences.
Anyhow according to what I get to know is, whenever we reach 5000+
sentences, the voice collection will start. So for sentence collector we
can go for both scripts.
…On Tue, 21 Sep 2021, 10:33 am Francis Tyers, ***@***.***> wrote:
Whichever one is more convenient. I know some people working on NLP for
Konkani and they have ~10k sentences
<https://raw.githubusercontent.com/apertium/apertium-kok/master/corpus/kok.txt>
in Konkani in Devanagari script that could be potentially used to bootstrap
the sentence collector.
Is it possible to convert sentences to and from Latin and Devanagari
automatically?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEVAVLL7DRGWDKK6BCDUDAG2TANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Hi sir,
Is the sentence collector available for Konkani Latin Script? I am not able
to figure it out yet.
On Tue, 21 Sep 2021, 10:36 am Alvyn Abranches, ***@***.***>
wrote:
… We have no converter for that. It would be great if you could put sentence
collector for both. Whoever is interested in either script can post their
sentences.
Anyhow according to what I get to know is, whenever we reach 5000+
sentences, the voice collection will start. So for sentence collector we
can go for both scripts.
On Tue, 21 Sep 2021, 10:33 am Francis Tyers, ***@***.***>
wrote:
> Whichever one is more convenient. I know some people working on NLP for
> Konkani and they have ~10k sentences
> <https://raw.githubusercontent.com/apertium/apertium-kok/master/corpus/kok.txt>
> in Konkani in Devanagari script that could be potentially used to bootstrap
> the sentence collector.
>
> Is it possible to convert sentences to and from Latin and Devanagari
> automatically?
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#3266 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ADBJHEVAVLL7DRGWDKK6BCDUDAG2TANCNFSM5EMMPYKA>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
|
The sentence collector is not available for Konkani yet. First we need to work out Pontoon and then enable it in the sentence collector. So a decision on script for localisation needs to be made first. We are currently working on this process, you can check out this link for an opportunity to participate in the New Language Workflow development. @anniedhempe, do you have any thoughts on this? In terms of language code and script. |
For script, we can proceed with the Latin Script first, because ot is
mostly understood by most of the people living in India as well as abroad.
Also that most of my work and my team's work's is related to Latin Script
only, it would be easy for me to help in development of this process.
…On Tue, 21 Sep 2021, 7:08 pm Francis Tyers, ***@***.***> wrote:
The sentence collector is not available for Konkani yet. First we need to
work out Pontoon and then enable it in the sentence collector. So a
decision on script for localisation needs to be made first. We are
currently working on this process, you can check out this link
<https://discourse.mozilla.org/t/weekly-update-save-the-date-contribute-athon-global/86137>
for an opportunity to participate in the New Language Workflow development.
@anniedhempe <https://github.com/anniedhempe>, do you have any thoughts
on this? In terms of language code and script.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEXRABKUTTBU4M3MZ3DUDCDENANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Hey @alvynabranches , Thanks so much for the clarification. I'm currently making Konkani available on Pontoon but I am struggling to find the CLDR Plurals for Konkani, by any chance do you know them ? In addition can you confirm the plural rule is n=2 ? Many thank,s Hillary |
Yes, that's right.
We translate plural with the exact number defined or with khup (many)
previous to the plural word.
…On Tue, 21 Sep 2021, 9:22 pm Hillary, ***@***.***> wrote:
Hey @alvynabranches <https://github.com/alvynabranches> ,
Thanks so much for the clarification. I'm currently making Konkani
available on Pontoon but I am struggling to find the CLDR Plurals for
Konkani, by any chance do you know them ? In addition can you confirm the
plural rule is n=2 ?
Many thank,s
Hillary
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHETG742N4KJ3JTUBVK3UDCS4XANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
I can contribute sentences in Devanagari script for Konkani. |
Great. But our institute needs it in Latin Script. And it is easy to read
for the elderly people.
…On Fri, 24 Sep 2021, 7:36 pm Annie Rajan, ***@***.***> wrote:
I can contribute sentences in Devanagari script for Konkani.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEUKVQVN7IJ43T3A4LLUDSAVXANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Sir,I do not know Latin script.Annie
Sent from Yahoo Mail on Android
On Fri, Sep 24, 2021 at 8:10 PM, Alvyn ***@***.***> wrote:
Great. But our institute needs it in Latin Script. And it is easy to read
for the elderly people.
On Fri, 24 Sep 2021, 7:36 pm Annie Rajan, ***@***.***> wrote:
I can contribute sentences in Devanagari script for Konkani.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEUKVQVN7IJ43T3A4LLUDSAVXANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
|
Hi Common-Voice/Common-Voice,
Any response on adding Konkani Latin Script?
…On Tue, 21 Sep 2021, 9:22 pm Hillary, ***@***.***> wrote:
Hey @alvynabranches <https://github.com/alvynabranches> ,
Thanks so much for the clarification. I'm currently making Konkani
available on Pontoon but I am struggling to find the CLDR Plurals for
Konkani, by any chance do you know them ? In addition can you confirm the
plural rule is n=2 ?
Many thank,s
Hillary
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHETG742N4KJ3JTUBVK3UDCS4XANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Hey @alvynabranches and @anniedhempe Thanks for your patience, I would like to ask if you could share your thoughts on the following.. Currently, the common Voice platform only supports one script for a language. However, we have already identified that this is a problem for many communities and are excited to let you know that it is in the roadmap to address around Q2 2022! Currently, localisations are tied to the dataset, so if we make different localisations as per the request, we would be splitting the Konaki language into two distinct datasets. Given that in the next 6 months we already intend to support multiple scripts for a language, you may wish to wait and make use of that option. To recap, there are three paths; A. We enable both scripts but as completely different languages that will not be part of the same dataset e.g you would have Konaki (Latin) as one dataset and Konaki (Devanagari) as another. This would be available now, but would fragment the datasets and make them more difficult to use together. Please let me know what you think ? |
Sounds good to me. Even if both scripts cannot be merged is fine with me,
because I will be mostly working with latin script. Anyone else who is
interested in Devanagari script will do so. Thank you for your support.
…On Mon, Oct 4, 2021 at 5:41 PM Hillary ***@***.***> wrote:
Hey @alvynabranches <https://github.com/alvynabranches> and @anniedhempe
<https://github.com/anniedhempe>
Thanks for your patience, I would like to ask if you could share your
thoughts on the following..
Currently, the common Voice platform only supports one script for a
language. However, we have already identified that this is a problem for
many communities and are excited to let you know that it is in the roadmap
to address around Q2 2022!
Currently, localisations are tied to the dataset, so if we make different
localisations as per the request, we would be splitting the Konaki language
into two distinct datasets. Given that in the next 6 months we already
intend to support multiple scripts for a language, you may wish to wait and
make use of that option.
*To recap, there are three paths;*
A. We enable both scripts but as completely different languages that will
not be part of the same dataset e.g you would have Konaki (Latin) as one
dataset and Konaki (Devanagari) as another. This would be available now,
but would fragment the datasets and make them more difficult to use
together.
B. Wait to progress Konaki altogether until the feature for multiple
scripts is launched in the new year (circa May 2022)
C. Continue to localise Konaki and launch the dataset in one script, and
then next year we will add the other. You would need to agree on which
script is better to start with as a community and let us know.
Please let me know what you think ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEUJOPDQQFTI2AEVPATUFGKWFANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Ideally there would be agreement from users of both scripts as to the best way forward. Also, "this is fine for me" does not really work here because there are issues of who gets which language code etc. It will make a lot of work for the team if the language codes have to be switched later, so it's better to come up with a consensus now. If you'd like to make a suggestion on how to deal with both scripts with respect to language code and dataset, we'd love to hear it. |
When I said "me" it meant my organization.
…On Mon, 4 Oct 2021, 7:35 pm Francis Tyers, ***@***.***> wrote:
Ideally there would be agreement from users of both scripts as to the best
way forward. Also, "this is fine for me" does not really work here because
there are issues of who gets which language code etc. It will make a lot of
work for the team if the language codes have to be switched later, so it's
better to come up with a consensus now. If you'd like to make a suggestion
on how to deal with both scripts with respect to language code and dataset,
we'd love to hear it.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHERFCNWQ2IDD7VV4I4TUFGYDBANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Our organization is working on the basis of Goan migrants in foreign
countries, elderly people and health sector interviews. And these people
are mostly dealing with Latin Script. Those who are in education sector
uses Devnagari script. And according to our organization scope, we have no
scope for Devnagari script. Also that we have developers that would prefer
Latin Script over Devnagari. If anyone else is interested in Devnagari they
can go ahead with that script. Because I know Goa University is working
with Devnagari script. So if both scripts are available it will be good for
both communities.
On Mon, 4 Oct 2021, 8:32 pm Alvyn Abranches, ***@***.***>
wrote:
… When I said "me" it meant my organization.
On Mon, 4 Oct 2021, 7:35 pm Francis Tyers, ***@***.***>
wrote:
> Ideally there would be agreement from users of both scripts as to the
> best way forward. Also, "this is fine for me" does not really work here
> because there are issues of who gets which language code etc. It will make
> a lot of work for the team if the language codes have to be switched later,
> so it's better to come up with a consensus now. If you'd like to make a
> suggestion on how to deal with both scripts with respect to language code
> and dataset, we'd love to hear it.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#3266 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ADBJHERFCNWQ2IDD7VV4I4TUFGYDBANCNFSM5EMMPYKA>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
|
I completely agree that if both scripts are available it will be good for the communities. The question remains about how to assign language codes. One possible option would be to use These could then be merged into Another option would be to do what Wikipedia does, and to include the sentences in both scripts: When making the decision we need to take into account the whole language community, not just one organisation. If you would like an organisation-specific Common Voice, then you can set up your own instance. We are looking for a holistic solution, not a solution that only takes one subset of the speaker population into account. |
Thank you for the clarification. The codes look good for now.
…On Mon, 4 Oct 2021, 8:52 pm Francis Tyers, ***@***.***> wrote:
I completely agree that if both scripts are available it will be good for
the communities. The question remains about how to assign language codes.
One possible option would be to use
- knn <https://iso639-3.sil.org/code/knn> for Konkani (in Devanagari)
- gom <https://iso639-3.sil.org/code/gom> for Goan Konkani (in Latin)
These could then be merged into kok at a possible future date. However
this is unsatisfactory because Goan Konkani may also written in Devanagari.
Another option would be to do what Wikipedia
<https://gom.wikipedia.org/wiki/%E0%A4%AE%E0%A5%81%E0%A4%96%E0%A5%87%E0%A4%B2_%E0%A4%AA%E0%A4%BE%E0%A4%A8>
does, and to include the sentences in both scripts:
[image: imatge]
<https://user-images.githubusercontent.com/449545/135878033-4e557007-4f9e-49eb-8f3f-1ff881eab148.png>
When making the decision we need to take into account the whole language
community, not just one organisation. If you would like an
organisation-specific Common Voice, then you can set up your own instance.
We are looking for a holistic solution, not a solution that only takes one
subset of the speaker population into account.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHETFJH77XTERAAWU45LUFHBDBANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Ok, let's get some input from some other people and make sure that we are getting an inclusive consensus. Do you know anyone at Goa University you could contact? |
Ya, I know 1 person there. She is a professor over there. Her name is Jyoti
Pawar.
…On Mon, 4 Oct 2021, 10:14 pm Francis Tyers, ***@***.***> wrote:
Ok, let's get some input from some other people and make sure that we are
getting an inclusive consensus. Do you know anyone at Goa University you
could contact?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHEWKZSZEZ3Z5NBE7TM3UFHKYFANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
I had a discussion with my dev folks. And they said it will also be ok if
we have both scripts accompanying the dataset, as far as we have Latin
Script also included in the final dataset.
On Mon, 4 Oct 2021, 10:52 pm Alvyn Abranches, ***@***.***>
wrote:
… Ya, I know 1 person there. She is a professor over there. Her name is
Jyoti Pawar.
On Mon, 4 Oct 2021, 10:14 pm Francis Tyers, ***@***.***>
wrote:
> Ok, let's get some input from some other people and make sure that we are
> getting an inclusive consensus. Do you know anyone at Goa University you
> could contact?
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#3266 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ADBJHEWKZSZEZ3Z5NBE7TM3UFHKYFANCNFSM5EMMPYKA>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
|
Hey everyone, thanks for sharing your thoughts regarding the localisation of Konaki. To confirm, I will be enabling pontoon with the following ISO codes (once there is community consensus):
Alyvn, is it possible if you could share the contact details for Jyoti ? one way we can communicate with people how this decision was made for the script is in language pontoon description and the creation of the L10N style guide, which list styles for languages on Pontoon. @MichaelKohler , on the sentence collector, is there any way we could signify what script is being used ? As a suggestion could it be under the name of the language ? |
Probably for sentence collector the easiest way is in the native name, e.g.
Is the eventual plan to merge these two? |
Yes, what @ftyers said is spot on. Do you think this would be clear enough for contributors? |
Is there any updates on Konkani language sentence collector?
…On Fri, 8 Oct 2021, 7:53 pm Michael Kohler, ***@***.***> wrote:
Yes, what @ftyers <https://github.com/ftyers> said is spot on. Do you
think this would be clear enough for contributors?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHER7M4X4XMG73OY6ITTUF35EVANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
We will add it to the Sentence Collector once it's added to Pontoon. |
Hey @anniedhempe , Are you happy with the proposed solution(s)? Please let us know your thoughts. So we can proceed. |
Any update on this?
…On Fri, Oct 22, 2021 at 3:45 PM Hillary ***@***.***> wrote:
Hey @anniedhempe <https://github.com/anniedhempe> ,
Are you happy with the proposed solution(s)?
<#3266 (comment)>
Please let us know your thoughts. So we can proceed.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHER3VJNEGC7JQHCR5ATUIE2SLANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Hey @alvynabranches , As per my previous message, I want to ensure there is some community consensus on this before starting to localise. I have reached out to @anniedhempe, who was involved in the discussion via github and I am waiting for her to also way in. You mentioned that you had a contact Jyoti, who might also be interested in discussing the script choice. By any chance have you reached out to Jyoti ? |
Sir,I was using devnagari script of Konkani for my work. Even Madam Jyoti is working with devnagari script the language.I would like to know what I am suppose to doAnnie.
Sent from Yahoo Mail on Android
On Mon, Nov 29, 2021 at 5:19 PM, ***@***.***> wrote:
Hey @alvynabranches ,
As per my previous message, I want to ensure there is some community consensus on this before starting to localise. I have reached out to @anniedhempe, who was involved in the discussion via github and I am waiting for her to also way in. You mentioned that you had a contact Jyoti, who might also be interested in discussing the script choice.
By any chance have you reached out to Jyoti ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
|
Hey Annie, thanks for your response. On October 4th, I shared three options we could take with handling varying scripts. Could you please share your thoughts on this ? Reference: #3266 (comment) |
Sir,In Goa the official accepted script is devnagari. So I think we should begin with the officially accepted script first.Other scripts are Romi Konkani, in Kannada, in Malayalam and persio arabic. My paper link I will send.A survey of Konkani NLP resources
|
|
|
| | |
|
|
|
| |
A survey of Konkani NLP resources
The first comprehensive survey is presented of natural language processing (NLP) research in Konkani, a low-resource regional Indian language with a s…
| |
|
|
Is all about this language.Annie
Sent from Yahoo Mail on Android
On Mon, Nov 29, 2021 at 6:02 PM, ***@***.***> wrote:
Sir,I was using devnagari script of Konkani for my work. Even Madam Jyoti is working with devnagari script the language.I would like to know what I am suppose to doAnnie. Sent from Yahoo Mail on Android On Mon, Nov 29, 2021 at 5:19 PM, @.***> wrote: Hey @alvynabranches , As per my previous message, I want to ensure there is some community consensus on this before starting to localise. I have reached out to @anniedhempe, who was involved in the discussion via github and I am waiting for her to also way in. You mentioned that you had a contact Jyoti, who might also be interested in discussing the script choice. By any chance have you reached out to Jyoti ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.
Hey Annie, thanks for your response. On October 4th, I shared three options we could take with handling varying scripts. Could you please share your thoughts on this ?
Reference: #3266 (comment)
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
|
Hey Annie, Thanks for clarifying your position. Overall I understand your preference for the script but not out of three options proposed. The three options are: Also, could you please refer to me as "Ms" as my pronouns are she/her. Thank you |
Madam,I think devanagari script is better to start with because those from Karnataka and Kerala can read devnagari.And the language is Konkani not JonakiAnnie
Sent from Yahoo Mail on Android
On Mon, Nov 29, 2021 at 6:23 PM, ***@***.***> wrote:
Hey Annie,
Thanks for clarifying your position. Overall I understand your preference for the script but not out of three options proposed.
The three options are:
A. We enable both scripts but as completely different languages that will not be part of the same dataset e.g you would have Konaki (Latin) as one dataset and Konaki (Devanagari) as another. This would be available now, but would fragment the datasets and make them more difficult to use together.
B. Wait to progress Konaki altogether until the feature for multiple scripts is launched in the new year (circa May 2022)
C. Continue to localise Konaki and launch the dataset in one script, and then next year we will add the other. You would need to agree on which script is better to start with as a community and let us know.
Also, could you please refer to me as "Ms" as my pronouns are she/her.
Thank you
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
|
Hey @alvynabranches and @anniedhempe, Sorry for the delay in response. Thanks for your contributions on this discussion your responses imply that you prefer option 1, which I can implement.
The Codes will be knn for Konkani (in Devanagari) and gom for Goan Konkani (in Latin). Earlier in this conversation earlier, we highlighted the inclusion of varaints for languages and using BCP-47 which provides flexibility for orthography. I would like to encourage you to check out our blog, which explains the variants in more details: https://foundation.mozilla.org/en/blog/how-we-are-making-common-voice-even-more-linguistically-inclusive/ Could we host a virtual meeting to dicuss the possible variants with you ? @anniedhempe could you please provide a pontoon user name, so I can set you up for Konkani (in Devanagari) please ? |
Thanks for the mail,I am comfortable with the time u decide provided it is daytime in India.It can be annrajAnnie
Sent from Yahoo Mail on Android
On Tue, Jan 11, 2022 at 5:09 PM, ***@***.***> wrote:
Hey @alvynabranches and @anniedhempe,
Sorry for the delay in response.
Thanks for your contributions on this discussion your responses imply that you prefer option 1, which I can implement.
A. We enable both scripts but as completely different languages that will not be part of the same dataset e.g you would have Konaki (Latin) as one dataset and Konaki (Devanagari) as another. This would be available now, but would fragment the datasets and make them more difficult to use together.
The Codes will be knn for Konkani (in Devanagari) and gom for Goan Konkani (in Latin).
Earlier in this conversation earlier, we highlighted the inclusion of varaints for languages and using BCP-47 which provides flexibility for orthography. I would like to encourage you to check out our blog, which explains the variants in more details: https://foundation.mozilla.org/en/blog/how-we-are-making-common-voice-even-more-linguistically-inclusive/
Could we host a virtual meeting to dicuss the possible variants with you ?
@anniedhempe could you please provide a pontoon user name, so I can set you up for Goan Konkani please ?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Thanks @anniedhempe I will send you a few dates once, @alvynabranches responds - also you can create a profile on Pontoon using this link: https://pontoon.mozilla.org/accounts/fxa/login/ |
Hey @alvynabranches , Thanks for your patience Goan Konkani is now syncing with Common Voice and will be available for localisation shortly. I have also added you as a team manager. This means all your work will go to staging (and later to production) without peer review. I hope through your work, there would be others joining you to collaborate on the different phases of this project. Here are a few docs that will help you with using Pontoon, working with other contributors, and localizing the file written in .ftl: - Pontoon user guide Thank you and welcome to the Mozilla l10n and Common Voice community! If you have any questions join us on the community chat, also if you would like any community support I'm happy to share ideas and support you. |
Thank you for the update. I am available at IST day timings.
…On Tue, Jan 11, 2022, 5:46 PM Hillary ***@***.***> wrote:
Hey @alvynabranches <https://github.com/alvynabranches> ,
Thanks for your patience
Goan Konkani is now syncing with Common Voice
<https://pontoon.mozilla.org/gom/> and will be available for localisation
shortly.
I have also added you as a team manager. This means all your work will go
to staging (and later to production) without peer review. I hope through
your work, there would be others joining you to collaborate on the
different phases of this project.
Here are a few docs that will help you with using Pontoon, working with
other contributors, and localizing the file written in .ftl:
- Pontoon user guide
<https://mozilla-l10n.github.io/localizer-documentation/tools/pontoon/index.html>
- Roles and responsibilities
<https://mozilla-l10n.github.io/localizer-documentation/community/index.html>
- Localizing in Fluent
<https://mozilla-l10n.github.io/localizer-documentation/tools/fluent/index.html>
- Common Voice web part project info
<https://pontoon.mozilla.org/projects/common-voice/info/>
Thank you and welcome to the Mozilla l10n and Common Voice community!
If you have any questions join us on the community chat
<https://chat.mozilla.org/#/room/%23common-voice:mozilla.org>, also if
you would like any community support I'm happy to share ideas and support
you.
—
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADBJHERRKF5EAYPQXBFPF7TUVQNRNANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I have checked Common Voice sentence collector now. It show only "gom",
some people might not know that it is Konkani Latin script. Instead if it
could be specified in detail it would be better. Even for Devnagari script
its just showing "knn".
Thanks.
On Tue, Jan 11, 2022, 6:29 PM Alvyn Abranches ***@***.***>
wrote:
… Thank you for the update. I am available at IST day timings.
On Tue, Jan 11, 2022, 5:46 PM Hillary ***@***.***> wrote:
> Hey @alvynabranches <https://github.com/alvynabranches> ,
>
> Thanks for your patience
>
> Goan Konkani is now syncing with Common Voice
> <https://pontoon.mozilla.org/gom/> and will be available for
> localisation shortly.
>
> I have also added you as a team manager. This means all your work will go
> to staging (and later to production) without peer review. I hope through
> your work, there would be others joining you to collaborate on the
> different phases of this project.
>
> Here are a few docs that will help you with using Pontoon, working with
> other contributors, and localizing the file written in .ftl:
>
> - Pontoon user guide
> <https://mozilla-l10n.github.io/localizer-documentation/tools/pontoon/index.html>
> - Roles and responsibilities
> <https://mozilla-l10n.github.io/localizer-documentation/community/index.html>
> - Localizing in Fluent
> <https://mozilla-l10n.github.io/localizer-documentation/tools/fluent/index.html>
> - Common Voice web part project info
> <https://pontoon.mozilla.org/projects/common-voice/info/>
>
> Thank you and welcome to the Mozilla l10n and Common Voice community!
>
> If you have any questions join us on the community chat
> <https://chat.mozilla.org/#/room/%23common-voice:mozilla.org>, also if
> you would like any community support I'm happy to share ideas and support
> you.
>
> —
> Reply to this email directly, view it on GitHub
> <#3266 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ADBJHERRKF5EAYPQXBFPF7TUVQNRNANCNFSM5EMMPYKA>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
> You are receiving this because you were mentioned.Message ID:
> ***@***.***>
>
|
I did this , I do not know what to do nextAnnie On Tuesday, 11 January, 2022, 05:14:29 pm IST, Hillary ***@***.***> wrote:
Thanks @anniedhempe I will send you a few dates once, @alvynabranches responds - also you can create a profile on Pontoon using this link: https://pontoon.mozilla.org/accounts/fxa/login/
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
All you need to do next is share with me your profile user link for example mine is https://pontoon.mozilla.org/contributors/UJeECCxearEuO2zZfY6iyzbuPn8/ You can access this via the menu, when clicking on your username. |
That will be updated on the next deployment after the native language name is added in the Pontoon strings. |
***@***.***
|
|
|
| | |
|
|
|
| |
***@***.***
Mozilla
Mozilla’s Localization Platform
|
|
|
I hope this is right
On Tuesday, 11 January, 2022, 08:47:03 pm IST, Hillary ***@***.***> wrote:
All you need to do next is share with me your profile user link for example mine is https://pontoon.mozilla.org/contributors/UJeECCxearEuO2zZfY6iyzbuPn8/
You can access this via the menu, when clicking on your username.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Hey Annie, because of our privacy settings, I'm unable to see your email
address that you shared with me. It comes up like this **@**
If you could please provide your profile link, I can then add you
alternatively you can email Common Voice team by filling out the contact
form on our website. <https://commonvoice.mozilla.org/en>
…On Tue, 11 Jan 2022 at 15:23, Annie Rajan ***@***.***> wrote:
***@***.***
|
|
|
| | |
|
|
|
| |
***@***.***
Mozilla
Mozilla’s Localization Platform
|
|
|
I hope this is right
On Tuesday, 11 January, 2022, 08:47:03 pm IST, Hillary ***@***.***> wrote:
All you need to do next is share with me your profile user link for
example mine is
https://pontoon.mozilla.org/contributors/UJeECCxearEuO2zZfY6iyzbuPn8/
You can access this via the menu, when clicking on your username.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.Message ID: ***@***.***>
—
Reply to this email directly, view it on GitHub
<#3266 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AUMKOTUORNG53JMALHBDA33UVRDP3ANCNFSM5EMMPYKA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were assigned.Message ID:
***@***.***>
|
***@***.***
https://pontoon.mozilla.org/contributors/Daa6FAmK9J0eZ9O9ekevzG9b0aY/
is this ok
On Tuesday, 11 January, 2022, 08:53:21 pm IST, ***@***.*** ***@***.***> wrote:
***@***.***
|
|
|
| | |
|
|
|
| |
***@***.***
Mozilla
Mozilla’s Localization Platform
|
|
|
I hope this is right
On Tuesday, 11 January, 2022, 08:47:03 pm IST, Hillary ***@***.***> wrote:
All you need to do next is share with me your profile user link for example mine is https://pontoon.mozilla.org/contributors/UJeECCxearEuO2zZfY6iyzbuPn8/
You can access this via the menu, when clicking on your username.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Thanks Annie, I have now made you the mamager for Konkani (Devanagari) . This means all your work will go to staging (and later to production) without peer review. I hope through your work, there would be others joining you to collaborate on the different phases of this project. Here are a few docs that will help you with using Pontoon, working with other contributors, and localizing the file written in .ftl: - Pontoon user guide Thank you and welcome to the Mozilla l10n and Common Voice community! If you have any questions join us on the community chat, also if you would like any community support I'm happy to share ideas and support you. |
Use this template to request a new localizable language that is currently not available on Pontoon.
Note: This issue only applies to the web interface language - in order to activate language contributions on Common Voice you will also need to ensure 5,000+ sentences are available to be read in that language. Please refer to the full Language documentation for more details.
Language name
What language would you like to add?
Konkani
Language code
ISO 639 2/B
Language size
Number of active speakers of this language in the world - 3 millions - 3.5 millions
Plural forms
How would you translate the following in this language?
0 rocks - fatôr na
1 rock - ek fatôr
2 rocks - dah fatôr
3 rocks - tin fatôr
4 rocks - char fatôr
5 rocks - panch fatôr
10 rocks - dah fatôr
20 rocks - vis fatôr
100 rocks - sambar fatôr
1000 rocks - hôzar fatôr
I see 0 rocks on the ground - Maka zomnir fatôr diso na
I see 1 rock on the ground - Maka zomnir ek fatôr dista
I see 10 rocks on the ground - Maka zomnir dah fatôr dista
I see rocks on the ground - Maka zomnir khup fatôr dista
Pontoon manager
https://pontoon.mozilla.org/contributors/eTnOEuxbFg7f5DhwUCTUBLtcqKg/
Language Script
What is the name of the language scripts used to write your language?
Latin, Roman, Devanagari, Kannada, Malayalam, Arabic. (Latin, Roman preferred, also Devanagari)
The text was updated successfully, but these errors were encountered: