New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nvda can read emojies #6523
Comments
P3, This is becoming more important as more and more sites / apps are using emojois rather than emoticons. There is some work to implement this, and some thought will be required on how to deal with translations. This may also be a good candidate for an addon, could the existing one be extended? |
will be nicer also if we have emogi, symbols and emoticons selecters On 11/1/16, Reef Turner notifications@github.com wrote:
|
Hi, about the question on Emoticons add-on, whose main author is Chris Leo, it's being extended to speak Emojis too. Work is been done to insert also emojis as well as emoticons. |
The translation part will probably be a bit difficult. I wonder whether Apple, for example, does all the emoji translations internally. I think we can't expect from our translation teams to translate all the emojis |
To what extent should the speaking of emoji characters be left to synthesizers? It might be worth looking at testing OneCore, it's language packs in different languages, and the Windows 10 emoji panel when the emoji panel functionality becomes available in other languages. People using synthesizers in languages other than English can also test by using this emoji catalog. For development purposes this Github iamcal/emoji-data repository looks useful. |
I looked into this a bit. The Unicode CLDR (Common Locale Data Repository) includes TTS descriptions for emoji and a whole bunch of other languages. You can find them in the common/annotations and common/annotationsDerived directories.
Some notes/thoughts:
|
@jcsteh commented on 28 mrt. 2018 04:05 CEST:
This makes sense.
This sounds suitable for a version 2 of the implementation.
It's not entirely clear to me what the differences are. Could you give an example of what you observed? |
@leonardder commented on Aug 1, 2018, 12:26 AM GMT+10:
Note that if the database doesn't cover some of the languages we need, our translators may want to translate them. At that point, we won't want to go for a "version 2" because we'll break all of their work.
As an example, the 😂 only appears in en (which is US). en_001 (which is the base for English) doesn't include it, nor does en_GB (which inherits directly from en_001). That means that en_GB doesn't include 😂 at all. See English Inheritance for details about inheritance for specific English locales. |
@jcsteh commented on 1 aug. 2018 00:58 CEST:
Fair point. Furthermore, it seems that the derived annotations mainly just stick the main annotations together, separated with a column which we'd like to ignore anyway. So if I'm correct, sticking to the base annotations gives us results that are close to what we want, unless:
I've seen these skin tone modifiers as part of the main annotations, but may be this is not the case for country flags? |
Looks like this repository might be a good way to go, it is perfectly kept up to date and contains exactly what we want: https://github.com/fujiwarat/cldr-emoji-annotation |
@jcsteh commented on 1 aug. 2018 00:58 CEST:
Based on this, it looks like en is currently the dictionary that is the most complete one, and as we don't specify an English dialect in NVDA, using en here might make sense. |
I have a prototype implementation in de @LeonarddeR i6523 branch (note that I'm using my private fork).
|
I think you might need to include en_001 and en, as en_001 is supposed to
be the base for all English locales according to that inheritance document.
In practice, I'm not sure whether there's anything in en_001 that isn't in
en.
|
@jcsteh commented on 2 Aug 2018, 23:21 CEST:
You're right. I've updated the code to support multiple sources per locale, which is also required for pt_br, which must include both pt and pt_BR sources. A different problem though, is whether we want the user to enable or disable processing of emojis. When we either disable or enable it in the gui, we can simply invalidate all data in the data map, so emojis either will or won't be loaded. However, config profile switches are more complex, as we don't want to invalidate all data for every config profile switch. Also, how should we treat emojis that are added as part of the user dictionary? It is easy to exclude the build in emoji dictionaries as they are in different files. The only way I can think of in case of user added emojis is adding an emojis category to the symbols files, becides symbols and complexSymbols. But then, data invalidation is still required, unless we use a separate regex for emojis within the symbol processor. |
I noticed that the annotations also contain symbols like ®, © and ™ which are strictly spoken not emojis I believe. |
cc @feerrenrut @michaelDCurran @josephsl, Do you have any thoughts about this issue? The emoji dictionary creation code is there and works quite well, the question is how to integrate these annotations into NVDA. Is it really necessary to have the ability to enable/disable them? |
I mentioned Chris, @Christianlm, the main author of the emoticons addon, but not correctly. |
Sorry for emotions, but it is very very bad innovation. |
Found the checkbox "Include Unicode Consortium data (including emoji) when processing characters and symbols". Thanks so much for the ability to disable this garbage. |
@kvark128 : what language are you using? Could you elaborate on what's so bad about the emoji translations for your language? |
Hello, I have found what I believe to be a bug. How to reproduce? 1- Open up notepad and past the below text: doesn't include 😂 at all. See 2- press home to go to the beginning of the line. 3- Press nvda + down arrow to read all the text. 4- You should hear "doesn't include face with tears of joy at all. See 5- Right until now everything is OK. 6- Now press home again and press twice ctrl + right arrow. You will hear doesn't include. 7- Now, press the right arrow. You will hear space and if you press right arrow again ... you will hear symbol d 8 3 d This demonstrates that when reading character by character the symbol is being somehow not processed or processed incorrectly. If you keep pressing control + right arrow though the symbol is read correctly. I had the same results using espeak ng in English or Portuguese with NVDA itself also either in English or Portuguese. One core voices (the two brazilian ones) with NVDA either in Portuguese or in English reported kind of the same behavior but in that case the character was not even read, a silence occurred when reading with arrows. The symbol was read correctly by read all or using ctrl arrows. |
This should have been fixed in Alpha builds of NVDA.
|
I am using nvda 218.4. This behavior occurred in Firefox and notepad as well.
Obrigado,
Marlon
Em 18 de dez de 2018, à(s) 04:00, Leonard de Ruijter <notifications@github.com> escreveu:
… This should have been fixed in Alpha builds of NVDA.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.
|
hi and sorry for asking this, i know that if you install a specific add on, nvda can read emoticons, but that one is the letter type, for example, this one, :) :( :@ etc. but this one is the a symbol that represents emojies. sorry if my information is not enough. thanks
The text was updated successfully, but these errors were encountered: