-
-
Notifications
You must be signed in to change notification settings - Fork 631
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support to detect languages in Word #2047
Comments
Comment 1 by jteh on 2012-01-17 21:08 |
Comment 2 by jhomme (in reply to comment description) on 2012-01-18 12:35 DetectLanguage There are other items that have to do with language. It also says that a Range is a piece of text of any size. Hope that helps.
|
Attachment bug-2047-auto-language-detection-Word.patch added by manish on 2013-02-22 11:33 |
Comment 3 by manish on 2013-02-22 12:26 |
Comment 4 by mdcurran on 2013-02-22 14:47 However, there are a few issues:
Thanks for your work. |
Comment 5 by manish (in reply to comment 4) on 2013-02-28 12:24
Replying to mdcurran:
|
Comment 6 by manish (in reply to comment 5) on 2013-02-28 13:45
|
Comment 7 by mdcurran (in reply to comment 5) on 2013-03-05 00:10
Not sure of the exact line number, but its in winword.py, in getTextWithFields. Otherthan this, things look good. I will await further fixes from you and I can also test with Chinese. |
Comment 8 by manish (in reply to comment 7) on 2013-03-10 13:45 The new patch attached after this comment takes care of the other 2 items: blank spaces and far east lang support. Please let me know if you face any problems with chinese. Regards, Replying to [comment:7 mdcurran]('documentFormatting'].copy() The):
|
Comment 9 by mdcurran on 2013-03-12 07:38 |
Comment 10 by manish (in reply to comment 9) on 2013-03-12 15:26
|
Attachment bug-2047-fixes-with-far-east-support.patch added by manish on 2013-03-12 15:28 |
Comment 11 by mdcurran on 2013-03-13 23:38 |
Comment 12 by nishimotz on 2013-03-22 10:34 They should be translated using the entries of \f and \n in symbols.dic. It occurs with main-5974 and 2013.1-5975 snapshots. |
Comment 13 by jteh on 2013-03-23 06:47 |
Attachment test-ja-en.docx added by nishimotz on 2013-03-23 14:09 |
Comment 14 by nishimotz on 2013-03-23 14:28 The attached file test-ja-en.docx contains four lines. According to the status bar of Japanese version of Microsoft Word 2013 (32bit), the line 1 and 2 are written in Japanese language. If automatic language switching option is enabled, the line feeds are always annunced as "line feed", although I expect "kaigyo" (line feed in Japanese) for line 1 and 2. Tested with nvda_snapshot_main-5974.exe and SAPI5 voice (Microsoft Haruka Desktop) which comes with Windows 8 Japanese (64bit). |
Comment 15 by nishimotz on 2013-03-24 04:35 As far as I know, the language attribute of "line feed" character is "en-us" even preceding and trailing characters are non-Latin Japanese characters. If my understanding is correct, for translating "line feed", it would be better to consider language setting of NVDA, rather than content property. |
Comment 16 by mdcurran on 2013-03-25 05:11 There is what I would consider a bug in MS Word where its impossible to select any non-eastern text (alpha numeric characters, line feed, page break etc) and tell MS Word its Japanese. This even includes when typing with a Japanese layout: MS Word marks japanese characters as Japanese, but not the typed line feeds. This is not a problem in all european languages. The line feeds seem to stay in the chosen language. The most obvious work around is for east-asian users to disable auto language switching. though I don't like this option. Remember though this is a problem with MS Word. Web documents, PDFs etc when marked up correctly would not have this problem. I'm itnerested in any opinions on what we should do here. We could:
|
Comment 17 by jteh (in reply to comment 16) on 2013-03-25 05:14
I'd probably choose this option. However, imo, it should only be done for Microsoft Word, since Word is the problem here, not these characters in general. Also, we should use current speech language, not interface language, just like we do for symbols normally. |
Comment 18 by jteh on 2013-03-25 05:15 |
Comment 19 by mdcurran on 2013-03-25 05:18 |
Comment 20 by jteh on 2013-03-25 05:55 Perhaps just ignoring the language when reading whitespace (or perhaps even line/page/paragraph marks) by character is sufficient. I just can't decide whether it's appropriate. To give us a bit of distance, if I'm arrowing through German text when English is my primary language, should I be hearing "space" and "line feed" in English or German? I would have thought German, but comment:14 disagrees. |
Comment 21 by mdcurran on 2013-03-25 23:38 |
Comment 22 by mdcurran on 2013-03-26 01:47 |
Comment 23 by nishimotz on 2013-03-26 03:57 Dear Manish, |
Reported by jfauchon on 2012-01-17 12:32
Hello,
Please, can you make support to detect languages full Microsoft Word.
Thanks for you Work.
The text was updated successfully, but these errors were encountered: