-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Portuguese - pre+ post+ reform #96
Comments
Should Brazilian Portuguese use the existing PortugueseCompoundRule (i.e. pre-reform)? |
I am not sure, Daniel. Could you ask to the Brazilian person who is translating in Transifex? Thanks! Kind regards, On 21/04/2014 10:47, Daniel Naber wrote:
|
Work has started on this. |
Marco, could you provide a short sentence for testing, i.e. a sentence that has an error when pre-reform is set but not with post-reform (or vice versa)? |
Daniel, I am not sure if the messages I send to GitHub arrive there since I Here is an example: post-reform: Does this help? Kind regards, On 21/04/2014 13:11, Daniel Naber wrote:
|
Thanks. Yes, all your messages arrive. |
There's a problem: in LibreOffice 4.1 and 4.1, I can only see Portuguese for Angola, Portugal, and Brazil. We'll get quite a confusion if LT supports different variants (pre/post) but you cannot set them in LibreOffice. Are there any plans for LibreOffice to support both variants? |
Well, the trick is to change in the LT settings in AOO/LO manually. AOO/LO only accepts one pt_PT dictionary at a time... in other words, it AOO/LO doesn't know if we are using pre or post, it just uses the pt_PT Years ago I wrote to AOO's mailing list suggesting a M$ Office approach I hope this helps! Kind regards, On 21/04/2014 13:48, Daniel Naber wrote:
|
"AOO/LO doesn't know if we are using pre or post" - and thus LT won't know which compound rule to use. You might want to contact the LO people, maybe they are more responsive. |
Daniel, Why can't we have in Tools > Language Tool > configuration: :( On 21/04/2014 14:40, Daniel Naber wrote:
|
Because that's the wrong place... that field is only used for the false friend rule and we'd make everything very complicated if we also use it for something else. |
:(((((( I will try to e-mail AOO and LO after AOO 4.1 is released. How do I subscribe to the LO mailing list? PS-> Having two compound files is good. I will start working on the post Thanks! On 21/04/2014 15:09, Daniel Naber wrote:
|
Hey, Brazilian Portuguese translator here! Daniel, pt_BR faced some changes with this reform, just like pt-PT. So, I believe the same pre-/post-reform situation applies to pt_BR. (e.g. MS Office 2010 has same Portuguese mode for pt_BR) |
Closing this report, as it seems there's nothing we can do until the situation in LO/OO changes. Feel free to re-open when that changes. |
LO ship now with post AO dictionaries by default. Some issues reported by users come from the disagreement between spellchecking and compund word suggestions made by LanguageTools. Changing defaults solves this issue. |
A couple of years ago or so, I mentioned the idea of having a "Language Specific Settings". For pt_PT people would be able to change between the pre and post reform compounds since it is no big deal as one just needs to toggle between two compounds files. I also mentioned the need of allowing to suggest to replace words to Italic for foreign and Latin words. Easy too, since one could have a .txt with the words. And the last thing would be an option to convert from pre to post and vice versa. Also a .txt file with pre + TAB + post. I don't believe all this is hard to do and maybe someone who knows how to code in Java could implement it. |
I have read this post. That idea is great but it is difficult to implement.
Somebody would have already done it, if it was that easy. I do not know how to do it (maybe I will add learning Java to November TODO list /sarc/) but what I have done in my local build as I have mentioned before is just change the default pointers like this:
and this:
Even a empty compound.txt is more useful than false positives all over the place. But we will not lose the benefits of this feature since I pushed today a populated hyphenator. See: If you agree with it, I will push tomorrow the file changes needed to change the preset and later you and me will try to implement your toggle idea. Is this a reasonable compromise? |
Yes, it is a good idea :) |
Great! Them tomorrow I will run the compile tests again and push the changes. |
@danielnaber On the future solution, I re-read the discussion and I understand that a interface toggle just for Portuguese is off-limits. Portuguese - Angola (Pre-AO) So far free dictionaries to those variants are also the ones from Universidade do Minho. This would appease all portuguese still wanting the PreReform spell-checking from LanguageTools. |
I'm fine with this change. Did you run all tests with |
Thank you for the prompt reply, Daniel. |
* compound rules moved to relevant locales (AO, MZ, PT) * PT has all compound rules active by default * closes issue #96
Implemented in: @marcoagpinto |
Tiago, I will check it tonight, after the nightly is released. |
There is no hurry. |
@TiagoSantos81 Great work you have done! I was wondering if you could also add the name of the country in the other "Portuguese" variants? This is more noticed in the "mother tongue" where there are no flags and one just sees "Portuguese" several times. Also, is it possible to add a Portuguese pre and post agreement (two identical PT flags with the agreement type between brackets, just like it appears in Thunderbird)? Thanks! |
You can already test these changes compiling from source.
The dictionaries are the same for all pre-AO countries. |
* not as elegant as the suggestion on c55232c but still a major legebility improvement - may be reworked later. * TODO reused to identify scientific names as general exceptions do spellchecking * TODO new rule to advice italicizing scientific species names
I verified that the translated messages are updated and correct on Transifex. In the next sync, they are fixed. @danielnaber |
@TiagoSantos81 the updates work like this:
The proper solution would be to switch to a better i18n software like https://weblate.org, but that would also take time. Anyway, I've triggered the manual update now. |
@danielnaber |
There are |
After looking at that too many times, I was unable to spot it. Many thanks! I pushed the fix moments ago. |
* compound rules moved to relevant locales (AO, MZ, PT) * PT has all compound rules active by default * closes issue #96
With the new feature added by Daniel that allows to select subtypes of languages under a major language:
"English " > "US" "AU" "NZ" "GB" "CA" "ZA".
This made me think about adding support for Portuguese pre-reform and post-reform.
I guess I can start working on the Portuguese, pre-agreement and post-agreement.
It should appear in the combo box as:
Portuguese -> PT-PRE
Portuguese -> PT-POS
I was wondering if in the next version we could do something about this.
For example, these are the needed files:
As you all know, 2) is pre-reform, so far with around 3500 words. The problem is that with post-reform the compound rules have changed, which means I need to create another file with compound words. I will try to purchase a Portuguese Dictionary post-reform soon but the supermarket will probably only have it on stock when school time begins (within months).
4: http://natura.di.uminho.pt/download/sources/Dictionaries/openoffice/Pre-AO/
5: http://natura.di.uminho.pt/download/sources/Dictionaries/openoffice/Pos-AO/
I am sharing the dictionary files taken from Minho University, on my Dropbox:
PT-PRE: https://dl.dropboxusercontent.com/u/30674540/oo4x-pt-PT-preao-14.4.1.1.oxt.zip
PT-POS: https://dl.dropboxusercontent.com/u/30674540/oo4x-pt-PT-posao-14.1.1.1.oxt.zip
They are both dated from yesterday.
Could someone also create the .txt for the compound words post-agreement?
I looked in the supermarket and they do have a post-agreement dictionary, but it is "2013" and I can wait a couple of months or so for the "2014" to be released... meanwhile I can use the Priberam site to get some compound words post-agreement. Microsoft Office 2010 uses Priberam.
The grammar.xml works for both, so no need to create another file.
The text was updated successfully, but these errors were encountered: