TextLevelRule conflicts with Java rules usage in LibreOffice plugin #669

TiagoSantos81 · 2017-02-17T14:23:16Z

After the implementation of the new caching mechanism, several issues appeared in the LibreOffice plugin.
The first and most obvious and pressing is the issue with UppercasecaseSentenceStart, that does not trigger properly inside LibreOffice, as seen in the sentence below.
"An error example, of this regression. it is possible to find the error in the beginning of this sentence."

Other related issues are the conflicts with prioritization in Java rules, seen in the Portuguese rules based in AbstractReplace class, and that affect other languages as well.

Last but not least, is the conflict with bitext rules, that demanded it to be disabled in the HTTP server, which may make it unavailable also in off-line usage of some plug-ins.

I would like to work on it, but I do not know where to find the relevant performance data that shows the benefits of the new caching mechanism, so that I can avoid further regressions.
In LanguageTool Regression test pagethe performance graphs do not show any statistically significant difference on any of the performance metrics, so I believe that the metrics that guide this change are not available on that portal.

Where can I find the relevant data? Am I allowed to work on it, or more work on that function is yet to be pushed?

PS - Not that not of the reported issues affect the website, that is working perfectly with all the previous functions.

danielnaber · 2017-02-17T19:20:50Z

After the implementation of the new caching mechanism, several issues appeared in the LibreOffice plugin.

The cache itself isn't used in LibreOffice. It's only used when the JLanguageTool object gets created with a cache, which is only the case for the HTTP server so far. So this problem is only related to the cache because I moved some rules to be TextLevelRules. Help with fixing the issues is very welcome.

In LanguageTool Regression test page the performance graphs do not show any statistically significant difference

That's because that check only tests a tiny one-word (or so) "sentence" that's so short it doesn't benefit from caching. You can use org.languagetool.rules.patterns.PerformanceTest to see that the cache actually work, if you modify it to run more often on the same text.

TiagoSantos81 · 2017-02-17T19:46:19Z

Excellent. Many thanks.
I will take this as an incentive to learn Java properly.

Many Java rules seem to already work properly, so maybe it is only the ones that are yet to be fully converted that have misbehave, although the UppercaseSentenceRule was actually converted.

Can the issue in it be related with the column adjustement workaround?
237c11a

The rule triggers in the beginning of a paragraph but not inside it, so:

 -              if (match.getLine() == 0) {
 -                newMatch.setColumn(match.getColumn() + 1);
 -              } else {
 -                newMatch.setColumn(match.getColumn());
 -              }
 +                newMatch.setColumn(match.getColumn() + 1);

Can the removal of the conditional in this way work without affecting the HTTP behaviour?

danielnaber · 2017-02-17T20:16:50Z

I'm not sure, I guess LO/LT integration works on character positions instead of lines and columns.

TiagoSantos81 · 2017-02-18T15:46:10Z

You are right. It doesn't. I tried it now and it does not change, but I am having odd results. If I change the language, e.g. to italian, I get the orthographical error triggers and the sentence trigger. For the untrained eye (like mine) it seemed a much easier solution. I will report any advances, but all tips are welcome.

milekpl · 2017-04-24T18:59:20Z

Ignore my commit in this thread, this was for issue #699. My bad.

FredKruse · 2017-08-24T18:38:40Z

I set the tokenizeText in the check for paragraph to 'true'.
This solved the Problem with the upperCaseRule and other textLevelRules like SentencesWhitespaceRule. I hope the value 'false' was just a bug and there are no other side effects. I tested it for German texts. It works well. Please test it for other languages.
But there is a general problem with the office extension. The textLevelRules works only for paragraphs. This is OK for rules like Uppercase or whitespace but isn't much satisfactory for rules like wordCoherencyRule.
This is because the office interface which is used works only on the level of paragraphs.
Is there anyone in the community who is so familiar with the office interface to advance the extension for the whole text?

TiagoSantos81 · 2017-08-24T19:56:13Z

Brilliant work, Fred.
I made several attempts and spent a lot of time on that file without results. I wouldn't have remembered to fiddle with that variable.
After testing it summarily, all seems to be working as expected. Since this should be tested in more languages, I would suggest closing in one week, if there are no reasonable complains.
The usefulness of LibreOffice and LanguageTool would be greatly improved, if there were more cross-development.

FredKruse · 2017-08-27T13:32:07Z

Thank you Tiago,
There are two points left so far I see:

A way should be found to analyze the whole text.
Rules that replaces a dot at the end of a sentences for example by a question mark don't work in the dialog box (F7) (out of the text using the right mouse button they work fine). The question mark is inserted before the dot (this produces the next error). The behavior is strange. I don't know if LT handles the dialog box and the suggestions in the text different or if there is a bug inside of libre office.
I would like to work on both points but it will take time because I am not familiar with the interface and also not with the LT-code. So I have to do a lot of research.

TiagoSantos81 · 2017-08-28T08:23:28Z

Rules that replaces a dot at the end of a sentences for example by a question mark don't work in the dialog box (F7)

I can confirm this. I tested it with the 'greeting and farewells' rules that I localized from German to Portuguese, since they have the exact same behaviour in both languages. For example, it replaces the dot in best regards. with best regards,. in an infinite cycle.
The rule works fine outside of the 'spelling and grammar' dialog (F7).

I do not know if this behaviour is new. In the coming days, I will test with an older release and see if it also happens, to see if it is somehow related with TextLevelRules.

I would like to work on both points but it will take time because I am not familiar with the interface and also not with the LT-code. So I have to do a lot of research.

If someone is working on it it is great. This part of the code links to too many parts, that have to be understood deeply. It would be great if you picked it up, but I know how disheartening spending time with this can be.

Regarding these problems, should a new issue be opened?

FredKruse · 2017-08-29T15:28:09Z

The problem with the dialog is not related with the TextLevelRules. I testet it with rules on sentences level.
Yes, I think, we should open two issues (they are not related, I think):

TextLevelRules are only working on the level of paragraphs
Sentences ending dot can't be over written in dialog box

TiagoSantos81 · 2017-08-31T09:07:14Z

This issue seems solved.
@FredKruse
I opened two new issue with the problems we were debating (#781 and #782). TextLevelRules are only working on the level of paragraphs this issue is self evident, but may benefit from more information.

danielnaber added the office-integration the LibreOffice/OpenOffice add-on label Feb 24, 2017

milekpl added a commit that referenced this issue Apr 24, 2017

[en] fix bug #669

b7a9b67

FredKruse added a commit that referenced this issue Aug 24, 2017

fix bug #669

2a5dcfe

TiagoSantos81 closed this as completed Aug 31, 2017

Mbodin pushed a commit to Mbodin/languagetool that referenced this issue Oct 4, 2017

fix bug languagetool-org#669

6be0183

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TextLevelRule conflicts with Java rules usage in LibreOffice plugin #669

TextLevelRule conflicts with Java rules usage in LibreOffice plugin #669

TiagoSantos81 commented Feb 17, 2017 •

edited

danielnaber commented Feb 17, 2017

TiagoSantos81 commented Feb 17, 2017 •

edited

danielnaber commented Feb 17, 2017

TiagoSantos81 commented Feb 18, 2017

milekpl commented Apr 24, 2017

FredKruse commented Aug 24, 2017

TiagoSantos81 commented Aug 24, 2017

FredKruse commented Aug 27, 2017

TiagoSantos81 commented Aug 28, 2017

FredKruse commented Aug 29, 2017

TiagoSantos81 commented Aug 31, 2017

TextLevelRule conflicts with Java rules usage in LibreOffice plugin #669

TextLevelRule conflicts with Java rules usage in LibreOffice plugin #669

Comments

TiagoSantos81 commented Feb 17, 2017 • edited

danielnaber commented Feb 17, 2017

TiagoSantos81 commented Feb 17, 2017 • edited

danielnaber commented Feb 17, 2017

TiagoSantos81 commented Feb 18, 2017

milekpl commented Apr 24, 2017

FredKruse commented Aug 24, 2017

TiagoSantos81 commented Aug 24, 2017

FredKruse commented Aug 27, 2017

TiagoSantos81 commented Aug 28, 2017

FredKruse commented Aug 29, 2017

TiagoSantos81 commented Aug 31, 2017

TiagoSantos81 commented Feb 17, 2017 •

edited

TiagoSantos81 commented Feb 17, 2017 •

edited