Allow Complex Parsing of Functions #1200

manticore-projects · 2021-05-20T05:20:00Z

This PR will determine the nested brackets "(" ")" and configure the Parser in order to parse Complex Function Parameters like to_char( a = b) and Named Function Parameters like trim(BOTH ' ' from 'foo bar ') ONLY WHEN the level of nesting functions is below a threshold = 7. Although this is a configurable Parser Feature.

This massively speeds up the parser. The whole NestedBracketsPerformanceTest Suite takes less than 3 seconds now, with 50! iterations for the concat test.

At the same time we can parse Complex Expression Parameters for all Functions (but only when there is no deep recursion of more than 9 levels although this detail is mood because it would not work in anyway because of the slowdown).

Fixes issues #1190 #1103

This is a CLEAN PR handcrafted against the latest Upstream MASTER, containing only the minimal changes.

Fixes issues JSQLParser#1190 JSQLParser#1103

Fixes issue JSQLParser#1194

manticore-projects · 2021-05-20T06:26:33Z

LOL! The CI server is too slow for the performance test timeouts.

coveralls · 2021-05-20T06:32:48Z

Coverage increased (+0.001%) to 88.531% when pulling cb1cb16 on manticore-projects:AllowComplexParsingClean into a5204f6 on JSQLParser:master.

wumpz · 2021-05-25T21:37:32Z

Building an upper boundary is quite a good idea. Could you please resolve the conflicts. I already merged your case when pr. My first changes for adaption the ValuesStatement are included as well. Maybe we could skip those MultiExpressionLists, because they are handles now by SimpleExpressonLists as well.

…plexParsingClean Conflicts: src/main/jjtree/net/sf/jsqlparser/parser/JSqlParserCC.jjt src/test/java/net/sf/jsqlparser/statement/select/NestedBracketsPerformanceTest.java src/test/java/net/sf/jsqlparser/statement/select/SelectTest.java

manticore-projects · 2021-05-26T07:19:44Z

Merged the latest Master Branch.

wumpz · 2021-05-26T20:46:22Z

So could you somehow better document, how the nesting depth is calculated or modified?

wumpz · 2021-05-26T21:10:11Z

BTW I check your modifications. Since complex parsing is now allowed for functions I removed FunctionWithCondParams. But now your depth of seven is not enough for multiple tests. Unfortunately I have problems for accepting one test.

manticore-projects · 2021-05-26T22:52:23Z

So could you somehow better document, how the nesting depth is calculated or modified?

It is a simple count of the maximum Open Bracket (. If there are more than 7 Open Brackets we assume deep nesting and do not allow complex parsing.

BTW I check your modifications. Since complex parsing is now allowed for functions I removed FunctionWithCondParams. But now your depth of seven is not enough for multiple tests. Unfortunately I have problems for accepting one test.

Out of my head, I foresee a situation where you have a deep nesting on a function (without complex expressions) and one complex expression somewhere else (without deep nesting). That would fail because Complex Parsing and Deep Nesting are now mutual exclusive.

However, I kept experimenting when addressing the CASE WHEN expressions.
Instead of counting the Open Brackets, we can define a Nested Expression Counter in the Grammar File and increment it when ever we enter a Primary Expression and decrement it whenever we leave a Primary Expression. If this worked we would have much better granularity on when to give up Complex Parsing.

Please let me know your particular failing test case and I will have a look at it promptly.

manticore-projects · 2021-05-27T23:43:01Z

@wumpz: You have modified the Brackets Threshold for Complex Parsing from 7 to 10. Unfortunately, the 7 has been selected for a purpose:

old duration 1565 new duration time 6873 for SELECT concat(concat(concat(concat(concat(concat(concat(concat(concat('A','B'),'B'),'B'),'B'),'B'),'B'),'B'),'B'),'B') FROM mytbl

It is the point when the performance impact LOOKAHEAD amplifies too much. This was also the reason why I have had defined timeout for the unit tests and I would like to recommend to leave it at 2000 and not to accept any change that breaks these constraints again. (2 seconds for parsing 1 single statement is too much already).

So, can we set the Threshold back to 7 again please? I am still working on a more fine granular solution, where the nesting is determined as per expression.

wumpz · 2021-06-02T12:05:34Z

Understand. The problem is, you are calculating the "complexity" of a statement via running through the parenthesis. So you are taking parenthesises into account that have nothing to do with complex expressions. Somehow we should count the deepness within the productions. I don't know, if that is even possible and limit there to a level of 7 or 10.

manticore-projects · 2021-06-02T12:13:50Z

It is not perfect, but it solves maybe 90% of the "real life" problems. When my PRs are merged then I would like to count the real nesting in the Primary Expression, same as I did for the CASE statement. Problem is really that I have so many loose ends right now because the acceptance is a bit slow.

…

On Wed, 2021-06-02 at 05:05 -0700, Tobias wrote: Understand. The problem is, you are calculating the "complexity" of a statement via running through the parenthesis. So you are taking parenthesises into account that have nothing to do with complex expressions. Somehow we should count the deepness within the productions. I don't know, if that is even possible and limit there to a level of 7 or 10. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

wumpz · 2021-06-02T12:21:44Z

Critics acknowledged. I try to improve.

wumpz · 2021-06-02T12:22:40Z

Does this case counting work for lookaheads as well? If I remember right, this case deepness was never checked, or am I wrong?

manticore-projects · 2021-06-02T12:34:31Z

On Wed, 2021-06-02 at 05:22 -0700, Tobias wrote: Does this case counting work for lookaheads as well?

Yes.

If I remember right, this case deepness was never checked, or am I wrong?

It was not needed for the cases but you see a similar preparation: 1) we have a global atomic LEVEL COUNT variable (this works well, because the parser works strictly LL) 2) when entering a Primary Expression we count up 3) when leaving the Primary Expression we count down This will give us the true nesting level. We will stop for looking for Complex Expressions below a certain threshold. I am keen to implement and test and benchmark this better logic as soon as everything else has been tied up. After that we can also look again if JavaCC21 can help anything. or not. Cheers!

Allow Complex Parsing of Functions

c9bfdc3

Fixes issues JSQLParser#1190 JSQLParser#1103

manticore-projects mentioned this pull request May 20, 2021

support named parameters #702

Merged

Apply Complex Parsing to PrimaryExpression()

a51a204

Fixes issue JSQLParser#1194

manticore-projects mentioned this pull request May 20, 2021

Parsing fails for conditions with brackets in SELECT clause #1194

Closed

Increase Test Timeout to 2 seconds for slow CI Servers.

42abed4

manticore-projects mentioned this pull request May 25, 2021

Suggestion: Enhance the SQL Formatter dbeaver/dbeaver#12240

Open

manticore-projects added 2 commits May 26, 2021 13:27

Appease Codazy

cb1cb16

wumpz merged commit 3a5da44 into JSQLParser:master May 26, 2021

manticore-projects deleted the AllowComplexParsingClean branch November 28, 2021 07:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Complex Parsing of Functions #1200

Allow Complex Parsing of Functions #1200

manticore-projects commented May 20, 2021 •

edited

manticore-projects commented May 20, 2021

coveralls commented May 20, 2021 •

edited

wumpz commented May 25, 2021

manticore-projects commented May 26, 2021

wumpz commented May 26, 2021

wumpz commented May 26, 2021 •

edited

manticore-projects commented May 26, 2021

manticore-projects commented May 27, 2021

wumpz commented Jun 2, 2021

manticore-projects commented Jun 2, 2021 via email

wumpz commented Jun 2, 2021

wumpz commented Jun 2, 2021

manticore-projects commented Jun 2, 2021 via email

Allow Complex Parsing of Functions #1200

Allow Complex Parsing of Functions #1200

Conversation

manticore-projects commented May 20, 2021 • edited

manticore-projects commented May 20, 2021

coveralls commented May 20, 2021 • edited

wumpz commented May 25, 2021

manticore-projects commented May 26, 2021

wumpz commented May 26, 2021

wumpz commented May 26, 2021 • edited

manticore-projects commented May 26, 2021

manticore-projects commented May 27, 2021

wumpz commented Jun 2, 2021

manticore-projects commented Jun 2, 2021 via email

wumpz commented Jun 2, 2021

wumpz commented Jun 2, 2021

manticore-projects commented Jun 2, 2021 via email

manticore-projects commented May 20, 2021 •

edited

coveralls commented May 20, 2021 •

edited

wumpz commented May 26, 2021 •

edited