[CAMEL-16770] Added JDBC Idempotent Repository caching #5767

mgenereu · 2021-06-30T00:57:33Z

For slow per-query SQL implementations like Snowflake, this new class caches the JDBC Idempotent Repository in one query. Just like the SQL Component's batch mode, this makes interactions with the server performant.

mgenereu · 2021-07-02T21:14:58Z

@davsclaus there is absolutely no rush on this. I just wanted to check that I did everything you needed from me on this PR and associated Jira ticket. I think it's right but if it's sitting because I forgot something, I'd like to address right away. Thank you!

davsclaus

Yeah looks good, just one question about the CAST function.

Also it would be good to add a section in the documentation (see src/main/docs) about this new cached repo so users better can know about it

davsclaus · 2021-07-03T08:12:26Z

.../src/main/java/org/apache/camel/processor/idempotent/jdbc/JdbcCachedMessageIdRepository.java

+    private int hitCount;
+    private int missCount;
+    private String queryAllString
+            = "SELECT messageId, CAST(COUNT(*) AS INTEGER) AS messageCount FROM CAMEL_MESSAGEPROCESSED WHERE processorName = ? GROUP BY messageId";


Is CAST a standard SQL function?

Actually, you brought up a good question for me around jdbcTemplate. The SQL statement clause for COUNT(*) returns a long on some SQL servers and an integer on others. While I was able to solve the problem on the SQL side for this low counts by casting (ANSI SQL-92 feature) to an integer, I feel like this should be done like the existing code in JdbcMessageIdRepository.java but I don't know how to cast two columns using jdbcTemplate. Are you or another Camel developer familiar with this so I can make this match the existing code style?

In the code of the jdbc template it checks for what type you cast to Long vs Integer and does at rs.getInteger vs rs.getLong and then its up to the JDBC driver how to handle that.

If the number can be really really big, then we should maybe change that code to use Long instead of Integer.

Ideally the number for an idempotent repository is 1 per processorName/messageId combination. To be honest, I'm not even sure why the count is even involved in the code. I just respected it for either past or future implementations. The documentation for the camel-sql component even says:

When working with concurrent consumers it is crucial to create a unique constraint on the columns processorName and messageId. Because the syntax for this constraint differs from database to database, we do not show it here.

This would result in the COUNT(*) always being 1.

The Long or Integer expectation comes from COUNT(*) and its relationship to the maximum number of rows for the underlying SQL/JDBC implementation. For Derby, it's an Integer and for Snowflake, it's a Long. For this use case of idempotency, there's no reasonable way it could be an actual long. I would keep the existing integer code.

I'll look into how to properly cast at the JDBC level and try to keep the SQL code a little more generic.

mgenereu · 2021-07-04T20:45:55Z

Also it would be good to add a section in the documentation (see src/main/docs) about this new cached repo so users better can know about it

Brief documentation added. Not really sure it needs more than "use this if you want the cache behavior". Would you expect more?

…ng-jdbc-idempotent

mgenereu · 2021-07-05T15:48:02Z

Okay. This seems closer to what we want. The jdbcTemplate now handles the casting in getInt and the code looks simpler and cleaner.

mgenereu added 2 commits June 30, 2021 00:26

Added JDBC Idempotent Repository caching

a269e25

Temp commit to see if CI build passes

902e594

davsclaus reviewed Jul 3, 2021

View reviewed changes

Added documentation and warning about caching

cee571f

Merge branch 'main' of https://github.com/apache/camel into add-cachi…

a6cc397

…ng-jdbc-idempotent

davsclaus approved these changes Jul 5, 2021

View reviewed changes

mgenereu marked this pull request as draft July 5, 2021 14:33

mgenereu added 2 commits July 5, 2021 15:26

Leave casting of count to JDBC implementation

a9f2387

Cleaned up lambda formatting

7887d68

mgenereu marked this pull request as ready for review July 5, 2021 15:48

mgenereu requested a review from davsclaus July 5, 2021 15:48

davsclaus approved these changes Jul 5, 2021

View reviewed changes

davsclaus merged commit bf457b1 into apache:main Jul 5, 2021

mgenereu deleted the add-caching-jdbc-idempotent branch July 6, 2021 17:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CAMEL-16770] Added JDBC Idempotent Repository caching #5767

[CAMEL-16770] Added JDBC Idempotent Repository caching #5767

mgenereu commented Jun 30, 2021

mgenereu commented Jul 2, 2021

davsclaus left a comment

davsclaus Jul 3, 2021

mgenereu Jul 4, 2021

davsclaus Jul 5, 2021

mgenereu Jul 5, 2021

mgenereu commented Jul 4, 2021

mgenereu commented Jul 5, 2021

[CAMEL-16770] Added JDBC Idempotent Repository caching #5767

[CAMEL-16770] Added JDBC Idempotent Repository caching #5767

Conversation

mgenereu commented Jun 30, 2021

mgenereu commented Jul 2, 2021

davsclaus left a comment

Choose a reason for hiding this comment

davsclaus Jul 3, 2021

Choose a reason for hiding this comment

mgenereu Jul 4, 2021

Choose a reason for hiding this comment

davsclaus Jul 5, 2021

Choose a reason for hiding this comment

mgenereu Jul 5, 2021

Choose a reason for hiding this comment

mgenereu commented Jul 4, 2021

mgenereu commented Jul 5, 2021