PHOENIX-6883 : Phoenix Metadata Caching Redesign #1883

palashc · 2024-04-24T02:34:47Z

JIRA - https://issues.apache.org/jira/browse/PHOENIX-6883

Introduce a region server endpoint for phoenix and server side metadata caching on region servers.
Server side cache kept consistent by invalidating cache entries synchronously during a DDL operation.
Client sets UPDATE_CACHE_FREQUENCY=never and uses LAST_DDL_TIMESTAMP to validate metadata before query/upsert.
Client refreshes its cache using the getTable RPC only when it receives a stale cache exception from the server.

palashc · 2024-05-03T19:31:35Z

phoenix-core-client/src/main/java/org/apache/phoenix/query/ConnectionQueryServicesImpl.java

+            boolean isRetry) throws Throwable {
+        RegionServerEndpointProtos.InvalidateServerMetadataCacheRequest protoRequest =
+                getRequest(invalidateCacheRequests);
+        // TODO Do I need my own executor or can I re-use QueryServices#Executor


This came up in discussion today and the recommendation is to create a separate thread pool for executing cache invalidations. I can create a subtask for the same and add the changes to this PR.
@shahrs87 @tkhurana @kadirozde

Assuming that these new methods are only called at the server side, I think we should move this code to a separate or existing server side class.

Since the cache invalidation code is in CQSI and we would want to avoid creating a new thread pool for every DDL operation, how do you suggest we save the executor? We can have an executor in CQSI but we would not want client side CQSI to create an executor unnecessarily - or is that okay?

@kadirozde This code did start out in a separate class but I remember @shahrs87 and @jpisaac had some reasons to move it to CQSI. I could not find which PR that was. @shahrs87 Can you recall those details?

This was the PR - #1748
@kadirozde Please see comments here #1748 (comment)

phoenix-core-client/src/main/java/org/apache/phoenix/util/PhoenixRuntime.java

tkhurana · 2024-06-12T22:54:30Z

phoenix-core-client/src/main/java/org/apache/phoenix/compile/UpsertCompiler.java

@@ -636,7 +636,7 @@ public MutationPlan compile(UpsertStatement upsert) throws SQLException {
                // as max TS, so that the query can safely restarted and still work of a snapshot
                // (so it won't see its own data in case of concurrent splits)
                // see PHOENIX-4849
-                long serverTime = selectResolver.getTables().get(0).getCurrentTime();
+                long serverTime = selectResolver.getTables().get(0).getTimeStamp();


What is the difference between getCurrentTime() and getTimeStamp() ?

@tkhurana Currently both of those methods return the same value i.e. upperBoundTimestamp which is either -1 or whatever is provided to the TableRef constructor. Current code:

// if UPDATE_CACHE_FREQUENCY is set, always let the server set timestamps this.upperBoundTimeStamp = table.getUpdateCacheFrequency()!=0 ? QueryConstants.UNSET_TIMESTAMP : upperBoundTimeStamp; this.currentTime = this.upperBoundTimeStamp; public long getTimeStamp() { return this.upperBoundTimeStamp; } public long getCurrentTime() { return this.currentTime; }

Once we set UPDATE_CACHE_FREQ to NEVER, currentTime was being set to -1 and features using currentTime were breaking (like QueryOptimizer deciding whether a disabled index is under its usability threshold or asyncCreatedDate during CreateIndex). The following change in TableRef keeps currentTime to whatever is provided to the TableRef constructor so that features using currentTime can continue to use it. There is effectively no change in UpsertCompiler - it will still be using the same value i.e. TableRef.upperBoundTimeStamp.

this.currentTime = upperBoundTimeStamp;

phoenix-core-client/src/main/java/org/apache/phoenix/cache/ServerMetadataCacheImpl.java

phoenix-core-client/src/main/java/org/apache/phoenix/compile/FromCompiler.java

...ix-core-server/src/main/java/org/apache/phoenix/coprocessor/PhoenixRegionServerEndpoint.java

tkhurana · 2024-06-13T21:12:47Z

phoenix-core-client/src/main/java/org/apache/phoenix/query/QueryServicesOptions.java

+    public static final long DEFAULT_UPDATE_CACHE_FREQUENCY
+                = (long) ConnectionProperty.UPDATE_CACHE_FREQUENCY.getValue("ALWAYS");
+    public static final boolean DEFAULT_LAST_DDL_TIMESTAMP_VALIDATION_ENABLED = false;
+    public static final boolean DEFAULT_PHOENIX_METADATA_INVALIDATE_CACHE_ENABLED = false;


Where is this being used ?

These are the client-side/server-side flags we would use to enable the feature.

DEFAULT_LAST_DDL_TIMESTAMP_VALIDATION_ENABLED used in helper methods in ValidateLastDDLTimestampUtil which client uses to decide whether to validate timestamps and DEFAULT_PHOENIX_METADATA_INVALIDATE_CACHE_ENABLED in CQSI.invalidateServerMetadataCache to decide whether to invalidate cache on server side.

phoenix-core-client/src/main/java/org/apache/phoenix/jdbc/PhoenixStatement.java

tkhurana · 2024-06-13T21:15:12Z

phoenix-core-client/src/main/java/org/apache/phoenix/jdbc/PhoenixStatement.java

+                                setLastQueryPlan(plan);
+
+                                //verify metadata for the table/view/index in the query plan
+                                //plan.getTableRef can be null in some cases like EXPLAIN <query>


Nice comment

tkhurana · 2024-06-13T21:24:41Z

phoenix-core-client/src/main/java/org/apache/phoenix/util/ValidateLastDDLTimestampUtil.java

+            requestBuilder.addLastDDLTimestampRequests(innerBuilder);
+
+            // add all indexes of the current table
+            for (PTable idxPTable : tableRef.getTable().getIndexes()) {


When validating timestamps we validate the timestamps of all the ancestors and the current table as well as indexes. When the server returns StaleMetadataCacheException are we also re-populating the client cache for all the objects that we are validating ? Are we also re-populating the indexes of the current table ?

Yes. Currently, we are re-populating everything since we use MetaDataClient.updateCache workflow - it resolves ancestors and goes through the process of adding inherited columns, indexes from them, it also re-populates the indexes into the client cache.

phoenix-core-server/src/main/java/org/apache/phoenix/coprocessor/MetaDataEndpointImpl.java

tkhurana · 2024-06-13T23:59:55Z

...in/java/org/apache/hadoop/hbase/ipc/controller/InvalidateMetadataCacheControllerFactory.java

+/**
+ * Factory to instantiate InvalidateMetadataCacheControllers
+ */
+public class InvalidateMetadataCacheControllerFactory extends RpcControllerFactory {


Why didn't we consider using the ServerSideRpcControllerFactory instead of creating a new factory ?

I am not sure about this one

palashc · 2024-06-27T04:00:29Z

Latest test report - https://ci-hadoop.apache.org/job/Phoenix/job/Phoenix-PreCommit-GitHub-PR/job/PR-1883/13/testReport/

@virajjasani I think the 3 failing tests are known flappers? If yes, I will go ahead and merge this.

virajjasani · 2024-06-27T05:54:46Z

Yes they are known flappers. All yours, go ahead!!

PHOENIX-6883 Phoenix metadata caching redesign

5195020

virajjasani self-requested a review April 26, 2024 07:29

palashc commented May 3, 2024

View reviewed changes

Palash Chauhan added 4 commits May 6, 2024 16:54

Custom thread pool for executing cache invalidation RPCs

c10a016

shutdown executor after cache invalidation

7d28e96

bypass CQSI cache when fetching latest PTable from rs endpoint

646d7ee

Merge branch 'master' into PHOENIX-6883-feature-squash

be51924

palashc mentioned this pull request May 30, 2024

PHOENIX-7001: Initial implementation of Change Data Capture (CDC) feature #1866

Merged

tkhurana reviewed Jun 12, 2024

View reviewed changes

phoenix-core-client/src/main/java/org/apache/phoenix/util/PhoenixRuntime.java Outdated Show resolved Hide resolved

tkhurana reviewed Jun 12, 2024

View reviewed changes

phoenix-core-client/src/main/java/org/apache/phoenix/cache/ServerMetadataCacheImpl.java Outdated Show resolved Hide resolved

tkhurana reviewed Jun 12, 2024

View reviewed changes

phoenix-core-client/src/main/java/org/apache/phoenix/cache/ServerMetadataCacheImpl.java Outdated Show resolved Hide resolved