PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong… #434

kadirozde · 2019-01-30T01:30:35Z

… timestamps

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java

phoenix-core/src/main/java/org/apache/phoenix/compile/ServerBuildIndexCompiler.java

...nix-core/src/main/java/org/apache/phoenix/mapreduce/index/PhoenixServerBuildIndexMapper.java

gokceni

LGTM. Thank you for adding extra comments, that is really helpful. Please rebase on top of my changes which should go in this week. I would also wait for Geoffrey's approval since I am new.

phoenix-core/src/main/java/org/apache/phoenix/mapreduce/util/PhoenixConfigurationUtil.java

gjacoby126 · 2019-02-07T23:25:25Z

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java

+
+        ResultSet rs = conn.createStatement().executeQuery(selectSql);
+        assertTrue (rs.next());
+        assertTrue(rs.unwrap(PhoenixResultSet.class).getCurrentRow().getValue(0).getTimestamp() >= clock1.initialTime());


@kadirozde - maybe I'm misunderstanding the test, but shouldn't this be checking against an upper bound to make sure that the index was populated with the initial timestamp of 'abc' (and in the second check 'bcd')?

Thx for noticing it. I will correct it.

phoenix-core/src/main/java/org/apache/phoenix/compile/ServerBuildIndexCompiler.java

gjacoby126 · 2019-02-07T23:44:47Z

phoenix-core/src/main/java/org/apache/phoenix/mapreduce/index/IndexTool.java

+            final Job job = Job.getInstance(configuration, jobName);
+            job.setJarByClass(IndexTool.class);
+            job.setMapOutputKeyClass(ImmutableBytesWritable.class);
+            FileOutputFormat.setOutputPath(job, outputPath);


Can dispense with the output path, I think

When I remove the code for setting the output path, there is always one other test failing. There are total of 120 tests. I tried restructuring the IndexTool code so that the output path is set only when bulk loading is enabled but did not help. So, I will leave the code as it is.

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java

gjacoby126 · 2019-02-13T22:37:21Z

phoenix-core/src/it/java/org/apache/phoenix/end2end/TableDDLPermissionsIT.java

@@ -211,23 +211,14 @@ public Void run() throws Exception {

            // we should be able to read the data from another index as well to which we have not given any access to
            // this user
-            verifyAllowed(createIndex(indexName2, phoenixTableName), unprivilegedUser);


@kadirozde - I'm curious what the purpose of these changes are. Have DDL permissions changed? If not, and your changes somehow break the existing test, is there a way to modify the test logic to keep the same test coverage?

Currently global indexes are built by using UPSERT SELECT, i..e, by reading mutations from the data table and writing the corresponding mutations to the index table. The user does not need to have the write privilege on the data table in order to build an index table. However, in the new approach, an index table is built by replaying the existing mutations back on the data table again (without actually applying them to the data table) for the purpose of reconstructing the index mutations. Such replay attempts are allowed only when the user has the write privilege. Therefore, I had to delete the scenarios that are not applicable anymore.

gjacoby126 · 2019-02-14T22:50:57Z

phoenix-core/src/main/java/org/apache/phoenix/schema/MetaDataClient.java

+
+        String dataTableFullName = SchemaUtil.getTableName(
+                tableRef.getTable().getSchemaName().getString(),
+                tableRef.getTable().getTableName().getString());


@kadirozde - curious why the refresh is needed here? The tableRef only seems to be used to get the base table name to give to the ServerBuildIndexDDLCompiler

I needed this for the initial version of the compiler where the index table was searched in the index list of the PTable for the data table. MetadataClient constructs the PTable object for the data table before the index table is created and thus the object does have a reference to the newly created index. The complier has changed and this search was eliminated. So there is no need to refresh the PTable object anymore. I will remove the refresh code.

Thanks for explaining, @kadirozde . Once the refresh code gets removed, I think this is ready to go so I'll +1 and commit.

@gjacoby126, I have removed the refresh code. Thank you for discovering this issue, helping me understand the code, and reviewing the patch!

… timestamps

gokceni reviewed Feb 4, 2019

View reviewed changes

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java Outdated Show resolved Hide resolved

gokceni reviewed Feb 4, 2019

View reviewed changes

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java Show resolved Hide resolved

gokceni reviewed Feb 4, 2019

View reviewed changes

phoenix-core/src/it/java/org/apache/phoenix/end2end/IndexBuildTimestampIT.java Outdated Show resolved Hide resolved

gokceni reviewed Feb 4, 2019

View reviewed changes

phoenix-core/src/main/java/org/apache/phoenix/compile/ServerBuildIndexCompiler.java Outdated Show resolved Hide resolved

vincentpoon reviewed Feb 5, 2019

View reviewed changes

...nix-core/src/main/java/org/apache/phoenix/mapreduce/index/PhoenixServerBuildIndexMapper.java Show resolved Hide resolved

gokceni approved these changes Feb 5, 2019

View reviewed changes

gjacoby126 requested changes Feb 7, 2019

View reviewed changes

gjacoby126 reviewed Feb 13, 2019

View reviewed changes

gjacoby126 reviewed Feb 14, 2019

View reviewed changes

PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong…

a07c7ca

… timestamps

gjacoby126 closed this Apr 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong… #434

PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong… #434

kadirozde commented Jan 30, 2019

gokceni left a comment

gjacoby126 Feb 7, 2019

kadirozde Feb 8, 2019

gjacoby126 Feb 7, 2019

kadirozde Feb 11, 2019

gjacoby126 Feb 13, 2019

kadirozde Feb 14, 2019

gjacoby126 Feb 14, 2019

kadirozde Feb 15, 2019

gjacoby126 Feb 19, 2019

kadirozde Feb 19, 2019

PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong… #434

PHOENIX-5018 Index mutations created by UPSERT SELECT will have wrong… #434

Conversation

kadirozde commented Jan 30, 2019

gokceni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment