NIFI-11402 - PutBigQuery fix for case sensitivity and error handling #7140

pvillard31 · 2023-04-08T11:31:06Z

Summary

Fix case sensitivity issue, by forcing lower case on the field names.
Fix error handling by setting the error to null after the error has been processed. Otherwise, any subsequent execution would keep failing with the same error and the user would have to create a new instance of the processor.

Tracking

Please complete the following tracking steps prior to pull request creation.

Issue Tracking

Apache NiFi Jira issue created

Pull Request Tracking

Pull Request title starts with Apache NiFi Jira issue number, such as NIFI-00000
Pull Request commit message starts with Apache NiFi Jira issue number, as such NIFI-00000

Pull Request Formatting

Pull Request based on current revision of the main branch
Pull Request refers to a feature branch with one commit containing changes

Verification

Please indicate the verification steps performed prior to pull request creation.

Build

Build completed using mvn clean install -P contrib-check
- JDK 11
- JDK 17

Licensing

New dependencies are compatible with the Apache License 2.0 according to the License Policy
New dependencies are documented in applicable LICENSE and NOTICE files

Documentation

Documentation formatting appears as expected in rendered files

bejancsaba

Thanks for the bugfix. I proposed a unit test that could be added to the PutBigQueryTest to validate the change. Other than that it is +1 from my side.

bejancsaba · 2023-04-09T20:34:40Z

...e/nifi-gcp-processors/src/main/java/org/apache/nifi/processors/gcp/bigquery/PutBigQuery.java

@@ -303,6 +301,7 @@ private void finishProcessing(ProcessSession session, FlowFile flowFile, StreamW
            flowFile = session.putAttribute(flowFile, BigQueryAttributes.JOB_NB_RECORDS_ATTR, isBatch() ? "0" : String.valueOf(appendSuccessCount.get() * recordBatchCount));
            session.penalize(flowFile);
            session.transfer(flowFile, REL_FAILURE);
+            error.set(null); // set error to null for next execution


Thanks for catching this. What do you think about adding a unit test covering this scenario (failing before the change and passing after it)

@Test void testNextFlowFileProcessedWhenIntermittentErrorResolved() { when(writeClient.createWriteStream(isA(CreateWriteStreamRequest.class))).thenReturn(writeStream); TableSchema myTableSchema = mockTableSchema(FIELD_1_NAME, TableFieldSchema.Type.STRING, FIELD_2_NAME, TableFieldSchema.Type.STRING); when(writeStream.getTableSchema()).thenReturn(myTableSchema); when(streamWriter.append(isA(ProtoRows.class), isA(Long.class))) .thenReturn(ApiFutures.immediateFailedFuture(new StatusRuntimeException(Status.INTERNAL))) .thenReturn(ApiFutures.immediateFuture(AppendRowsResponse.newBuilder().setAppendResult(mock(AppendRowsResponse.AppendResult.class)).build())); runner.setProperty(PutBigQuery.RETRY_COUNT, "0"); runner.enqueue(csvContentWithLines(1)); runner.enqueue(csvContentWithLines(1)); runner.run(2); verify(streamWriter, times(2)).append(any(ProtoRows.class), anyLong()); runner.assertQueueEmpty(); runner.assertTransferCount(PutBigQuery.REL_FAILURE, 1); runner.assertTransferCount(PutBigQuery.REL_SUCCESS, 1); }

bejancsaba · 2023-04-09T20:39:57Z

...e/nifi-gcp-processors/src/main/java/org/apache/nifi/processors/gcp/bigquery/PutBigQuery.java

@@ -434,6 +433,9 @@ private static Map<String, Object> convertMapRecord(Map<String, Object> map) {
        Map<String, Object> result = new HashMap<>();
        for (String key : map.keySet()) {
            Object obj = map.get(key);
+            // BigQuery is not case sensitive on the column names but the protobuf message
+            // expect all column names to be lower case
+            key = key.toLowerCase();


This is good to know. Thanks.

pvillard31 · 2023-04-10T14:42:44Z

Thanks for the review @bejancsaba and thanks for providing a unit test, definitely agree that's much better with one, I added it to my PR. Much appreciated.

This closes #7140. Signed-off-by: Csaba Bejan <bejan.csaba@gmail.com>

bejancsaba · 2023-04-19T19:17:22Z

Thanks @pvillard31 looks good, merged and backported to support/nifi-1.x as well

bejancsaba reviewed Apr 9, 2023

View reviewed changes

pvillard31 added 2 commits April 10, 2023 17:41

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling

24254ce

added unit test

72aa26d

pvillard31 force-pushed the NIFI-11402 branch from 9febb9f to 72aa26d Compare April 10, 2023 14:41

bejancsaba closed this in bdcd4fc Apr 19, 2023

bejancsaba pushed a commit that referenced this pull request Apr 19, 2023

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling

6a65566

This closes #7140. Signed-off-by: Csaba Bejan <bejan.csaba@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling #7140

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling #7140

pvillard31 commented Apr 8, 2023

bejancsaba left a comment

bejancsaba Apr 9, 2023

bejancsaba Apr 9, 2023

pvillard31 commented Apr 10, 2023

bejancsaba commented Apr 19, 2023

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling #7140

NIFI-11402 - PutBigQuery fix for case sensitivity and error handling #7140

Conversation

pvillard31 commented Apr 8, 2023

Summary

Tracking

Issue Tracking

Pull Request Tracking

Pull Request Formatting

Verification

Build

Licensing

Documentation

bejancsaba left a comment

Choose a reason for hiding this comment

bejancsaba Apr 9, 2023

Choose a reason for hiding this comment

bejancsaba Apr 9, 2023

Choose a reason for hiding this comment

pvillard31 commented Apr 10, 2023

bejancsaba commented Apr 19, 2023