METRON-672: SolrIndexingIntegrationTest fails intermittently #424

cestella · 2017-01-26T00:12:21Z

This failure is due to a change in default behavior when indexing was split off into a separate configuration file. The default batch size was changed from 5 to 1 in particular. This, by itself, is not a problem, but the IndexingIntegrationTest (base class for Solr and Elastic search integration tests):

submits the configs
starts the indexing topology
writes the input data

The writing of the input data may happen before the topology fully loads or the configuration fully loads, especially if the machine running the unit tests is under load (like with travis). As a result, the first record may end up with the default batch size (of 1) and write out immediately because the indexing configs haven't loaded into zookeeper just yet. In that circumstance, eventually the configs load and the batch size is set to 5. Meanwhile we've written 10 records and are expecting 10 in return, but because you wrote the first out already and then the next 5, we have another 4 pending to be written by the BulkMessageWriterBolt.

So, the failure scenario is as follows:

Message 1 is received and the indexing config hasn't loaded yet, so the batch size is 1 and it immediately gets written out
Message 2 - 5 are each received and the indexing config has loaded, so the batch size is 5 and it queues up
Message 6 is received and the batch writes out
Messages 7 - 10 are received, but never make a full batch, so we time out waiting for them to write out

The fix is to ensure that we don't write out messages to kafka until the configs are loaded, which is what this PR does.

cestella · 2017-01-26T00:15:26Z

To test this locally, because it's extremely sporadic that it happens locally (1 out of every 50 times I run the test), I did the following:

Built and installed the project in Maven: mvn -DskipTests clean install
Ran the metron-solr project integration tests for at least 2 hours in a row ensuring that they don't fail. So, from metron-platform/metron-solr: echo "" > /tmp/output;while [ $(cat /tmp/output | grep "vs 6" | wc -l) -lt 1 ];do mvn install >& /tmp/output;done

justinleet · 2017-01-26T01:01:39Z

...n-indexing/src/test/java/org/apache/metron/indexing/integration/IndexingIntegrationTest.java

+              isLoaded.set(true);
+              return null;
+              }
+            );
            ;


Can you kill the extra semicolon?

justinleet · 2017-01-26T01:02:01Z

...n-indexing/src/test/java/org/apache/metron/indexing/integration/IndexingIntegrationTest.java

+        }
+      }
+      while(bytes == null || bytes.length == 0);
+      return;


Drop the return, since it's a void method.

justinleet · 2017-01-26T01:03:27Z

...src/test/java/org/apache/metron/enrichment/integration/components/ConfigUploadComponent.java

@@ -38,6 +39,7 @@
  private String enrichmentConfigsPath;
  private String indexingConfigsPath;
  private String profilerConfigPath;
+  private Optional<Function<ConfigUploadComponent, Void>> postStartCallback = Optional.empty();


Could this just use Consumer instead of Function? Since the second type parameter is Void, it seems like the Function is just being a Consumer anyway

justinleet · 2017-01-26T01:04:20Z

Thanks for taking the effort to dig into this. Great work. Other than a couple minor comments, I'm very happy with this.

cestella · 2017-01-26T13:42:22Z

@justinleet comments addressed, let me know if there's anything else.

justinleet · 2017-01-26T13:45:01Z

Thanks again for this. I'm +1 on it.

Fixing integration test.

9617e09

Merge branch 'master' into METRON-672

5e836f4

justinleet reviewed Jan 26, 2017

View reviewed changes

cestella added 2 commits January 25, 2017 20:12

Updating to react to comments.

b2103fd

fixed commit.

0f6602c

cestella closed this Jan 26, 2017

cestella reopened this Jan 26, 2017

asfgit closed this in 0219e56 Jan 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

METRON-672: SolrIndexingIntegrationTest fails intermittently #424

METRON-672: SolrIndexingIntegrationTest fails intermittently #424

cestella commented Jan 26, 2017

cestella commented Jan 26, 2017 •

edited

justinleet Jan 26, 2017

justinleet Jan 26, 2017

justinleet Jan 26, 2017

justinleet commented Jan 26, 2017

cestella commented Jan 26, 2017

justinleet commented Jan 26, 2017

METRON-672: SolrIndexingIntegrationTest fails intermittently #424

METRON-672: SolrIndexingIntegrationTest fails intermittently #424

Conversation

cestella commented Jan 26, 2017

cestella commented Jan 26, 2017 • edited

justinleet Jan 26, 2017

Choose a reason for hiding this comment

justinleet Jan 26, 2017

Choose a reason for hiding this comment

justinleet Jan 26, 2017

Choose a reason for hiding this comment

justinleet commented Jan 26, 2017

cestella commented Jan 26, 2017

justinleet commented Jan 26, 2017

cestella commented Jan 26, 2017 •

edited