Refactor `CollectionTest`, `ConfigureIndexTest`, and `IndexManager` to improve integration test speed and reliability #81

austin-denoble · 2024-03-13T16:28:35Z

Problem

We've had continued issues with flakiness of integration tests, and tests taking a while to run (25 - 30 minutes for the entire pr workflow in some cases). A lot of this is waiting for indexes and collections to be Ready instead of Initializing. Specifically, the CollectionTest and CollectionErrorTest can take a really long time due to setting up and waiting on multiple indexes and collections.

While working on the above issue, I also noticed test failures and repeated issues due to findIndexWithDimensionAndType, and tests running with indexes created by and belonging to other test runs. The isIndexReady function also tends to be inefficient as the polling isn't consistent, and we can end up waiting sometimes minutes longer than we may need to.

With the new Pinecone and Index/AsyncIndex class structure in place, we can refactor our integration tests around the new pattern.

Solution

Ultimately, I would like to move our tests away from the createIndexIfNotExistsDataPlane / createIndexIfNotExistsControlPlane functions and their reliance on the findIndexWithDimensionAndType function, which should be deprecated. We should come up with a pattern for sharing resources across all tests in a run, and setting up / tearing those down once. This can be worked on in subsequent PRs.

Move CollectionErrorTests tests into CollectionTest. Share index and collection setup @BeforeAll across collections tests. Wait less time for the indexes created from the collection to be ready as this specifically can take a number of minutes. Relax some of the assertions on the status of created collections and indexes as I don't think we need to be that thorough here.
Update createIndexIfNotExistsDataPlane to return a tuple (AbstractMap.SimpleEntry<String, Pinecone>) of the indexName and the Pinecone client to make things a bit easier to work with.
Update all of the dataPlane/ tests to use pineconeClient.createIndexConnection() and pineconeClient.createAsyncIndexConnection() for managing data plane operations.
Deprecate isIndexReady in lieu of waitUntilIndexIsReady. The polling is more consistent with this function, and it most likely saves us time overall.
Refactor ConfigureIndexTest to create its own index and clean up after.
Fix issue with findIndexByDimensionAndType calling isIndexReady() on each index while iterating through the list.
Fix issue in IndexInterface.validateUpsertRequest where we were trying to call sparseValuesWithUnsignedIndices.getIndicesWithUnsigned32IntList() and sparseValuesWithUnsignedIndices.getValuesList() on a possible null causing a NullPointerException.
Talked with @ssmith-pc, and I think it's standard practice in tests to not try/catch yourself unless you need to assert on the result. We should be letting errors throw to the test runner and let it handle them so we're not clobbering logs and stack traces. I've cleaned up try/catch statements which don't seem to be needed.
Running gradle integrationTest --info in pr.yml to get more detailed log output in the console for better troubleshooting of ongoing flapping.
Adding assertWithRetry wrappers for specific actions which have been troublesome. Adding Thread.sleep() to a few places to avoid hammering an index too quickly / etc.
Added a new describeIndexStats() overload to IndexInterface and Index / AsyncIndex to allow calling without needing to explicitly pass null.

I spent a lot of time running these locally and in CI to see how they perform. Overall, it seems like these changes improve overall reliability, although we still do see a few failures like the gRPC no healthy upstream on data plane operations with fresh indexes.

The total amount of time it takes to run both sets of integration tests has been cut significantly in most cases:

Next steps would be to create a Junit extension and possible IndexManagerSingleton to manage index resources across all tests directly. This would help make things more predictable and reliable when adding future tests. This would also allow us to handle concurrent integration test runs under the same API key, which is currently very difficult due to findIndexByDimensionAndType.

Type of Change

Bug fix (non-breaking change which fixes an issue)
Infrastructure change (CI configs, etc)

Test Plan

Run integration tests and compare over run length and

… move to using waitUntilIndexReady over isIndexReady

… error tests into the CollectionTest file

… to ease server flakiness

…n test

…grationTest command in pr workflow

…indexes

…ittle clean up in CollectionTest

…yncIndex architecture

…ction tests to wait less time and not wait at all in certain cases

…eady

…ctors when ready

src/integration/java/io/pinecone/helpers/IndexManager.java

src/integration/java/io/pinecone/integration/controlPlane/pod/ConfigureIndexTest.java

src/integration/java/io/pinecone/integration/dataPlane/UpsertAndQueryPodTest.java

src/integration/java/io/pinecone/integration/dataPlane/UpsertAndQueryServerlessTest.java

src/main/java/io/pinecone/clients/AsyncIndex.java

.github/workflows/pr.yml

src/integration/java/io/pinecone/helpers/AssertRetry.java

src/integration/java/io/pinecone/integration/controlPlane/pod/ConfigureIndexTest.java

src/integration/java/io/pinecone/integration/controlPlane/pod/CollectionTest.java

src/integration/java/io/pinecone/integration/dataPlane/QueryErrorPodTest.java

ssmith-pc

In general these changes look good and should improve the reliability and speed of the test runs.

I added some various test structure comments throughout

…rtions

rohanshah18

Thanks for going through all of the changes. Looks like the collections tests are still failing. Also updating the CI to output more logs will add more noise and its not easily clear what tests are failing. I think having the short summary of what assertions failed with line numbers is cleaner and we can reproduce them locally but if this is helpful to you, we can keep add more info temporarily.

.github/workflows/pr.yml

src/integration/java/io/pinecone/helpers/AssertRetry.java

src/integration/java/io/pinecone/helpers/IndexManager.java

src/integration/java/io/pinecone/integration/dataPlane/UpsertAndQueryPodTest.java

austin-denoble · 2024-03-18T21:14:06Z

Looks like the collections tests are still failing. Also updating the CI to output more logs will add more noise and its not easily clear what tests are failing. I think having the short summary of what assertions failed with line numbers is cleaner and we can reproduce them locally but if this is helpful to you, we can keep add more info temporarily.

I left a comment about this, but I don't think running locally is a substitute for debugging what is happening in CI. We should be able to look at the CI logs, and have a somewhat clear idea what the issue is. You can pretty easily search through the logs for FAILED to see which specific tests are failing.

austin-denoble changed the title ~~Tweak Collection and Configure Index tests again~~ Refactor CollectionTest and ConfigureIndexTest to increase speed and reliability Mar 14, 2024

austin-denoble added 12 commits March 14, 2024 16:36

adjust collection and configure index tests a bit more to help flake,…

4042692

… move to using waitUntilIndexReady over isIndexReady

remove reference to isIndexReady

28adbc2

bump the default wait for waitUntilIndexIsReady

dd21b23

try macos-latest to check speed

3ee9a59

back out gradle-wrapper changes, add some logging to build.gradle

a8bf971

bump total await for index ready to 5 minutes

8a3ab1e

move off index manager util for configure index test, move collection…

2a551be

… error tests into the CollectionTest file

allow index cleanup to use the list

27599a8

remove extraneous try catch statements, retry configureIndex requests…

b22b9b8

… to ease server flakiness

swap back to ubuntu-latest, remove logging configs from build.gradle

9db462d

add detailed logging config back for now, bump wait time in collectio…

de6d6bb

…n test

add missing Disabled import for CollectionTest

5e1fef3

austin-denoble force-pushed the adenoble/tweak-collection-and-configure-index-tests branch from 47513a6 to 5e1fef3 Compare March 14, 2024 21:21

austin-denoble added 16 commits March 14, 2024 18:34

bump the backoff for the upsert retries in CollectionTest setUp

0952b6c

remove test logging

8d94583

increase default assertWithRetry backoff

d296356

clean up try catch in data plane tests, add --info to the gradle inte…

96e6710

…grationTest command in pr workflow

refactor findIndexWithDimensionAndType to avoid waiting on unrelated …

becb5fa

…indexes

update environment for serverless index creation in indexmanager, a l…

a32692f

…ittle clean up in CollectionTest

continue refactoring integration tests around new Pinecone, Index, As…

b7edc27

…yncIndex architecture

add new signature for describeIndexStats without filter, update colle…

838333e

…ction tests to wait less time and not wait at all in certain cases

more refactoring, refine some of the waiting for index stuff

af4e93b

more time for attempted collection test upsert

86e1191

more tweaking, don't wait around for indexes from collections to be r…

00052c3

…eady

tweaking pt. ?

1c1a017

more timing tweaks

e9c95f9

timing

8ea2ff1

update collection test to not assert on ready and to only validate ve…

6cd5865

…ctors when ready

delete collections before indexes in collectionstest clean up

10540fb