Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] AzureRepositoriesMeteringIT multiple tests failing failing #105864

Closed
mosche opened this issue Mar 1, 2024 · 3 comments
Closed

[CI] AzureRepositoriesMeteringIT multiple tests failing failing #105864

mosche opened this issue Mar 1, 2024 · 3 comments
Assignees
Labels
:Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs medium-risk An open issue or test failure that is a medium risk to future releases Team:Distributed Meta label for distributed team >test-failure Triaged test failures from CI

Comments

@mosche
Copy link
Contributor

mosche commented Mar 1, 2024

Build scan:
https://gradle-enterprise.elastic.co/s/bjvkojojf4ekc/tests/:x-pack:plugin:repositories-metering-api:qa:azure:javaRestTest/org.elasticsearch.xpack.repositories.metering.azure.AzureRepositoriesMeteringIT/testStatsAreArchivedAfterRepositoryDeletion

Test failures
elasticsearch / periodic / 8.13 / third-party / azure
Test failures:
AzureRepositoriesMeteringIT » testClearRepositoriesStats
AzureRepositoriesMeteringIT » testStatsAreTracked
AzureRepositoriesMeteringIT » testRegisterMultipleRepositoriesAndGetStats
AzureRepositoriesMeteringIT » testDeleteThenAddRepositoryWithTheSameName
AzureRepositoriesMeteringIT » testStatsAreArchivedAfterRepositoryDeletion

Reproduction line:

./gradlew ':x-pack:plugin:repositories-metering-api:qa:azure:javaRestTest' --tests "org.elasticsearch.xpack.repositories.metering.azure.AzureRepositoriesMeteringIT.testStatsAreArchivedAfterRepositoryDeletion" -Dtests.seed=35579160B1CCDAFB -Dtests.locale=lt -Dtests.timezone=Etc/GMT+9 -Druntime.java=21

Applicable branches:
8.13

Reproduces locally?:
No

Failure history:
Failure dashboard for org.elasticsearch.xpack.repositories.metering.azure.AzureRepositoriesMeteringIT#testStatsAreArchivedAfterRepositoryDeletion

Failure excerpt:

org.elasticsearch.client.ResponseException: method [POST], host [http://[::1]:35993], URI [_snapshot/yvwzjtrfot/botvekesxg/_restore?wait_for_completion=true], status line [HTTP/1.1 500 Internal Server Error]
{"error":{"root_cause":[{"type":"snapshot_restore_exception","reason":"[yvwzjtrfot:botvekesxg/9xSEHhi-RyehlqqA0XEQ9A] cannot restore index [bqkleqgcdm] because an open index with same name already exists in the cluster. Either close or delete the existing index or restore the index under a different name by providing a rename pattern and replacement name"}],"type":"snapshot_restore_exception","reason":"[yvwzjtrfot:botvekesxg/9xSEHhi-RyehlqqA0XEQ9A] cannot restore index [bqkleqgcdm] because an open index with same name already exists in the cluster. Either close or delete the existing index or restore the index under a different name by providing a rename pattern and replacement name"},"status":500}

  at __randomizedtesting.SeedInfo.seed([35579160B1CCDAFB:417ABC912FB90BDB]:0)
  at org.elasticsearch.client.RestClient.convertResponse(RestClient.java:351)
  at org.elasticsearch.client.RestClient.performRequest(RestClient.java:317)
  at org.elasticsearch.client.RestClient.performRequest(RestClient.java:292)
  at org.elasticsearch.test.rest.ESRestTestCase.restoreSnapshot(ESRestTestCase.java:1990)
  at org.elasticsearch.xpack.repositories.metering.AbstractRepositoriesMeteringAPIRestTestCase.snapshotAndRestoreIndex(AbstractRepositoriesMeteringAPIRestTestCase.java:252)
  at org.elasticsearch.xpack.repositories.metering.AbstractRepositoriesMeteringAPIRestTestCase.snapshotAndRestoreIndex(AbstractRepositoriesMeteringAPIRestTestCase.java:214)
  at org.elasticsearch.xpack.repositories.metering.AbstractRepositoriesMeteringAPIRestTestCase.testStatsAreArchivedAfterRepositoryDeletion(AbstractRepositoriesMeteringAPIRestTestCase.java:144)
  at jdk.internal.reflect.DirectMethodHandleAccessor.invoke(DirectMethodHandleAccessor.java:103)
  at java.lang.reflect.Method.invoke(Method.java:580)
  at com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1758)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:946)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$9.evaluate(RandomizedRunner.java:982)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$10.evaluate(RandomizedRunner.java:996)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at org.junit.rules.RunRules.evaluate(RunRules.java:20)
  at org.apache.lucene.tests.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:48)
  at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
  at org.apache.lucene.tests.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:45)
  at org.apache.lucene.tests.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:60)
  at org.apache.lucene.tests.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:44)
  at org.junit.rules.RunRules.evaluate(RunRules.java:20)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:390)
  at com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:843)
  at com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:490)
  at com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:955)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:840)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:891)
  at com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:902)
  at org.elasticsearch.test.cluster.local.DefaultLocalElasticsearchCluster$1.evaluate(DefaultLocalElasticsearchCluster.java:47)
  at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
  at org.junit.rules.RunRules.evaluate(RunRules.java:20)
  at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at org.apache.lucene.tests.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:38)
  at com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:40)
  at com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:40)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at org.apache.lucene.tests.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:53)
  at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
  at org.apache.lucene.tests.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:44)
  at org.apache.lucene.tests.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:60)
  at org.apache.lucene.tests.util.TestRuleIgnoreTestSuites$1.evaluate(TestRuleIgnoreTestSuites.java:47)
  at org.junit.rules.RunRules.evaluate(RunRules.java:20)
  at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
  at com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:390)
  at com.carrotsearch.randomizedtesting.ThreadLeakControl.lambda$forkTimeoutingTask$0(ThreadLeakControl.java:850)
  at java.lang.Thread.run(Thread.java:1583)

@mosche mosche added the >test-failure Triaged test failures from CI label Mar 1, 2024
@elasticsearchmachine elasticsearchmachine added blocker needs:triage Requires assignment of a team area label labels Mar 1, 2024
@mosche
Copy link
Contributor Author

mosche commented Mar 1, 2024

@mosche mosche changed the title [CI] AzureRepositoriesMeteringIT testStatsAreArchivedAfterRepositoryDeletion failing [CI] AzureRepositoriesMeteringIT multiple tests failing failing Mar 1, 2024
@brianseeders brianseeders added the :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs label Mar 1, 2024
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed (Team:Distributed)

@elasticsearchmachine elasticsearchmachine added Team:Distributed Meta label for distributed team and removed needs:triage Requires assignment of a team area label labels Mar 1, 2024
@idegtiarenko idegtiarenko added medium-risk An open issue or test failure that is a medium risk to future releases and removed blocker labels Mar 12, 2024
@pxsalehi pxsalehi self-assigned this May 17, 2024
@pxsalehi
Copy link
Member

There were a lot of failures in this suite due to invalid credentials which have been change. The only relevant failures in the past two months are two: https://es-delivery-stats.elastic.dev/app/r/s/5CAv8

One seem to be an http connection timeout which I would consider an infra issue. The other one seems to be a generic problem in the suite that occasionally happens. When we take a snapshot, we remove the index. However, it seems sometimes this removal doesn't work, and restore fails because there is already an open index with the same name. I haven't been able to reproduce this, but I will open a specific issue for that one failure example so we can track this. I will close this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs medium-risk An open issue or test failure that is a medium risk to future releases Team:Distributed Meta label for distributed team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

5 participants