Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OAK-9686: replace Elastic RHLC with Elastic Java Client #490

Closed
wants to merge 86 commits into from

Conversation

AngelaFabregues
Copy link
Contributor

This is the elasticserach client migration approach that I'm following. Two clients share the same transport layer.

@AngelaFabregues
Copy link
Contributor Author

AngelaFabregues commented Feb 10, 2022

I've increased the max jar file from run-elastic to try to pass the checks. Now to 120MB.

angelafabreguesv and others added 22 commits February 14, 2022 13:05
…9686

# Conflicts:
#	oak-search-elastic/src/main/java/org/apache/jackrabbit/oak/plugins/index/elastic/index/ElasticIndexHelper.java
* trunk:
  OAK-9704 Fix Utils#getConnectionString
  OAK-9704 AzureBlobStoreBackend: empty string as null in boolean property
  OAK-9690 | Adding bringIndexUptoDate support for elastic index post offline reindexing (apache#492)
  OAK-9699 Document Elastic Indexes
  OAK-9701 : Additional restrictions to simplify permission setup
  OAK-9700: RevisionGC may fail with NPE
  OAK-9700: RevisionGC may fail with NPE
  OAK-9700: RevisionGC may fail with NPE
  OAK-9694: expose elastic stats in oak-index-stats and mbean (apache#498)
  Update oak-doc/src/site/markdown/query/elastic.md
  OAK-9699 Document Elastic Indexes
  OAK-9699 Document Elastic Indexes
  Update oak-doc/src/site/markdown/query/elastic.md
  Update oak-doc/src/site/markdown/query/elastic.md
  OAK-9699 Document Elastic Indexes

# Conflicts:
#	oak-search-elastic/src/main/java/org/apache/jackrabbit/oak/plugins/index/elastic/ElasticIndexStatistics.java
angelafabreguesv and others added 11 commits May 5, 2022 18:00
…bit-oak into clean-OAK-9686

* 'OAK-9686' of https://github.com/AngelaFabregues/jackrabbit-oak: (58 commits)
  OAK-9686 - Executes a single elasticsearch connection to use for all the tests.
  OAK-9686 - Fixing spellchecker
  OAK-9686 - Fixing collate boolean query.
  (minor) removed unused code + small improvements
  get rid of operations using the low level client
  ElasticIndexHelper2: added details on why we cannot still migrate to the new client
  use UTF-8 to serialize SearchRequest to strings (fixes a couple of test failures)
  fix suggest queries
  use the same version for both es clients (7.17.2)
  ElasticFunctionIndexCommonTest: reintroduced needed overrides
  (minor) style improvements in similarityQuery
  remove XContentBuilder from ElasticRequestHandler
  fixed baseQuery logic
  fixed ElasticPropertyIndexTest
  fixed range queries
  use java client for ElasticIndexStatisticsTest
  fixed similarity queries
  (minor) improved formatting of ElasticRequestHandler fixed code
  OAK-9686 - Removing unnecessary overwrittes.
  OAK-9686 - Refactoring aggregations
  ...

# Conflicts:
#	oak-search-elastic/src/test/java/org/apache/jackrabbit/oak/plugins/index/elastic/ElasticConnectionRule.java
@fabriziofortino fabriziofortino marked this pull request as ready for review May 12, 2022 13:52
@nit0906
Copy link
Contributor

nit0906 commented May 12, 2022

[INFO] --- maven-bundle-plugin:5.1.1:baseline (baseline) @ oak-search-elastic ---
[INFO] artifact org.apache.jackrabbit:oak-search-elastic: checking for updates from Nexus
[INFO] artifact org.apache.jackrabbit:oak-search-elastic: checking for updates from central
[INFO] ------------------------------------------------------------------------
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time:  30:33 min
[INFO] Finished at: 2022-05-12T14:28:57
[INFO] ------------------------------------------------------------------------
[ERROR] GC overhead limit exceeded -> [Help 1]

The jenkins build on the PR for oak-search-elastic seems to be failing with GC overhead limit exceeded - we should probably look into this.

@fabriziofortino
Copy link
Contributor

[ERROR] GC overhead limit exceeded -> [Help 1]

The jenkins build on the PR for oak-search-elastic seems to be failing with GC overhead limit exceeded - we should probably look into this.

@nit0906 I have seen this but it's not clear what triggers it. I would expect an OOM when the tests are executed. The OOM happens later though. Can we enable full debug on jenkins to see if that helps in understanding where this comes from?

@nit0906
Copy link
Contributor

nit0906 commented May 13, 2022

@fabriziofortino

Can we enable full debug on jenkins to see if that helps in understanding where this comes from?

This seems to be some apache ci jenkins - I am not entirely sure how to change the debug logging, but it seems it runs from this jenkins file - https://github.com/apache/jackrabbit-oak/blob/trunk/Jenkinsfile

I think if we add the -X option , in the mvn clean verify command, it should log at debug level. But I think we might need to change that in the trunk branch (changing in the feature branch wouldn't work I guess since the job https://ci-builds.apache.org/job/Jackrabbit/job/oak-trunk-pr/configure here is picking from the trunk of apache repo.)

@fabriziofortino
Copy link
Contributor

@nit0906 the problem was with the bundle. I have fine-tuned the dependencies (we can get rid of all lucene dependencies when the migration to ES Java Client is complete) and the build is now successful.

fabriziofortino and others added 2 commits May 13, 2022 14:49
…-oak into OAK-9686

* 'trunk' of https://github.com/AngelaFabregues/jackrabbit-oak:
  OAK-9755 | Lucene metrics using labels (apache#565)
  change to info level
  remove duplicate
  fix Path Predicate
  OAK-9758 error out if tika dependencies are missing and improve logging to log warning instead of debug silently
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
5 participants