Modify Deployment for multi data source searchengine #448

khaledk2 · 2025-05-12T08:07:20Z

This PR contains the required changes to deploy the multi-data source searchengine ome/omero_search_engine#102

ansible/decommission/archive-logs.yml

ansible/idr-searchengine.yml

sbesson

A few comments. Also two general process questions:

what happens if the searchengine_backup volume is created but not cloned from a previous volume? Will the playbook still initialise an empty cluster or will it fail during restore_elasticsearch_data command? This will be immediately relevant for the creation of prod128 as the volume will not be cloned from a previous version.
what is the process for creating a new backup of the search cluster? Does that happen automatically after the indexer is executed? Or is that a separate command?

We will probably need to 2 companion PRs:

one against submission workflow with the indexing/backup commands if they are different from the current one
one against the deployment workflow to update the creation of new volumes and include the new volume to clone

ansible/decommission/archive-logs.yml

ansible/group_vars/searchengine-hosts.yml

ansible/idr-searchengine.yml

… issues

khaledk2 · 2025-06-10T11:02:01Z

I have added two new playbooks to backup and restore the searching data

restore_searchengine_data.yml

This one should run after the deployment playbooks have completed successfully. It will check for the existence of the snapshot before running

backup_searchengine_data.yml

This playbook should run just before releasing the production server.

sbesson · 2025-06-10T11:22:17Z

This one should run after the deployment playbooks have completed successfully. It will check for the existence of the snapshot before running

Shouldn't it run automatically as part of the search engine deployment then? Is the task asynchronous? If not how long will it typically take?

This playbook should run just before releasing the production server.

So the current indexing process is unchanged to update for new studies. Should this be executed by the person running the indexer using the same approach docker run rather than requiring an Ansible set-up?

khaledk2 · 2025-06-10T12:33:04Z

The search engine data restore process is asynchronous, and it runs automatically after the last commit.
Indexing and cache updates for newly added studies are performed manually.
Backups can also be run manually using the following command

docker run -v /data/searchengine/searchengine/:/etc/searchengine/ -v /data/searchengine/searchengine/logs/:/opt/app-root/src/logs/ --network searchengine-net openmicroscopy/omero-searchengine:0.7 backup_elasticsearch_data

sbesson · 2025-06-10T13:22:23Z

Backups can also be run manually using the following command

docker run -v /data/searchengine/searchengine/:/etc/searchengine/ -v /data/searchengine/searchengine/logs/:/opt/app-root/src/logs/ --network searchengine-net openmicroscopy/omero-searchengine:0.7 backup_elasticsearch_data

@jburel @francesw @will-moore @dominikl what are your thoughts about the best place to integrate this into the IDR lifecycle?

sbesson

Let's discuss the state of this PR tomorrow at the IDR weekly meeting. I think we need to clarify two questions before spinning up prod128 with this included:

the process for creating a dump of the search engine as raised in #448 (comment). My opinion is that this task is outside the scope of these playbooks
re-reading the diff, what would happen if the playbooks and notably idr-02-services.yml was re-run against an environment previously deployed? Would the restore command be re-executed and what would be the expected state of the indexer?

sbesson · 2025-06-16T09:49:46Z

From the earlier discussion, the agreement was to:

merge this in preparation of the deployment of prod128 with a new search engine container
following creation, immediately run the indexer and backup the database (from the state of prod127)
load new studies as usual
run the indexer for the new studies and update the backup for prod129

I will open 2 issues to capture the two concerns raised in #448 (review)

khaledk2 added 3 commits May 1, 2025 21:17

Deploy multi-data source searchengine

e281afc

Add comments to the playbooks

9635253

Add comment

bf3e653

sbesson reviewed May 12, 2025

View reviewed changes

ansible/decommission/archive-logs.yml Outdated Show resolved Hide resolved

ansible/idr-searchengine.yml Outdated Show resolved Hide resolved

khaledk2 added 2 commits May 16, 2025 17:59

Update archive-logs.yml

44d6f3f

Remove elasticsearch_backup_folder folder creation

35a7d55

khaledk2 mentioned this pull request May 28, 2025

Multi data source search ome/omero_search_engine#102

Merged

khaledk2 added 3 commits June 9, 2025 10:44

change to use openmicroscopy image

9bdb4f3

create volume for searchengine backup

664cca8

snapshot for searchengine_backup

e6103f9

sbesson requested changes Jun 9, 2025

View reviewed changes

ansible/decommission/archive-logs.yml Outdated Show resolved Hide resolved

ansible/group_vars/searchengine-hosts.yml Outdated Show resolved Hide resolved

ansible/idr-searchengine.yml Outdated Show resolved Hide resolved

khaledk2 added 3 commits June 10, 2025 10:45

add restore and backup searchengine data playbooks and address review…

09f1da5

… issues

remove comment

8717068

Verify whether the snapshot exists before restore

43b4a67

Run restore_searchengine_data automaticalley

143451a

khaledk2 mentioned this pull request Jun 11, 2025

Copy test scripts to the host and add backup script ome/omero_search_engine#112

Merged

sbesson reviewed Jun 15, 2025

View reviewed changes

sbesson merged commit 11cc96d into IDR:master Jun 16, 2025
3 checks passed

sbesson mentioned this pull request Jun 16, 2025

prod128: searchengine deployment issues #451

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modify Deployment for multi data source searchengine #448

Modify Deployment for multi data source searchengine #448

Uh oh!

khaledk2 commented May 12, 2025

Uh oh!

Uh oh!

Uh oh!

sbesson left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khaledk2 commented Jun 10, 2025

Uh oh!

sbesson commented Jun 10, 2025

Uh oh!

khaledk2 commented Jun 10, 2025

Uh oh!

sbesson commented Jun 10, 2025

Uh oh!

sbesson left a comment

Uh oh!

sbesson commented Jun 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Modify Deployment for multi data source searchengine #448

Modify Deployment for multi data source searchengine #448

Uh oh!

Conversation

khaledk2 commented May 12, 2025

Uh oh!

Uh oh!

Uh oh!

sbesson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khaledk2 commented Jun 10, 2025

Uh oh!

sbesson commented Jun 10, 2025

Uh oh!

khaledk2 commented Jun 10, 2025

Uh oh!

sbesson commented Jun 10, 2025

Uh oh!

sbesson left a comment

Choose a reason for hiding this comment

Uh oh!

sbesson commented Jun 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants