New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disaster recovery in Vespa #17898
Comments
For high availability we recommend running a live Vespa instance in at least two regions/data centers. If you want to increase protection against catastrophic data loss, you could take periodic backups by visiting, or keep another offline copy of the data in some other system. Many want to do the latter in any case for data analysis and machine learning. |
Does below mean setup two different Vespa instances in two different DCs? Or same Vespa instance with nodes spanning across DCs?
|
Two separate Vespa instances. |
Gotcha, so keep calling 'visit' on live instance and sync the delta to non-live? |
You can do that but it's not necessarily the most efficient way, and you'll be behind. |
Hi Vespa team,
What are the options for disaster recovery in Vespa? I have read about 'visit' but my understanding is this operation takes backup in the form of json. The problem with that is if I want to quickly bring up a vespa cluster using this json, we have to incur time overhead of indexing these documents. Do you recommend taking periodic disk index snapshots for disaster recovery? Is there a way to pause writes, force flush to disk on all content nodes, and taking a snapshot of the disk by using something like rsync? We use rsync based backup for a lucene based search engine currently, and the differential rsync for the delta duration is pretty quick. That's based on the assumption that index contents on disk change by delta and not massively. How does Vespa cloud manage disaster recovery in case an entire datacenter gets wiped out?
The text was updated successfully, but these errors were encountered: