Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backing up the WAL #992

Closed
diptanu opened this issue Jan 16, 2024 · 6 comments
Closed

Backing up the WAL #992

diptanu opened this issue Jan 16, 2024 · 6 comments
Labels
wontfix This will not be worked on

Comments

@diptanu
Copy link

diptanu commented Jan 16, 2024

Hi @drmingdrmer do you guys back up the WAL or the snapshot somewhere in datafuse ? I am trying to figure out how to make it easy for us to stream the WAL to S3 so that if we lose the nodes we can bootstrap the cluster without losing the source of truth of the data in the state machine

Copy link

👋 Thanks for opening this issue!

Get help or engage by:

  • /help : to print help messages.
  • /assignme : to assign this issue to you.

@drmingdrmer
Copy link
Member

drmingdrmer commented Jan 17, 2024

In databend, we just export all of the log and the snapshot to a file and backup it. The following is the server side export implementation:
https://github.com/datafuselabs/databend/blob/313b40ed3ff1cfe001bc6e2f806773d9001861d0/src/meta/service/src/store/store_inner.rs#L394

Fix the above url

@diptanu
Copy link
Author

diptanu commented Jan 17, 2024

@drmingdrmer Do you do this in a continuous basis for databend or is this happens after every couple of hours?

@drmingdrmer
Copy link
Member

@drmingdrmer Do you do this in a continuous basis for databend or is this happens after every couple of hours?

It is a cronjob and is executed by schedule. It is not a incremental backup :(

@Xuanwo
Copy link
Member

Xuanwo commented Jan 28, 2024

Hi @drmingdrmer, would you be open to incorporating that logic into openraft? This way, users wouldn't have to replicate the same work.

@schreter
Copy link
Collaborator

Just my 2c: I don't think there is a generic way to back up logs. We'll also back up to S3, but that's just one of the options. We must support 5 different hyperscalers, all with different APIs. From openraft POV, the log is continuous and how you cut it for backup is your beer. In our case, once a log segment is filled, it can be backed up. Once openraft tells us to purge, it's removed locally (unless the backup didn't yet finish).

@drmingdrmer drmingdrmer added the wontfix This will not be worked on label Jan 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

4 participants