Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

s3 setup for archival storage #24

Closed
ivotron opened this issue Mar 2, 2021 · 6 comments · Fixed by #86
Closed

s3 setup for archival storage #24

ivotron opened this issue Mar 2, 2021 · 6 comments · Fixed by #86
Assignees

Comments

@ivotron
Copy link
Member

ivotron commented Mar 2, 2021

When deploying a cluster it would be great to have an option that setup infra support for archival storage feature in redpanda. Briefly this would be:

  1. s3 bucket
  2. query credentials, urls, etc...
  3. deploy redpanda.yml with archival settings from (1) and (2)

@Lazin do you have the set of configuration elements that we'll need? I thought they were in configuration.h but I don't see them in there. Maybe they haven't merged yet?

@ivotron ivotron self-assigned this Mar 2, 2021
@Lazin
Copy link

Lazin commented Mar 2, 2021

Yes, this is not merged yet. The options are:

  archival_storage_enabled: true
  archival_storage_s3_access_key: <...>
  archival_storage_s3_secret_key: <...>
  archival_storage_s3_region: eu-north-1
  archival_storage_s3_bucket: redpandabackup2
  archival_storage_upload_interval: 10000
  archival_storage_max_connections: 5

developer mode should be enabled, you also have to specify AWS security credentials, bucket should be created in advance, versioning and encryptions are not needed, upload_interval is in milliseconds (5 or 10 seconds is OK), max_connections sets number of simultaneous S3 uploads per-shard, values in 1-10 range should be fine

@ivotron
Copy link
Member Author

ivotron commented Mar 2, 2021

thanks @Lazin! that info should be good to get me started (I think)

@dotnwat
Copy link
Member

dotnwat commented Mar 9, 2021

@ivotron so we have three modes

  1. ductape local
  2. ducktape cluster
  3. punisher

We're handling (1) with an extra docker container running a local s3 endpoint. For (2) and (3) I assume these share a common method for deployment?

@Lazin
Copy link

Lazin commented Mar 9, 2021

I renamed archival_storage_upload_interval to archival_storage_reconciliation_interval_ms. It has a sensible default so there is no need to specify it explicitly anywhere. Will create PR with doc tomorrow.

@vuldin
Copy link
Member

vuldin commented Oct 17, 2022

Is this ticket tracking adding shadow indexing feature, or is this for some other (possibly more specific) issue? I didn't want to create an additional ticket if it wasn't needed.

@vuldin
Copy link
Member

vuldin commented Oct 24, 2022

Created a new ticket for SI on AWS: #87

Associated PR: #86

@WesWWagner WesWWagner linked a pull request Jan 10, 2023 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants