clustering: zot scale-out cluster #125

hallyn · 2020-08-05T21:11:24Z

We will want to support running a cluster of zot servers.

When a blob is uploaded, it should be distributed to all the nodes.

When fetching an image, the client should be able to fetch each
layer blob from a different server to load balance.

tych0 · 2020-08-05T22:11:09Z

Might be nice to do it with redirects, so you only have to talk to a zot in the cluster instead of the right one.

rchincha · 2020-08-17T01:30:38Z

Couple of design considerations to consider:

is the client aware of the members of a cluster or we do some sort of proxying
given unique content-addressable blobs, routing can assume the quorum to be stable or unstable (DHT assumptions)

rchincha · 2024-03-26T17:33:31Z

#2041

andaaron · 2024-04-11T16:08:34Z

Considerations on storing various data - in the context of the clustering discussion.
I wrote these a while back, some may have been addressed.

We support local file system and s3 in case of AWS for the image stores (responsible for storing image blobs)
In case of zot sync we only support local, because the 3rd party library we use for sync-ing uses local storage as an intermediate destination for the copy - a note for kubernetes/cloud use case
The information about dedupe is stored in a cache DB - on local file system (cache.db under root dir) or on DynamoDB in case of AWS - there are multiple such cache DBs, one per image store - these would probably need to be the same for all zot instances
Trivy is using local disk space to store CVE information and scan results (in the folder _trivy under the root directory) - 2 DBs, one for Java scanning and one for the rest, the Java one being huge (hundreds of MB if I rememeber correctly) - right now we are not storing CVE scan results anywhere else, we rely on Trivy and in-memory cache of the latest results.
zot user session authentication (for zui) uses a folder _sessions under the root directory to store session information - if we have multiple zot instances we need to design also for the session authentication use case
Right now we advertize zot with an embedded UI - in case of a cluster, would we have multiple UIs, or a separate UI service?
We have a metadata db (meta.db stored locally under the root directory, or as DynamoDB tables for cloud case) - this DB needs to be the same for all zots (we store information on manifests, configs, download counters, signature verification results)
We store certificates and private keys for signature verification locally regardless of storage type (we want to also support AWS), these files are under the root dir (folders _notation and _cosign) -> not sure if this is still applicable today for AWS.

rchincha changed the title ~~zot cluster~~ clustering: zot scale-out cluster Jan 29, 2021

rchincha mentioned this issue Mar 17, 2021

Other storage backends? #179

Closed

vrajashkr mentioned this issue Apr 12, 2024

feat(cluster): Add support for request proxying for scale out #2385

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clustering: zot scale-out cluster #125

clustering: zot scale-out cluster #125

hallyn commented Aug 5, 2020

tych0 commented Aug 5, 2020

rchincha commented Aug 17, 2020

rchincha commented Mar 26, 2024

andaaron commented Apr 11, 2024

clustering: zot scale-out cluster #125

clustering: zot scale-out cluster #125

Comments

hallyn commented Aug 5, 2020

tych0 commented Aug 5, 2020

rchincha commented Aug 17, 2020

rchincha commented Mar 26, 2024

andaaron commented Apr 11, 2024