Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question(s) #24

Closed
ghost opened this issue Nov 20, 2020 · 4 comments
Closed

Question(s) #24

ghost opened this issue Nov 20, 2020 · 4 comments

Comments

@ghost
Copy link

ghost commented Nov 20, 2020

Hello,

Thank you for making this fork available!

Since there is very little documentation available on groupcache I thought I might get some insight from someone who is actually running it in production.

Basically, I want to know what happens when one or multiple groupcache running nodes goes offline temporarily. Will the groupcache cluster be able to recover and continue operating properly?

@thrawn01
Copy link
Contributor

Groupcache doesn't provided peer discovery or keep track of peer liveliness. You will need something that can do that for you, like kubernetes, etcd, consul, memberlist, such that when a node goes down or becomes unavailable it will notify you and your application can remove that peer from the groupcache peer list. We have used https://github.com/hashicorp/memberlist and etcd in the past with great success. Others have used kubernetes for this, you will have to write that yourself with your project needs in mind.

@ghost
Copy link
Author

ghost commented Nov 20, 2020

Thank you for the tips @thrawn01 . memberlist seems an interesting project and I plan to give it a spin.

I am aware that groupcache doesn't provide peer discovery nor does it keep track of peer liveliness out of the box, however based on the loading process described here, my assumption is that groupcache should tolerate nodes going offline without issues.

When the owner of a cached entry is offline and the entry is not hot on the node requesting it, the requester attempts a RPC call to the owner node, the call fails and the requester will retrieve the data itself. Based on this description, my assumption is that I can safely take groupcache nodes offline without destabilizing the cluster.

Can you confirm this assumption?

@thrawn01
Copy link
Contributor

Yes, your assumption is correct.

@ghost
Copy link
Author

ghost commented Nov 20, 2020

Thank you @thrawn01 !

@ghost ghost closed this as completed Nov 20, 2020
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant