rbd: don't cap the buffer size used in GetImageNames by Sanford137 · Pull Request #700 · ceph/go-ceph

Sanford137 · 2022-06-02T17:21:20Z

This removes the limit on the max buffer size GetImageNames is willing to pass to rbd_list2, which is somewhat arbitrary and is too small for large clusters. GetImageNames will continue to start with a small buffer size and retry with a larger buffer if rbd_list2 returns ERANGE (just without a cap on the max buffer size it's willing to go to).

I considered creating a new API method in which max buffer size is configurable, but ultimately it seemed much simpler/cleaner to me to just keep the current method and kill the cap entirely. My team has experimented with a no-cap version of GetImageNames (since the capped one fails on our clusters which have many thousands of volumes), and it hasn't had any issues. However, if there is a subset of users for whom having the cap is important, happy to take another approach.

Checklist

Added tests for features and functional changes
Public functions and types are documented
Standard formatting is applied to Go code
Is this a new API? Is this new API marked PREVIEW?

phlogistonjohn · 2022-06-03T13:09:21Z

Code seems reasonable enough, but I'm a bit concerned about the potential unbounded memory use. I understand that you didn't want to change the existing function... especially since you're only changing the legacy "nautilus" version.

One thought I have is that you could move the limit value to a global variable, and then alter the limit that way. It's technically a "new api" but it's fully compatible with the existing code. But I wonder if @ansiwen would think it too hacky. :-)

I will think about this some more too.

Sanford137 · 2022-06-03T16:03:02Z

It's not actually just a nautilus change - there aren't any build tags on rbd_nautilus.go in the latest version of go-ceph. I'm guessing the original purpose of that file was to contain API functionality that was only available in nautilus or later, but not in luminous/mimic, which are now unsupported (and at some point the build tags were removed, since necessary librbd calls became available in all supported versions of Ceph).

I'm okay with going the global variable way too, although yeah it is a bit hacky. Let me know what you think after mulling for a few days.

phlogistonjohn · 2022-06-03T16:45:50Z

It's not actually just a nautilus change - there aren't any build tags on rbd_nautilus.go in the latest version of go-ceph. I'm guessing the original purpose of that file was to contain API functionality that was only available in nautilus or later, but not in luminous/mimic, which are now unsupported (and at some point the build tags were removed, since necessary librbd calls became available in all supported versions of Ceph).

OK, thank you for pointing that out. I missed that.

I'm okay with going the global variable way too, although yeah it is a bit hacky. Let me know what you think after mulling for a few days.

Will do.

Sanford137 · 2022-06-08T15:36:58Z

Any new thoughts on this?

phlogistonjohn · 2022-06-09T14:03:02Z

I was waiting for @ansiwen to return from time off to discuss this. Thanks for being patient with us.

Sanford137 · 2022-06-09T14:27:28Z

Cool, no worries!

…

On Thu, Jun 9, 2022 at 10:03 AM John Mulligan ***@***.***> wrote: I was waiting for @ansiwen <https://github.com/ansiwen> to return from time off to discuss this. Thanks for being patient with us. — Reply to this email directly, view it on GitHub <#700 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEMGATZRJ4W5WDJTTZOUUETVOH2SDANCNFSM5XVU4DGQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

ansiwen

In general I like the change, because I found the retry mechanism over-engineered for a buffer of a few bytes in size. So keeping it simple is good: try with 4096 bytes and then with whatever is required. (I wouldn't call that "unbound", because it's a specific request.) Speaking of simplicity: I'm not a fan of the recursion (in Go code), and would prefer to see a loop with a break. I will not block on that, though, but that would be clearly my preference.

ansiwen · 2022-06-10T15:10:30Z

rbd/rbd_nautilus.go

+	err := getErrorIfNegative(ret)
+	if err != nil {
+		if err == errRange {
+			return getImageNames(ioctx, size)


Although I'm heavily using recursion when writing ocaml code, I don't think it's idiomatic in Go code. Go doesn't guarantee tail-call optimisation (although it seems it does have it in some cases), so I think a loop or a goto is a better option for Go. It seems nit-picky, I know, because in most cases it will only get called twice, but it's more like a "Go style hygiene" for other coders that will work on the code later.

Actually, the go compiler does not tail-call optimization, which means for example that the buffers would all stay allocated in parallel:

% cat tail-call-recursion.go package main func f() { f() } func main() { f() } % go run ./tail-call-recursion.go runtime: goroutine stack exceeds 1000000000-byte limit runtime: sp=0xc0200e0390 stack=[0xc0200e0000, 0xc0400e0000] fatal error: stack overflow runtime stack: runtime.throw({0x10614e2, 0x10b98a0}) /usr/local/Cellar/go/1.17.5/libexec/src/runtime/panic.go:1198 +0x71 runtime.newstack() /usr/local/Cellar/go/1.17.5/libexec/src/runtime/stack.go:1088 +0x5ac runtime.morestack() /usr/local/Cellar/go/1.17.5/libexec/src/runtime/asm_amd64.s:461 +0x8b goroutine 1 [running]: main.f() /Users/svanders/tmp/go/tail-call-recursion.go:3 +0x26 fp=0xc0200e03a0 sp=0xc0200e0398 pc=0x1054cc6 main.f() /Users/svanders/tmp/go/tail-call-recursion.go:4 +0x17 fp=0xc0200e03b0 sp=0xc0200e03a0 pc=0x1054cb7 main.f() /Users/svanders/tmp/go/tail-call-recursion.go:4 +0x17 fp=0xc0200e03c0 sp=0xc0200e03b0 pc=0x1054cb7 main.f() ...

ansiwen

That looks better, thanks!

Sanford137 · 2022-06-14T15:46:18Z

I think the pacific failure is a flakey test (the failure is in cephfs/admin), can you re-run it?

phlogistonjohn · 2022-06-14T16:02:21Z

@Mergifyio rebase

This removes the limit on the max buffer size GetImageNames is willing to pass to rbd_list2, which is somewhat arbitrary and is too small for large clusters. GetImageNames will continue to start with a small buffer size and retry with a larger buffer if rbd_list2 returns ERANGE (just without a cap on the max buffer size it's willing to go to). Signed-off-by: Sanford Miller <smiller@digitalocean.com>

This is done because using a for loop is more idiomatic in Go code. Signed-off-by: Sanford Miller <smiller@digitalocean.com>

mergify · 2022-06-14T16:03:03Z

rebase

✅ Branch has been successfully rebased

phlogistonjohn

While I do still think it would have been a lot less work to simply bump up the WithSizes maximum, this is fine with me.

phlogistonjohn · 2022-06-14T16:07:18Z

I plan to have this land in the v0.16.0 release to be created today. This is about as last minute as we get. To use a sports metaphor you got this one in right before the buzzer. :-D

ansiwen reviewed Jun 10, 2022

View reviewed changes

ansiwen added the no-API This PR does not include any changes to the public API of a go-ceph package label Jun 10, 2022

ansiwen approved these changes Jun 14, 2022

View reviewed changes

Sanford137 added 2 commits June 14, 2022 16:03

rbd: refactor GetImageNames to use a for loop instead of recursion

02ded3f

This is done because using a for loop is more idiomatic in Go code. Signed-off-by: Sanford Miller <smiller@digitalocean.com>

ktdreyer force-pushed the no-max-buffer-size-get-image-names branch from ec53a8b to 02ded3f Compare June 14, 2022 16:03

phlogistonjohn approved these changes Jun 14, 2022

View reviewed changes

mergify bot merged commit 133e675 into ceph:master Jun 14, 2022

Conversation

Sanford137 commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

phlogistonjohn commented Jun 3, 2022

Uh oh!

Sanford137 commented Jun 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phlogistonjohn commented Jun 3, 2022

Uh oh!

Sanford137 commented Jun 8, 2022

Uh oh!

phlogistonjohn commented Jun 9, 2022

Uh oh!

Sanford137 commented Jun 9, 2022 via email

Uh oh!

ansiwen left a comment

Choose a reason for hiding this comment

Uh oh!

ansiwen Jun 10, 2022

Choose a reason for hiding this comment

Uh oh!

ansiwen Jun 10, 2022

Choose a reason for hiding this comment

Uh oh!

ansiwen left a comment

Choose a reason for hiding this comment

Uh oh!

Sanford137 commented Jun 14, 2022

Uh oh!

phlogistonjohn commented Jun 14, 2022

Uh oh!

mergify bot commented Jun 14, 2022

✅ Branch has been successfully rebased

Uh oh!

phlogistonjohn left a comment

Choose a reason for hiding this comment

Uh oh!

phlogistonjohn commented Jun 14, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sanford137 commented Jun 2, 2022 •

edited

Loading

Sanford137 commented Jun 3, 2022 •

edited

Loading