Fix VM network clone IP calculation and consolidate cleanup #6461

bduffany · 2024-04-29T18:08:23Z

Instead of using vmIdx for the clone IP, we want to use the netIdx which we have locked. The netIdx is different from the vmIdx in the case where the vmIdx is already in use.

This also restructures the cleanup work that needs to be done after calling SetupVethPair. Because the clone IP depends on the netIdx and not the vmIdx, some of the current cleanup work that is done in firecracker.go would now need to know about the netIdx. But this feels a bit like implementation details leaking out. So, in order to avoid exposing the netIdx and having to plumb it around, this PR makes it so that the cleanup func already returned by SetupVethPair now also does this cleanup work. This makes the networking API a bit simpler since more of the cleanup work is done automatically.

Related issues: N/A

server/util/networking/networking.go

maggie-lou · 2024-04-29T19:17:20Z

server/util/networking/networking.go

+	// this func that needs to be explicitly cleaned up, we append a cleanup
+	// task to this list. Cleanup work is done in the reverse order in which
+	// it's added to this list (i.e., it's a stack).
+	var cleanupStack []func(ctx context.Context) error


I find this logic a bit complex to follow. For example it takes some mental work to figure out the order the cleanup tasks are executing (i.e. "if I want to cleanup the route before I unlock the net idx, when should I add it to the slice?") I would personally find it a bit more readable to use booleans

Ex.
if shouldCleanupRoute {}
if shouldCleanupRule {}

I did it this way because it keeps the cleanup logic close to the creation logic, which gives me more peace of mind, similar to how I expect to see a defer Unlock() after a Lock(). Also, similar to defer, it provides a natural "reverse-dependency-ordering" execution order - if A depends on B then that means A will get created before B, which means A's cleanup func will also be registered before B's, which means it'll execute after B's, which is what we want.

Overall I feel like this makes the code more robust, but let me know if you feel strongly about the readability aspect. I feel like with booleans, it's another potential source of bugs, because we have to explicitly keep track of the dependencies between the resources whereas the stack naturally does the cleanup in reverse dependency order.

maggie-lou · 2024-04-29T19:18:38Z

server/util/networking/networking.go

+	// If we return an error from this func then we need to clean up any
+	// resources that were created before returning.
+	defer func() {
+		if err != nil {


When err is shadowed - i.e. hostEndpointNet, err := hostNetAllocator.Get(vmIdx) - does this still work correctly?

Yeah - in that example, err is not being declared in a new scope (shadowed), it's being re-assigned in the top-level function body scope. The declaration happens in the named return value.

I was on the fence about whether to do this vs using a wrapper func but I think ultimately the wrapper func is more error prone because we have to remember to return cleanup, err instead of return nil, err everywhere in this func.

Ah okay. I wasn't sure if a new err variable was being created, but it seems like it won't, and only the value will get reassigned (https://go.dev/ref/spec#Short_variable_declarations)

maggie-lou · 2024-04-30T16:01:52Z

server/util/networking/networking.go

+	// If we return an error from this func then we need to clean up any
+	// resources that were created before returning.
+	defer func() {
+		if err != nil {


Ah okay. I wasn't sure if a new err variable was being created, but it seems like it won't, and only the value will get reassigned (https://go.dev/ref/spec#Short_variable_declarations)

bduffany requested review from tylerwilliams, vadimberezniker and maggie-lou April 29, 2024 18:08

bduffany force-pushed the fix-vm-networking branch 3 times, most recently from eb98f8d to fb57b6d Compare April 29, 2024 18:10

bduffany changed the title ~~Fix VM network clone IP calculation and improve cleanup~~ Fix VM network clone IP calculation and consolidate cleanup Apr 29, 2024

Fix VM network clone IP calculation and improve cleanup

b64b3b9

bduffany force-pushed the fix-vm-networking branch from fb57b6d to b64b3b9 Compare April 29, 2024 18:26

maggie-lou reviewed Apr 29, 2024

View reviewed changes

bduffany added 2 commits April 29, 2024 16:18

Clarify comment

ff26026

Add clarification about cleanupStack

57cb04a

bduffany requested a review from maggie-lou April 30, 2024 15:48

maggie-lou approved these changes Apr 30, 2024

View reviewed changes

bduffany merged commit 615c78b into master Apr 30, 2024
19 checks passed

bduffany deleted the fix-vm-networking branch April 30, 2024 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix VM network clone IP calculation and consolidate cleanup #6461

Fix VM network clone IP calculation and consolidate cleanup #6461

bduffany commented Apr 29, 2024 •

edited

maggie-lou Apr 29, 2024

bduffany Apr 29, 2024 •

edited

maggie-lou Apr 29, 2024

bduffany Apr 29, 2024 •

edited

maggie-lou Apr 30, 2024

maggie-lou Apr 30, 2024

Fix VM network clone IP calculation and consolidate cleanup #6461

Fix VM network clone IP calculation and consolidate cleanup #6461

Conversation

bduffany commented Apr 29, 2024 • edited

maggie-lou Apr 29, 2024

Choose a reason for hiding this comment

bduffany Apr 29, 2024 • edited

Choose a reason for hiding this comment

maggie-lou Apr 29, 2024

Choose a reason for hiding this comment

bduffany Apr 29, 2024 • edited

Choose a reason for hiding this comment

maggie-lou Apr 30, 2024

Choose a reason for hiding this comment

maggie-lou Apr 30, 2024

Choose a reason for hiding this comment

bduffany commented Apr 29, 2024 •

edited

bduffany Apr 29, 2024 •

edited

bduffany Apr 29, 2024 •

edited